Source author record

Peng Li

Peng Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

76works

39topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

GIFT: Guided Fine-Tuning and Transfer for Enhancing Instruction-Tuned Language Models

A promising paradigm for adapting instruction-tuned language models is to learn task-specific updates on a pretrained base model and subsequently merge them into the instruction-tuned model. However, existing approaches typically treat the instruction-tuned model as a passive target that is only involved at the final merging stage, without guiding the training process. We propose GIFT (Guided Fine-Tuning and Transfer), a simple and efficient framework that incorporates guidance from the instruction model into task adaptation. GIFT fine-tunes a low-rank adapter on the pretrained base model using confidence signals derived from the instruction-tuned model. The learned adapter is then merged into the instruction-tuned model, yielding task-specialized models that preserve general instruction-following behavior. We evaluate GIFT on mathematical and knowledge-intensive benchmarks across multiple model families and scales. Results show that GIFT consistently outperforms direct fine-tuning and representative transfer-based baselines, while maintaining robust generalization and favorable test-time scaling behavior.

preprint2025arXiv

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Recent Large Reasoning Models (LRMs), such as DeepSeek-R1 and OpenAI o1, have demonstrated strong performance gains by scaling up the length of Chain-of-Thought (CoT) reasoning during inference. However, a growing concern lies in their tendency to produce excessively long reasoning traces, which are often filled with redundant content (e.g., repeated definitions), over-analysis of simple problems, and superficial exploration of multiple reasoning paths for harder tasks. This inefficiency introduces significant challenges for training, inference, and real-world deployment (e.g., in agent-based systems), where token economy is critical. In this survey, we provide a comprehensive overview of recent efforts aimed at improving reasoning efficiency in LRMs, with a particular focus on the unique challenges that arise in this new paradigm. We identify common patterns of inefficiency, examine methods proposed across the LRM lifecycle, i.e., from pretraining to inference, and discuss promising future directions for research. To support ongoing development, we also maintain a real-time GitHub repository tracking recent progress in the field. We hope this survey serves as a foundation for further exploration and inspires innovation in this rapidly evolving area.

preprint2024arXiv

Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU

This study utilized the Tempotron, a robust classifier based on a third-generation neural network model, for pulse shape discrimination. By eliminating the need for manual feature extraction, the Tempotron model can process pulse signals directly, generating discrimination results based on prior knowledge. The study performed experiments using GPU acceleration, resulting in over 500 times faster compared to the CPU-based model, and investigated the impact of noise augmentation on the Tempotron performance. Experimental results substantiated that Tempotron serves as a formidable classifier, adept at accomplishing high discrimination accuracy on both AmBe and time-of-flight PuBe datasets. Furthermore, analyzing the neural activity of Tempotron during training shed light on its learning characteristics and aided in selecting its hyperparameters. Moreover, the study addressed the constraints and potential avenues for future development in utilizing the Tempotron for pulse shape discrimination. The dataset used in this study and the GPU-based Tempotron are publicly available on GitHub at https://github.com/HaoranLiu507/TempotronGPU.

preprint2024arXiv

Random-coupled Neural Network

Improving the efficiency of current neural networks and modeling them in biological neural systems have become popular research directions in recent years. Pulse-coupled neural network (PCNN) is a well applicated model for imitating the computation characteristics of the human brain in computer vision and neural network fields. However, differences between the PCNN and biological neural systems remain: limited neural connection, high computational cost, and lack of stochastic property. In this study, random-coupled neural network (RCNN) is proposed. It overcomes these difficulties in PCNN's neuromorphic computing via a random inactivation process. This process randomly closes some neural connections in the RCNN model, realized by the random inactivation weight matrix of link input. This releases the computational burden of PCNN, making it affordable to achieve vast neural connections. Furthermore, the image and video processing mechanisms of RCNN are researched. It encodes constant stimuli as periodic spike trains and periodic stimuli as chaotic spike trains, the same as biological neural information encoding characteristics. Finally, the RCNN is applicated to image segmentation, fusion, and pulse shape discrimination subtasks. It is demonstrated to be robust, efficient, and highly anti-noised, with outstanding performance in all applications mentioned above.

preprint2023arXiv

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

Diverse video captioning aims to generate a set of sentences to describe the given video in various aspects. Mainstream methods are trained with independent pairs of a video and a caption from its ground-truth set without exploiting the intra-set relationship, resulting in low diversity of generated captions. Different from them, we formulate diverse captioning into a semantic-concept-guided set prediction (SCG-SP) problem by fitting the predicted caption set to the ground-truth set, where the set-level relationship is fully captured. Specifically, our set prediction consists of two synergistic tasks, i.e., caption generation and an auxiliary task of concept combination prediction providing extra semantic supervision. Each caption in the set is attached to a concept combination indicating the primary semantic content of the caption and facilitating element alignment in set prediction. Furthermore, we apply a diversity regularization term on concepts to encourage the model to generate semantically diverse captions with various concept combinations. These two tasks share multiple semantics-specific encodings as input, which are obtained by iterative interaction between visual features and conceptual queries. The correspondence between the generated captions and specific concept combinations further guarantees the interpretability of our model. Extensive experiments on benchmark datasets show that the proposed SCG-SP achieves state-of-the-art (SOTA) performance under both relevance and diversity metrics.

preprint2022arXiv

A Computational Framework of Cortical Microcircuits Approximates Sign-concordant Random Backpropagation

Several recent studies attempt to address the biological implausibility of the well-known backpropagation (BP) method. While promising methods such as feedback alignment, direct feedback alignment, and their variants like sign-concordant feedback alignment tackle BP's weight transport problem, their validity remains controversial owing to a set of other unsolved issues. In this work, we answer the question of whether it is possible to realize random backpropagation solely based on mechanisms observed in neuroscience. We propose a hypothetical framework consisting of a new microcircuit architecture and its supporting Hebbian learning rules. Comprising three types of cells and two types of synaptic connectivity, the proposed microcircuit architecture computes and propagates error signals through local feedback connections and supports the training of multi-layered spiking neural networks with a globally defined spiking error function. We employ the Hebbian rule operating in local compartments to update synaptic weights and achieve supervised learning in a biologically plausible manner. Finally, we interpret the proposed framework from an optimization point of view and show its equivalence to sign-concordant feedback alignment. The proposed framework is benchmarked on several datasets including MNIST and CIFAR10, demonstrating promising BP-comparable accuracy.

preprint2022arXiv

A Roadmap for Big Model

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm. Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields. At present, there is a lack of research work that sorts out the overall progress of BMs and guides the follow-up research. In this paper, we cover not only the BM technologies themselves but also the prerequisites for BM training and applications with BMs, dividing the BM review into four parts: Resource, Models, Key Technologies and Application. We introduce 16 specific BM-related topics in those four parts, they are Data, Knowledge, Computing System, Parallel Training System, Language Model, Vision Model, Multi-modal Model, Theory&Interpretability, Commonsense Reasoning, Reliability&Security, Governance, Evaluation, Machine Translation, Text Generation, Dialogue and Protein Research. In each topic, we summarize clearly the current studies and propose some future research directions. At the end of this paper, we conclude the further development of BMs in a more general view.

preprint2022arXiv

A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models

Pre-trained language models (PLMs) cannot well recall rich factual knowledge of entities exhibited in large-scale corpora, especially those rare entities. In this paper, we propose to build a simple but effective Pluggable Entity Lookup Table (PELT) on demand by aggregating the entity's output representations of multiple occurrences in the corpora. PELT can be compatibly plugged as inputs to infuse supplemental entity knowledge into PLMs. Compared to previous knowledge-enhanced PLMs, PELT only requires 0.2%-5% pre-computation with capability of acquiring knowledge from out-of-domain corpora for domain adaptation scenario. The experiments on knowledge-related tasks demonstrate that our method, PELT, can flexibly and effectively transfer entity knowledge from related corpora into PLMs with different architectures.

preprint2022arXiv

Anisotropic Dzyaloshinskii-Moriya interaction and topological magnetism in two-dimensional magnets protected by P4-m2 crystal symmetry

As a fundamental magnetic parameter, Dzyaloshinskii-Moriya interaction (DMI), has gained a great deal of attention in the last two decades due to its critical role in formation of magnetic skyrmions. Recent discoveries of two-dimensional (2D) van der Waals (vdW) magnets has also gained a great deal of attention due to appealing physical properties, such as gate tunability, flexibility and miniaturization. Intensive studies have shown that isotropic DMI stabilizes ferromagnetic (FM) topological spin textures in 2D magnets or their corresponding heterostructures. However, the investigation of anisotropic DMI and antiferromagnetic (AFM) topological spin configurations remains elusive. Here, we propose and demonstrate that a family of 2D magnets with P4-m2 symmetry-protected anisotropic DMI. More interestingly, various topological spin configurations, including FM/AFM antiskyrmion and AFM vortex-antivortex pair, emerge in this family. These results give a general method to design anisotropic DMI and pave the way towards topological magnetism in 2D materials using crystal symmetry.

preprint2022arXiv

Binary Neural Networks as a general-propose compute paradigm for on-device computer vision

For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks. To this end, we propose a BNN framework comprising 1) a minimalistic inference scheme for hardware-friendliness, 2) an over-parameterized training scheme for high accuracy, and 3) a simple procedure to adapt to different vision tasks. The resultant framework overtakes 8-bit quantization in the speed-vs-accuracy tradeoff for classification, detection, segmentation, super-resolution and matching: our BNNs not only retain the accuracy levels of their 8-bit baselines but also showcase 1.3-2.4$\times$ faster FPS on mobile CPUs. Similar conclusions can be drawn for prototypical systolic-array-based AI accelerators, where our BNNs promise 2.8-7$\times$ fewer execution cycles than 8-bit and 2.1-2.7$\times$ fewer cycles than alternative BNN designs. These results suggest that the time for large-scale BNN adoption could be upon us.

preprint2022arXiv

CF-YOLO: Cross Fusion YOLO for Object Detection in Adverse Weather with a High-quality Real Snow Dataset

Snow is one of the toughest adverse weather conditions for object detection (OD). Currently, not only there is a lack of snowy OD datasets to train cutting-edge detectors, but also these detectors have difficulties learning latent information beneficial for detection in snow. To alleviate the two above problems, we first establish a real-world snowy OD dataset, named RSOD. Besides, we develop an unsupervised training strategy with a distinctive activation function, called $Peak \ Act$, to quantitatively evaluate the effect of snow on each object. Peak Act helps grading the images in RSOD into four-difficulty levels. To our knowledge, RSOD is the first quantitatively evaluated and graded snowy OD dataset. Then, we propose a novel Cross Fusion (CF) block to construct a lightweight OD network based on YOLOv5s (call CF-YOLO). CF is a plug-and-play feature aggregation module, which integrates the advantages of Feature Pyramid Network and Path Aggregation Network in a simpler yet more flexible form. Both RSOD and CF lead our CF-YOLO to possess an optimization ability for OD in real-world snow. That is, CF-YOLO can handle unfavorable detection problems of vagueness, distortion and covering of snow. Experiments show that our CF-YOLO achieves better detection results on RSOD, compared to SOTAs. The code and dataset are available at https://github.com/qqding77/CF-YOLO-and-RSOD.

preprint2022arXiv

Deterministic relation between optical polarization and lattice symmetry revealed in ion-doped single microcrystals

Rare-earth ions doped crystals are of great significance for micro-sensing and quantum information, whilst the ions in the crystals emit light with spontaneous partial polarization, which is, though believed to be originated from the crystal lattice structure, still lacking a deterministic explanation that can be tested with quantitative accuracy. We report the experimental evidence showing the profound physical relation between the polarization degree of light emitted by the doped ion and the lattice symmetry, by demonstrating, with unprecedented precision, that the lattice constant ratio c/a directly quantifies the macroscopic effective polar angle of the electric and magnetic dipoles, which essentially determines the linear polarization degree of the emission. Based on this discovery, we further propose a pure optical technology to identify the three-dimensional orientation of a rod-shaped single microcrystal using the polarization-resolved micro-spectroscopy. Our results, revealing the physical origin of light polarization in ion-doped crystals, open the way towards on-demand polarization control with crystallography, and provide a versatile platform for polarization-based microscale sensing in dynamical systems.

preprint2022arXiv

ELLE: Efficient Lifelong Pre-training for Emerging Data

Current pre-trained language models (PLM) are typically trained with static data, ignoring that in real-world scenarios, streaming data of various sources may continuously grow. This requires PLMs to integrate the information from all the sources in a lifelong manner. Although this goal could be achieved by exhaustive pre-training on all the existing data, such a process is known to be computationally expensive. To this end, we propose ELLE, aiming at efficient lifelong pre-training for emerging data. Specifically, ELLE consists of (1) function preserved model expansion, which flexibly expands an existing PLM's width and depth to improve the efficiency of knowledge acquisition; and (2) pre-trained domain prompts, which disentangle the versatile knowledge learned during pre-training and stimulate the proper knowledge for downstream tasks. We experiment ELLE with streaming data from 5 domains on BERT and GPT. The results show the superiority of ELLE over various lifelong learning baselines in both pre-training efficiency and downstream performances. The codes are publicly available at https://github.com/thunlp/ELLE.

preprint2022arXiv

Evidence of Magnon-Mediated Orbital Magnetism in a Quasi-2D Topological Magnon Insulator

We explore spin dynamics in Cu(1,3-bdc), a quasi-2D topological magnon insulator. The results show that the thermal evolution of Landé $g$-factor ($g$) is anisotropic: $g_\textrm{in-plane}$ reduces while $g_\textrm{out-plane}$ increases with increasing temperature $T$. Moreover, the anisotropy of the $g$-factor ($Δg$) and the anisotropy of saturation magnetization ($ΔM_\textrm{s}$) are correlated below 4 K, but they diverge above 4 K. We show that the electronic orbital moment contributes to the $g$ anisotropy at lower $T$, while the topological orbital moment induced by thermally excited spin chirality dictates the $g$ anisotropy at higher $T$. Our work suggests an interplay among topology, spin chirality, and orbital magnetism in Cu(1,3-bdc).

preprint2022arXiv

Fully Hyperbolic Neural Networks

Hyperbolic neural networks have shown great potential for modeling complex data. However, existing hyperbolic networks are not completely hyperbolic, as they encode features in a hyperbolic space yet formalize most of their operations in the tangent space (a Euclidean subspace) at the origin of the hyperbolic space. This hybrid method greatly limits the modeling ability of networks. In this paper, we propose a fully hyperbolic framework to build hyperbolic networks based on the Lorentz model by adapting the Lorentz transformations (including boost and rotation) to formalize essential operations of neural networks. Moreover, we also prove that linear transformation in tangent spaces used by existing hyperbolic networks is a relaxation of the Lorentz rotation and does not include the boost, implicitly limiting the capabilities of existing hyperbolic networks. The experimental results on four NLP tasks show that our method has better performance for building both shallow and deep networks. Our code will be released to facilitate follow-up research.

preprint2022arXiv

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog

Visual dialog, which aims to hold a meaningful conversation with humans about a given image, is a challenging task that requires models to reason the complex dependencies among visual content, dialog history, and current questions. Graph neural networks are recently applied to model the implicit relations between objects in an image or dialog. However, they neglect the importance of 1) coreference relations among dialog history and dependency relations between words for the question representation; and 2) the representation of the image based on the fully represented question. Therefore, we propose a novel relation-aware graph-over-graph network (GoG) for visual dialog. Specifically, GoG consists of three sequential graphs: 1) H-Graph, which aims to capture coreference relations among dialog history; 2) History-aware Q-Graph, which aims to fully understand the question through capturing dependency relations between words based on coreference resolution on the dialog history; and 3) Question-aware I-Graph, which aims to capture the relations between objects in an image based on fully question representation. As an additional feature representation module, we add GoG to the existing visual dialogue model. Experimental results show that our model outperforms the strong baseline in both generative and discriminative settings by a significant margin.

preprint2022arXiv

Growth, Electronic Structure and Superconductivity of Ultrathin Epitaxial CoSi2 Films

We report growth, electronic structure and superconductivity of ultrathin epitaxial CoSi2 films on Si(111). At low coverages, preferred islands with 2, 5 and 6 monolayers height develop, which agrees well with the surface energy calculation. We observe clear quantum well states as a result of electronic confinement and their dispersion agrees well with density functional theory calculations, indicating weak correlation effect despite strong contributions from Co 3d electrons. Ex-situ transport measurements show that superconductivity persists down to at least 10 monolayers, with reduced Tc but largely enhanced upper critical field. Our study opens up the opportunity to study the interplay between quantum confinement, interfacial symmetry breaking and superconductivity in an epitaxial silicide film, which is technologically relevant in microelectronics.

preprint2022arXiv

Knowledge Inheritance for Pre-trained Language Models

Recent explorations of large-scale pre-trained language models (PLMs) have revealed the power of PLMs with huge amounts of parameters, setting off a wave of training ever-larger PLMs. However, it requires tremendous computational resources to train a large-scale PLM, which may be practically unaffordable. In addition, existing large-scale PLMs are mainly trained from scratch individually, ignoring that many well-trained PLMs are available. To this end, we explore the question how could existing PLMs benefit training large-scale PLMs in future. Specifically, we introduce a pre-training framework named "knowledge inheritance" (KI) and explore how could knowledge distillation serve as auxiliary supervision during pre-training to efficiently learn larger PLMs. Experimental results demonstrate the superiority of KI in training efficiency. We also conduct empirical analyses to explore the effects of teacher PLMs' pre-training settings, including model architecture, pre-training data, etc. Finally, we show that KI could be applied to domain adaptation and knowledge transfer.

preprint2022arXiv

LF-VIO: A Visual-Inertial-Odometry Framework for Large Field-of-View Cameras with Negative Plane

Visual-inertial-odometry has attracted extensive attention in the field of autonomous driving and robotics. The size of Field of View (FoV) plays an important role in Visual-Odometry (VO) and Visual-Inertial-Odometry (VIO), as a large FoV enables to perceive a wide range of surrounding scene elements and features. However, when the field of the camera reaches the negative half plane, one cannot simply use [u,v,1]^T to represent the image feature points anymore. To tackle this issue, we propose LF-VIO, a real-time VIO framework for cameras with extremely large FoV. We leverage a three-dimensional vector with unit length to represent feature points, and design a series of algorithms to overcome this challenge. To address the scarcity of panoramic visual odometry datasets with ground-truth location and pose, we present the PALVIO dataset, collected with a Panoramic Annular Lens (PAL) system with an entire FoV of 360°x(40°-120°) and an IMU sensor. With a comprehensive variety of experiments, the proposed LF-VIO is verified on both the established PALVIO benchmark and a public fisheye camera dataset with a FoV of 360°x(0°-93.5°). LF-VIO outperforms state-of-the-art visual-inertial-odometry methods. Our dataset and code are made publicly available at https://github.com/flysoaryun/LF-VIO

preprint2022arXiv

Manual-Guided Dialogue for Flexible Conversational Agents

How to build and use dialogue data efficiently, and how to deploy models in different domains at scale can be two critical issues in building a task-oriented dialogue system. In this paper, we propose a novel manual-guided dialogue scheme to alleviate these problems, where the agent learns the tasks from both dialogue and manuals. The manual is an unstructured textual document that guides the agent in interacting with users and the database during the conversation. Our proposed scheme reduces the dependence of dialogue models on fine-grained domain ontology, and makes them more flexible to adapt to various domains. We then contribute a fully-annotated multi-domain dataset MagDial to support our scheme. It introduces three dialogue modeling subtasks: instruction matching, argument filling, and response generation. Modeling these subtasks is consistent with the human agent's behavior patterns. Experiments demonstrate that the manual-guided dialogue scheme improves data efficiency and domain scalability in building dialogue systems. The dataset and benchmark will be publicly available for promoting future research.

preprint2022arXiv

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts

Recent work has shown that feed-forward networks (FFNs) in pre-trained Transformers are a key component, storing various linguistic and factual knowledge. However, the computational patterns of FFNs are still unclear. In this work, we study the computational patterns of FFNs and observe that most inputs only activate a tiny ratio of neurons of FFNs. This phenomenon is similar to the sparsity of the human brain, which drives research on functional partitions of the human brain. To verify whether functional partitions also emerge in FFNs, we propose to convert a model into its MoE version with the same parameters, namely MoEfication. Specifically, MoEfication consists of two phases: (1) splitting the parameters of FFNs into multiple functional partitions as experts, and (2) building expert routers to decide which experts will be used for each input. Experimental results show that MoEfication can conditionally use 10% to 30% of FFN parameters while maintaining over 95% original performance for different models on various downstream tasks. Besides, MoEfication brings two advantages: (1) it significantly reduces the FLOPS of inference, i.e., 2x speedup with 25% of FFN parameters, and (2) it provides a fine-grained perspective to study the inner mechanism of FFNs. The source code of this paper can be obtained from https://github.com/thunlp/MoEfication.

preprint2022arXiv

OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation

Light field disparity estimation is an essential task in computer vision with various applications. Although supervised learning-based methods have achieved both higher accuracy and efficiency than traditional optimization-based methods, the dependency on ground-truth disparity for training limits the overall generalization performance not to say for real-world scenarios where the ground-truth disparity is hard to capture. In this paper, we argue that unsupervised methods can achieve comparable accuracy, but, more importantly, much higher generalization capacity and efficiency than supervised methods. Specifically, we present the Occlusion Pattern Aware Loss, named OPAL, which successfully extracts and encodes the general occlusion patterns inherent in the light field for loss calculation. OPAL enables: i) accurate and robust estimation by effectively handling occlusions without using any ground-truth information for training and ii) much efficient performance by significantly reducing the network parameters required for accurate inference. Besides, a transformer-based network and a refinement module are proposed for achieving even more accurate results. Extensive experiments demonstrate our method not only significantly improves the accuracy compared with the SOTA unsupervised methods, but also possesses strong generalization capacity, even for real-world data, compared with supervised methods. Our code will be made publicly available.

preprint2022arXiv

Packed Levitated Marker for Entity and Relation Extraction

Recent entity and relation extraction works focus on investigating how to obtain a better span representation from the pre-trained encoder. However, a major limitation of existing works is that they ignore the interrelation between spans (pairs). In this work, we propose a novel span representation approach, named Packed Levitated Markers (PL-Marker), to consider the interrelation between the spans (pairs) by strategically packing the markers in the encoder. In particular, we propose a neighborhood-oriented packing strategy, which considers the neighbor spans integrally to better model the entity boundary information. Furthermore, for those more complicated span pair classification tasks, we design a subject-oriented packing strategy, which packs each subject and all its objects to model the interrelation between the same-subject span pairs. The experimental results show that, with the enhanced marker feature, our model advances baselines on six NER benchmarks, and obtains a 4.1%-4.3% strict relation F1 improvement with higher speed over previous state-of-the-art models on ACE04 and ACE05.

preprint2022arXiv

Quantum Interference Transport in two-dimensional Semi-Dirac Semimetals

Semi-Dirac semimetals have received enthusiastic research both theoretically and experimentally in the recent years. Due to the anisotropic dispersion, its physical properties are highly direction-dependent. In this work we employ the Feynman diagrammatic perturbation theory to study the transport properties in quantum diffusive regime. The magneto-conductivity with quantum interference corrections is derived, which demonstrate the weak localization effect in the semi-Dirac semimetal. Furthermore, the origin of anomalous Hall conductivity is also clarified, where both the intrinsic and side-jump contributions vanish and only the skew-scattering gives rise to non-zero transverse conductivity. The conductance fluctuations in both mesoscopic and quantum diffusive regimes are investigated in detail. Our work provides theoretical predictions for transport experiments, which can be examined by conductivity measurements at sufficiently low temperature.

preprint2022arXiv

Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification

Graph Contrastive Learning (GCL) has proven highly effective in promoting the performance of Semi-Supervised Node Classification (SSNC). However, existing GCL methods are generally transferred from other fields like CV or NLP, whose underlying working mechanism remains under-explored. In this work, we first deeply probe the working mechanism of GCL in SSNC, and find that the promotion brought by GCL is severely unevenly distributed: the improvement mainly comes from subgraphs with less annotated information, which is fundamentally different from contrastive learning in other fields. However, existing GCL methods generally ignore this uneven distribution of annotated information and apply GCL evenly to the whole graph. To remedy this issue and further improve GCL in SSNC, we propose the Topology InFormation gain-Aware Graph Contrastive Learning (TIFA-GCL) framework that considers the annotated information distribution across graph in GCL. Extensive experiments on six benchmark graph datasets, including the enormous OGB-Products graph, show that TIFA-GCL can bring a larger improvement than existing GCL methods in both transductive and inductive settings. Further experiments demonstrate the generalizability and interpretability of TIFA-GCL.

preprint2022arXiv

Tunable magnetically induced transparency spectra in magnon-magnon coupled Y3Fe5O12/permalloy bilayers

Hybrid magnonic systems host a variety of characteristic quantum phenomena such as the magnetically-induced transparency (MIT) and Purcell effect, which are considered useful for future coherent quantum information processing. In this work, we experimentally demonstrate a tunable MIT effect in the Y3Fe5O12(YIG)/Permalloy(Py) magnon-magnon coupled system via changing the magnetic field orientations. By probing the magneto-optic effects of Py and YIG, we identify clear features of MIT spectra induced by the mode hybridization between the uniform mode of Py and the perpendicular standing spin-wave modes of YIG. By changing the external magnetic field orientations, we observe a tunable coupling strength between the YIG's spin-wave modes and the Py's uniform mode, upon the application of an out-of-plane magnetic field. This observation is theoretically interpreted by a geometrical consideration of the Py and YIG magnetization under the oblique magnetic field even at a constant interfacial exchange coupling. Our findings show high promise for investigating tunable coherent phenomena with hybrid magnonic platforms.

preprint2021arXiv

Anisotropic Magnon Spin Transport in Ultra-thin Spinel Ferrite Thin Films -- Evidence for Anisotropy in Exchange Stiffness

We report measurements of magnon spin transport in a spinel ferrite, magnesium aluminum ferrite $\mathrm{MgAl_{0.5}Fe_{1.5}O_4}$ (MAFO), which has a substantial in-plane four-fold magnetic anisotropy. We observe spin diffusion lengths $> 0.8$ $\mathrm{μm}$ at room temperature in 6 nm films, with spin diffusion length 30% longer along the easy axes compared to the hard axes. The sign of this difference is opposite to the effects just of anisotropy in the magnetic energy for a uniform magnetic state. We suggest instead that accounting for anisotropy in exchange stiffness is necessary to explain these results.

preprint2021arXiv

Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples

Fuzzy similarity join is an important database operator widely used in practice. So far the research community has focused exclusively on optimizing fuzzy join \textit{scalability}. However, practitioners today also struggle to optimize fuzzy-join \textit{quality}, because they face a daunting space of parameters (e.g., distance-functions, distance-thresholds, tokenization-options, etc.), and often have to resort to a manual trial-and-error approach to program these parameters in order to optimize fuzzy-join quality. This key challenge of automatically generating high-quality fuzzy-join programs has received surprisingly little attention thus far. In this work, we study the problem of "auto-program" fuzzy-joins. Leveraging a geometric interpretation of distance-functions, we develop an unsupervised \textsc{Auto-FuzzyJoin} framework that can infer suitable fuzzy-join programs on given input tables, without requiring explicit human input such as labeled training data. Using \textsc{Auto-FuzzyJoin}, users only need to provide two input tables $L$ and $R$, and a desired precision target $τ$ (say 0.9). \textsc{Auto-FuzzyJoin} leverages the fact that one of the input is a reference table to automatically program fuzzy-joins that meet the precision target $τ$ in expectation, while maximizing fuzzy-join recall (defined as the number of correctly joined records). Experiments on both existing benchmarks and a new benchmark with 50 fuzzy-join tasks created from Wikipedia data suggest that the proposed \textsc{Auto-FuzzyJoin} significantly outperforms existing unsupervised approaches, and is surprisingly competitive even against supervised approaches (e.g., Magellan and DeepMatcher) when 50\% of ground-truth labels are used as training data.

preprint2021arXiv

Giant Topological Hall Effect in van der Waals Heterostructures of CrTe2/Bi2Te3

Discoveries of interfacial topological Hall effect (THE) provide an ideal platform for exploring physics arising from the interplay between topology and magnetism. The interfacial topological Hall effect is closely related to the Dzyaloshinskii-Moriya interaction (DMI) at interface and topological spin textures. However, it is difficult to achieve a sizable THE in heterostructures due to the stringent constraints on the constituents of THE heterostructures such as strong spin-orbit coupling (SOC). Here we report the observation of a giant THE signal of 1.39 $μΩ\cdot$cm in the van der Waals heterostructures of CrTe2/Bi2Te3 fabricated by molecular beam epitaxy, a prototype of two-dimensional (2D) ferromagnet (FM)/topological insulator (TI). This large magnitude of THE is attributed to an optimized combination of 2D ferromagnetism in CrTe2, strong SOC in Bi2Te3, and an atomically sharp interface. Our work reveals CrTe2/Bi2Te3 as a convenient platform for achieving large interfacial THE in hybrid systems, which could be utilized to develop quantum science and high-density information storage.

preprint2020arXiv

Comprehensive SNN Compression Using ADMM Optimization and Activity Regularization

As well known, the huge memory and compute costs of both artificial neural networks (ANNs) and spiking neural networks (SNNs) greatly hinder their deployment on edge devices with high efficiency. Model compression has been proposed as a promising technique to improve the running efficiency via parameter and operation reduction. Whereas, this technique is mainly practiced in ANNs rather than SNNs. It is interesting to answer how much an SNN model can be compressed without compromising its functionality, where two challenges should be addressed: i) the accuracy of SNNs is usually sensitive to model compression, which requires an accurate compression methodology; ii) the computation of SNNs is event-driven rather than static, which produces an extra compression dimension on dynamic spikes. To this end, we realize a comprehensive SNN compression through three steps. First, we formulate the connection pruning and weight quantization as a constrained optimization problem. Second, we combine spatio-temporal backpropagation (STBP) and alternating direction method of multipliers (ADMM) to solve the problem with minimum accuracy loss. Third, we further propose activity regularization to reduce the spike events for fewer active operations. These methods can be applied in either a single way for moderate compression or a joint way for aggressive compression. We define several quantitative metrics to evaluation the compression performance for SNNs. Our methodology is validated in pattern recognition tasks over MNIST, N-MNIST, CIFAR10, and CIFAR100 datasets, where extensive comparisons, analyses, and insights are provided. To our best knowledge, this is the first work that studies SNN compression in a comprehensive manner by exploiting all compressible components and achieves better results.

preprint2020arXiv

Evolving the pulmonary nodules diagnosis from classical approaches to deep learning aided decision support: three decades development course and future prospect

Lung cancer is the commonest cause of cancer deaths worldwide, and its mortality can be reduced significantly by performing early diagnosis and screening. Since the 1960s, driven by the pressing needs to accurately and effectively interpret the massive volume of chest images generated daily, computer-assisted diagnosis of pulmonary nodule has opened up new opportunities to relax the limitation from physicians' subjectivity, experiences and fatigue. And the fair access to the reliable and affordable computer-assisted diagnosis will fight the inequalities in incidence and mortality between populations. It has been witnessed that significant and remarkable advances have been achieved since the 1980s, and consistent endeavors have been exerted to deal with the grand challenges on how to accurately detect the pulmonary nodules with high sensitivity at low false-positives rate as well as on how to precisely differentiate between benign and malignant nodules. There is a lack of comprehensive examination of the techniques' development which is evolving the pulmonary nodules diagnosis from classical approaches to machine learning-assisted decision support. The main goal of this investigation is to provide a comprehensive state-of-the-art review of the computer-assisted nodules detection and benign-malignant classification techniques developed over 3 decades, which have evolved from the complicated ad hoc analysis pipeline of conventional approaches to the simplified seamlessly integrated deep learning techniques. This review also identifies challenges and highlights opportunities for future work in learning models, learning algorithms and enhancement schemes for bridging current state to future prospect and satisfying future demand.

preprint2020arXiv

Experimental parameters, combined dynamics, and nonlinearity of a Magnonic-Opto-Electronic Oscillator (MOEO)

We report the construction and characterization of a comprehensive magnonic-opto-electronic oscillator (MOEO) system based on 1550-nm photonics and yttirum iron garnet (YIG) magnonics. The system exhibits a rich and synergistic parameter space because of the ability to control individual photonic, electronic, and magnonic components. Taking advantage of the spin wave dispersion of YIG, the frequency self-generation as well as the related nonlinear processes become sensitive to the external magnetic field. Besides being known as a narrowband filter and a delay element, the YIG delayline possesses spin wave modes that can be controlled to mix with the optoelectronic modes to generate higher-order harmonic beating modes. With the high sensitivity and external tunability, the MOEO system may find usefulness in sensing applications in magnetism and spintronics beyond optoelectronics and photonics.

preprint2020arXiv

FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval

In this paper, we address the text and image matching in cross-modal retrieval of the fashion industry. Different from the matching in the general domain, the fashion matching is required to pay much more attention to the fine-grained information in the fashion images and texts. Pioneer approaches detect the region of interests (i.e., RoIs) from images and use the RoI embeddings as image representations. In general, RoIs tend to represent the "object-level" information in the fashion images, while fashion texts are prone to describe more detailed information, e.g. styles, attributes. RoIs are thus not fine-grained enough for fashion text and image matching. To this end, we propose FashionBERT, which leverages patches as image features. With the pre-trained BERT model as the backbone network, FashionBERT learns high level representations of texts and images. Meanwhile, we propose an adaptive loss to trade off multitask learning in the FashionBERT modeling. Two tasks (i.e., text and image matching and cross-modal retrieval) are incorporated to evaluate FashionBERT. On the public dataset, experiments demonstrate FashionBERT achieves significant improvements in performances than the baseline and state-of-the-art approaches. In practice, FashionBERT is applied in a concrete cross-modal retrieval application. We provide the detailed matching performance and inference efficiency analysis.

preprint2020arXiv

HighwayGraph: Modelling Long-distance Node Relations for Improving General Graph Neural Network

Graph Neural Networks (GNNs) are efficient approaches to process graph-structured data. Modelling long-distance node relations is essential for GNN training and applications. However, conventional GNNs suffer from bad performance in modelling long-distance node relations due to limited-layer information propagation. Existing studies focus on building deep GNN architectures, which face the over-smoothing issue and cannot model node relations in particularly long distance. To address this issue, we propose to model long-distance node relations by simply relying on shallow GNN architectures with two solutions: (1) Implicitly modelling by learning to predict node pair relations (2) Explicitly modelling by adding edges between nodes that potentially have the same label. To combine our two solutions, we propose a model-agnostic training framework named HighwayGraph, which overcomes the challenge of insufficient labeled nodes by sampling node pairs from the training set and adopting the self-training method. Extensive experimental results show that our HighwayGraph achieves consistent and significant improvements over four representative GNNs on three benchmark datasets.

preprint2020arXiv

Hybrid vector beams with non-uniform orbital angular momentum density induced by designed azimuthal polarization gradient

Based on angular amplitude modulation of orthogonal base vectors in common-path interference method, we propose an interesting type of hybrid vector beams with unprecedented azimuthal polarization gradient and demonstrate in experiment. Distinct to previously reported types, the synthetic hybrid vector beams exhibit geometrically intriguing projection tracks of angular polarization state on Poincare sphere, more than just conventional circles. More noteworthily, the designed azimuthal polarization gradients are found to be able to induce azimuthally non-uniform orbital angular momentum density, while generally uniform for circle-track cases, immersing in homogenous intensity background whatever base states are. Moreover, via tailoring relevant parameters, more special polarization mapping tracks can be handily achieved. These peculiar features may open alternative routes for new optical effects and applications.

preprint2020arXiv

Implement Liquid Democracy on Ethereum: A Fast Algorithm for Realtime Self-tally Voting System

We study the liquid democracy problem, where each voter can either directly vote to a candidate or delegate his voting power to a proxy. We consider the implementation of liquid democracy on the blockchain through Ethereum smart contract and to be compatible with the realtime self-tallying property, where the contract itself can record ballots and update voting status upon receiving each voting massage. A challenge comes due to the gas fee limitation of Ethereum mainnet, that the number of instruction for processing a voting massage can not exceed a certain amount, which restrict the application scenario with respect to algorithms whose time complexity is linear to the number of voters. We propose a fast algorithm to overcome the challenge, such that i) shifts the on-chain initialization to off-chain and ii) the on-chain complexity for processing each voting massage is O(\log n), where n is the number of voters.

preprint2020arXiv

Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation

Non-autoregressive neural machine translation (NAT) predicts the entire target sequence simultaneously and significantly accelerates inference process. However, NAT discards the dependency information in a sentence, and thus inevitably suffers from the multi-modality problem: the target tokens may be provided by different possible translations, often causing token repetitions or missing. To alleviate this problem, we propose a novel semi-autoregressive model RecoverSAT in this work, which generates a translation as a sequence of segments. The segments are generated simultaneously while each segment is predicted token-by-token. By dynamically determining segment length and deleting repetitive segments, RecoverSAT is capable of recovering from repetitive and missing token errors. Experimental results on three widely-used benchmark datasets show that our proposed model achieves more than 4$\times$ speedup while maintaining comparable performance compared with the corresponding autoregressive model.

preprint2020arXiv

Making Robots Draw A Vivid Portrait In Two Minutes

Significant progress has been made with artistic robots. However, existing robots fail to produce high-quality portraits in a short time. In this work, we present a drawing robot, which can automatically transfer a facial picture to a vivid portrait, and then draw it on paper within two minutes averagely. At the heart of our system is a novel portrait synthesis algorithm based on deep learning. Innovatively, we employ a self-consistency loss, which makes the algorithm capable of generating continuous and smooth brush-strokes. Besides, we propose a componential sparsity constraint to reduce the number of brush-strokes over insignificant areas. We also implement a local sketch synthesis algorithm, and several pre- and post-processing techniques to deal with the background and details. The portrait produced by our algorithm successfully captures individual characteristics by using a sparse set of continuous brush-strokes. Finally, the portrait is converted to a sequence of trajectories and reproduced by a 3-degree-of-freedom robotic arm. The whole portrait drawing robotic system is named AiSketcher. Extensive experiments show that AiSketcher can produce considerably high-quality sketches for a wide range of pictures, including faces in-the-wild and universal images of arbitrary content. To our best knowledge, AiSketcher is the first portrait drawing robot that uses neural style transfer techniques. AiSketcher has attended a quite number of exhibitions and shown remarkable performance under diverse circumstances.

preprint2020arXiv

Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions

Machine learning (ML) applications have been thriving recently, largely attributed to the increasing availability of data. However, inconsistency and incomplete information are ubiquitous in real-world datasets, and their impact on ML applications remains elusive. In this paper, we present a formal study of this impact by extending the notion of Certain Answers for Codd tables, which has been explored by the database research community for decades, into the field of machine learning. Specifically, we focus on classification problems and propose the notion of "Certain Predictions" (CP) -- a test data example can be certainly predicted (CP'ed) if all possible classifiers trained on top of all possible worlds induced by the incompleteness of data would yield the same prediction. We study two fundamental CP queries: (Q1) checking query that determines whether a data example can be CP'ed; and (Q2) counting query that computes the number of classifiers that support a particular prediction (i.e., label). Given that general solutions to CP queries are, not surprisingly, hard without assumption over the type of classifier, we further present a case study in the context of nearest neighbor (NN) classifiers, where efficient solutions to CP queries can be developed -- we show that it is possible to answer both queries in linear or polynomial time over exponentially many possible worlds. We demonstrate one example use case of CP in the important application of "data cleaning for machine learning (DC for ML)." We show that our proposed CPClean approach built based on CP can often significantly outperform existing techniques in terms of classification accuracy with mild manual cleaning effort.

preprint2020arXiv

Ring-Frustrated Non-Hermitian $XY$ Model

We study a non-Hermitian version of XY closed chain with odd number of lattice sites. We consider both anti-ferromagnetic coupling and also a symmetric non-collinear spin coupling. It is found that the energy spectrum is real in certain region of the parameter space. In contrast to previous non-Hermitian models, the ground state is a state with one mode occupied inside this real energy spectrum region, instead of the artificially identified vacuum state. At the same time, there appears a gapless excitation, which is made by kink like spin configurations. It is also found that this kink phase has non-trivial topological invariant.

preprint2020arXiv

SpecuSym: Speculative Symbolic Execution for Cache Timing Leak Detection

CPU cache is a limited but crucial storage component in modern processors, whereas the cache timing side-channel may inadvertently leak information through the physically measurable timing variance. Speculative execution, an essential processor optimization, and a source of such variances, can cause severe detriment on deliberate branch mispredictions. Despite static analysis could qualitatively verify the timing-leakage-free property under speculative execution, it is incapable of producing endorsements including inputs and speculated flows to diagnose leaks in depth. This work proposes a new symbolic execution based method, SpecuSym, for precisely detecting cache timing leaks introduced by speculative execution. Given a program (leakage-free in non-speculative execution), SpecuSymsystematically explores the program state space, models speculative behavior at conditional branches, and accumulates the cache side effects along with subsequent path explorations. During the dynamic execution, SpecuSymconstructs leak predicates for memory visits according to the specified cache model and conducts a constraint-solving based cache behavior analysis to inspect the new cache behaviors. We have implementedSpecuSymatop KLEE and evaluated it against 15 open-source benchmarks. Experimental results show thatSpecuSymsuccessfully detected from 2 to 61 leaks in 6 programs under 3 different cache settings and identified false positives in 2 programs reported by recent work.

preprint2019arXiv

Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators using Time Compression Supporting Multiple Spike Codes

Spiking neural networks (SNNs) are the third generation of neural networks and can explore both rate and temporal coding for energy-efficient event-driven computation. However, the decision accuracy of existing SNN designs is contingent upon processing a large number of spikes over a long period. Nevertheless, the switching power of SNN hardware accelerators is proportional to the number of spikes processed while the length of spike trains limits throughput and static power efficiency. This paper presents the first study on developing temporal compression to significantly boost throughput and reduce energy dissipation of digital hardware SNN accelerators while being applicable to multiple spike codes. The proposed compression architectures consist of low-cost input spike compression units, novel input-and-output-weighted spiking neurons, and reconfigurable time constant scaling to support large and flexible time compression ratios. Our compression architectures can be transparently applied to any given pre-designed SNNs employing either rate or temporal codes while incurring minimal modification of the neural models, learning algorithms, and hardware design. Using spiking speech and image recognition datasets, we demonstrate the feasibility of supporting large time compression ratios of up to 16x, delivering up to 15.93x, 13.88x, and 86.21x improvements in throughput, energy dissipation, the tradeoffs between hardware area, runtime, energy, and classification accuracy, respectively based on different spike codes on a Xilinx Zynq-7000 FPGA. These results are achieved while incurring little extra hardware overhead.

preprint2019arXiv

Spike-Train Level Backpropagation for Training Deep Recurrent Spiking Neural Networks

Spiking neural networks (SNNs) well support spatiotemporal learning and energy-efficient event-driven hardware neuromorphic processors. As an important class of SNNs, recurrent spiking neural networks (RSNNs) possess great computational power. However, the practical application of RSNNs is severely limited by challenges in training. Biologically-inspired unsupervised learning has limited capability in boosting the performance of RSNNs. On the other hand, existing backpropagation (BP) methods suffer from high complexity of unrolling in time, vanishing and exploding gradients, and approximate differentiation of discontinuous spiking activities when applied to RSNNs. To enable supervised training of RSNNs under a well-defined loss function, we present a novel Spike-Train level RSNNs Backpropagation (ST-RSBP) algorithm for training deep RSNNs. The proposed ST-RSBP directly computes the gradient of a rated-coded loss function defined at the output layer of the network w.r.t tunable parameters. The scalability of ST-RSBP is achieved by the proposed spike-train level computation during which temporal effects of the SNN is captured in both the forward and backward pass of BP. Our ST-RSBP algorithm can be broadly applied to RSNNs with a single recurrent layer or deep RSNNs with multiple feed-forward and recurrent layers. Based upon challenging speech and image datasets including TI46, N-TIDIGITS, Fashion-MNIST and MNIST, ST-RSBP is able to train RSNNs with an accuracy surpassing that of the current state-of-art SNN BP algorithms and conventional non-spiking deep learning models.

preprint2016arXiv

An accelerator scenario for hard X-ray free electron laser joint with high energy electron radiography

In order to study the dynamic response of the material and the physical mechanism of the fluid dynamics, an accelerator scenario which can be applied to hard X-ray free electron laser and high energy electron radiography was proposed. This accelerator is mainly composed of a 12GeV linac, an undulator branch and an eRad beamline. In order to characterize sample's dynamic behavior in situ and real-time with XFEL and eRad simultaneously, the linac should be capable of accelerating the two kinds of beam within the same operation mode. Combining with in-vacuum and tapering techniques, the undulator branch can produce more than 1E11 photons per pulse in 0.1 precent bandwidth at 42keV. Finally, the eRad amplifying beamline with 1:10 ratio was proposed as an important complementary tool for the wider view field and density identification ability.

preprint2016arXiv

Clinical Information Extraction via Convolutional Neural Network

We report an implementation of a clinical information extraction tool that leverages deep neural network to annotate event spans and their attributes from raw clinical notes and pathology reports. Our approach uses context words and their part-of-speech tags and shape information as features. Then we hire temporal (1D) convolutional neural network to learn hidden feature representations. Finally, we use Multilayer Perceptron (MLP) to predict event spans. The empirical evaluation demonstrates that our approach significantly outperforms baselines.

preprint2016arXiv

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

While question answering (QA) with neural network, i.e. neural QA, has achieved promising results in recent years, lacking of large scale real-word QA dataset is still a challenge for developing and evaluating neural QA system. To alleviate this problem, we propose a large scale human annotated real-world QA dataset WebQA with more than 42k questions and 556k evidences. As existing neural QA methods resolve QA either as sequence generation or classification/ranking problem, they face challenges of expensive softmax computation, unseen answers handling or separate candidate answer generation component. In this work, we cast neural QA as a sequence labeling problem and propose an end-to-end sequence labeling model, which overcomes all the above challenges. Experimental results on WebQA show that our model outperforms the baselines significantly with an F1 score of 74.69% with word-based input, and the performance drops only 3.72 F1 points with more challenging character-based input.

preprint2016arXiv

Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation

Neural machine translation (NMT) aims at solving machine translation (MT) problems using neural networks and has exhibited promising results in recent years. However, most of the existing NMT models are shallow and there is still a performance gap between a single NMT model and the best conventional MT system. In this work, we introduce a new type of linear connections, named fast-forward connections, based on deep Long Short-Term Memory (LSTM) networks, and an interleaved bi-directional architecture for stacking the LSTM layers. Fast-forward connections play an essential role in propagating the gradients and building a deep topology of depth 16. On the WMT'14 English-to-French task, we achieve BLEU=37.7 with a single attention model, which outperforms the corresponding single shallow model by 6.2 BLEU points. This is the first time that a single NMT model achieves state-of-the-art performance and outperforms the best conventional model by 0.7 BLEU points. We can still achieve BLEU=36.3 even without using an attention mechanism. After special handling of unknown words and model ensembling, we obtain the best score reported to date on this task with BLEU=40.4. Our models are also validated on the more difficult WMT'14 English-to-German task.

preprint2016arXiv

Enhancing Sentence Relation Modeling with Auxiliary Character-level Embedding

Neural network based approaches for sentence relation modeling automatically generate hidden matching features from raw sentence pairs. However, the quality of matching feature representation may not be satisfied due to complex semantic relations such as entailment or contradiction. To address this challenge, we propose a new deep neural network architecture that jointly leverage pre-trained word embedding and auxiliary character embedding to learn sentence meanings. The two kinds of word sequence representations as inputs into multi-layer bidirectional LSTM to learn enhanced sentence representation. After that, we construct matching features followed by another temporal CNN to learn high-level hidden matching feature representations. Experimental results demonstrate that our approach consistently outperforms the existing methods on standard evaluation datasets.

preprint2016arXiv

Frustration-Induced Gaplessness in the Frustrated Transverse Ising Ring

New effects in the frustrated transverse Ising ring are predicted. The system is solved based on a mapping of Pauli spin operators to the Jordan-Wigner fermions. We group the low-lying energy levels into bands after imposing appropriate parity constraint, which projects out the redundant degrees of freedom brought about by the Jordan-Wigner fermions. In the region of strong antiferromagnetic coupling, we uncover an unusual gapless phase induced by the ring frustration. We demonstrate that its ground state exhibits a strong longitudinal spin-spin correlation and possesses a considerably large entropy of entanglement. The low-lying energy levels evolve adiabatically in the gapless phase, which facilitates us to work out new behaviors of density of states, low-temperature correlation functions and specific heat. We also propose an experimental protocol for observing this peculiar gapless phase.

preprint2016arXiv

Longitudinal RF capture simulation and BPM signal estimation

In this paper, the theoretical aspects behind longitudinal RF capture are reviewed and the capture process is simulated via a program based on this theory. Four kinds of cases with different initial distribution and capture curve are considered, i.e. uniform distribution with adiabatic capture, uniform distribution with non-adiabatic capture, Gaussian distribution with adiabatic capture and Gaussian distribution with non-adiabatic capture. The simulation results are compared each other and discussed, and Gaussian distribution with adiabatic capture is demonstrated having a higher capture efficiency and leading to a shorter bunch length. In addition, the BPM induced signal is simulated with high input impendence, i.e. $1MΩ$, and low input impendence, i.e. $50Ω$, respectively. Finally, the BPM signal of Heavy Ion Medical Machine (HIMM) is estimated and compared with measured one, and a good agreement is achieved.

preprint2016arXiv

Providing Probabilistic Robustness Guarantee for Crowdsensing

Due to its flexible and pervasive sensing ability, crowdsensing has been extensively studied recently in research communities. However, the fundamental issue of how to meet the requirement of sensing robustness in crowdsensing remains largely unsolved. Specifically, from the task owner's perspective, how to minimize the total payment in crowdsensing while guaranteeing the sensing data quality is a critical issue to be resolved. We elegantly model the robustness requirement over sensing data quality as chance constraints, and investigate both hard and soft chance constraints for different crowdsensing applications. For the former, we reformulate the problem through Boole's Inequality, and explore the optimal value gap between the original problem and the reformulated problem. For the latter, we study a serial of a general payment minimization problem, and propose a binary search algorithm that achieves both feasibility and low payment. The performance gap between our solution and the optimal solution is also theoretically analyzed. Extensive simulations validate our theoretical analysis.

preprint2015arXiv

Concept for a Future Super Proton-Proton Collider

Following the discovery of the Higgs boson at LHC, new large colliders are being studied by the international high-energy community to explore Higgs physics in detail and new physics beyond the Standard Model. In China, a two-stage circular collider project CEPC-SPPC is proposed, with the first stage CEPC (Circular Electron Positron Collier, a so-called Higgs factory) focused on Higgs physics, and the second stage SPPC (Super Proton-Proton Collider) focused on new physics beyond the Standard Model. This paper discusses this second stage.

preprint2015arXiv

Determining phase-space properties of the IHEP RFQ output beam using the RMS beam widths from wire-scanners

A beam line is built after the IHEP RFQ for halo study. To determine transverse emittance and ellipse parameters of the RFQ output beam, beam size data obtained from the first two of 14 wire scanners are employed. By using the transfer matrix method and the least square method, a set of linear equations were set up and solved. The solutions were then applied as initial beam parameters in multi-particle simulations to check the method of calculation. It is shown that difference between the simulated RMS beam size and the measured one at the measurement location is less than 7%, which is acceptable in our experiments.

preprint2015arXiv

Efficient Approximation Algorithms for Computing \emph{k} Disjoint Restricted Shortest Paths

Network applications, such as multimedia streaming and video conferencing, impose growing requirements over Quality of Service (QoS), including bandwidth, delay, jitter, etc. Meanwhile, networks are expected to be load-balanced, energy-efficient, and resilient to some degree of failures. It is observed that the above requirements could be better met with multiple disjoint QoS paths than a single one. Let $G=(V,\, E)$ be a digraph with nonnegative integral cost and delay on every edge, $s,\, t\in V$ be two specified vertices, and $D\in\mathbb{Z}_{0}^{+}$ be a delay bound (or some other constraint), the \emph{$k$ Disjoint Restricted Shortest Path} ($k$\emph{RSP})\emph{ Problem} is computing $k$ disjoint paths between $s$ and $t$ with total cost minimized and total delay bounded by $D$. Few efficient algorithms have been developed because of the hardness of the problem. In this paper, we propose efficient algorithms with provable performance guarantees for the $k$RSP problem. We first present a pseudo-polynomial-time approximation algorithm with a bifactor approximation ratio of $(1,\,2)$, then improve the algorithm to polynomial time with a bifactor ratio of $(1+ε,\,2+ε)$ for any fixed $ε>0$, which is better than the current best approximation ratio $(O(1+γ),\, O(1+\frac{1}γ)\})$ for any fixed $γ>0$ \cite{orda2004efficient}. To the best of our knowledge, this is the first constant-factor algorithm that almost strictly obeys the constraint for the $k$RSP problem.

preprint2015arXiv

Low Noise, tunable Ho:fiber Soliton Oscillator for Ho:YLF Amplifier Seeding

We present a passively mode-locked, tunable soliton Ho:fiber ring oscillator, optimized for seeding of Ho:YLF amplifiers. The oscillator is independently tunable in central wavelength and spectral width from 2040 nm to 2070 nm and from 5 nm to 10 nm, respectively. At all settings the pulse energy within the soliton is around 800 pJ. The soliton oscillator was optimized to fully meets the spectral requirements for seeding Ho:YLF amplifiers. Its Kelly sidebands are located outside the amplifier gain spectrum, resulting in a train of about 1 ps long pedestal-free pulses with relative intensity noise (RIN) of only 0.13 % RMS when integrated from 1 Hz to Nyquist frequency.

preprint2015arXiv

Statistical analysis of motion contrast in optical coherence tomography angiography

Optical coherence tomography angiography (Angio-OCT), mainly based on the temporal dynamics of OCT scattering signals, has found a range of potential applications in clinical and scientific research. Based on the model of random phasor sums, temporal statistics of the complex-valued OCT signals are mathematically described. Statistical distributions of the amplitude differential and complex differential Angio-OCT signals are derived. The theories are validated through the flow phantom and live animal experiments. Using the model developed, the origin of the motion contrast in Angio-OCT is mathematically explained, and the implications in the improvement of motion contrast are further discussed, including threshold determination and its residual classification error, averaging method, and scanning protocol. The proposed mathematical model of Angio-OCT signals can aid in the optimal design of the system and associated algorithms.

preprint2014arXiv

Distributed Iterative Detection Based on Reduced Message Passing for Networked MIMO Cellular Systems

This paper considers base station cooperation (BSC) strategies for the uplink of a multi-user multi-cell high frequency reuse scenario where distributed iterative detection (DID) schemes with soft/hard interference cancellation algorithms are studied. The conventional distributed detection scheme exchanges {soft symbol estimates} with all cooperating BSs. Since a large amount of information needs to be shared via the backhaul, the exchange of hard bit information is preferred, however a performance degradation is experienced. In this paper, we consider a reduced message passing (RMP) technique in which each BS generates a detection list with the probabilities for the desired symbol that are sorted according to the calculated probability. The network then selects the best {detection candidates} from the lists and conveys the index of the constellation symbols (instead of double-precision values) among the cooperating cells. The proposed DID-RMP achieves an inter-cell-interference (ICI) suppression with low backhaul traffic overhead compared with {the conventional soft bit exchange} and outperforms the previously reported hard/soft information exchange algorithms.

preprint2014arXiv

Leveraging local identity-by-descent increases the power of case/control GWAS with related individuals

Large case/control Genome-Wide Association Studies (GWAS) often include groups of related individuals with known relationships. When testing for associations at a given locus, current methods incorporate only the familial relationships between individuals. Here, we introduce the chromosome-based Quasi Likelihood Score (cQLS) statistic that incorporates local Identity-By-Descent (IBD) to increase the power to detect associations. In studies robust to population stratification, such as those with case/control sibling pairs, simulations show that the study power can be increased by over 50%. In our example, a GWAS examining late-onset Alzheimer's disease, the $p$-values among the most strongly associated SNPs in the APOE gene tend to decrease, with the smallest $p$-value decreasing from $1.23\times10^{-8}$ to $7.70\times 10^{-9}$. Furthermore, as a part of our simulations, we reevaluate our expectations about the use of families in GWAS. We show that, although adding only half as many unique chromosomes, genotyping affected siblings is more efficient than genotyping randomly ascertained cases. We also show that genotyping cases with a family history of disease will be less beneficial when searching for SNPs with smaller effect sizes.

preprint2014arXiv

The simulation of loss of U ions due to charge changing processes in the CSRm ring

Significant beam loss caused by the charge exchange processes and ions impact induced outgassing play a crucial role in the limitation of the maximum number of accumulated heavy ions during the high intensity operation in the accelerators. With the aim to control beam loss due to charge exchange processes and to confine the generated desorption gas, the tracking of the loss positions and installing the absorber blocks with low-desorption rate material at appropriate locations in the CSRm ring will be taken. The loss simulation of U ions having lost an electron will be presented in this report and the calculation of the collimation efficiency of the CSRm ring will be continued in the future.

preprint2013arXiv

Adaptive Decision Feedback Detection with Parallel Interference Cancellation and Constellation Constraints for Multi-Antenna Systems

In this paper, a novel low-complexity adaptive decision feedback detection with parallel decision feedback and constellation constraints (P-DFCC) is proposed for multiuser MIMO systems. We propose a constrained constellation map which introduces a number of selected points served as the feedback candidates for interference cancellation. By introducing a reliability checking, a higher degree of freedom is introduced to refine the unreliable estimates. The P-DFCC is followed by an adaptive receive filter to estimate the transmitted symbol. In order to reduce the complexity of computing the filters with time-varying MIMO channels, an adaptive recursive least squares (RLS) algorithm is employed in the proposed P-DFCC scheme. An iterative detection and decoding (Turbo) scheme is considered with the proposed P-DFCC algorithm. Simulations show that the proposed technique has a complexity comparable to the conventional parallel decision feedback detector while it obtains a performance close to the maximum likelihood detector at a low to medium SNR range.

preprint2013arXiv

Adaptive Iterative Decision Feedback Detection Algorithms for Multi-User MIMO Systems

An adaptive iterative decision multi-feedback detection algorithm with constellation constraints is proposed for multiuser multi-antenna systems. An enhanced detection and interference cancellation is performed by introducing multiple constellation points as decision candidates. A complexity reduction strategy is developed to avoid redundant processing with reliable decisions along with an adaptive recursive least squares algorithm for time-varying channels. An iterative detection and decoding scheme is also considered with the proposed detection algorithm. Simulations show that the proposed technique has a complexity as low as the conventional decision feedback detector while it obtains a performance close to the maximum likelihood detector.

preprint2013arXiv

Dimer XXZ Spin Ladders: Phase diagram and a Non-Trivial Antiferromagnetic Phase

We study the dimer $XXZ$ spin model on two-leg ladders with isotropic Heisenberg interactions on the rung and anisotropic $XXZ$ interactions along the rail in an external field. Combining both analytical and numerical methods, we set up the ground state phase diagram and investigate the quantum phase transitions and the properties of rich phases, including the full polarized, singlet dimer, Luttinger liquid, triplon solid, and a non-trivial antiferromagnetic phases with gap. The analytical analyses based on solvable effective Hamiltonians are presented for clear view of the phases and transitions. Quantum Monte Carlo and exact diagonalization methods are employed on finite system to verify the exact nature of the phases and transitions. Of all the phases, we pay a special attention to the gapped antiferromagnetic phase, which is disclosed to be a non-trivial one that exhibits the time-reversal symmetry. We also discuss how our findings could be detected in experiment in the light of ultracold atoms technology advances.

preprint2013arXiv

Ground-state and finite-temperature properties of spin liquid phase in the J1-J2 honeycomb model

In this paper we analyze the groundstate and finite-temperature properties of a frustrated Heisenberg $J_1-J_2$ model on a honeycomb lattice by employing the Schwinger boson technique. The phase diagram and spin gap as functions of ${J}_{2}/{J}_{1}$ are presented, showing that the exotic spin liquid phase lies in $0.21<{J}_{2}/{J}_{1} <0.43$. The temperature and magnetic-field dependences of specific heat, magnetic susceptibility and Knight shift are also presented. We find the spin liquid state is robust with respect to external magnetic field. These results provide clear information characterizing unusual properties of the exotic spin liquid phase for further experiments.

preprint2013arXiv

Iterative Detection and Decoding for MIMO Systems with Knowledge-Aided Message Passing Algorithms

In this paper, we consider the problem of iterative detection and decoding (IDD) for multi-antenna systems using low-density parity-check (LDPC) codes. The proposed IDD system consists of a soft-input soft-output parallel interference (PIC) cancellation scheme with linear minimum mean-square error (MMSE) receive filters and two novel belief propagation (BP) decoding algorithms. The proposed BP algorithms exploit the knowledge of short cycles in the graph structure and the reweighting factors derived from the hypergraph's expansion. Simulation results show that when used to perform IDD for multi-antenna systems both proposed BP decoding algorithms can consistently outperform existing BP techniques with a small number of decoding iterations.

preprint2013arXiv

Multi-Feedback Successive Interference Cancellation for Multiuser MIMO Systems

In this paper, a low-complexity multiple feedback successive interference cancellation (MF-SIC) strategy is proposed for the uplink of multiuser multiple-input multiple-output (MU-MIMO) systems. In the proposed MF-SIC {algorithm with shadow area constraints (SAC)}, an enhanced interference cancellation is achieved by introducing {constellation points as the candidates} to combat the error propagation in decision feedback loops. We also combine the MF-SIC with multi-branch (MB) processing, which achieves a higher detection diversity order. For coded systems, a low-complexity soft-input soft-output (SISO) iterative (turbo) detector is proposed based on the MF and the MB-MF interference suppression techniques. The computational complexity of the MF-SIC is {comparable to} the conventional SIC algorithm {since very little additional complexity is required}. {Simulation} results show that the algorithms significantly outperform the conventional SIC scheme and {approach} the optimal detector.

preprint2012arXiv

Analytical Solution of Cross Polarization Dynamics

Cross polarization (CP) dynamics, which was remained unknown for five decades, has been derived analytically in the zero- and double-quantum spaces. The initial polarization in the double-quantum space is a constant of motion under strong pulse condition ($|ω_{1I}+ω_{1S}|\gg |d(t)|$), while the Hamiltonian in the zero-quantum space reduces to $d(t)σ_{z}^Δ$ under the Hartmann-Hahn match condition ($ω_{1I}=ω_{1S}$). The time dependent Hamilontian ($d(t)σ_{z}^Δ$) in the zero-quantum space can be expressed by average Hamiltonians. Since$[d(t')σ_{z}^Δ, d(t")σ_{z}^Δ]=0$, only zero order average Hamiltonian needs to be calculated, leading to an analytical solution of CP dynamics.

preprint2012arXiv

Topological edge states in spin 1 bilinear-biquadratic model

The spin 1 bilinear-biquadratic model on square lattice in the region $0<ϕ<π/4$ is studied in a fermion representation with a p-wave pairing BCS type mean-field theory. Our results show there may exist a non-trivial gapped spin liquid with time-reversal symmetry spontaneously breaking. This exotic state manifests its topological nature by forming chiral states at the edges. To show it more clear, we set up and solved a ribbon system. We got a gapless dispersion representing the edge modes beneath the bulk modes. The edge modes with nonzero longitudinal momentum ($k_{x}\neq0$) convect in opposite directions at the two edges, which leads to a two-fold degeneracy. While the zero longitudinal momentum ($k_{x}=0$) modes turn out to be Majorana fermion states. The edge spin correlation functions are found to decay in a power law with the distance increasing. We also calculated the contribution of the edge modes to the specific heat and obtained a linear law at low temperatures.

preprint2011arXiv

Retrapping Current, Self-Heating, and Hysteretic Current-Voltage Curves in Ultra-Narrow Superconducting Aluminum Nanowires

Hysteretic I-V (current-voltage) is studied in narrow Al nanowires. The nanowires have a cross section as small as 50 nm^2. We focus on the retapping current in a down-sweep of the current, at which a nanowire re-enters the superconducting state from a normal state. The retrapping current is found to be significantly smaller than the switching current at which the nanowire switches into the normal state from a superconducting state during a current up-sweep. For wires of different lengths, we analyze the heat removal due to various processes, including electronic and phonon processes. For a short wires 1.5 um in length, electronic thermal conduction is effective; for longer wires 10um in length, phonon conduction becomes important. We demonstrate that the measured retrapping current as a function of temperature can be quantitatively accounted for by the selfheating occurring in the normal portions of the nanowires to better than 20 % accuracy. For the phonon processes, the extracted thermal conduction parameters support the notion of a reduced phase-space below 3-dimensions, consistent with the phonon thermal wavelength having exceeded the lateral dimensions at temperatures below ~ 1.3K. Nevertheless, surprisingly the best fit was achieved with a functional form corresponding to 3-dimensional phonons, albeit requiring parameters far exceeding known values in the literature.

preprint2011arXiv

Single Phase Slip Limited Switching Current in 1-Dimensional Superconducting Al Nanowires

An Aluminum nanowire switches from superconducting to normal as the current is increased in an upsweep. The switching current (I_s) averaged over upsweeps approximately follows the depairing critical current (I_c) but falls below it. Fluctuations in I_s exhibit three distinct regions of behaviors and are non-monotonic in temperature: saturation well below the critical temperature T_c, an increase as T^{2/3} at intermediate temperatures, and a rapid decrease close to T_c. Heat dissipation analysis indicates that a single phase slip is able to trigger switching at low and intermediate temperatures, whereby the T^{2/3} dependence arises from the thermal activation of a phase slip, while saturation at low temperatures provides striking evidence that the phase slips by macroscopic quantum tunneling.

preprint2011arXiv

Thermodynamics of Third Order Lovelock-Born-Infeld Black Holes

We here explore black holes in the third order Lovelock gravity coupling with nonlinear Born-Infeld electromagnetic field. Considering special second and third order coefficients ($\hatα_2^2=3\hatα_3=α^2$), we analyze the thermodynamics of third order Lovelock-Born-Infeld black holes and, in 7-dimensional AdS space-time, discuss the stability of black holes in different event horizon structures. We find that the cosmological constant $Λ$ plays an important role in the distribution of stable regions of black holes.

preprint2010arXiv

Bivariate Quasi-Tower Sets and Their Associated Lagrange Interpolation Bases

As we all known, there is still a long way for us to solve arbitrary multivariate Lagrange interpolation in theory. Nevertheless, it is well accepted that theories about Lagrange interpolation on special point sets should cast important lights on the general solution. In this paper, we propose a new type of bivariate point sets, quasi-tower sets, whose geometry is more natural than some known point sets such as cartesian sets and tower sets. For bivariate Lagrange interpolation on quasi-tower sets, we construct the associated degree reducing interpolation monomial and Newton bases w.r.t. common monomial orderings theoretically. Moreover, by inputting these bases into Buchberger-Möller algorithm, we obtain the reduced Gröbner bases for vanishing ideals of quasi-tower sets much more efficiently than before.

preprint2010arXiv

Slowly rotating charged black holes in anti-de Sitter third order Lovelock gravity

In this paper, we study slowly rotating black hole solutions in Lovelock gravity (n=3). These exact slowly rotating black hole solutions are obtained in uncharged and charged cases, respectively. Up to the linear order of the rotating parameter a, the mass, Hawking temperature and entropy of the uncharged black holes get no corrections from rotation. In charged case, we compute magnetic dipole moment and gyromagnetic ratio of the black holes. It is shown that the gyromagnetic ratio keeps invariant after introducing the Gauss-Bonnet and third order Lovelock interactions.

preprint2010arXiv

Thermodynamics of Gauss-Bonnet-Born-Infeld black holes in AdS space

We construct solutions of a model which includes the Gauss-Bonnet and Born-Infeld terms for various horizon topologies $(k=0, \pm 1)$, and then the mass, temperature, entropy and heat capacity of black holes are computed. For the sake of simplicity, we perform the stability analysis of five dimensional topological black holes in AdS space.

preprint2007arXiv

Modeling Interconnect Variability Using Efficient Parametric Model Order Reduction

Assessing IC manufacturing process fluctuations and their impacts on IC interconnect performance has become unavoidable for modern DSM designs. However, the construction of parametric interconnect models is often hampered by the rapid increase in computational cost and model complexity. In this paper we present an efficient yet accurate parametric model order reduction algorithm for addressing the variability of IC interconnect performance. The efficiency of the approach lies in a novel combination of low-rank matrix approximation and multi-parameter moment matching. The complexity of the proposed parametric model order reduction is as low as that of a standard Krylov subspace method when applied to a nominal system. Under the projection-based framework, our algorithm also preserves the passivity of the resulting parametric models.

preprint2007arXiv

Specification Test Compaction for Analog Circuits and MEMS

Testing a non-digital integrated system against all of its specifications can be quite expensive due to the elaborate test application and measurement setup required. We propose to eliminate redundant tests by employing e-SVM based statistical learning. Application of the proposed methodology to an operational amplifier and a MEMS accelerometer reveal that redundant tests can be statistically identified from a complete set of specification-based tests with negligible error. Specifically, after eliminating five of eleven specification-based tests for an operational amplifier, the defect escape and yield loss is small at 0.6% and 0.9%, respectively. For the accelerometer, defect escape of 0.2% and yield loss of 0.1% occurs when the hot and colt tests are eliminated. For the accelerometer, this level of Compaction would reduce test cost by more than half.

preprint2005arXiv

Contractor renormalization group theory of the SU($N$) chains and ladders

Contractor renormalization group (CORE) method is applied to the SU($N$) chain and ladders in this paper. In our designed schemes, we show that these two classes of systems can return to their original form of Hamiltonian after CORE transformation. Successive iteration of the transformation leads to a fixed point so that the ground state energy and the energy gap to the ground state can be deduced. The result of SU($N$) chain is compared with the one by Bethe ansatz method. The transformation on spin-1/2 ladders gives a finite gap in the excited energy spectra to the ground state in an intuitive way. The application to SU(3) ladders is also discussed.

Peng Li

What is connected

Connect this record

See the researcher in context

Building this map preview

76 published item(s)

GIFT: Guided Fine-Tuning and Transfer for Enhancing Instruction-Tuned Language Models

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU

Random-coupled Neural Network

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

A Computational Framework of Cortical Microcircuits Approximates Sign-concordant Random Backpropagation

A Roadmap for Big Model

A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models

Anisotropic Dzyaloshinskii-Moriya interaction and topological magnetism in two-dimensional magnets protected by P4-m2 crystal symmetry

Binary Neural Networks as a general-propose compute paradigm for on-device computer vision

CF-YOLO: Cross Fusion YOLO for Object Detection in Adverse Weather with a High-quality Real Snow Dataset

Deterministic relation between optical polarization and lattice symmetry revealed in ion-doped single microcrystals

ELLE: Efficient Lifelong Pre-training for Emerging Data

Evidence of Magnon-Mediated Orbital Magnetism in a Quasi-2D Topological Magnon Insulator

Fully Hyperbolic Neural Networks

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog

Growth, Electronic Structure and Superconductivity of Ultrathin Epitaxial CoSi2 Films

Knowledge Inheritance for Pre-trained Language Models

LF-VIO: A Visual-Inertial-Odometry Framework for Large Field-of-View Cameras with Negative Plane

Manual-Guided Dialogue for Flexible Conversational Agents

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts

OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation

Packed Levitated Marker for Entity and Relation Extraction

Quantum Interference Transport in two-dimensional Semi-Dirac Semimetals

Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification

Tunable magnetically induced transparency spectra in magnon-magnon coupled Y3Fe5O12/permalloy bilayers

Anisotropic Magnon Spin Transport in Ultra-thin Spinel Ferrite Thin Films -- Evidence for Anisotropy in Exchange Stiffness

Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples

Giant Topological Hall Effect in van der Waals Heterostructures of CrTe2/Bi2Te3

Comprehensive SNN Compression Using ADMM Optimization and Activity Regularization

Evolving the pulmonary nodules diagnosis from classical approaches to deep learning aided decision support: three decades development course and future prospect

Experimental parameters, combined dynamics, and nonlinearity of a Magnonic-Opto-Electronic Oscillator (MOEO)

FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval

HighwayGraph: Modelling Long-distance Node Relations for Improving General Graph Neural Network

Hybrid vector beams with non-uniform orbital angular momentum density induced by designed azimuthal polarization gradient

Implement Liquid Democracy on Ethereum: A Fast Algorithm for Realtime Self-tally Voting System

Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation

Making Robots Draw A Vivid Portrait In Two Minutes

Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions

Ring-Frustrated Non-Hermitian $XY$ Model

SpecuSym: Speculative Symbolic Execution for Cache Timing Leak Detection

Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators using Time Compression Supporting Multiple Spike Codes

Spike-Train Level Backpropagation for Training Deep Recurrent Spiking Neural Networks

An accelerator scenario for hard X-ray free electron laser joint with high energy electron radiography

Clinical Information Extraction via Convolutional Neural Network

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation

Enhancing Sentence Relation Modeling with Auxiliary Character-level Embedding

Frustration-Induced Gaplessness in the Frustrated Transverse Ising Ring

Longitudinal RF capture simulation and BPM signal estimation

Providing Probabilistic Robustness Guarantee for Crowdsensing

Concept for a Future Super Proton-Proton Collider

Determining phase-space properties of the IHEP RFQ output beam using the RMS beam widths from wire-scanners

Efficient Approximation Algorithms for Computing \emph{k} Disjoint Restricted Shortest Paths

Low Noise, tunable Ho:fiber Soliton Oscillator for Ho:YLF Amplifier Seeding

Statistical analysis of motion contrast in optical coherence tomography angiography

Distributed Iterative Detection Based on Reduced Message Passing for Networked MIMO Cellular Systems

Leveraging local identity-by-descent increases the power of case/control GWAS with related individuals

The simulation of loss of U ions due to charge changing processes in the CSRm ring

Adaptive Decision Feedback Detection with Parallel Interference Cancellation and Constellation Constraints for Multi-Antenna Systems

Adaptive Iterative Decision Feedback Detection Algorithms for Multi-User MIMO Systems

Dimer XXZ Spin Ladders: Phase diagram and a Non-Trivial Antiferromagnetic Phase

Ground-state and finite-temperature properties of spin liquid phase in the J1-J2 honeycomb model

Iterative Detection and Decoding for MIMO Systems with Knowledge-Aided Message Passing Algorithms

Multi-Feedback Successive Interference Cancellation for Multiuser MIMO Systems

Analytical Solution of Cross Polarization Dynamics

Topological edge states in spin 1 bilinear-biquadratic model

Retrapping Current, Self-Heating, and Hysteretic Current-Voltage Curves in Ultra-Narrow Superconducting Aluminum Nanowires

Single Phase Slip Limited Switching Current in 1-Dimensional Superconducting Al Nanowires

Thermodynamics of Third Order Lovelock-Born-Infeld Black Holes

Bivariate Quasi-Tower Sets and Their Associated Lagrange Interpolation Bases

Slowly rotating charged black holes in anti-de Sitter third order Lovelock gravity

Thermodynamics of Gauss-Bonnet-Born-Infeld black holes in AdS space

Modeling Interconnect Variability Using Efficient Parametric Model Order Reduction