Source author record

Qian Li

Qian Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

65works

36topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

Large Vision-Language Models (VLMs) have achieved remarkable multimodal performance yet remain prone to factual hallucinations, particularly in long-tail or specialized domains. Moreover, current models exhibit a weak capacity to refuse queries that exceed their parametric knowledge. In this paper, we propose a systematic framework to enhance the refusal capability of VLMs when facing such unknown questions. We first curate a model-specific "Visual-Idk" (Visual-I don't know) dataset, leveraging multi-sample consistency probing to distinguish between known and unknown facts. We then align the model using supervised fine-tuning followed by preference-aware optimization (e.g., DPO, ORPO) to effectively delineate its knowledge boundaries. Results on the Visual-Idk dataset show our method improves the Truthful Rate from 57.9\% to 67.3\%. Additionally, internal probing also demonstrates that the model genuinely recognizes its boundaries instead of just memorizing refusal patterns. Our framework further generalizes to out-of-distribution medical and perceptual domains, providing a robust path toward more trustworthy and prudent visual assistants.

preprint2024arXiv

Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks

Text-video retrieval is a challenging task that aims to identify relevant videos given textual queries. Compared to conventional textual retrieval, the main obstacle for text-video retrieval is the semantic gap between the textual nature of queries and the visual richness of video content. Previous works primarily focus on aligning the query and the video by finely aggregating word-frame matching signals. Inspired by the human cognitive process of modularly judging the relevance between text and video, the judgment needs high-order matching signal due to the consecutive and complex nature of video contents. In this paper, we propose chunk-level text-video matching, where the query chunks are extracted to describe a specific retrieval unit, and the video chunks are segmented into distinct clips from videos. We formulate the chunk-level matching as n-ary correlations modeling between words of the query and frames of the video and introduce a multi-modal hypergraph for n-ary correlation modeling. By representing textual units and video frames as nodes and using hyperedges to depict their relationships, a multi-modal hypergraph is constructed. In this way, the query and the video can be aligned in a high-order semantic space. In addition, to enhance the model's generalization ability, the extracted features are fed into a variational inference component for computation, obtaining the variational representation under the Gaussian distribution. The incorporation of hypergraphs and variational inference allows our model to capture complex, n-ary interactions among textual and visual contents. Experimental results demonstrate that our proposed method achieves state-of-the-art performance on the text-video retrieval task.

preprint2023arXiv

Multi-spatial Multi-temporal Air Quality Forecasting with Integrated Monitoring and Reanalysis Data

Accurate air quality forecasting is crucial for public health, environmental monitoring and protection, and urban planning. However, existing methods fail to effectively utilize multi-scale information, both spatially and temporally. Spatially, there is a lack of integration between individual monitoring stations and city-wide scales. Temporally, the periodic nature of air quality variations is often overlooked or inadequately considered. To address these limitations, we present a novel Multi-spatial Multi-temporal air quality forecasting method based on Graph Convolutional Networks and Gated Recurrent Units (M2G2), bridging the gap in air quality forecasting across spatial and temporal scales. The proposed framework consists of two modules: Multi-scale Spatial GCN (MS-GCN) for spatial information fusion and Multi-scale Temporal GRU(MT-GRU) for temporal information integration. In the spatial dimension, the MS-GCN module employs a bidirectional learnable structure and a residual structure, enabling comprehensive information exchange between individual monitoring stations and the city-scale graph. Regarding the temporal dimension, the MT-GRU module adaptively combines information from different temporal scales through parallel hidden states. Leveraging meteorological indicators and four air quality indicators, we present comprehensive comparative analyses and ablation experiments, showcasing the higher accuracy of M2G2 in comparison to nine currently available advanced approaches across all aspects. The improvements of M2G2 over the second-best method on RMSE of the 24h/48h/72h are as follows: PM2.5: (7.72%, 6.67%, 10.45%); PM10: (6.43%, 5.68%, 7.73%); NO2: (5.07%, 7.76%, 16.60%); O3: (6.46%, 6.86%, 9.79%). Furthermore, we demonstrate the effectiveness of each module of M2G2 by ablation study.

preprint2022arXiv

Atomically engineered cobaltite layers for robust ferromagnetism

Emergent phenomena at heterointerfaces are directly associated with the bonding geometry of adjacent layers. Effective control of accessible parameters, such as the bond length and bonding angles, offers an elegant method to tailor competing energies of the electronic and magnetic ground states. In this study, we construct unit thick syntactic layers of cobaltites within a strongly tilted octahedral matrix via atomically precise synthesis. The octahedral tilt patterns of adjacent layers propagate into cobaltites, leading to a continuation of octahedral tilting while maintaining significant misfit tensile strain. These effects induce severe rumpling within an atomic plane of neighboring layers triggers the electronic reconstruction between the splitting orbitals. First-principles calculations reveal that the cobalt ions transits to a higher spin state level upon octahedral tilting, resulting in robust ferromagnetism in ultrathin cobaltites. This work demonstrates a design methodology for fine-tuning the lattice and spin degrees of freedom in correlated quantum heterostructures by exploiting epitaxial geometric engineering.

preprint2022arXiv

Braiding lateral morphotropic grain boundary in homogeneitic oxides

Interfaces formed by correlated oxides offer a critical avenue for discovering emergent phenomena and quantum states. However, the fabrication of oxide interfaces with variable crystallographic orientations and strain states integrated along a film plane is extremely challenge by conventional layer-by-layer stacking or self-assembling. Here, we report the creation of morphotropic grain boundaries (GBs) in laterally interconnected cobaltite homostructures. Single-crystalline substrates and suspended ultrathin freestanding membranes provide independent templates for coherent epitaxy and constraint on the growth orientation, resulting in seamless and atomically sharp GBs. Electronic states and magnetic behavior in hybrid structures are laterally modulated and isolated by GBs, enabling artificially engineered functionalities in the planar matrix. Our work offers a simple and scalable method for fabricating unprecedented innovative interfaces through controlled synthesis routes as well as provides a platform for exploring potential applications in neuromorphics, solid state batteries, and catalysis.

preprint2022arXiv

Causal Disentanglement for Semantics-Aware Intent Learning in Recommendation

Traditional recommendation models trained on observational interaction data have generated large impacts in a wide range of applications, it faces bias problems that cover users' true intent and thus deteriorate the recommendation effectiveness. Existing methods tracks this problem as eliminating bias for the robust recommendation, e.g., by re-weighting training samples or learning disentangled representation. The disentangled representation methods as the state-of-the-art eliminate bias through revealing cause-effect of the bias generation. However, how to design the semantics-aware and unbiased representation for users true intents is largely unexplored. To bridge the gap, we are the first to propose an unbiased and semantics-aware disentanglement learning called CaDSI (Causal Disentanglement for Semantics-Aware Intent Learning) from a causal perspective. Particularly, CaDSI explicitly models the causal relations underlying recommendation task, and thus produces semantics-aware representations via disentangling users true intents aware of specific item context. Moreover, the causal intervention mechanism is designed to eliminate confounding bias stemmed from context information, which further to align the semantics-aware representation with users true intent. Extensive experiments and case studies both validate the robustness and interpretability of our proposed model.

preprint2022arXiv

Coexistence of extended flat band and Kekulé order in Li-intercalated graphene

Doping graphene near the 1/4 filling to shift the extended flat band and van Hove singularity below E$_F$ has been highly desirable. Here we report the experimental observation of an extended flat band below E$_F$ in Li-intercalated graphene. Strong electron-phonon interaction is clearly identified by notable kinks in the band dispersion. Moreover, the evolution of the band structure upon Li intercalation shows that the extended flat band and the Kekulé order emerge simultaneously. Our work provides opportunities for investigating flat band related instabilities and its interplay with the Kekulé order

preprint2022arXiv

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

Continual learning requires incremental compatibility with a sequence of tasks. However, the design of model architecture remains an open question: In general, learning all tasks with a shared set of parameters suffers from severe interference between tasks; while learning each task with a dedicated parameter subspace is limited by scalability. In this work, we theoretically analyze the generalization errors for learning plasticity and memory stability in continual learning, which can be uniformly upper-bounded by (1) discrepancy between task distributions, (2) flatness of loss landscape and (3) cover of parameter space. Then, inspired by the robust biological learning system that processes sequential experiences with multiple parallel compartments, we propose Cooperation of Small Continual Learners (CoSCL) as a general strategy for continual learning. Specifically, we present an architecture with a fixed number of narrower sub-networks to learn all incremental tasks in parallel, which can naturally reduce the two errors through improving the three components of the upper bound. To strengthen this advantage, we encourage to cooperate these sub-networks by penalizing the difference of predictions made by their feature representations. With a fixed parameter budget, CoSCL can improve a variety of representative continual learning approaches by a large margin (e.g., up to 10.64% on CIFAR-100-SC, 9.33% on CIFAR-100-RS, 11.45% on CUB-200-2011 and 6.72% on Tiny-ImageNet) and achieve the new state-of-the-art performance.

preprint2022arXiv

Delineating complex ferroelectric domain structures via second harmonic generation spectral imaging

Understanding the mechanisms and spatial correlations of crystallographic symmetry breaking in ferroelectric materials is essential to tuning their functional properties. While optical second harmonic generation (SHG) has long been utilized in ferroelectric studies, its capability for probing complex polar materials has yet to be fully realized. Here, we develop a SHG spectral imaging method implemented on a home-designed laser-scanning SHG microscope, and demonstrate its application for a model system of (K,Na)NbO3 single crystals. Supervised model fitting analysis produces comprehensive information about the polarization vector orientations and relative fractions of constituent domain variants as well as their thermal evolution across the polymorphic phase transitions. We observe an unexpected persistence of the orthorhombic phase at low temperatures, pointing to the phase competitions. Besides, we show that unsupervised matrix decomposition analysis can quickly and faithfully reveal domain configurations without a priori knowledge about specific material systems. The SHG spectral imaging method can be readily extended to other ferroelectric materials with potentials to be further enhanced.

preprint2022arXiv

Epitaxial stabilization of an orthorhombic Mg-Ti-O superconductor

The family of titanium oxide superconductors exhibits many intriguing phenomena comparable to cuprates and iron pnictides/chalcogenides, and thus provides an ideal platform to contrastively study the unconventional pairing mechanism of high-temperature superconductors. Here, we successfully deposit superconducting Mg-Ti-O films on MgAl$_2$O$_4$ substrates with three principal orientations by ablating a MgTi$_2$O$_4$ target. Particularly, it is striking to observed that a single-crystalline film of an unintended structure has been grown on the (011)-oriented substrate, with the highest zero resistance transition temperature ($T_{\mathrm{c}0}$) of 5.0 K among them. The film has a highly reduced Mg/Ti ratio and an orthorhombic Ti$_9$O$_{10}$-like structure (denoted as Mg: Ti$_9$O$_{10}$), demonstrated by further characterizations of chemical composition and structure. Such a structure is unstable in bulk but favorable to be epitaxially stabilized on the (011)-surface of MgAl$_2$O$_4$ due to a relatively small strain at the formed interface. An isotropic upper critical field ($B_{\mathrm{c}2}$) up to 13.7 T that breaks the Pauli limit is observed in the Mg: Ti$_9$O$_{10}$ film, analogous to other superconducting titanium oxides. The similarity points to a common origin for the superconductivity in the family, which will provide valuable opinions for the mechanism of unconventional superconductivity in transition metal compounds.

preprint2022arXiv

Event Extraction by Associating Event Types and Argument Roles

Event extraction (EE), which acquires structural event knowledge from texts, can be divided into two sub-tasks: event type classification and element extraction (namely identifying triggers and arguments under different role patterns). As different event types always own distinct extraction schemas (i.e., role patterns), previous work on EE usually follows an isolated learning paradigm, performing element extraction independently for different event types. It ignores meaningful associations among event types and argument roles, leading to relatively poor performance for less frequent types/roles. This paper proposes a novel neural association framework for the EE task. Given a document, it first performs type classification via constructing a document-level graph to associate sentence nodes of different types, and adopting a graph attention network to learn sentence embeddings. Then, element extraction is achieved by building a universal schema of argument roles, with a parameter inheritance mechanism to enhance role preference for extracted elements. As such, our model takes into account type and role associations during EE, enabling implicit information sharing among them. Experimental results show that our approach consistently outperforms most state-of-the-art EE methods in both sub-tasks. Particularly, for types/roles with less training data, the performance is superior to the existing methods.

preprint2022arXiv

FedMCSA: Personalized Federated Learning via Model Components Self-Attention

Federated learning (FL) facilitates multiple clients to jointly train a machine learning model without sharing their private data. However, Non-IID data of clients presents a tough challenge for FL. Existing personalized FL approaches rely heavily on the default treatment of one complete model as a basic unit and ignore the significance of different layers on Non-IID data of clients. In this work, we propose a new framework, federated model components self-attention (FedMCSA), to handle Non-IID data in FL, which employs model components self-attention mechanism to granularly promote cooperation between different clients. This mechanism facilitates collaboration between similar model components while reducing interference between model components with large differences. We conduct extensive experiments to demonstrate that FedMCSA outperforms the previous methods on four benchmark datasets. Furthermore, we empirically show the effectiveness of the model components self-attention mechanism, which is complementary to existing personalized FL and can significantly improve the performance of FL.

preprint2022arXiv

Forestry digital twin with machine learning in Landsat 7 data

Modeling forests using historical data allows for more accurately evolution analysis, thus providing an important basis for other studies. As a recognized and effective tool, remote sensing plays an important role in forestry analysis. We can use it to derive information about the forest, including tree type, coverage and canopy density. There are many forest time series modeling studies using statistic values, but few using remote sensing images. Image prediction digital twin is an implementation of digital twin, which aims to predict future images bases on historical data. In this paper, we propose an LSTM-based digital twin approach for forest modeling, using Landsat 7 remote sensing image within 20 years. The experimental results show that the prediction twin method in this paper can effectively predict the future images of study area.

preprint2022arXiv

How Does Knowledge Graph Embedding Extrapolate to Unseen Data: A Semantic Evidence View

Knowledge Graph Embedding (KGE) aims to learn representations for entities and relations. Most KGE models have gained great success, especially on extrapolation scenarios. Specifically, given an unseen triple (h, r, t), a trained model can still correctly predict t from (h, r, ?), or h from (?, r, t), such extrapolation ability is impressive. However, most existing KGE works focus on the design of delicate triple modeling function, which mainly tells us how to measure the plausibility of observed triples, but offers limited explanation of why the methods can extrapolate to unseen data, and what are the important factors to help KGE extrapolate. Therefore in this work, we attempt to study the KGE extrapolation of two problems: 1. How does KGE extrapolate to unseen data? 2. How to design the KGE model with better extrapolation ability? For the problem 1, we first discuss the impact factors for extrapolation and from relation, entity and triple level respectively, propose three Semantic Evidences (SEs), which can be observed from train set and provide important semantic information for extrapolation. Then we verify the effectiveness of SEs through extensive experiments on several typical KGE methods. For the problem 2, to make better use of the three levels of SE, we propose a novel GNN-based KGE model, called Semantic Evidence aware Graph Neural Network (SE-GNN). In SE-GNN, each level of SE is modeled explicitly by the corresponding neighbor pattern, and merged sufficiently by the multi-layer aggregation, which contributes to obtaining more extrapolative knowledge representation. Finally, through extensive experiments on FB15k-237 and WN18RR datasets, we show that SE-GNN achieves state-of-the-art performance on Knowledge Graph Completion task and performs a better extrapolation ability. Our code is available at https://github.com/renli1024/SE-GNN.

preprint2022arXiv

Learning Generalizable Light Field Networks from Few Images

We explore a new strategy for few-shot novel view synthesis based on a neural light field representation. Given a target camera pose, an implicit neural network maps each ray to its target pixel's color directly. The network is conditioned on local ray features generated by coarse volumetric rendering from an explicit 3D feature volume. This volume is built from the input images using a 3D ConvNet. Our method achieves competitive performances on synthetic and real MVS data with respect to state-of-the-art neural radiance field based competition, while offering a 100 times faster rendering.

preprint2022arXiv

Machine Learning with DBOS

We recently proposed a new cluster operating system stack, DBOS, centered on a DBMS. DBOS enables unique support for ML applications by encapsulating ML code within stored procedures, centralizing ancillary ML data, providing security built into the underlying DBMS, co-locating ML code and data, and tracking data and workflow provenance. Here we demonstrate a subset of these benefits around two ML applications. We first show that image classification and object detection models using GPUs can be served as DBOS stored procedures with performance competitive to existing systems. We then present a 1D CNN trained to detect anomalies in HTTP requests on DBOS-backed web services, achieving SOTA results. We use this model to develop an interactive anomaly detection system and evaluate it through qualitative user feedback, demonstrating its usefulness as a proof of concept for future work to develop learned real-time security services on top of DBOS.

preprint2022arXiv

New Distinguishers for Negation-Limited Weak Pseudorandom Functions

We show how to distinguish circuits with $\log k$ negations (a.k.a $k$-monotone functions) from uniformly random functions in $\exp\left(\tilde{O}\left(n^{1/3}k^{2/3}\right)\right)$ time using random samples. The previous best distinguisher, due to the learning algorithm by Blais, Cannone, Oliveira, Servedio, and Tan (RANDOM'15), requires $\exp\big(\tilde{O}(n^{1/2} k)\big)$ time. Our distinguishers are based on Fourier analysis on \emph{slices of the Boolean cube}. We show that some "middle" slices of negation-limited circuits have strong low-degree Fourier concentration and then we apply a variation of the classic Linial, Mansour, and Nisan "Low-Degree algorithm" (JACM'93) on slices. Our techniques also lead to a slightly improved weak learner for negation limited circuits under the uniform distribution.

preprint2022arXiv

Observation of SQUID-like behavior in fiber laser with intra-cavity epsilon-near-zero effect

Establishing relations between fundamental effects in far-flung areas of physics is a subject of great interest in the current research. We here report realization of a novel photonic system akin to the radio-frequency superconducting quantum interference device (RF-SQUID), in a fiber laser cavity with epsilon-near-zero (ENZ) nanolayers as intra-cavity components. Emulating the RF-SQUID scheme, the photonic counterpart of the supercurrent, represented by the optical wave, circulates in the cavity, passing through effective optical potential barriers. Different ENZ wavelengths translate into distinct spectral outputs through the variation of cavity resonances, emulating the situation with a frequency-varying tank circuit in the RF-SQUID. Due to the presence of the ENZ element, the optical potential barrier is far lower for selected frequency components, granting them advantage in the gain-resource competition. The findings reported in this work provide a deeper insight into the ultrafast ENZ photonics, revealing a new path towards the design of nanophotonic on-chip devices with various operational functions, and offer a new approach to study superconducting and quantum-mechanical systems.

preprint2022arXiv

Position-aware Structure Learning for Graph Topology-imbalance by Relieving Under-reaching and Over-squashing

Topology-imbalance is a graph-specific imbalance problem caused by the uneven topology positions of labeled nodes, which significantly damages the performance of GNNs. What topology-imbalance means and how to measure its impact on graph learning remain under-explored. In this paper, we provide a new understanding of topology-imbalance from a global view of the supervision information distribution in terms of under-reaching and over-squashing, which motivates two quantitative metrics as measurements. In light of our analysis, we propose a novel position-aware graph structure learning framework named PASTEL, which directly optimizes the information propagation path and solves the topology-imbalance issue in essence. Our key insight is to enhance the connectivity of nodes within the same class for more supervision information, thereby relieving the under-reaching and over-squashing phenomena. Specifically, we design an anchor-based position encoding mechanism, which better incorporates relative topology position and enhances the intra-class inductive bias by maximizing the label influence. We further propose a class-wise conflict measure as the edge weights, which benefits the separation of different node classes. Extensive experiments demonstrate the superior potential and adaptability of PASTEL in enhancing GNNs' power in different data annotation scenarios.

preprint2022arXiv

Reinforced Path Reasoning for Counterfactual Explainable Recommendation

Counterfactual explanations interpret the recommendation mechanism via exploring how minimal alterations on items or users affect the recommendation decisions. Existing counterfactual explainable approaches face huge search space and their explanations are either action-based (e.g., user click) or aspect-based (i.e., item description). We believe item attribute-based explanations are more intuitive and persuadable for users since they explain by fine-grained item demographic features (e.g., brand). Moreover, counterfactual explanation could enhance recommendations by filtering out negative items. In this work, we propose a novel Counterfactual Explainable Recommendation (CERec) to generate item attribute-based counterfactual explanations meanwhile to boost recommendation performance. Our CERec optimizes an explanation policy upon uniformly searching candidate counterfactuals within a reinforcement learning environment. We reduce the huge search space with an adaptive path sampler by using rich context information of a given knowledge graph. We also deploy the explanation policy to a recommendation model to enhance the recommendation. Extensive explainability and recommendation evaluations demonstrate CERec's ability to provide explanations consistent with user preferences and maintain improved recommendations. We release our code at https://github.com/Chrystalii/CERec.

preprint2022arXiv

Room-temperature printing of ultrathin Quasi-2D GaN semiconductor via liquid metal gallium surface confined nitridation reaction

Outstanding wide-bandgap semiconductor material such as gallium nitride (GaN) has been extensively utilized in power electronics, radiofrequency amplifiers, and harsh environment devices. Due to its quantum confinement effect in enabling desired deep-ultraviolet emission, excitonic impact, and electronic transport features, two-dimensional (2D) or ultrathin quasi-2D GaN semiconductors have been one of the most remarkable candidates for future growth of microelectronic devices. Here, for the first time, we reported a large area, wide bandgap, and room-temperature quasi-2D GaN synthesis and printing strategy through introducing the plasma medicated liquid metal gallium surface-confined nitridation reaction mechanism. The developed direct fabrication and compositional process is consistent with various electronics manufacturing approaches and thus opens an easy going way for cost-effective growth of the third-generation semiconductor. In particular, the fully printed field-effect transistors relying on the GaN thus made show p-type switching with an on/off ratio greater than 105, maximum field-effect hole mobility of 53 cm2/(V*s), and a small sub-threshold swing. As it was demonstrated, the present method allows to produce at room temperature the GaN with thickness spanning from 1nm to nanometers. This basic method can be further extended, generalized, and utilized for making various electronic and photoelectronic devices in the coming time.

preprint2022arXiv

Transactions Make Debugging Easy

We propose TROD, a novel transaction-oriented framework for debugging modern distributed web applications and online services. Our critical insight is that if applications store all state in databases and only access state transactionally, TROD can use lightweight always-on tracing to track the history of application state changes and data provenance, and then leverage the captured traces and transaction logs to faithfully replay or even test modified code retroactively on any past event. We demonstrate how TROD can simplify programming and debugging in production applications, list several research challenges and directions, and encourage the database and systems communities to drastically rethink the synergy between the way people develop and debug applications.

preprint2022arXiv

Universal Segmentation of 33 Anatomies

In the paper, we present an approach for learning a single model that universally segments 33 anatomical structures, including vertebrae, pelvic bones, and abdominal organs. Our model building has to address the following challenges. Firstly, while it is ideal to learn such a model from a large-scale, fully-annotated dataset, it is practically hard to curate such a dataset. Thus, we resort to learn from a union of multiple datasets, with each dataset containing the images that are partially labeled. Secondly, along the line of partial labelling, we contribute an open-source, large-scale vertebra segmentation dataset for the benefit of spine analysis community, CTSpine1K, boasting over 1,000 3D volumes and over 11K annotated vertebrae. Thirdly, in a 3D medical image segmentation task, due to the limitation of GPU memory, we always train a model using cropped patches as inputs instead a whole 3D volume, which limits the amount of contextual information to be learned. To this, we propose a cross-patch transformer module to fuse more information in adjacent patches, which enlarges the aggregated receptive field for improved segmentation performance. This is especially important for segmenting, say, the elongated spine. Based on 7 partially labeled datasets that collectively contain about 2,800 3D volumes, we successfully learn such a universal model. Finally, we evaluate the universal model on multiple open-source datasets, proving that our model has a good generalization performance and can potentially serve as a solid foundation for downstream tasks.

preprint2021arXiv

Magnetic field-tuned quantum criticality in optimally electron-doped cuprate thin films

Antiferromagnetic (AF) spin fluctuations are commonly believed to play a key role in electron pairing of cuprate superconductors. In electron-doped cuprates, it is still in paradox about the interplay among different electronic states in quantum perturbations, especially between superconducting and magnetic states. Here, we report a systematic transport study on cation-optimized La2-xCexCuO4 (x = 0.10) thin films in high magnetic fields. We find an AF quantum phase transition near 60 T, where the Hall number jumps from nH =-x to nH = 1-x, resembling the change of nH at the AF boundary (xAF = 0.14) tuned by Ce doping. In the AF region a spin dependent state manifesting anomalous positive magnetoresistance is observed, which is closely related to superconductivity. Once the AF state is suppressed by magnetic field, a polarized ferromagnetic state is predicted, reminiscent of the recently reported ferromagnetic state at the quantum endpoint of the superconducting dome by Ce doping. The magnetic field that drives phase transitions in a similar but distinct manner to doping thereby provides a unique perspective to understand the quantum criticality of electron-doped cuprates.

preprint2021arXiv

Population inversion and Dirac fermion cooling in 3D Dirac semimetal Cd$_3$As$_2$

Revealing the ultrafast dynamics of three-dimensional (3D) Dirac fermions upon photoexcitation is critical for both fundamental science and device applications. So far, how the cooling of 3D Dirac fermions differs from that of two-dimensional (2D) Dirac fermions and whether there is population inversion are fundamental questions that remain to be answered. Here we reveal the ultrafast dynamics of Dirac fermions in a model 3D Dirac semimetal Cd$_3$As$_2$ by ultrafast time- and angle-resolved photoemission spectroscopy (TrARPES) with a tunable probe photon energy from 5.3 - 6.9 eV. The energy- and momentum-resolved relaxation rate shows a linear dependence on the energy, suggesting Dirac fermion cooling through intraband relaxation. Moreover, a population inversion is reported based on the observation of accumulated photoexcited carriers in the conduction band with a lifetime of $τ_n$ = 3.0 ps. Our work provides direct experimental evidence for a long-lived population inversion in a 3D Dirac semimetal, which is in contrast to 2D graphene where the interband relaxation occurs on a much faster timescale.

preprint2021arXiv

Room-temperature ferromagnetism at an oxide/nitride interface

Heterointerfaces have led to the discovery of novel electronic and magnetic states because of their strongly entangled electronic degrees of freedom. Single-phase chromium compounds always exhibit antiferromagnetism following the prediction of Goodenough-Kanamori rules. So far, exchange coupling between chromium ions via hetero-anions has not been explored and the associated quantum states is unknown. Here we report the successful epitaxial synthesis and characterizations of chromium oxide (Cr2O3)-chromium nitride (CrN) superlattices. Room-temperature ferromagnetic spin ordering is achieved at the interfaces between these two antiferromagnets, and the magnitude of the effect decays with increasing layer thickness. First-principles calculations indicate that robust ferromagnetic spin interaction between Cr3+ ions via anion-hybridizations across the interface yields the lowest total energy. This work opens the door to fundamental understanding of the unexpected and exceptional properties of oxide-nitride interfaces and provides access to hidden phases at low-dimensional quantum heterostructures.

preprint2020arXiv

Consistency of a kind of general noncanonical warm inflation

The framework of a kind of noncanonical warm inflation is introduced, and the dynamical equations of this scenario are presented. We propose the slow roll approximations and give some redefining slow roll parameters in this scenario which remain dimensionless. Performing systemic stability analysis, we calculate the slow roll conditions to guarantee that slow roll approximations hold. The slow roll conditions suggest slow roll inflation in general noncanonical warm inflationary scenario can still exist, and in addition, the slow roll approximations are more easily to be satisfied. Then, a concrete Dirac-Born-Infeld warm inflationary model is studied.

preprint2020arXiv

Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation

The medical imaging literature has witnessed remarkable progress in high-performing segmentation models based on convolutional neural networks. Despite the new performance highs, the recent advanced segmentation models still require large, representative, and high quality annotated datasets. However, rarely do we have a perfect training dataset, particularly in the field of medical imaging, where data and annotations are both expensive to acquire. Recently, a large body of research has studied the problem of medical image segmentation with imperfect datasets, tackling two major dataset limitations: scarce annotations where only limited annotated data is available for training, and weak annotations where the training data has only sparse annotations, noisy annotations, or image-level annotations. In this article, we provide a detailed review of the solutions above, summarizing both the technical novelties and empirical results. We further compare the benefits and requirements of the surveyed methodologies and provide our recommended solutions. We hope this survey article increases the community awareness of the techniques that are available to handle imperfect medical image segmentation datasets.

preprint2020arXiv

How to Investigate the Historical Roots and Evolution of Research Fields in China? A Case Study on iMetrics Using RootCite

This paper aimed to provide an approach to investigate the historical roots and evolution of research fields in China by extending the reference publication year spectroscopy (RPYS). RootCite, an open source software accepts raw data from both the Web of Science and the China Social Science Citation Index (CSSCI), was developed using python. We took iMetrics in China as the research case. 5,141 Chinese iMetrics related publications with 73,376 non-distinct cited references (CR) collected from the CSSCI were analyzed using RootCite. The results showed that the first CR in the field can be dated back to 1882 and written in English; but the majority (64.2%) of the CR in the field were Chinese publications. 17 peaks referring to 18 seminal works (13 in English and 5 in Chinese) were located during the period from 1900 to 2017. The field shared the same roots with that in the English world but has its own characteristics, and it was then shaped by contributions from both the English world and China. The five Chinese works have played irreplaceable and positive roles in the historical evolutionary path of the field, which should not be ignored, especially for the evolution of the field. This research demonstrated how RootCite aided the task of identifying the origin and evolution of research fields in China, which could be valuable for extending RPYS for countries with other languages.

preprint2020arXiv

INFaaS: A Model-less and Managed Inference Serving System

Despite existing work in machine learning inference serving, ease-of-use and cost efficiency remain challenges at large scales. Developers must manually search through thousands of model-variants -- versions of already-trained models that differ in hardware, resource footprints, latencies, costs, and accuracies -- to meet the diverse application requirements. Since requirements, query load, and applications themselves evolve over time, these decisions need to be made dynamically for each inference query to avoid excessive costs through naive autoscaling. To avoid navigating through the large and complex trade-off space of model-variants, developers often fix a variant across queries, and replicate it when load increases. However, given the diversity across variants and hardware platforms in the cloud, a lack of understanding of the trade-off space can incur significant costs to developers. This paper introduces INFaaS, a managed and model-less system for distributed inference serving, where developers simply specify the performance and accuracy requirements for their applications without needing to specify a specific model-variant for each query. INFaaS generates model-variants, and efficiently navigates the large trade-off space of model-variants on behalf of developers to meet application-specific objectives: (a) for each query, it selects a model, hardware architecture, and model optimizations, (b) it combines VM-level horizontal autoscaling with model-level autoscaling, where multiple, different model-variants are used to serve queries within each machine. By leveraging diverse variants and sharing hardware resources across models, INFaaS achieves 1.3x higher throughput, violates latency objectives 1.6x less often, and saves up to 21.6x in cost (8.5x on average) compared to state-of-the-art inference serving systems on AWS EC2.

preprint2020arXiv

Influence of Initialization on the Performance of Metaheuristic Optimizers

All metaheuristic optimization algorithms require some initialization, and the initialization for such optimizers is usually carried out randomly. However, initialization can have some significant influence on the performance of such algorithms. This paper presents a systematic comparison of 22 different initialization methods on the convergence and accuracy of five optimizers: differential evolution (DE), particle swarm optimization (PSO), cuckoo search (CS), artificial bee colony (ABC) algorithm and genetic algorithm (GA). We have used 19 different test functions with different properties and modalities to compare the possible effects of initialization, population sizes and the numbers of iterations. Rigorous statistical ranking tests indicate that 43.37\% of the functions using the DE algorithm show significant differences for different initialization methods, while 73.68\% of the functions using both PSO and CS algorithms are significantly affected by different initialization methods. The simulations show that DE is less sensitive to initialization, while both PSO and CS are more sensitive to initialization. In addition, under the condition of the same maximum number of function evaluations (FEs), the population size can also have a strong effect. Particle swarm optimization usually requires a larger population, while the cuckoo search needs only a small population size. Differential evolution depends more heavily on the number of iterations, a relatively small population with more iterations can lead to better results. Furthermore, ABC is more sensitive to initialization, while such initialization has little effect on GA. Some probability distributions such as the beta distribution, exponential distribution and Rayleigh distribution can usually lead to better performance. The implications of this study and further research topics are also discussed in detail.

preprint2020arXiv

Learning Differential Diagnosis of Skin Conditions with Co-occurrence Supervision using Graph Convolutional Networks

Skin conditions are reported the 4th leading cause of nonfatal disease burden worldwide. However, given the colossal spectrum of skin disorders defined clinically and shortage in dermatology expertise, diagnosing skin conditions in a timely and accurate manner remains a challenging task. Using computer vision technologies, a deep learning system has proven effective assisting clinicians in image diagnostics of radiology, ophthalmology and more. In this paper, we propose a deep learning system (DLS) that may predict differential diagnosis of skin conditions using clinical images. Our DLS formulates the differential diagnostics as a multi-label classification task over 80 conditions when only incomplete image labels are available. We tackle the label incompleteness problem by combining a classification network with a Graph Convolutional Network (GCN) that characterizes label co-occurrence and effectively regularizes it towards a sparse representation. Our approach is demonstrated on 136,462 clinical images and concludes that the classification accuracy greatly benefit from the Co-occurrence supervision. Our DLS achieves 93.6% top-5 accuracy on 12,378 test images and consistently outperform the baseline classification network.

preprint2020arXiv

Leveraging Multi-level Dependency of Relational Sequences for Social Spammer Detection

Much recent research has shed light on the development of the relation-dependent but content-independent framework for social spammer detection. This is largely because the relation among users is difficult to be altered when spammers attempt to conceal their malicious intents. Our study investigates the spammer detection problem in the context of multi-relation social networks, and makes an attempt to fully exploit the sequences of heterogeneous relations for enhancing the detection accuracy. Specifically, we present the Multi-level Dependency Model (MDM). The MDM is able to exploit user's long-term dependency hidden in their relational sequences along with short-term dependency. Moreover, MDM fully considers short-term relational sequences from the perspectives of individual-level and union-level, due to the fact that the type of short-term sequences is multi-folds. Experimental results on a real-world multi-relational social network demonstrate the effectiveness of our proposed MDM on multi-relational social spammer detection.

preprint2020arXiv

Neighborhood Information-based Probabilistic Algorithm for Network Disintegration

Many real-world applications can be modelled as complex networks, and such networks include the Internet, epidemic disease networks, transport networks, power grids, protein-folding structures and others. Network integrity and robustness are important to ensure that crucial networks are protected and undesired harmful networks can be dismantled. Network structure and integrity can be controlled by a set of key nodes, and to find the optimal combination of nodes in a network to ensure network structure and integrity can be an NP-complete problem. Despite extensive studies, existing methods have many limitations and there are still many unresolved problems. This paper presents a probabilistic approach based on neighborhood information and node importance, namely, neighborhood information-based probabilistic algorithm (NIPA). We also define a new centrality-based importance measure (IM), which combines the contribution ratios of the neighbor nodes of each target node and two-hop node information. Our proposed NIPA has been tested for different network benchmarks and compared with three other methods: optimal attack strategy (OAS), high betweenness first (HBF) and high degree first (HDF). Experiments suggest that the proposed NIPA is most effective among all four methods. In general, NIPA can identify the most crucial node combination with higher effectiveness, and the set of optimal key nodes found by our proposed NIPA is much smaller than that by heuristic centrality prediction. In addition, many previously neglected weakly connected nodes are identified, which become a crucial part of the newly identified optimal nodes. Thus, revised strategies for protection are recommended to ensure the safeguard of network integrity. Further key issues and future research topics are also discussed.

preprint2020arXiv

Region-Referenced Spectral Power Dynamics of EEG Signals: A Hierarchical Modeling Approach

Functional brain imaging through electroencephalography (EEG) relies upon the analysis and interpretation of high-dimensional, spatially organized time series. We propose to represent time-localized frequency domain characterizations of EEG data as region-referenced functional data. This representation is coupled with a hierarchical modeling approach to multivariate functional observations. Within this familiar setting, we discuss how several prior models relate to structural assumptions about multivariate covariance operators. An overarching modeling framework, based on infinite factorial decompositions, is finally proposed to balance flexibility and efficiency in estimation. The motivating application stems from a study of implicit auditory learning, in which typically developing (TD) children, and children with autism spectrum disorder (ASD) were exposed to a continuous speech stream. Using the proposed model, we examine differential band power dynamics as brain function is interrogated throughout the duration of a computer-controlled experiment. Our work offers a novel look at previous findings in psychiatry, and provides further insights into the understanding of ASD. Our approach to inference is fully Bayesian and implemented in a highly optimized Rcpp package.

preprint2020arXiv

Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

Data augmentation have been intensively used in training deep neural network to improve the generalization, whether in original space (e.g., image space) or representation space. Although being successful, the connection between the synthesized data and the original data is largely ignored in training, without considering the distribution information that the synthesized samples are surrounding the original sample in training. Hence, the behavior of the network is not optimized for this. However, that behavior is crucially important for generalization, even in the adversarial setting, for the safety of the deep learning system. In this work, we propose a framework called Stochastic Batch Augmentation (SBA) to address these problems. SBA stochastically decides whether to augment at iterations controlled by the batch scheduler and in which a ''distilled'' dynamic soft label regularization is introduced by incorporating the similarity in the vicinity distribution respect to raw samples. The proposed regularization provides direct supervision by the KL-Divergence between the output soft-max distributions of original and virtual data. Our experiments on CIFAR-10, CIFAR-100, and ImageNet show that SBA can improve the generalization of the neural networks and speed up the convergence of network training.

preprint2020arXiv

Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

The COVID-19 is sweeping the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wide availability of routine blood tests, we propose to leverage them for COVID-19 testing using the power of machine learning. Two proven-robust machine learning model families, random forests (RFs) and support vector machines (SVMs), are employed to tackle the challenge. Trained on blood data from 208 moderate COVID-19 subjects and 86 subjects with non-COVID-19 moderate viral pneumonia, the best result is obtained in an SVM-based classifier with an accuracy of 84%, a sensitivity of 88%, a specificity of 80%, and a precision of 92%. The results are found explainable from both machine learning and medical perspectives. A privacy-protected web portal is set up to help medical personnel in their practice and the trained models are released for developers to further build other applications. We hope our results can help the world fight this pandemic and welcome clinical verification of our approach on larger populations.

preprint2020arXiv

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

Continual acquisition of novel experience without interfering previously learned knowledge, i.e. continual learning, is critical for artificial neural networks, but limited by catastrophic forgetting. A neural network adjusts its parameters when learning a new task, but then fails to conduct the old tasks well. By contrast, the brain has a powerful ability to continually learn new experience without catastrophic interference. The underlying neural mechanisms possibly attribute to the interplay of hippocampus-dependent memory system and neocortex-dependent memory system, mediated by prefrontal cortex. Specifically, the two memory systems develop specialized mechanisms to consolidate information as more specific forms and more generalized forms, respectively, and complement the two forms of information in the interplay. Inspired by such brain strategy, we propose a novel approach named triple memory networks (TMNs) for continual learning. TMNs model the interplay of hippocampus, prefrontal cortex and sensory cortex (a neocortex region) as a triple-network architecture of generative adversarial networks (GAN). The input information is encoded as specific representation of the data distributions in a generator, or generalized knowledge of solving tasks in a discriminator and a classifier, with implementing appropriate brain-inspired algorithms to alleviate catastrophic forgetting in each module. Particularly, the generator replays generated data of the learned tasks to the discriminator and the classifier, both of which are implemented with a weight consolidation regularizer to complement the lost information in generation process. TMNs achieve new state-of-the-art performance on a variety of class-incremental learning benchmarks on MNIST, SVHN, CIFAR-10 and ImageNet-50, comparing with strong baseline methods.

preprint2020arXiv

U-Net Using Stacked Dilated Convolutions for Medical Image Segmentation

This paper proposes a novel U-Net variant using stacked dilated convolutions for medical image segmentation (SDU-Net). SDU-Net adopts the architecture of vanilla U-Net with modifications in the encoder and decoder operations (an operation indicates all the processing for feature maps of the same resolution). Unlike vanilla U-Net which incorporates two standard convolutions in each encoder/decoder operation, SDU-Net uses one standard convolution followed by multiple dilated convolutions and concatenates all dilated convolution outputs as input to the next operation. Experiments showed that SDU-Net outperformed vanilla U-Net, attention U-Net (AttU-Net), and recurrent residual U-Net (R2U-Net) in all four tested segmentation tasks while using parameters around 40% of vanilla U-Net's, 17% of AttU-Net's, and 15% of R2U-Net's.

preprint2020arXiv

Weakly Supervised Context Encoder using DICOM metadata in Ultrasound Imaging

Modern deep learning algorithms geared towards clinical adaption rely on a significant amount of high fidelity labeled data. Low-resource settings pose challenges like acquiring high fidelity data and becomes the bottleneck for developing artificial intelligence applications. Ultrasound images, stored in Digital Imaging and Communication in Medicine (DICOM) format, have additional metadata data corresponding to ultrasound image parameters and medical exams. In this work, we leverage DICOM metadata from ultrasound images to help learn representations of the ultrasound image. We demonstrate that the proposed method outperforms the non-metadata based approaches across different downstream tasks.

preprint2019arXiv

Coherent transfer of spin angular momentum by evanescent spin waves within antiferromagnetic NiO

Insulating antiferromagnets are efficient and robust conductors of spin current. To realise the full potential of these materials within spintronics, the outstanding challenges are to demonstrate scalability down to nanometric lengthscales and the transmission of coherent spin currents. Here, we report the coherent transfer of spin angular momentum by excitation of evanescent spin waves of GHz frequency within antiferromagnetic NiO at room temperature. Using element-specific and phase-resolved x-ray ferromagnetic resonance, we probe the injection and transmission of ac spin current, and demonstrate that insertion of a few nanometre thick epitaxial NiO(001) layer between a ferromagnet and non-magnet can even enhance the flow of spin current. Our results pave the way towards coherent control of the phase and amplitude of spin currents at the nanoscale, and enable the realization of spin-logic devices and spin current amplifiers that operate at GHz and THz frequencies.

preprint2019arXiv

Emergent superconductivity in single crystalline $\mathrm{MgTi}_2\mathrm{O}_4$ films via structural engineering

Spinel compounds have demonstrated rich functionalities but rarely shown superconductivity. Here, we report the emergence of superconductivity in the spinel $\mathrm{MgTi}_2\mathrm{O}_4$, known to be an insulator with a complicated order. The superconducting transition is achieved by engineering a superlattice of $\mathrm{MgTi}_2\mathrm{O}_4$ and $\mathrm{SrTiO}_3$. The onset transition temperature in the $\mathrm{MgTi}_2\mathrm{O}_4$ layer can be tuned from 0 to 5 K in such geometry, concurrently with a stretched $c$-axis (from 8.51 to 8.53 Å) compared to the bulk material. Such a positive correlation without saturation suggests ample room for the further enhancement. Intriguingly, the superlattice exhibits isotropic upper critical field $H_{\mathrm{c}2}$ that breaks the Pauli limit, distinct from the highly anisotropic feature of interface superconductivity. The origin of superconductivity in the $\mathrm{MgTi}_2\mathrm{O}_4$ layer is understood in combination with the electron energy loss spectra and the first-principles electronic structure calculations, which point to the birth of superconductivity in the $\mathrm{MgTi}_2\mathrm{O}_4$ layer by preventing the Ti-Ti dimerization. Our discovery not only provides a platform to explore the interplay between the superconductivity and other exotic states, but also opens a new window to realize superconductivity in the spinel compounds as well as other titanium oxides.

preprint2019arXiv

On some model equations for pulsatile flow in viscoelastic vessels

Considered here is the derivation of partial differential equations arising in pulsatile flow in pipes with viscoelastic walls. The equations are asymptotic models describing the propagation of long-crested pulses in pipes with cylindrical symmetry. Additional effects due to viscous stresses in bio-fluids are also taken into account. The effects of viscoelasticity of the vessels on the propagation of solitary and periodic waves in a vessel of constant radius are being explored numerically.

preprint2018arXiv

A Reconfigurable Nanophotonics Platform for Sub-Millisecond, Deep Brain Neural Stimulation

Nanophotonics provides the ability to rapidly and precisely reconfigure light beams on a compact platform. Infrared nanophotonic devices are widely used in data communications to overcome traditional bandwidth limitations of electrical interconnects. Nanophotonic devices also hold promise for use in biological applications that require visible light, but this has remained technically elusive due to the challenges of reconfiguring and guiding light at these smaller dimensions. In neuroscience, for example, there is a need for implantable optical devices to optogenetically stimulate neurons across deep brain regions with the speed and precision matching state-of-the-art recording probes. Here we demonstrate the first platform for reconfigurable nanophotonic devices in the visible wavelength range and show its application in vivo in the brain. We demonstrate an implantable probe endowed with the ability to rapidly switch and route multiple optical beams using a nanoscale switching network. Each switch consists of a silicon nitride waveguide structure that can be reconfigured by electrically tuning the phase of light and is designed for robustness to fabrication variation, enabling scalable devices. By implanting our probe in mouse visual cortex, we demonstrate in vivo the ability to stimulate identified sets of neurons across layers to produce multi-neuron spike patterns and record them simultaneously with sub-millisecond temporal precision. This nanophotonic platform can be scaled up and integrated with high-density neural recording technologies, opening the door to implantable probe technologies that are able to simultaneously record and stimulate the activity of large neural populations at distant regions of the brain with sub-millisecond precision. We expect this platform will enable researchers to gain a deeper understanding into the spatio-temporal precision of the neural code.

preprint2016arXiv

A Tighter Relation between Sensitivity and Certificate Complexity

The sensitivity conjecture which claims that the sensitivity complexity is polynomially related to block sensitivity complexity, is one of the most important and challenging problem in decision tree complexity theory. Despite of a lot of efforts, the best known upper bound of block sensitivity, as well as the certificate complexity, are still exponential in terms of sensitivity: $bs(f)\leq C(f)\leq\max\{2^{s(f)-1}(s(f)-\frac{1}{3}),s(f)\}$. In this paper, we give a better upper bound of $bs(f)\leq C(f)\leq(\frac{8}{9} + o(1))s(f)2^{s(f) - 1}$. The proof is based on a deep investigation on the structure of the sensitivity graph. We also provide a tighter relationship between $C_0(f)$ and $s_0(f)$ for functions with $s_1(f)=2$.

preprint2016arXiv

An end-to-end network slicing framework for 5G wireless communication systems

Wireless industry nowadays is facing two major challenges: 1) how to support the vertical industry applications so that to expand the wireless industry market and 2) how to further enhance device capability and user experience. In this paper, we propose a technology framework to address these challenges. The proposed technology framework is based on end-to-end vertical and horizontal slicing, where vertical slicing enables vertical industry and services and horizontal slicing improves system capacity and user experience. The technology development on vertical slicing has already started in late 4G and early 5G and is mostly focused on slicing the core network. We envision this trend to continue with the development of vertical slicing in the radio access network and the air interface. Moving beyond vertical slicing, we propose to horizontally slice the computation and communication resources to form virtual computation platforms for solving the network capacity scaling problem and enhancing device capability and user experience. In this paper, we explain the concept of vertical and horizontal slicing and illustrate the slicing techniques in the air interface, the radio access network, the core network and the computation platform. This paper aims to initiate the discussion on the long-range technology roadmap and spur development on the solutions for E2E network slicing in 5G and beyond.

preprint2016arXiv

Efficient Delivery Policy to Minimize User Traffic Consumption in Guaranteed Advertising

In this work, we study the guaranteed delivery model which is widely used in online display advertising. In the guaranteed delivery scenario, ad exposures (which are also called impressions in some works) to users are guaranteed by contracts signed in advance between advertisers and publishers. A crucial problem for the advertising platform is how to fully utilize the valuable user traffic to generate as much as possible revenue. Different from previous works which usually minimize the penalty of unsatisfied contracts and some other cost (e.g. representativeness), we propose the novel consumption minimization model, in which the primary objective is to minimize the user traffic consumed to satisfy all contracts. Under this model, we develop a near optimal method to deliver ads for users. The main advantage of our method lies in that it consumes nearly as least as possible user traffic to satisfy all contracts, therefore more contracts can be accepted to produce more revenue. It also enables the publishers to estimate how much user traffic is redundant or short so that they can sell or buy this part of traffic in bulk in the exchange market. Furthermore, it is robust with regard to priori knowledge of user type distribution. Finally, the simulation shows that our method outperforms the traditional state-of-the-art methods.

preprint2016arXiv

On the Optimality of Tape Merge of Two Lists with Similar Size

The problem of merging sorted lists in the least number of pairwise comparisons has been solved completely only for a few special cases. Graham and Karp \cite{taocp} independently discovered that the tape merge algorithm is optimal in the worst case when the two lists have the same size. In the seminal papers, Stockmeyer and Yao\cite{yao}, Murphy and Paull\cite{3k3}, and Christen\cite{christen1978optimality} independently showed when the lists to be merged are of size $m$ and $n$ satisfying $m\leq n\leq\lfloor\frac{3}{2}m\rfloor+1$, the tape merge algorithm is optimal in the worst case. This paper extends this result by showing that the tape merge algorithm is optimal in the worst case whenever the size of one list is no larger than 1.52 times the size of the other. The main tool we used to prove lower bounds is Knuth's adversary methods \cite{taocp}. In addition, we show that the lower bound cannot be improved to 1.8 via Knuth's adversary methods. We also develop a new inequality about Knuth's adversary methods, which might be interesting in its own right. Moreover, we design a simple procedure to achieve constant improvement of the upper bounds for $2m-2\leq n\leq 3m $.

preprint2016arXiv

On the Sensitivity Complexity of $k$-Uniform Hypergraph Properties

In this paper we investigate the sensitivity complexity of hypergraph properties. We present a $k$-uniform hypergraph property with sensitivity complexity $O(n^{\lceil k/3\rceil})$ for any $k\geq3$, where $n$ is the number of vertices. Moreover, we can do better when $k\equiv1$ (mod 3) by presenting a $k$-uniform hypergraph property with sensitivity $O(n^{\lceil k/3\rceil-1/2})$. This result disproves a conjecture of Babai~\cite{Babai}, which conjectures that the sensitivity complexity of $k$-uniform hypergraph properties is at least $Ω(n^{k/2})$. We also investigate the sensitivity complexity of other symmetric functions and show that for many classes of transitive Boolean functions the minimum achievable sensitivity complexity can be $O(N^{1/3})$, where $N$ is the number of variables. Finally, we give a lower bound for sensitivity of $k$-uniform hypergraph properties, which implies the {\em sensitivity conjecture} of $k$-uniform hypergraph properties for any constant $k$.

preprint2016arXiv

Slow light based optical frequency shifter

We demonstrate experimentally and theoretically a controllable way of shifting the frequency of an optical pulse by using a combination of spectral hole burning, slow light effect, and linear Stark effect in a rare-earth-ion doped crystal. We claim that the solid angle of acceptance of a frequency shift structure can be close to $2π$, which means that the frequency shifter could work not only for optical pulses propagating in a specific spatial mode but also for randomly scattered light. As the frequency shift is controlled solely by an external electric field, it works also for weak coherent light fields, and can e.g. be used as a frequency shifter for quantum memory devices in quantum communication.

preprint2016arXiv

Soft Phonon Modes and Diffuse Scattering in Pb(In1/2Nb1/2)O3-Pb(Mg1/3Nb2/3)O3-PbTiO3 Relaxor Ferroelectrics

Single crystals of a ternary relaxor ferroelectric system, 0.29Pb(In1/2Nb1/2)O3-0.45Pb(Mg1/3Nb2/3)O3-0.26PbTiO3, have been studied using triple-axis based elastic and inelastic neutron scattering. Elastic diffuse scattering confirms the presence of polar nano-regions (PNRs) in this system. The PNRs emerge at the Burns temperature, TB = 630 K and then grow continuously in population and correlation size as the crystal cools down to 100 K. At 300 K, characteristic 'butterfly' and ellipsoid shaped diffuse scattering patterns are observed on the HK0 reciprocal space plane. Electrical poling along the [110] direction produces a marked asymmetry in the diffuse scattering patterns, with the parallel-to-the-field components enhanced while the perpendicular-to-the-field components suppressed. Several low energy phonon branches along the [001] and [111] directions were studied. Most significantly, the PNR-acoustic phonon coupling is confirmed for the [110] transverse acoustic (TA) phonons polarized along the [1-10] real space direction and the [100] TA phonons. This coupling appears to be anisotropic and correlated with the distribution of PNRs, and also affected by the relative length scales of the PNRs and phonon wave vectors. The well-known 'waterfall' phenomenon is observed on the [001] and [110] transverse optical (TO) branches, near the zone center. The optical phonon measurements also reveal a lowest-energy, zone center soft TO mode, whose squared phonon energy increase linearly with decreasing temperature below the TB.

preprint2015arXiv

Dirac-Point Solitons in Nonlinear Optical Lattices

The discovery of a new type of solitons occuring in periodic systems without photonic bandgaps is reported. Solitons are nonlinear self-trapped wave packets. They have been extensively studied in many branches of physics. Solitons in periodic systems, which have become the mainstream of soliton research in the past decade, are localized states supported by photonic bandgaps. In this Letter, we report the discovery of a new type of solitons located at the Dirac point beyond photonic bandgaps. The Dirac point is a conical singularity of a photonic band structure where wave motion obeys the famous Dirac equation. These new solitons are sustained by the Dirac point rather than photonic bandgaps, thus provides a sort of advance in conceptual understanding over the traditional gap solitons. Apart from their theoretical impact within soliton theory, they have many potential uses because such solitons have dramatic stability characteristics and are possible in both Kerr material and photorefractive crystals that possess self-focusing and self-defocusing nonlinearities. The new results elegantly reveal that traditional photonic bandgaps are not required when Dirac points are accessible. The findings enrich the soliton family and provide valuable information for studies of nonlinear waves in many branches of physics, including hydrodynamics, plasma physics, and Bose Einstein condensates.

preprint2015arXiv

Fast all-optical nuclear spin echo technique based on EIT

We demonstrate an all-optical Raman spin echo technique, using Electromagnetically Induced Transparency (EIT) to create the different pulses of the spin echo sequence: initialization, pi-rotation, and readout. The first pulse of the sequence induces coherence directly from a mixed state, and the technique is used to measure the nuclear spin coherence of an inhomogeneously broadened ensemble of rare-earth ions (Pr$^{3+}$). In contrast to previous experiments it does not require any preparatory hole burning pulse sequences, which greatly shortens the total duration of the sequence. The effect of the different pulses is characterized by quantum state tomography and is compared with simulations. We demonstrate two applications of the technique by using the spin echo sequence to accurately compensate a magnetic field across our sample, and to measure the coherence time at high temperatures up to 11 K, where standard preparation techniques are difficult to implement. We explore the potential of the technique and possible applications.

preprint2015arXiv

Mechanical Tuning of Thermal Transport in a Molecular Junction

Understanding and controlling heat transport in molecular junctions would provide new routes to design nanoscale coupled electronic and phononic devices. Using first principles full quantum calculations, we tune thermal conductance of a molecular junction by mechanically compressing and extending a short alkane chain connected to graphene leads. We find that the thermal conductance of the compressed junction drops by half in comparison to the extended junction, making it possible to turn on and off the heat current. The low conductance of the off state does not vary by further approaching the leads and stems from the suppression of the transmission of the in--plane transverse and longitudinal channels. Furthermore, we show that misalignment of the leads does not reduce the conductance ratio. These results also contribute to the general understanding of thermal transport in molecular junctions.

preprint2014arXiv

Correlation between centrality metrics and their application to the opinion model

In recent decades, a number of centrality metrics describing network properties of nodes have been proposed to rank the importance of nodes. In order to understand the correlations between centrality metrics and to approximate a high-complexity centrality metric by a strongly correlated low-complexity metric, we first study the correlation between centrality metrics in terms of their Pearson correlation coefficient and their similarity in ranking of nodes. In addition to considering the widely used centrality metrics, we introduce a new centrality measure, the degree mass. The m order degree mass of a node is the sum of the weighted degree of the node and its neighbors no further than m hops away. We find that the B_{n}, the closeness, and the components of x_{1} are strongly correlated with the degree, the 1st-order degree mass and the 2nd-order degree mass, respectively, in both network models and real-world networks. We then theoretically prove that the Pearson correlation coefficient between x_{1} and the 2nd-order degree mass is larger than that between x_{1} and a lower order degree mass. Finally, we investigate the effect of the inflexible antagonists selected based on different centrality metrics in helping one opinion to compete with another in the inflexible antagonists opinion model. Interestingly, we find that selecting the inflexible antagonists based on the leverage, the B_{n}, or the degree is more effective in opinion-competition than using other centrality metrics in all types of networks. This observation is supported by our previous observations, i.e., that there is a strong linear correlation between the degree and the B_{n}, as well as a high centrality similarity between the leverage and the degree.

preprint2014arXiv

Designing $π$-stacked molecular structures to control heat transport through molecular junctions

We propose and analyze a new way of using $π$ stacking to design molecular junctions that either enhance or suppress a phononic heat current, but at the same time remain conductors for an electric current. Such functionality is highly desirable in thermoelectric energy converters, as well as in other electronic components where heat dissipation should be minimized or maximized. We suggest a molecular design consisting of two masses coupled to each other with one mass coupled to each lead. By having a small coupling (spring constant) between the masses, it is possible to either reduce, or perhaps more surprisingly enhance the phonon conductance. We investigate a simple model system to identify optimal parameter regimes and then use first principle calculations to extract model parameters for a number of specific molecular realizations, confirming that our proposal can indeed be realized using standard molecular building blocks.

preprint2014arXiv

Non-consensus opinion model on directed networks

Dynamic social opinion models have been widely studied on undirected networks, and most of them are based on spin interaction models that produce a consensus. In reality, however, many networks such as Twitter and the World Wide Web are directed and are composed of both unidirectional and bidirectional links. Moreover, from choosing a coffee brand to deciding who to vote for in an election, two or more competing opinions often coexist. In response to this ubiquity of directed networks and the coexistence of two or more opinions in decision-making situations, we study a non-consensus opinion model introduced by Shao et al. \cite{shao2009dynamic} on directed networks. We define directionality $ξ$ as the percentage of unidirectional links in a network, and we use the linear correlation coefficient $ρ$ between the indegree and outdegree of a node to quantify the relation between the indegree and outdegree. We introduce two degree-preserving rewiring approaches which allow us to construct directed networks that can have a broad range of possible combinations of directionality $ξ$ and linear correlation coefficient $ρ$ and to study how $ξ$ and $ρ$ impact opinion competitions. We find that, as the directionality $ξ$ or the indegree and outdegree correlation $ρ$ increases, the majority opinion becomes more dominant and the minority opinion's ability to survive is lowered.

preprint2013arXiv

Effect of the Interconnected Network Structure on the Epidemic Threshold

Most real-world networks are not isolated. In order to function fully, they are interconnected with other networks, and this interconnection influences their dynamic processes. For example, when the spread of a disease involves two species, the dynamics of the spread within each species (the contact network) differs from that of the spread between the two species (the interconnected network). We model two generic interconnected networks using two adjacency matrices, A and B, in which A is a 2N*2N matrix that depicts the connectivity within each of two networks of size N, and B a 2N*2N matrix that depicts the interconnections between the two. Using an N-intertwined mean-field approximation, we determine that a critical susceptable-infected-susceptable (SIS) epidemic threshold in two interconnected networks is 1/λ1(A+αB), where the infection rate is βwithin each of the two individual networks and αβin the interconnected links between the two networks and λ1(A+αB) is the largest eigenvalue of the matrix A+αB. In order to determine how the epidemic threshold is dependent upon the structure of interconnected networks, we analytically derive λ1(A+αB) using perturbation approximation for small and large α, the lower and upper bound for any αas a function of the adjacency matrix of the two individual networks, and the interconnections between the two and their largest eigenvalues/eigenvectors. We verify these approximation and boundary values for λ1(A+αB) using numerical simulations, and determine how component network features affect λ1(A+αB).

preprint2013arXiv

Efficient quantum memory using a weakly absorbing sample

A light-storage experiment with a total (storage and retrieval) efficiency $η=58 \pm 5%$ is carried out by enclosing a sample, with a single pass absorption of 10%, in an impedance-matched cavity. The experiment is carried out using the Atomic Frequency Comb (AFC) technique in a praseodymium-doped crystal ($0.05%Pr^{3+}:Y_2SiO_5$) and the cavity is created by reflection coating the crystal surfaces. The AFC technique has previously by far demonstrated the highest multi-mode capacity of all quantum memory concepts tested experimentally. We claim that the present work shows that it is realistic to create efficient, on-demand, long storage time AFC memories.

preprint2013arXiv

Identifying Influential Spreaders by Weighted LeaderRank

Identifying influential spreaders is crucial for understanding and controlling spreading processes on social networks. Via assigning degree-dependent weights onto links associated with the ground node, we proposed a variant to a recent ranking algorithm named LeaderRank [L. Lv et al., PLoS ONE 6 (2011) e21202]. According to the simulations on the standard SIR model, the weighted LeaderRank performs better than LeaderRank in three aspects: (i) the ability to find out more influential spreaders, (ii) the higher tolerance to noisy data, and (iii) the higher robustness to intentional attacks.

preprint2013arXiv

Spectral Engineering of Slow Light, Cavity Line Narrowing, and Pulse Compression

More than 4 orders of magnitude of cavity-linewidth narrowing in a rare-earth-ion-doped crystal cavity, emanating from strong intracavity dispersion caused by off-resonant interaction with dopant ions, is demonstrated. The dispersion profiles are engineered using optical pumping techniques creating significant semipermanent but reprogrammable changes of the rare-earth absorption profiles. Several cavity modes are shown within the spectral transmission window. Several possible applications of this phenomenon are discussed.

preprint2013arXiv

Three orders of magnitude cavity-linewidth narrowing by slow light in a rare-earth-ion-doped crystal cavity

Three orders of magnitude cavity-linewidth narrowing in a rare-earth-ion-doped crystal cavity, induced by strong intra-cavity dispersion caused by off-resonant interaction with dopant ions is demonstrated. The strong dispersion is created by semi-permanent but rapidly reprogrammable changes of the rare earth absorption profiles using optical pumping techniques. Several cavity modes are shown within the spectral transmission window. Potential applications are discussed.

preprint2012arXiv

Non-consensus opinion models on complex networks

We focus on non-consensus opinion models in which above a certain threshold two opinions coexist in a stable relationship. We revisit and extend the non-consensus opinion (NCO) model introduced by Shao. We generalize the NCO model by adding a weight factor W to individual's own opinion when determining its future opinion (NCOW model). We find that as W increases the minority opinion holders tend to form stable clusters with a smaller initial minority fraction compared to the NCO model. We also revisit another non-consensus opinion, the inflexible contrarian opinion (ICO) model, which introduces inflexible contrarians to model a competition between two opinions in the steady state. In the ICO model, the inflexible contrarians effectively decrease the size of the largest cluster of the rival opinion. All of the above models have previously been explored in terms of a single network. However opinions propagate not only within single networks but also between networks, we study here the opinion dynamics in coupled networks. We apply the NCO rule on each individual network and the global majority rule on interdependent pairs. We find that the interdependent links effectively force the system from a second order phase transition, which is characteristic of the NCO model on a single network, to a hybrid phase transition, i.e., a mix of second-order and abrupt jump-like transitions that ultimately becomes, as we increase the percentage of interdependent agents, a pure abrupt transition. We conclude that for the NCO model on coupled networks, interactions through interdependent links could push the non-consensus opinion type model to a consensus opinion type model, which mimics the reality that increased mass communication causes people to hold opinions that are increasingly similar.

preprint2012arXiv

Proton Capture on ^{17}O and its astrophysical implications

The reaction $^{17}$O$(p,γ)^{18}$F influences hydrogen-burning nucleosynthesis in several stellar sites, such as red giants, asymptotic giant branch (AGB) stars, massive stars and classical novae. In the relevant temperature range for these environments ($T_{9}=0.01-0.4), the main contributions to the rate of this reaction are the direct capture process, two low lying narrow resonances ($E_{r}=65.1$ and 183 keV) and the low-energy tails of two broad resonances ($E_{r}=557$ and 677 keV). Previous measurements and calculations give contradictory results for the direct capture contribution which in turn increases the uncertainty of the reaction rate. In addition, very few published cross section data exist for the high energy region that might affect the interpretation of the direct capture and the contributions of the broad resonances in the lower energy range. This work aims to address these issues. The reaction cross section was measured in a wide proton energy range ($E_{c.m.}=345$ - 1700 keV) and at several angles ($θ_{lab}=0^{\circ},45^{\circ},90^{\circ},135^{\circ}$). The observed primary $γ$-transitions were used as input in an $R$-matrix code in order to obtain the contribution of the direct capture and the two broad resonances to the low-energy region. The extrapolated S-factor from the present data is in good agreement with the existing literature data in the low-energy region. A new reaction rate was calculated from the combined results of this work and literature S-factor determinations. Resonance strengths and branchings are reported for several $^{18}$F states. We were able to extrapolate the astrophysical S-factor of the reaction $^{17}$O$(p,γ)^{18}$F at low energies from cross section data taken at higher energies. No significant changes in the nucleosynthesis are expected from the newly calculated reaction rate.

preprint2011arXiv

Strategy of Competition between Two Groups based on a Contrarian Opinion Model

We introduce a contrarian opinion (CO) model in which a fraction p of contrarians within a group holds a strong opinion opposite to the opinion held by the rest of the group. At the initial stage, stable clusters of two opinions, A and B exist. Then we introduce contrarians which hold a strong B opinion into the opinion A group. Through their interactions, the contrarians are able to decrease the size of the largest A opinion cluster, and even destroy it. We see this kind of method in operation, e.g when companies send free new products to potential customers in order to convince them to adopt the product and influence others. We study the CO model, using two different strategies, on both ER and scale-free networks. In strategy I, the contrarians are positioned at random. In strategy II, the contrarians are chosen to be the highest degrees nodes. We find that for both strategies the size of the largest A cluster decreases to zero as p increases as in a phase transition. At a critical threshold value p_c the system undergoes a second-order phase transition that belongs to the same universality class of mean field percolation. We find that even for an ER type model, where the degrees of the nodes are not so distinct, strategy II is significantly more effctive in reducing the size of the largest A opinion cluster and, at very small values of p, the largest A opinion cluster is destroyed.

Qian Li

What is connected

Connect this record

See the researcher in context

Building this map preview

65 published item(s)

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks

Multi-spatial Multi-temporal Air Quality Forecasting with Integrated Monitoring and Reanalysis Data

Atomically engineered cobaltite layers for robust ferromagnetism

Braiding lateral morphotropic grain boundary in homogeneitic oxides

Causal Disentanglement for Semantics-Aware Intent Learning in Recommendation

Coexistence of extended flat band and Kekulé order in Li-intercalated graphene

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

Delineating complex ferroelectric domain structures via second harmonic generation spectral imaging

Epitaxial stabilization of an orthorhombic Mg-Ti-O superconductor

Event Extraction by Associating Event Types and Argument Roles

FedMCSA: Personalized Federated Learning via Model Components Self-Attention

Forestry digital twin with machine learning in Landsat 7 data

How Does Knowledge Graph Embedding Extrapolate to Unseen Data: A Semantic Evidence View

Learning Generalizable Light Field Networks from Few Images

Machine Learning with DBOS

New Distinguishers for Negation-Limited Weak Pseudorandom Functions

Observation of SQUID-like behavior in fiber laser with intra-cavity epsilon-near-zero effect

Position-aware Structure Learning for Graph Topology-imbalance by Relieving Under-reaching and Over-squashing

Reinforced Path Reasoning for Counterfactual Explainable Recommendation

Room-temperature printing of ultrathin Quasi-2D GaN semiconductor via liquid metal gallium surface confined nitridation reaction

Transactions Make Debugging Easy

Universal Segmentation of 33 Anatomies

Magnetic field-tuned quantum criticality in optimally electron-doped cuprate thin films

Population inversion and Dirac fermion cooling in 3D Dirac semimetal Cd$_3$As$_2$

Room-temperature ferromagnetism at an oxide/nitride interface

Consistency of a kind of general noncanonical warm inflation

Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation

How to Investigate the Historical Roots and Evolution of Research Fields in China? A Case Study on iMetrics Using RootCite

INFaaS: A Model-less and Managed Inference Serving System

Influence of Initialization on the Performance of Metaheuristic Optimizers

Learning Differential Diagnosis of Skin Conditions with Co-occurrence Supervision using Graph Convolutional Networks

Leveraging Multi-level Dependency of Relational Sequences for Social Spammer Detection

Neighborhood Information-based Probabilistic Algorithm for Network Disintegration

Region-Referenced Spectral Power Dynamics of EEG Signals: A Hierarchical Modeling Approach

Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

U-Net Using Stacked Dilated Convolutions for Medical Image Segmentation

Weakly Supervised Context Encoder using DICOM metadata in Ultrasound Imaging

Coherent transfer of spin angular momentum by evanescent spin waves within antiferromagnetic NiO

Emergent superconductivity in single crystalline $\mathrm{MgTi}_2\mathrm{O}_4$ films via structural engineering

On some model equations for pulsatile flow in viscoelastic vessels

A Reconfigurable Nanophotonics Platform for Sub-Millisecond, Deep Brain Neural Stimulation

A Tighter Relation between Sensitivity and Certificate Complexity

An end-to-end network slicing framework for 5G wireless communication systems

Efficient Delivery Policy to Minimize User Traffic Consumption in Guaranteed Advertising

On the Optimality of Tape Merge of Two Lists with Similar Size

On the Sensitivity Complexity of $k$-Uniform Hypergraph Properties

Slow light based optical frequency shifter

Soft Phonon Modes and Diffuse Scattering in Pb(In1/2Nb1/2)O3-Pb(Mg1/3Nb2/3)O3-PbTiO3 Relaxor Ferroelectrics

Dirac-Point Solitons in Nonlinear Optical Lattices

Fast all-optical nuclear spin echo technique based on EIT

Mechanical Tuning of Thermal Transport in a Molecular Junction

Correlation between centrality metrics and their application to the opinion model

Designing $π$-stacked molecular structures to control heat transport through molecular junctions

Non-consensus opinion model on directed networks

Effect of the Interconnected Network Structure on the Epidemic Threshold

Efficient quantum memory using a weakly absorbing sample

Identifying Influential Spreaders by Weighted LeaderRank

Spectral Engineering of Slow Light, Cavity Line Narrowing, and Pulse Compression

Three orders of magnitude cavity-linewidth narrowing by slow light in a rare-earth-ion-doped crystal cavity

Non-consensus opinion models on complex networks

Proton Capture on ^{17}O and its astrophysical implications

Strategy of Competition between Two Groups based on a Contrarian Opinion Model