Source author record

Jia Li

Jia Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

72works

32topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Signal Reshaping for GRPO in Weak-Feedback Agentic Code Repair

Code-agent RL often receives weak feedback: rollout-time signals are reliable and executable, but capture only necessary or surface conditions for task success rather than the target semantic predicate. Using agentic compile-fix as the setting, we study signal reshaping for standard GRPO under such feedback. Our central claim is that GRPO's within-group comparison is meaningful only after three kinds of signals are reshaped: outcome rewards recover semantic ranking, process signals localize intra-trajectory credit, and rollouts from the same prompt remain execution-comparable. We operationalize these conditions with a minimal signal-reshaping construction that leaves GRPO's group-normalized advantage construction unchanged: compile-and-semantic layered rewards reshape trajectory ranking, step-level process scores outside group reward normalization reshape within-trajectory update strength, and failure-cause-aware rollout governance reshapes within-group comparability. Experiments show a clear end-to-end gain: full signal-reshaped GRPO improves strict compile-and-semantic accuracy from the base model's zero-shot $0.385$ to $0.535$. Controlled comparisons further explain the source of this gain: binary rewards remove the compile-only middle tier and degrade trajectory control; on top of layered rewards, process-score weighting further improves accuracy from $0.48$ to $0.53$ and reduces average evaluation steps from $23.50$ to $17.02$. As a boundary comparison, privileged-prompt token-level distillation mainly optimizes local distributional alignment; in long tool-use trajectories, this signal is diluted by non-critical tokens and cannot replace outcome semantics, process credit, or within-group comparability.

preprint2026arXiv

To Use AI as Dice of Possibilities with Timing Computation

The dominant noun-based modeling paradigm has fundamentally constrained AI development, precluding any adequate representation of the future as an open temporal dimension. This paper introduces a verb-based paradigm, together with precise definitions of \emph{timing computation} and \emph{possibility}, that enables AI to function as an effective instrument for realizing the grammar of our thought. Applied to longitudinal EHR data from 3,276 breast cancer patients, the framework empirically demonstrates: (1) automatic discovery of clinically significant patient trajectories, and (2) counterfactual timing deduction. Both results are purely data-driven, require no prior domain knowledge, and, to our knowledge, represent the first such demonstrations in the machine learning literature.

preprint2024arXiv

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers

This paper presents an in-depth analysis of Large Language Models (LLMs), focusing on LLaMA, a prominent open-source foundational model in natural language processing. Instead of assessing LLaMA through its generative output, we design multiple-choice tasks to probe its intrinsic understanding in high-order tasks such as reasoning and computation. We examine the model horizontally, comparing different sizes, and vertically, assessing different layers. We unveil several key and uncommon findings based on the designed probing tasks: (1) Horizontally, enlarging model sizes almost could not automatically impart additional knowledge or computational prowess. Instead, it can enhance reasoning abilities, especially in math problem solving, and helps reduce hallucinations, but only beyond certain size thresholds; (2) In vertical analysis, the lower layers of LLaMA lack substantial arithmetic and factual knowledge, showcasing logical thinking, multilingual and recognitive abilities, with top layers housing most computational power and real-world knowledge.

preprint2024arXiv

Rectangular carbon nitrides C4N monolayers with a zigzag buckled structure: Quasi-one-dimensional Dirac nodal lines and topological flat edge states

Due to the flexibility of C and N atoms in forming different types of bonds, the prediction of new two-dimensional (2D) carbon nitrides is a hot topic in the field of carbon-based materials. Using first-principles calculations, we propose two C4N monolayers with a zigzag buckled (ZB) structure. The ZB C4N monolayers contain raised-C (raised-N) atoms with sp3 hybridization, different from the traditional 2D graphene-like carbon nitride materials with sp2 hybridization. Interestingly, the band structures of the ZB C4N monolayers exhibit quasi-one-dimensional (quasi-1D) Dirac nodal line that results from the corresponding quasi-1D structure of the zigzag carbon chains, which is essentially different from the more common ring-shaped nodal line. The quasi-1D Dirac nodal line exhibits the following features: (i) gapless Dirac points, (ii) varying Fermi velocity, and (iii) slightly curved band along the high-symmetry path. All these features are successfully explained by our proposed tight-binding model that includes interactions up to the third nearest-neighbor. The Fermi velocity of the 2D system can reach 105 m/s, which is promising for applications in high-speed electronic devices. The topological flat band structure determined by the Zak phase and band inversion of the corresponding 1D system is edge-dependent, which is corresponding to the Su-Schrieffer-Heeger model, providing to rich physical phenomena.

preprint2023arXiv

Rethinking Lightweight Salient Object Detection via Network Depth-Width Tradeoff

Existing salient object detection methods often adopt deeper and wider networks for better performance, resulting in heavy computational burden and slow inference speed. This inspires us to rethink saliency detection to achieve a favorable balance between efficiency and accuracy. To this end, we design a lightweight framework while maintaining satisfying competitive accuracy. Specifically, we propose a novel trilateral decoder framework by decoupling the U-shape structure into three complementary branches, which are devised to confront the dilution of semantic context, loss of spatial structure and absence of boundary detail, respectively. Along with the fusion of three branches, the coarse segmentation results are gradually refined in structure details and boundary quality. Without adding additional learnable parameters, we further propose Scale-Adaptive Pooling Module to obtain multi-scale receptive filed. In particular, on the premise of inheriting this framework, we rethink the relationship among accuracy, parameters and speed via network depth-width tradeoff. With these insightful considerations, we comprehensively design shallower and narrower models to explore the maximum potential of lightweight SOD. Our models are purposed for different application environments: 1) a tiny version CTD-S (1.7M, 125FPS) for resource constrained devices, 2) a fast version CTD-M (12.6M, 158FPS) for speed-demanding scenarios, 3) a standard version CTD-L (26.5M, 84FPS) for high-performance platforms. Extensive experiments validate the superiority of our method, which achieves better efficiency-accuracy balance across five benchmarks.

preprint2023arXiv

Self-supervised Hypergraph Representation Learning for Sociological Analysis

Modern sociology has profoundly uncovered many convincing social criteria for behavioural analysis. Unfortunately, many of them are too subjective to be measured and presented in online social networks. On the other hand, data mining techniques can better find data patterns but many of them leave behind unnatural understanding. In this paper, we propose a fundamental methodology to support the further fusion of data mining techniques and sociological behavioral criteria. Our highlights are three-fold: First, we propose an effective hypergraph awareness and a fast line graph construction framework. The hypergraph can more profoundly indicate the interactions between individuals and their environments because each edge in the hypergraph (a.k.a hyperedge) contains more than two nodes, which is perfect to describe social environments. A line graph treats each social environment as a super node with the underlying influence between different environments. In this way, we go beyond traditional pair-wise relations and explore richer patterns under various sociological criteria; Second, we propose a novel hypergraph-based neural network to learn social influence flowing from users to users, users to environments, environment to users, and environments to environments. The neural network can be learned via a task-free method, making our model very flexible to support various data mining tasks and sociological analysis; Third, we propose both qualitative and quantitive solutions to effectively evaluate the most common sociological criteria like social conformity, social equivalence, environmental evolving and social polarization. Our extensive experiments show that our framework can better support both data mining tasks for online user behaviours and sociological analysis.

preprint2022arXiv

A Domain-Specific Language for Simulation-Based Testing of IoT Edge-to-Cloud Solutions

The Internet of things (IoT) is increasingly prevalent in domains such as emergency response, smart cities and autonomous vehicles. Simulation plays a key role in the testing of IoT systems, noting that field testing of a complete IoT product may be infeasible or prohibitively expensive. In this paper, we propose a domain-specific language (DSL) for generating edge-to-cloud simulators. An edge-to-cloud simulator executes the functionality of a large array of edge devices that communicate with cloud applications. Our DSL, named IoTECS, is the result of a collaborative project with an IoT analytics company, Cheetah Networks. The industrial use case that motivates IoTECS is ensuring the scalability of cloud applications by putting them under extreme loads from IoT devices connected to the edge. We implement IoTECS using Xtext and empirically evaluate its usefulness. We further reflect on the lessons learned.

preprint2022arXiv

Automatic detection of low surface brightness galaxies from SDSS images

Low surface brightness (LSB) galaxies are galaxies with central surface brightness fainter than the night sky. Due to the faint nature of LSB galaxies and the comparable sky background, it is difficult to search LSB galaxies automatically and efficiently from large sky survey. In this study, we established the Low Surface Brightness Galaxies Auto Detect model (LSBG-AD), which is a data-driven model for end-to-end detection of LSB galaxies from Sloan Digital Sky Survey (SDSS) images. Object detection techniques based on deep learning are applied to the SDSS field images to identify LSB galaxies and estimate their coordinates at the same time. Applying LSBG-AD to 1120 SDSS images, we detected 1197 LSB galaxy candidates, of which 1081 samples are already known and 116 samples are newly found candidates. The B-band central surface brightness of the candidates searched by the model ranges from 22 mag arcsec $^ {- 2} $ to 24 mag arcsec $^ {- 2} $, quite consistent with the surface brightness distribution of the standard sample. 96.46\% of LSB galaxy candidates have an axis ratio ($b/a$) greater than 0.3, and 92.04\% of them have $fracDev\_r$\textless 0.4, which is also consistent with the standard sample. The results show that the LSBG-AD model learns the features of LSB galaxies of the training samples well, and can be used to search LSB galaxies without using photometric parameters. Next, this method will be used to develop efficient algorithms to detect LSB galaxies from massive images of the next generation observatories.

preprint2022arXiv

Coexistence of extended flat band and Kekulé order in Li-intercalated graphene

Doping graphene near the 1/4 filling to shift the extended flat band and van Hove singularity below E$_F$ has been highly desirable. Here we report the experimental observation of an extended flat band below E$_F$ in Li-intercalated graphene. Strong electron-phonon interaction is clearly identified by notable kinks in the band dispersion. Moreover, the evolution of the band structure upon Li intercalation shows that the extended flat band and the Kekulé order emerge simultaneously. Our work provides opportunities for investigating flat band related instabilities and its interplay with the Kekulé order

preprint2022arXiv

Complementary Feature Enhanced Network with Vision Transformer for Image Dehazing

Conventional CNNs-based dehazing models suffer from two essential issues: the dehazing framework (limited in interpretability) and the convolution layers (content-independent and ineffective to learn long-range dependency information). In this paper, firstly, we propose a new complementary feature enhanced framework, in which the complementary features are learned by several complementary subtasks and then together serve to boost the performance of the primary task. One of the prominent advantages of the new framework is that the purposively chosen complementary tasks can focus on learning weakly dependent complementary features, avoiding repetitive and ineffective learning of the networks. We design a new dehazing network based on such a framework. Specifically, we select the intrinsic image decomposition as the complementary tasks, where the reflectance and shading prediction subtasks are used to extract the color-wise and texture-wise complementary features. To effectively aggregate these complementary features, we propose a complementary features selection module (CFSM) to select the more useful features for image dehazing. Furthermore, we introduce a new version of vision transformer block, named Hybrid Local-Global Vision Transformer (HyLoG-ViT), and incorporate it within our dehazing networks. The HyLoG-ViT block consists of the local and the global vision transformer paths used to capture local and global dependencies. As a result, the HyLoG-ViT introduces locality in the networks and captures the global and long-range dependencies. Extensive experiments on homogeneous, non-homogeneous, and nighttime dehazing tasks reveal that the proposed dehazing network can achieve comparable or even better performance than CNNs-based dehazing models.

preprint2022arXiv

ConTinTin: Continual Learning from Task Instructions

The mainstream machine learning paradigms for NLP often work with two underlying presumptions. First, the target task is predefined and static; a system merely needs to learn to solve it exclusively. Second, the supervision of a task mainly comes from a set of labeled examples. A question arises: how to build a system that can keep learning new tasks from their instructions? This work defines a new learning paradigm ConTinTin (Continual Learning from Task Instructions), in which a system should learn a sequence of new tasks one by one, each task is explained by a piece of textual instruction. The system is required to (i) generate the expected outputs of a new task by learning from its instruction, (ii) transfer the knowledge acquired from upstream tasks to help solve downstream tasks (i.e., forward-transfer), and (iii) retain or even improve the performance on earlier tasks after learning new tasks (i.e., backward-transfer). This new problem is studied on a stream of more than 60 tasks, each equipped with an instruction. Technically, our method InstructionSpeak contains two strategies that make full use of task instructions to improve forward-transfer and backward-transfer: one is to learn from negative outputs, the other is to re-visit instructions of previous tasks. To our knowledge, this is the first time to study ConTinTin in NLP. In addition to the problem formulation and our promising approach, this work also contributes to providing rich analyses for the community to better understand this novel learning problem.

preprint2022arXiv

Cross-scale Attention Guided Multi-instance Learning for Crohn's Disease Diagnosis with Pathological Images

Multi-instance learning (MIL) is widely used in the computer-aided interpretation of pathological Whole Slide Images (WSIs) to solve the lack of pixel-wise or patch-wise annotations. Often, this approach directly applies "natural image driven" MIL algorithms which overlook the multi-scale (i.e. pyramidal) nature of WSIs. Off-the-shelf MIL algorithms are typically deployed on a single-scale of WSIs (e.g., 20x magnification), while human pathologists usually aggregate the global and local patterns in a multi-scale manner (e.g., by zooming in and out between different magnifications). In this study, we propose a novel cross-scale attention mechanism to explicitly aggregate inter-scale interactions into a single MIL network for Crohn's Disease (CD), which is a form of inflammatory bowel disease. The contribution of this paper is two-fold: (1) a cross-scale attention mechanism is proposed to aggregate features from different resolutions with multi-scale interaction; and (2) differential multi-scale attention visualizations are generated to localize explainable lesion patterns. By training ~250,000 H&E-stained Ascending Colon (AC) patches from 20 CD patient and 30 healthy control samples at different scales, our approach achieved a superior Area under the Curve (AUC) score of 0.8924 compared with baseline models. The official implementation is publicly available at https://github.com/hrlblab/CS-MIL.

preprint2022arXiv

Deep Learning Eliminates Massive Dust Storms from Images of Tianwen-1

Dust storms may remarkably degrade the imaging quality of Martian orbiters and delay the progress of mapping the global topography and geomorphology. To address this issue, this paper presents an approach that reuses the image dehazing knowledge obtained on Earth to resolve the dust-removal problem on Mars. In this approach, we collect remote-sensing images captured by Tianwen-1 and manually select hundreds of clean and dusty images. Inspired by the haze formation process on Earth, we formulate a similar visual degradation process on clean images and synthesize dusty images sharing a similar feature distribution with realistic dusty images. These realistic clean and synthetic dusty image pairs are used to train a deep model that inherently encodes dust irrelevant features and decodes them into dust-free images. Qualitative and quantitative results show that dust storms can be effectively eliminated by the proposed approach, leading to obviously improved topographical and geomorphological details of Mars.

preprint2022arXiv

ELECRec: Training Sequential Recommenders as Discriminators

Sequential recommendation is often considered as a generative task, i.e., training a sequential encoder to generate the next item of a user's interests based on her historical interacted items. Despite their prevalence, these methods usually require training with more meaningful samples to be effective, which otherwise will lead to a poorly trained model. In this work, we propose to train the sequential recommenders as discriminators rather than generators. Instead of predicting the next item, our method trains a discriminator to distinguish if a sampled item is a 'real' target item or not. A generator, as an auxiliary model, is trained jointly with the discriminator to sample plausible alternative next items and will be thrown out after training. The trained discriminator is considered as the final SR model and denoted as \modelname. Experiments conducted on four datasets demonstrate the effectiveness and efficiency of the proposed approach.

preprint2022arXiv

Generating Negative Samples for Sequential Recommendation

To make Sequential Recommendation (SR) successful, recent works focus on designing effective sequential encoders, fusing side information, and mining extra positive self-supervision signals. The strategy of sampling negative items at each time step is less explored. Due to the dynamics of users' interests and model updates during training, considering randomly sampled items from a user's non-interacted item set as negatives can be uninformative. As a result, the model will inaccurately learn user preferences toward items. Identifying informative negatives is challenging because informative negative items are tied with both dynamically changed interests and model parameters (and sampling process should also be efficient). To this end, we propose to Generate Negative Samples (items) for SR (GenNi). A negative item is sampled at each time step based on the current SR model's learned user preferences toward items. An efficient implementation is proposed to further accelerate the generation process, making it scalable to large-scale recommendation tasks. Extensive experiments on four public datasets verify the importance of providing high-quality negative samples for SR and demonstrate the effectiveness and efficiency of GenNi.

preprint2022arXiv

Hybrid Multimodal Feature Extraction, Mining and Fusion for Sentiment Analysis

In this paper, we present our solutions for the Multimodal Sentiment Analysis Challenge (MuSe) 2022, which includes MuSe-Humor, MuSe-Reaction and MuSe-Stress Sub-challenges. The MuSe 2022 focuses on humor detection, emotional reactions and multimodal emotional stress utilizing different modalities and data sets. In our work, different kinds of multimodal features are extracted, including acoustic, visual, text and biological features. These features are fused by TEMMA and GRU with self-attention mechanism frameworks. In this paper, 1) several new audio features, facial expression features and paragraph-level text embeddings are extracted for accuracy improvement. 2) we substantially improve the accuracy and reliability of multimodal sentiment prediction by mining and blending the multimodal features. 3) effective data augmentation strategies are applied in model training to alleviate the problem of sample imbalance and prevent the model from learning biased subject characters. For the MuSe-Humor sub-challenge, our model obtains the AUC score of 0.8932. For the MuSe-Reaction sub-challenge, the Pearson's Correlations Coefficient of our approach on the test set is 0.3879, which outperforms all other participants. For the MuSe-Stress sub-challenge, our approach outperforms the baseline in both arousal and valence on the test dataset, reaching a final combined result of 0.5151.

preprint2022arXiv

Improving Contrastive Learning with Model Augmentation

The sequential recommendation aims at predicting the next items in user behaviors, which can be solved by characterizing item relationships in sequences. Due to the data sparsity and noise issues in sequences, a new self-supervised learning (SSL) paradigm is proposed to improve the performance, which employs contrastive learning between positive and negative views of sequences. However, existing methods all construct views by adopting augmentation from data perspectives, while we argue that 1) optimal data augmentation methods are hard to devise, 2) data augmentation methods destroy sequential correlations, and 3) data augmentation fails to incorporate comprehensive self-supervised signals. Therefore, we investigate the possibility of model augmentation to construct view pairs. We propose three levels of model augmentation methods: neuron masking, layer dropping, and encoder complementing. This work opens up a novel direction in constructing views for contrastive SSL. Experiments verify the efficacy of model augmentation for the SSL in the sequential recommendation. Code is available\footnote{\url{https://github.com/salesforce/SRMA}}.

preprint2022arXiv

Intent Contrastive Learning for Sequential Recommendation

Users' interactions with items are driven by various intents (e.g., preparing for holiday gifts, shopping for fishing equipment, etc.).However, users' underlying intents are often unobserved/latent, making it challenging to leverage such latent intents forSequentialrecommendation(SR). To investigate the benefits of latent intents and leverage them effectively for recommendation, we proposeIntentContrastiveLearning(ICL), a general learning paradigm that leverages a latent intent variable into SR. The core idea is to learn users' intent distribution functions from unlabeled user behavior sequences and optimize SR models with contrastive self-supervised learning (SSL) by considering the learned intents to improve recommendation. Specifically, we introduce a latent variable to represent users' intents and learn the distribution function of the latent variable via clustering. We propose to leverage the learned intents into SR models via contrastive SSL, which maximizes the agreement between a view of sequence and its corresponding intent. The training is alternated between intent representation learning and the SR model optimization steps within the generalized expectation-maximization (EM) framework. Fusing user intent information into SR also improves model robustness. Experiments conducted on four real-world datasets demonstrate the superiority of the proposed learning paradigm, which improves performance, and robustness against data sparsity and noisy interaction issues.

preprint2022arXiv

Interaction expansion inchworm Monte Carlo solver for lattice and impurity models

Multi-orbital quantum impurity models with general interaction and hybridization terms appear in a wide range of applications including embedding, quantum transport, and nanoscience. However, most quantum impurity solvers are restricted to a few impurity orbitals, discretized baths, diagonal hybridizations, or density-density interactions. Here, we generalize the inchworm quantum Monte Carlo method to the interaction expansion and explore its application to typical single- and multi-orbital problems encountered in investigations of impurity and lattice models. Our implementation generically outperforms bare and bold-line quantum Monte Carlo algorithms in the interaction expansion. So far, for the systems studied here, it remains inferior to the more specialized hybridization expansion and auxiliary field algorithms. The problem of convergence to unphysical fixed points, which hampers so-called bold-line methods, is not encountered in inchworm Monte Carlo.

preprint2022arXiv

Learning Self-adaptations for IoT Networks: A Genetic Programming Approach

Internet of Things (IoT) is a pivotal technology in application domains that require connectivity and interoperability between large numbers of devices. IoT systems predominantly use a software-defined network (SDN) architecture as their core communication backbone. This architecture offers several advantages, including the flexibility to make IoT networks self-adaptive through software programmability. In general, self-adaptation solutions need to periodically monitor, reason about, and adapt a running system. The adaptation step involves generating an adaptation strategy and applying it to the running system whenever an anomaly arises. In this paper, we argue that, rather than generating individual adaptation strategies, the goal should be to adapt the logic / code of the running system in such a way that the system itself would learn how to steer clear of future anomalies, without triggering self-adaptation too frequently. We instantiate and empirically assess this idea in the context of IoT networks. Specifically, using genetic programming (GP), we propose a self-adaptation solution that continuously learns and updates the control constructs in the data-forwarding logic of SDN-based IoT networks. Our evaluation, performed using open-source synthetic and industrial data, indicates that, compared to a baseline adaptation technique that attempts to generate individual adaptations, our GP-based approach is more effective in resolving network congestion, and further, reduces the frequency of adaptation interventions over time. In addition, we compare our approach against a standard data-forwarding algorithm from the network literature, demonstrating that our approach significantly reduces packet loss.

preprint2022arXiv

MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation

Counterfactual explanation is an important Explainable AI technique to explain machine learning predictions. Despite being studied actively, existing optimization-based methods often assume that the underlying machine-learning model is differentiable and treat categorical attributes as continuous ones, which restricts their real-world applications when categorical attributes have many different values or the model is non-differentiable. To make counterfactual explanation suitable for real-world applications, we propose a novel framework of Model-Agnostic Counterfactual Explanation (MACE), which adopts a newly designed pipeline that can efficiently handle non-differentiable machine-learning models on a large number of feature values. in our MACE approach, we propose a novel RL-based method for finding good counterfactual examples and a gradient-less descent method for improving proximity. Experiments on public datasets validate the effectiveness with better validity, sparsity and proximity.

preprint2022arXiv

Permutation-based tests for discontinuities in event studies

We propose using a permutation test to detect discontinuities in an underlying economic model at a known cutoff point. Relative to the existing literature, we show that this test is well suited for event studies based on time-series data. The test statistic measures the distance between the empirical distribution functions of observed data in two local subsamples on the two sides of the cutoff. Critical values are computed via a standard permutation algorithm. Under a high-level condition that the observed data can be coupled by a collection of conditionally independent variables, we establish the asymptotic validity of the permutation test, allowing the sizes of the local subsamples to be either be fixed or grow to infinity. In the latter case, we also establish that the permutation test is consistent. We demonstrate that our high-level condition can be verified in a broad range of problems in the infill asymptotic time-series setting, which justifies using the permutation test to detect jumps in economic variables such as volatility, trading activity, and liquidity. These potential applications are illustrated in an empirical case study for selected FOMC announcements during the ongoing COVID-19 pandemic.

preprint2022arXiv

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection

Recent salient object detection (SOD) methods based on deep neural network have achieved remarkable performance. However, most of existing SOD models designed for low-resolution input perform poorly on high-resolution images due to the contradiction between the sampling depth and the receptive field size. Aiming at resolving this contradiction, we propose a novel one-stage framework called Pyramid Grafting Network (PGNet), using transformer and CNN backbone to extract features from different resolution images independently and then graft the features from transformer branch to CNN branch. An attention-based Cross-Model Grafting Module (CMGM) is proposed to enable CNN branch to combine broken detailed information more holistically, guided by different source feature during decoding process. Moreover, we design an Attention Guided Loss (AGL) to explicitly supervise the attention matrix generated by CMGM to help the network better interact with the attention from different models. We contribute a new Ultra-High-Resolution Saliency Detection dataset UHRSD, containing 5,920 images at 4K-8K resolutions. To our knowledge, it is the largest dataset in both quantity and resolution for high-resolution SOD task, which can be used for training and testing in future research. Sufficient experiments on UHRSD and widely-used SOD datasets demonstrate that our method achieves superior performance compared to the state-of-the-art methods.

preprint2022arXiv

ReAct: Temporal Action Detection with Relational Queries

This work aims at advancing temporal action detection (TAD) using an encoder-decoder framework with action queries, similar to DETR, which has shown great success in object detection. However, the framework suffers from several problems if directly applied to TAD: the insufficient exploration of inter-query relation in the decoder, the inadequate classification training due to a limited number of training samples, and the unreliable classification scores at inference. To this end, we first propose a relational attention mechanism in the decoder, which guides the attention among queries based on their relations. Moreover, we propose two losses to facilitate and stabilize the training of action classification. Lastly, we propose to predict the localization quality of each action query at inference in order to distinguish high-quality queries. The proposed method, named ReAct, achieves the state-of-the-art performance on THUMOS14, with much lower computational costs than previous methods. Besides, extensive ablation studies are conducted to verify the effectiveness of each proposed component. The code is available at https://github.com/sssste/React.

preprint2022arXiv

Rethinking Graph Neural Networks for Anomaly Detection

Graph Neural Networks (GNNs) are widely applied for graph anomaly detection. As one of the key components for GNN design is to select a tailored spectral filter, we take the first step towards analyzing anomalies via the lens of the graph spectrum. Our crucial observation is the existence of anomalies will lead to the `right-shift' phenomenon, that is, the spectral energy distribution concentrates less on low frequencies and more on high frequencies. This fact motivates us to propose the Beta Wavelet Graph Neural Network (BWGNN). Indeed, BWGNN has spectral and spatial localized band-pass filters to better handle the `right-shift' phenomenon in anomalies. We demonstrate the effectiveness of BWGNN on four large-scale anomaly detection datasets. Our code and data are released at https://github.com/squareRoot3/Rethinking-Anomaly-Detection

preprint2022arXiv

RGRecSys: A Toolkit for Robustness Evaluation of Recommender Systems

Robust machine learning is an increasingly important topic that focuses on developing models resilient to various forms of imperfect data. Due to the pervasiveness of recommender systems in online technologies, researchers have carried out several robustness studies focusing on data sparsity and profile injection attacks. Instead, we propose a more holistic view of robustness for recommender systems that encompasses multiple dimensions - robustness with respect to sub-populations, transformations, distributional disparity, attack, and data sparsity. While there are several libraries that allow users to compare different recommender system models, there is no software library for comprehensive robustness evaluation of recommender system models under different scenarios. As our main contribution, we present a robustness evaluation toolkit, Robustness Gym for RecSys (RGRecSys -- https://www.github.com/salesforce/RGRecSys), that allows us to quickly and uniformly evaluate the robustness of recommender system models.

preprint2022arXiv

Styleverse: Towards Identity Stylization across Heterogeneous Domains

We propose a new challenging task namely IDentity Stylization (IDS) across heterogeneous domains. IDS focuses on stylizing the content identity, rather than completely swapping it using the reference identity. We use an effective heterogeneous-network-based framework $Styleverse$ that uses a single domain-aware generator to exploit the Metaverse of diverse heterogeneous faces, based on the proposed dataset FS13 with limited data. FS13 means 13 kinds of Face Styles considering diverse lighting conditions, art representations and life dimensions. Previous similar tasks, \eg, image style transfer can handle textural style transfer based on a reference image. This task usually ignores the high structure-aware facial area and high-fidelity preservation of the content. However, Styleverse intends to controllably create topology-aware faces in the Parallel Style Universe, where the source facial identity is adaptively styled via AdaIN guided by the domain-aware and reference-aware style embeddings from heterogeneous pretrained models. We first establish the IDS quantitative benchmark as well as the qualitative Styleverse matrix. Extensive experiments demonstrate that Styleverse achieves higher-fidelity identity stylization compared with other state-of-the-art methods.

preprint2022arXiv

Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing

With the rapid development of mobile devices, modern widely-used mobile phones typically allow users to capture 4K resolution (i.e., ultra-high-definition) images. However, for image demoireing, a challenging task in low-level vision, existing works are generally carried out on low-resolution or synthetic images. Hence, the effectiveness of these methods on 4K resolution images is still unknown. In this paper, we explore moire pattern removal for ultra-high-definition images. To this end, we propose the first ultra-high-definition demoireing dataset (UHDM), which contains 5,000 real-world 4K resolution image pairs, and conduct a benchmark study on current state-of-the-art methods. Further, we present an efficient baseline model ESDNet for tackling 4K moire images, wherein we build a semantic-aligned scale-aware module to address the scale variation of moire patterns. Extensive experiments manifest the effectiveness of our approach, which outperforms state-of-the-art methods by a large margin while being much more lightweight. Code and dataset are available at https://xinyu-andy.github.io/uhdm-page.

preprint2022arXiv

Video Demoireing with Relation-Based Temporal Consistency

Moire patterns, appearing as color distortions, severely degrade image and video qualities when filming a screen with digital cameras. Considering the increasing demands for capturing videos, we study how to remove such undesirable moire patterns in videos, namely video demoireing. To this end, we introduce the first hand-held video demoireing dataset with a dedicated data collection pipeline to ensure spatial and temporal alignments of captured data. Further, a baseline video demoireing model with implicit feature space alignment and selective feature aggregation is developed to leverage complementary information from nearby frames to improve frame-level video demoireing. More importantly, we propose a relation-based temporal consistency loss to encourage the model to learn temporal consistency priors directly from ground-truth reference videos, which facilitates producing temporally consistent predictions and effectively maintains frame-level qualities. Extensive experiments manifest the superiority of our model. Code is available at \url{https://daipengwa.github.io/VDmoire_ProjectPage/}.

preprint2021arXiv

Ferromagnetism with in-plane magnetization, Dirac spin-gapless semiconducting property, and tunable topological states in two-dimensional rare-earth-metal dinitrides

As the bulk single-crystal MoN2/ReN2 with a layered structure was successfully synthesized in experiment, transition-metal dinitrides have attracted considerable attention in recent years. Here, we focus on rare-earth-metal (Rem) elements and propose seven stable Rem dinitride monolayers with a 1T structure, namely 1T-RemN2. These monolayers have a ferromagnetic ground state with in-plane magnetization. Without spin-orbit coupling (SOC) effect, the band structures are spin-polarized with Dirac points at the Fermi level. Remarkably, the 1T-LuN2 monolayer shows an isotropic magnetic anisotropy energy in the xy-plane with in-plane magnetization, indicating easy tunability of the magnetization direction. When rotating the magnetization vector in the xy-plane, our proposed model can accurately describe the variety of the SOC band gap and two topological states (Weyl-like semimetal and Chern insulator states) appear with tunable properties. The Weyl-like semimetal state is a critical point between the two Chern insulator states with opposite sign of the Chern numbers. The large nontrivial band gap (up to 60.3 meV) and the Weyl-like semimetal state are promising for applications in spintronic devices.

preprint2021arXiv

Mask-GVAE: Blind Denoising Graphs via Partition

We present Mask-GVAE, a variational generative model for blind denoising large discrete graphs, in which "blind denoising" means we don't require any supervision from clean graphs. We focus on recovering graph structures via deleting irrelevant edges and adding missing edges, which has many applications in real-world scenarios, for example, enhancing the quality of connections in a co-authorship network. Mask-GVAE makes use of the robustness in low eigenvectors of graph Laplacian against random noise and decomposes the input graph into several stable clusters. It then harnesses the huge computations by decoding probabilistic smoothed subgraphs in a variational manner. On a wide variety of benchmarks, Mask-GVAE outperforms competing approaches by a significant margin on PSNR and WL similarity.

preprint2021arXiv

Structural Similarity of Boundary Conditions and an Efficient Local Search Algorithm for Goal Conflict Identification

In goal-oriented requirements engineering, goal conflict identification is of fundamental importance for requirements analysis. The task aims to find the feasible situations which make the goals diverge within the domain, called boundary conditions (BCs). However, the existing approaches for goal conflict identification fail to find sufficient BCs and general BCs which cover more combinations of circumstances. From the BCs found by these existing approaches, we have observed an interesting phenomenon that there are some pairs of BCs are similar in formula structure, which occurs frequently in the experimental cases. In other words, once a BC is found, a new BC may be discovered quickly by slightly changing the former. It inspires us to develop a local search algorithm named LOGION to find BCs, in which the structural similarity is captured by the neighborhood relation of formulae. Based on structural similarity, LOGION can find a lot of BCs in a short time. Moreover, due to the large number of BCs identified, it potentially selects more general BCs from them. By taking experiments on a set of cases, we show that LOGION effectively exploits the structural similarity of BCs. We also compare our algorithm against the two state-of-the-art approaches. The experimental results show that LOGION produces one order of magnitude more BCs than the state-of-the-art approaches and confirm that LOGION finds out more general BCs thanks to a large number of BCs.

preprint2020arXiv

Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

There is an increasing amount of literature that claims the brittleness of deep neural networks in dealing with adversarial examples that are created maliciously. It is unclear, however, how the models will perform in realistic scenarios where \textit{natural rather than malicious} adversarial instances often exist. This work systematically explores the robustness of BERT, the state-of-the-art Transformer-style model in NLP, in dealing with noisy data, particularly mistakes in typing the keyboard, that occur inadvertently. Intensive experiments on sentiment analysis and question answering benchmarks indicate that: (i) Typos in various words of a sentence do not influence equally. The typos in informative words make severer damages; (ii) Mistype is the most damaging factor, compared with inserting, deleting, etc.; (iii) Humans and machines have different focuses on recognizing adversarial attacks.

preprint2020arXiv

Adversarial Attack on Community Detection by Hiding Individuals

It has been demonstrated that adversarial graphs, i.e., graphs with imperceptible perturbations added, can cause deep graph models to fail on node/graph classification tasks. In this paper, we extend adversarial graphs to the problem of community detection which is much more difficult. We focus on black-box attack and aim to hide targeted individuals from the detection of deep graph community detection models, which has many applications in real-world scenarios, for example, protecting personal privacy in social networks and understanding camouflage patterns in transaction networks. We propose an iterative learning framework that takes turns to update two modules: one working as the constrained graph generator and the other as the surrogate community detection model. We also find that the adversarial graphs generated by our method can be transferred to other learning based community detection models.

preprint2020arXiv

Cartoon Face Recognition: A Benchmark Dataset

Recent years have witnessed increasing attention in cartoon media, powered by the strong demands of industrial applications. As the first step to understand this media, cartoon face recognition is a crucial but less-explored task with few datasets proposed. In this work, we first present a new challenging benchmark dataset, consisting of 389,678 images of 5,013 cartoon characters annotated with identity, bounding box, pose, and other auxiliary attributes. The dataset, named iCartoonFace, is currently the largest-scale, high-quality, richannotated, and spanning multiple occurrences in the field of image recognition, including near-duplications, occlusions, and appearance changes. In addition, we provide two types of annotations for cartoon media, i.e., face recognition, and face detection, with the help of a semi-automatic labeling algorithm. To further investigate this challenging dataset, we propose a multi-task domain adaptation approach that jointly utilizes the human and cartoon domain knowledge with three discriminative regularizations. We hence perform a benchmark analysis of the proposed dataset and verify the superiority of the proposed approach in the cartoon face recognition task. We believe this public availability will attract more research attention in broad practical application scenarios.

preprint2020arXiv

Cooperative Bi-path Metric for Few-shot Learning

Given base classes with sufficient labeled samples, the target of few-shot classification is to recognize unlabeled samples of novel classes with only a few labeled samples. Most existing methods only pay attention to the relationship between labeled and unlabeled samples of novel classes, which do not make full use of information within base classes. In this paper, we make two contributions to investigate the few-shot classification problem. First, we report a simple and effective baseline trained on base classes in the way of traditional supervised learning, which can achieve comparable results to the state of the art. Second, based on the baseline, we propose a cooperative bi-path metric for classification, which leverages the correlations between base classes and novel classes to further improve the accuracy. Experiments on two widely used benchmarks show that our method is a simple and effective framework, and a new state of the art is established in the few-shot classification field.

preprint2020arXiv

Diagrammatic Monte Carlo Method for Impurity Models with General Interactions and Hybridizations

We present a diagrammatic Monte Carlo method for quantum impurity problems with general interactions and general hybridization functions. Our method uses a recursive determinant scheme to sample diagrams for the scattering amplitude. Unlike in other methods for general impurity problems, an approximation of the continuous hybridization function by a finite number of bath states is not needed, and accessing low temperature does not incur an exponential cost. We test the method for the example of molecular systems, where we systematically vary temperature, interatomic distance, and basis set size. We further apply the method to an impurity problem generated by a self-energy embedding calculation of correlated antiferromagnetic NiO. We find that the method is ideal for quantum impurity problems with a large number of orbitals but only moderate correlations.

preprint2020arXiv

Forecasting People's Needs in Hurricane Events from Social Network

Social networks can serve as a valuable communication channel for calls for help, offering assistance, and coordinating rescue activities in disaster. Social networks such as Twitter allow users to continuously update relevant information, which is especially useful during a crisis, where the rapidly changing conditions make it crucial to be able to access accurate information promptly. Social media helps those directly affected to inform others of conditions on the ground in real time and thus enables rescue workers to coordinate their efforts more effectively, better meeting the survivors' need. This paper presents a new sequence to sequence based framework for forecasting people's needs during disasters using social media and weather data. It consists of two Long Short-Term Memory (LSTM) models, one of which encodes input sequences of weather information and the other plays as a conditional decoder that decodes the encoded vector and forecasts the survivors' needs. Case studies utilizing data collected during Hurricane Sandy in 2012, Hurricane Harvey and Hurricane Irma in 2017 were analyzed and the results compared with those obtained using a statistical language model n-gram and an LSTM generative model. Our proposed sequence to sequence method forecast people's needs more successfully than either of the other models. This new approach shows great promise for enhancing disaster management activities such as evacuation planning and commodity flow management.

preprint2020arXiv

Intrinsic Relationship Reasoning for Small Object Detection

The small objects in images and videos are usually not independent individuals. Instead, they more or less present some semantic and spatial layout relationships with each other. Modeling and inferring such intrinsic relationships can thereby be beneficial for small object detection. In this paper, we propose a novel context reasoning approach for small object detection which models and infers the intrinsic semantic and spatial layout relationships between objects. Specifically, we first construct a semantic module to model the sparse semantic relationships based on the initial regional features, and a spatial layout module to model the sparse spatial layout relationships based on their position and shape information, respectively. Both of them are then fed into a context reasoning module for integrating the contextual information with respect to the objects and their relationships, which is further fused with the original regional visual features for classification and regression. Experimental results reveal that the proposed approach can effectively boost the small object detection performance.

preprint2020arXiv

Is Depth Really Necessary for Salient Object Detection?

Salient object detection (SOD) is a crucial and preliminary task for many computer vision applications, which have made progress with deep CNNs. Most of the existing methods mainly rely on the RGB information to distinguish the salient objects, which faces difficulties in some complex scenarios. To solve this, many recent RGBD-based networks are proposed by adopting the depth map as an independent input and fuse the features with RGB information. Taking the advantages of RGB and RGBD methods, we propose a novel depth-aware salient object detection framework, which has following superior designs: 1) It only takes the depth information as training data while only relies on RGB information in the testing phase. 2) It comprehensively optimizes SOD features with multi-level depth-aware regularizations. 3) The depth information also serves as error-weighted map to correct the segmentation process. With these insightful designs combined, we make the first attempt in realizing an unified depth-aware framework with only RGB information as input for inference, which not only surpasses the state-of-the-art performances on five public RGB SOD benchmarks, but also surpasses the RGBD-based methods on five benchmarks by a large margin, while adopting less information and implementation light-weighted. The code and model will be publicly available.

preprint2020arXiv

Model-guided Multi-path Knowledge Aggregation for Aerial Saliency Prediction

As an emerging vision platform, a drone can look from many abnormal viewpoints which brings many new challenges into the classic vision task of video saliency prediction. To investigate these challenges, this paper proposes a large-scale video dataset for aerial saliency prediction, which consists of ground-truth salient object regions of 1,000 aerial videos, annotated by 24 subjects. To the best of our knowledge, it is the first large-scale video dataset that focuses on visual saliency prediction on drones. Based on this dataset, we propose a Model-guided Multi-path Network (MM-Net) that serves as a baseline model for aerial video saliency prediction. Inspired by the annotation process in eye-tracking experiments, MM-Net adopts multiple information paths, each of which is initialized under the guidance of a classic saliency model. After that, the visual saliency knowledge encoded in the most representative paths is selected and aggregated to improve the capability of MM-Net in predicting spatial saliency in aerial scenarios. Finally, these spatial predictions are adaptively combined with the temporal saliency predictions via a spatiotemporal optimization algorithm. Experimental results show that MM-Net outperforms ten state-of-the-art models in predicting aerial video saliency.

preprint2020arXiv

SEKD: Self-Evolving Keypoint Detection and Description

Researchers have attempted utilizing deep neural network (DNN) to learn novel local features from images inspired by its recent successes on a variety of vision tasks. However, existing DNN-based algorithms have not achieved such remarkable progress that could be partly attributed to insufficient utilization of the interactive characters between local feature detector and descriptor. To alleviate these difficulties, we emphasize two desired properties, i.e., repeatability and reliability, to simultaneously summarize the inherent and interactive characters of local feature detector and descriptor. Guided by these properties, a self-supervised framework, namely self-evolving keypoint detection and description (SEKD), is proposed to learn an advanced local feature model from unlabeled natural images. Additionally, to have performance guarantees, novel training strategies have also been dedicatedly designed to minimize the gap between the learned feature and its properties. We benchmark the proposed method on homography estimation, relative pose estimation, and structure-from-motion tasks. Extensive experimental results demonstrate that the proposed method outperforms popular hand-crafted and DNN-based methods by remarkable margins. Ablation studies also verify the effectiveness of each critical training strategy. We will release our code along with the trained model publicly.

preprint2020arXiv

Semi-Supervised Cervical Dysplasia Classification With Learnable Graph Convolutional Network

Cervical cancer is the second most prevalent cancer affecting women today. As the early detection of cervical carcinoma relies heavily upon screening and pre-clinical testing, digital cervicography has great potential as a primary or auxiliary screening tool, especially in low-resource regions due to its low cost and easy access. Although an automated cervical dysplasia detection system has been desirable, traditional fully-supervised training of such systems requires large amounts of annotated data which are often labor-intensive to collect. To alleviate the need for much manual annotation, we propose a novel graph convolutional network (GCN) based semi-supervised classification model that can be trained with fewer annotations. In existing GCNs, graphs are constructed with fixed features and can not be updated during the learning process. This limits their ability to exploit new features learned during graph convolution. In this paper, we propose a novel and more flexible GCN model with a feature encoder that adaptively updates the adjacency matrix during learning and demonstrate that this model design leads to improved performance. Our experimental results on a cervical dysplasia classification dataset show that the proposed framework outperforms previous methods under a semi-supervised setting, especially when the labeled samples are scarce.

preprint2020arXiv

Single Image Dehazing Using Ranking Convolutional Neural Network

Single image dehazing, which aims to recover the clear image solely from an input hazy or foggy image, is a challenging ill-posed problem. Analysing existing approaches, the common key step is to estimate the haze density of each pixel. To this end, various approaches often heuristically designed haze-relevant features. Several recent works also automatically learn the features via directly exploiting Convolutional Neural Networks (CNN). However, it may be insufficient to fully capture the intrinsic attributes of hazy images. To obtain effective features for single image dehazing, this paper presents a novel Ranking Convolutional Neural Network (Ranking-CNN). In Ranking-CNN, a novel ranking layer is proposed to extend the structure of CNN so that the statistical and structural attributes of hazy images can be simultaneously captured. By training Ranking-CNN in a well-designed manner, powerful haze-relevant features can be automatically learned from massive hazy image patches. Based on these features, haze can be effectively removed by using a haze density prediction model trained through the random forest regression. Experimental results show that our approach outperforms several previous dehazing approaches on synthetic and real-world benchmark images. Comprehensive analyses are also conducted to interpret the proposed Ranking-CNN from both the theoretical and experimental aspects.

preprint2020arXiv

The First Round Result from the TianQin-1 Satellite

The TianQin-1 satellite (TQ-1), which is the first technology demonstration satellite for the TianQin project, was launched on 20 December 2019. The first round of experiment had been carried out from 21 December 2019 until 1 April 2020. The residual acceleration of the satellite is found to be about $1\times10^{-10}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$ and about $5\times10^{-11}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.05~{\rm Hz}\,$, measured by an inertial sensor with a sensitivity of $5\times10^{-12}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$. The micro-Newton thrusters has demonstrated a thrust resolution of $0.1~μ{\rm N}$ and a thrust noise of $0.3~μ{\rm N}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}$. The residual noise of the satellite with drag-free control is $3\times10^{-9}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$. The noise level of the optical readout system is about $30~{\rm pm}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$. The temperature stability at temperature monitoring position is controlled to be about $\pm3~{\rm mK}$ per orbit, and the mismatch between the center-of-mass of the satellite and that of the test mass is measured with a precision of better than $0.1~{\rm mm}$.

preprint2020arXiv

Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Large convolutional neural network models have recently demonstrated impressive performance on video attention prediction. Conventionally, these models are with intensive computation and large memory. To address these issues, we design an extremely light-weight network with ultrafast speed, named UVA-Net. The network is constructed based on depth-wise convolutions and takes low-resolution images as input. However, this straight-forward acceleration method will decrease performance dramatically. To this end, we propose a coupled knowledge distillation strategy to augment and train the network effectively. With this strategy, the model can further automatically discover and emphasize implicit useful cues contained in the data. Both spatial and temporal knowledge learned by the high-resolution complex teacher networks also can be distilled and transferred into the proposed low-resolution light-weight spatiotemporal network. Experimental results show that the performance of our model is comparable to 11 state-of-the-art models in video attention prediction, while it costs only 0.68 MB memory footprint, runs about 10,106 FPS on GPU and 404 FPS on CPU, which is 206 times faster than previous models.

preprint2019arXiv

Direct comparison of many-body methods for realistic electronic Hamiltonians

A large collaboration carefully benchmarks 20 first principles many-body electronic structure methods on a test set of 7 transition metal atoms, and their ions and monoxides. Good agreement is attained between the 3 systematically converged methods, resulting in experiment-free reference values. These reference values are used to assess the accuracy of modern emerging and scalable approaches to the many-electron problem. The most accurate methods obtain energies indistinguishable from experimental results, with the agreement mainly limited by the experimental uncertainties. Comparison between methods enables a unique perspective on calculations of many-body systems of electrons.

preprint2019arXiv

Sparse sampling approach to efficient ab initio calculations at finite temperature

Efficient ab initio calculations of correlated materials at finite temperature require compact representations of the Green's functions both in imaginary time and Matsubara frequency. In this paper, we introduce a general procedure which generates sparse sampling points in time and frequency from compact orthogonal basis representations, such as Chebyshev polynomials and intermediate representation (IR) basis functions. These sampling points accurately resolve the information contained in the Green's function, and efficient transforms between different representations are formulated with minimal loss of information. As a demonstration, we apply the sparse sampling scheme to diagrammatic $GW$ and GF2 calculations of a hydrogen chain, of noble gas atoms and of a silicon crystal.

preprint2019arXiv

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

The performance of video saliency estimation techniques has achieved significant advances along with the rapid development of Convolutional Neural Networks (CNNs). However, devices like cameras and drones may have limited computational capability and storage space so that the direct deployment of complex deep saliency models becomes infeasible. To address this problem, this paper proposes a dynamic saliency estimation approach for aerial videos via spatiotemporal knowledge distillation. In this approach, five components are involved, including two teachers, two students and the desired spatiotemporal model. The knowledge of spatial and temporal saliency is first separately transferred from the two complex and redundant teachers to their simple and compact students, and the input scenes are also degraded from high-resolution to low-resolution to remove the probable data redundancy so as to greatly speed up the feature extraction process. After that, the desired spatiotemporal model is further trained by distilling and encoding the spatial and temporal saliency knowledge of two students into a unified network. In this manner, the inter-model redundancy can be further removed for the effective estimation of dynamic saliency on aerial videos. Experimental results show that the proposed approach outperforms ten state-of-the-art models in estimating visual saliency on aerial videos, while its speed reaches up to 28,738 FPS on the GPU platform.

preprint2017arXiv

Low Power Microwave Signal Detection With a Spin-Torque Nano-Oscillator in the Active Self-Oscillating Regime

A spin-torque nano-oscillator (STNO) driven by a ramped bias current can perform spectrum analysis quickly over a wide frequency bandwidth. The STNO spectrum analyzer operates by injection locking to external microwave signals and produces an output DC voltage $V_{\rm dc}$ that temporally encodes the input spectrum. We found, via numerical analysis with a macrospin approximation, that an STNO is able to scan a $10~\rm GHz$ bandwidth in less than $100~\rm ns$ (scanning rate $R$ exceeds $100~\rm MHz/ns$). In contrast to conventional quadratic microwave detectors, the output voltage of the STNO analyzer is proportional to the amplitude of the input microwave signal $I_{\rm rf}$ with sensitivity $S = dV_{\rm dc}/dI_{\rm rf} \approx 750~\rm mV/mA$. The minimum detectable signal of the analyzer depends on the scanning rate $R$ and, at low $R \approx 1~\rm MHz/ns$, is about $1~\rm pW$.

preprint2016arXiv

A Distance for HMMs based on Aggregated Wasserstein Metric and State Registration

We propose a framework, named Aggregated Wasserstein, for computing a dissimilarity measure or distance between two Hidden Markov Models with state conditional distributions being Gaussian. For such HMMs, the marginal distribution at any time spot follows a Gaussian mixture distribution, a fact exploited to softly match, aka register, the states in two HMMs. We refer to such HMMs as Gaussian mixture model-HMM (GMM-HMM). The registration of states is inspired by the intrinsic relationship of optimal transport and the Wasserstein metric between distributions. Specifically, the components of the marginal GMMs are matched by solving an optimal transport problem where the cost between components is the Wasserstein metric for Gaussian distributions. The solution of the optimization problem is a fast approximation to the Wasserstein metric between two GMMs. The new Aggregated Wasserstein distance is a semi-metric and can be computed without generating Monte Carlo samples. It is invariant to relabeling or permutation of the states. This distance quantifies the dissimilarity of GMM-HMMs by measuring both the difference between the two marginal GMMs and the difference between the two transition matrices. Our new distance is tested on the tasks of retrieval and classification of time series. Experiments on both synthetic data and real data have demonstrated its advantages in terms of accuracy as well as efficiency in comparison with existing distances based on the Kullback-Leibler divergence.

preprint2016arXiv

AutoScaler: Scale-Attention Networks for Visual Correspondence

Finding visual correspondence between local features is key to many computer vision problems. While defining features with larger contextual scales usually implies greater discriminativeness, it could also lead to less spatial accuracy of the features. We propose AutoScaler, a scale-attention network to explicitly optimize this trade-off in visual correspondence tasks. Our network consists of a weight-sharing feature network to compute multi-scale feature maps and an attention network to combine them optimally in the scale space. This allows our network to have adaptive receptive field sizes over different scales of the input. The entire network is trained end-to-end in a siamese framework for visual correspondence tasks. Our method achieves favorable results compared to state-of-the-art methods on challenging optical flow and semantic matching benchmarks, including Sintel, KITTI and CUB-2011. We also show that our method can generalize to improve hand-crafted descriptors (e.g Daisy) on general visual correspondence tasks. Finally, our attention network can generate visually interpretable scale attention maps.

preprint2016arXiv

Hidden Markov Models on Variable Blocks with a Modal Clustering Algorithm and Applications

Motivated by high-throughput single-cell cytometry data with applications to vaccine development and immunological research, we consider statistical clustering in large-scale data that contain multiple rare clusters. We propose a new hierarchical mixture model, namely Hidden Markov Model on Variable Blocks (HMM-VB), and a new mode search algorithm called Modal Baum-Welch (MBW) for efficient clustering. Exploiting the widely accepted chain-like dependence among groups of variables in the cytometry data, we propose to treat the hierarchy of variable groups as a figurative time line and employ a HMM-type model, namely HMM-VB. We also propose to use mode-based clustering, aka modal clustering, and overcome the exponential computational complexity by MBW. In a series of experiments on simulated data HMM-VB and MBW have better performance than existing methods. We also apply our method to identify rare cell subsets in cytometry data and examine its strengths and limitations.

preprint2016arXiv

Modeling Photographic Composition via Triangles

The capacity of automatically modeling photographic composition is valuable for many real-world machine vision applications such as digital photography, image retrieval, image understanding, and image aesthetics assessment. The triangle technique is among those indispensable composition methods on which professional photographers often rely. This paper proposes a system that can identify prominent triangle arrangements in two major categories of photographs: natural or urban scenes, and portraits. For the natural or urban scene pictures, the focus is on the effect of linear perspective. For portraits, we carefully examine the positioning of human subjects in a photo. We show that line analysis is highly advantageous for modeling composition in both categories. Based on the detected triangles, new mathematical descriptors for composition are formulated and used to retrieve similar images. Leveraging the rich source of high aesthetics photos online, similar approaches can potentially be incorporated in future smart cameras to enhance a person's photo composition skills.

preprint2016arXiv

Robust Coordinated Transmission and Generation Expansion Planning Considering Ramping Requirements and Construction Periods

Two critical issues have arisen in transmission expansion planning with the rapid growth of wind power generation. First, severe power ramping events in daily operation due to the high variability of wind power generation pose great challenges to multi-year planning decision making. Second, the long construction periods of transmission lines may not be able to keep pace with the fast growing uncertainty due to the increasing integration of renewable energy generation. To address such issues, we propose a comprehensive robust planning model considering different resources, namely, transmission lines, generators, and FACTS devices. Various factors are taken into account, including flexibility requirement, construction period, and cost. We construct the hourly net load ramping uncertainty (HLRU) set to characterize the variation of hourly net load including wind power generation, and the annual net load duration curve uncertainty (LDCU) set for the uncertainty of normal annual net load duration curve. This results in a two-stage robust optimization model with two different types of uncertainty sets, which are decoupled into two different sets of subproblems to make the entire solution process tractable. Numerical simulations with real-world data show that the proposed model and solution method are effective to coordinate different flexible resources, rendering robust expansion planning strategies.

preprint2016arXiv

Spin-current probe for phase transition in an insulator

Spin fluctuation and transition have always been one of central topics of magnetism and condensed matter science. Experimentally, the spin fluctuation is found transcribed onto scattering intensity in the neutron scattering process, which is represented by dynamical magnetic susceptibility and maximized at phase transitions. Importantly, a neutron carries spin without electric charge, and it can bring spin into a sample without being disturbed by electric energy, although large facilities such as a nuclear reactor is necessary. Here we show that spin pumping, frequently used in nanoscale spintronic devices, provides a desktop micro probe for spin transition; spin current is a flux of spin without an electric charge and its transport reflects spin excitation. We demonstrate detection of antiferromagnetic transition in ultra-thin CoO films via frequency dependent spin-current transmission measurements, which provides a versatile probe for phase transition in an electric manner in minute devices.

preprint2015arXiv

Gaussian Mixture Models with Component Means Constrained in Pre-selected Subspaces

We investigate a Gaussian mixture model (GMM) with component means constrained in a pre-selected subspace. Applications to classification and clustering are explored. An EM-type estimation algorithm is derived. We prove that the subspace containing the component means of a GMM with a common covariance matrix also contains the modes of the density and the class means. This motivates us to find a subspace by applying weighted principal component analysis to the modes of a kernel density and the class means. To circumvent the difficulty of deciding the kernel bandwidth, we acquire multiple subspaces from the kernel densities based on a sequence of bandwidths. The GMM constrained by each subspace is estimated; and the model yielding the maximum likelihood is chosen. A dimension reduction property is proved in the sense of being informative for classification or clustering. Experiments on real and simulated data sets are conducted to examine several ways of determining the subspace and to compare with the reduced rank mixture discriminant analysis (MDA). Our new method with the simple technique of spanning the subspace only by class means often outperforms the reduced rank MDA when the subspace dimension is very low, making it particularly appealing for visualization.

preprint2015arXiv

Graphene/g-C2N bilayer:gap opening, enhanced visible light response and electrical field tuning band structure

Opening up a band gap of the graphene and finding a suitable substrate are two challenges for constituting the nano-electronic equipment. A new two-dimensional layered crystal g-C2N (Nat. Commun. 2015, 6, 1--7) with novel electronic and optical properties can be effectively synthesized via a wet-chemical reaction. And g-C2N can be used as a suitable substrate to open the band gap of graphene as much as 0.239 eV, which is large enough for the band gap opening at room temperature. The physics behind the band gap opening is that g-C2N substrate can produce the inhomogeneous electrostatic potential over the graphene layer. The imposition of external electrical field can tune the band gap of the hybrid of graphene/g-C2N effectively from the semiconductors to the metal. The hybrid graphene/g-C2N displays an enhanced optical activity compared with the pure g-C2N monolayer.

preprint2015arXiv

Monolayer charge-neutral graphene on platinum with extremely weak electron-phonon coupling

Epitaxial growth of graphene on transition metal substrates is an important route for obtaining large scale graphene. However, the interaction between graphene and the substrate often leads to multiple orientations, distorted graphene band structure, large doping and strong electron-phonon coupling. Here we report the growth of monolayer graphene with high crystalline quality on Pt(111) substrate by using a very low concentration of an internal carbon source with high annealing temperature. The controlled growth leads to electronically decoupled graphene: it is nearly charge neutral and has extremely weak electron-phonon coupling (coupling strength $λ$ $\approx$ 0.056) as revealed by angle-resolved photoemission spectroscopic measurements. The thermodynamics and kinetics of the carbon diffusion process is investigated by DFT calculation. Such graphene with negligible graphene-substrate interaction provides an important platform for fundamental research as well as device applications when combined with a nondestructive sample transfer technique.

preprint2015arXiv

Theory of the Dirac Half Metal and Quantum Anomalous Hall Effect in Mn Intercalated Epitaxial Graphene

The prospect of a Dirac half metal, a material which is characterized by a bandstructure with a gap in one spin channel but a Dirac cone in the other, is of both fundamental interest and a natural candidate for use in spin-polarized current applications. However, while the possibility of such a material has been reported based on model calculations[H. Ishizuka and Y. Motome, Phys. Rev. Lett. 109, 237207 (2012)], it remains unclear what material system might realize such an exotic state. Using first-principles calculations, we show that the experimentally accessible Mn intercalated epitaxial graphene on SiC(0001) transits to a Dirac half metal when the coverage is > 1/3 monolayer. This transition results from an orbital-selective breaking of quasi-2D inversion symmetry, leading to symmetry breaking in a single spin channel which is robust against randomness in the distribution of Mn intercalates. Furthermore, the inclusion of spin-orbit interaction naturally drives the system into the quantum anomalous Hall (QAH) state. Our results thus not only demonstrate the practicality of realizing the Dirac half metal beyond a toy model but also open up a new avenue to the realization of the QAH effect.

preprint2014arXiv

A random attractor for stochastic porous media equations on infinite lattices

The paper is devoted to studying the existence of a random attractor for stochastic porous media equations on infinite lattices under some conditions.

preprint2014arXiv

Manganese reduction/oxidation reaction on graphene composites as a reversible process for storing enormous energy at a fast rate

Oxygen reduction/evolution reaction (ORR/OER) is a basic process for fuel cells or metal air batteries. However, ORR/OER generally requires noble metal catalysts and suffers from low solubility (10-3 molar per liter) of O2, low kinetics rate (10-6 cm2/s) and low reversibility. We report a manganese reduction/oxidation reaction (MRR/MOR) on graphene/MnO2 composites, delivering a high capacity (4200 mAh/g), fast kinetics (0.0024 cm2/s, three orders higher than ORR/OER), high solubility (three orders than O2), and high reversibility (100%). We further use MRR/MOR to invent a rechargeable manganese ion battery (MIB), which delivers an energy density of 1200 Wh/Kg (several times of lithium ion battery), a fast charge ability (3 minutes), and a long cycle life (10,000 cycles). MRR/MOR renders a new class of energy conversion or storage systems with a very high energy density enabling electric vehicles run much more miles at one charge.

preprint2014arXiv

Random Attractor For Stochastic Lattice FitzHugh-Nagumo System Driven By $α$-stable Lévy Noises

The present paper is devoted to the existence of a random attractor for stochastic lattice FitzHugh-Nagumo system driven by $α$-stable Lévy noises under some dissipative conditions.

preprint2013arXiv

An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies

Global expression analyses using microarray technologies are becoming more common in genomic research, therefore, new statistical challenges associated with combining information from multiple studies must be addressed. In this paper we will describe our proposal for an adaptively weighted (AW) statistic to combine multiple genomic studies for detecting differentially expressed genes. We will also present our results from comparisons of our proposed AW statistic to Fisher's equally weighted (EW), Tippett's minimum p-value (minP) and Pearson's (PR) statistics. Due to the absence of a uniformly powerful test, we used a simplified Gaussian scenario to compare the four methods. Our AW statistic consistently produced the best or near-best power for a range of alternative hypotheses. AW-obtained weights also have the additional advantage of filtering discordant biomarkers and providing natural detected gene categories for further biological investigation. Here we will demonstrate the superior performance of our proposed AW statistic based on a mix of power analyses, simulations and applications using data sets for multi-tissue energy metabolism mouse, multi-lab prostate cancer and lung cancer.

preprint2013arXiv

Parallel D2-Clustering: Large-Scale Clustering of Discrete Distributions

The discrete distribution clustering algorithm, namely D2-clustering, has demonstrated its usefulness in image classification and annotation where each object is represented by a bag of weighed vectors. The high computational complexity of the algorithm, however, limits its applications to large-scale problems. We present a parallel D2-clustering algorithm with substantially improved scalability. A hierarchical structure for parallel computing is devised to achieve a balance between the individual-node computation and the integration process of the algorithm. Additionally, it is shown that even with a single CPU, the hierarchical structure results in significant speed-up. Experiments on real-world large-scale image data, Youtube video data, and protein sequence data demonstrate the efficiency and wide applicability of the parallel D2-clustering algorithm. The loss in clustering accuracy is minor in comparison with the original sequential algorithm.

preprint2013arXiv

Volatility occupation times

We propose nonparametric estimators of the occupation measure and the occupation density of the diffusion coefficient (stochastic volatility) of a discretely observed Itô semimartingale on a fixed interval when the mesh of the observation grid shrinks to zero asymptotically. In a first step we estimate the volatility locally over blocks of shrinking length, and then in a second step we use these estimates to construct a sample analogue of the volatility occupation time and a kernel-based estimator of its density. We prove the consistency of our estimators and further derive bounds for their rates of convergence. We use these results to estimate nonparametrically the quantiles associated with the volatility occupation measure.

preprint2012arXiv

Dirac Fermion in Strongly-Bound Graphene Systems

It is highly desirable to integrate graphene into existing semiconductor technology, where the combined system is thermodynamically stable yet maintain a Dirac cone at the Fermi level. Firstprinciples calculations reveal that a certain transition metal (TM) intercalated graphene/SiC(0001), such as the strongly-bound graphene/intercalated-Mn/SiC, could be such a system. Different from free-standing graphene, the hybridization between graphene and Mn/SiC leads to the formation of a dispersive Dirac cone of primarily TM d characters. The corresponding Dirac spectrum is still isotropic, and the transport behavior is nearly identical to that of free-standing graphene for a bias as large as 0.6 V, except that the Fermi velocity is half that of graphene. A simple model Hamiltonian is developed to qualitatively account for the physics of the transfer of the Dirac cone from a dispersive system (e.g., graphene) to an originally non-dispersive system (e.g., TM).

preprint2012arXiv

The transition from amorphous to crystalline in Al/Zr multilayers

The amorphous-to-crystalline transition in Al(1.0%wtSi)/Zr and Al(Pure)/Zr multilayers grown by direct-current magnetron sputtering system has been characterized over a range of Al layer thicknesses (1.0-5.0 nm) by using a series of complementary measurements including grazing incidence X-ray reflectometry, atomic force microscopy, X-ray diffraction and high-resolution transmission electron microscopy. The Al layer thickness transition exhibits the Si doped in Al could not only disfavor the crystallization of Al, but also influence the changing trends of surface roughness and diffraction peak position of phase Al<111>. An interesting feature of the presence of Si in Al layer is that Si could influence the transition process in Al(1%wtSi) layer, in which the critical thickness (1.6 nm) of Al(Pure) layer in Al(Pure)/Zr shifts to 1.8 nm of Al(1.0%wtSi) layer in Al(1.0%wtSi)/Zr multilayer. We also found that the Zr-on-Al interlayer is wider than the Al-on-Zr interlayer in both systems, and the Al layers do not have specific crystal orientation in the directions vertical to the layer from SAED patterns below the thickness (3.0 nm) of Al layers. Above the thickness (3.0 nm) of Al layers, the Al layers are highly oriented in Al<111>, so that the transformation from asymmetrical to symmetrical interlayers can be observed. Based on the analysis of all measurements, we build up a model with four steps, which could explain the Al layer thickness transition process in terms of a critical thickness for the nucleation of Al(Pure) and Al(1%wtSi) crystallites.

preprint2012arXiv

Thermally induced structural modification in the Al/Zr multilayers

The effect of increasing temperature on the structural stability and interactions of two kinds of Al/Zr (Al(1%wtSi)/Zr and Al(Pure)/Zr) multilayer mirrors are investigated. All Al/Zr multilayers annealed from 200^{\circ}C to 500^{\circ}C, were deposited on Si wafers by using direct-current magnetron sputtering technology. A detailed and consistent picture of the thermally induced changes in the microstructure is obtained using an array of complementary measurements including grazing incidence X-ray reflectance, atomic force microscope, X-ray diffraction and high-resolution transmission electron microscopy. The first significant structural changes of two systems are observed at 250^{\circ}C, characterized by asymmetric interlayers appears at interface. At 290^{\circ}C, the interface consisted of amorphous Al-Zr alloy is transformed to amorphous Al-Zr alloy and cubic ZrAl3 in both systems. By 298^{\circ}C of Al(1%wtSi)/Zr and 295^{\circ}C of Al(Pure)/Zr multilayers, the interfacial phases of Al-Zr alloy transform completely into polycrystalline mixtures of hcp-ZrAl2 and cubic-ZrAl3, which smooth the interface boundary and lower the surface roughness in the multilayers. Up to 500^{\circ}C, the multilayer structure still exists in both systems, and the differences between the asymmetric interlayers are much larger in the multilayers. Finally, we discuss the transformation from symmetric to asymmetric in the annealing process for other systems.

preprint2011arXiv

Activated dissociation of O2 on Pb(111) surfaces by Pb adatoms

We investigate the dissociation of O2 on Pb(111) surface using first-principles calculations. It is found that in a practical high-vacuum environment, the adsorption of molecular O2 takes place on clean Pb surfaces only at low temperatures such as 100 K, but the O2 easily desorbs at (elevated) room temperatures. It is further found that the Pb adatoms enhance the molecular adsorption and activate the adsorbed O2 to dissociate during subsequent room-temperature annealing. Our theory explains the observation of a two-step oxidation process on the Pb surfaces by the unique role of Pb adatoms.

preprint2010arXiv

Spontaneous edge-defect formation and defect-induced conductance suppression in graphene nanoribbons

We present a first-principles study of the migration and recombination of edge defects (carbon adatom and/or vacancy) and their influence on electrical conductance in zigzag graphene nanoribbons (ZGNRs). It is found that at room temperature, the adatom is quite mobile while the vacancy is almost immobile along the edge of ZGNRs. The recombination of an adatom-vacancy pair leads to a pentagon-heptagon ring defect structure having a lower energy than the perfect edge, implying that such an edge-defect can be formed spontaneously. This edge defect can suppresses the conductance of ZGNRs drastically, which provides some useful hints for understanding the observed semiconducting behavior of the fabricated narrow GNRs.

preprint2009arXiv

Ambient Isotopic Meshing of Implicit Algebraic Surface with Singularities

A complete method is proposed to compute a certified, or ambient isotopic, meshing for an implicit algebraic surface with singularities. By certified, we mean a meshing with correct topology and any given geometric precision. We propose a symbolic-numeric method to compute a certified meshing for the surface inside a box containing singularities and use a modified Plantinga-Vegter marching cube method to compute a certified meshing for the surface inside a box without singularities. Nontrivial examples are given to show the effectiveness of the algorithm. To our knowledge, this is the first method to compute a certified meshing for surfaces with singularities.

Jia Li

What is connected

Connect this record

See the researcher in context

Building this map preview

72 published item(s)

Signal Reshaping for GRPO in Weak-Feedback Agentic Code Repair

To Use AI as Dice of Possibilities with Timing Computation

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers

Rectangular carbon nitrides C4N monolayers with a zigzag buckled structure: Quasi-one-dimensional Dirac nodal lines and topological flat edge states

Rethinking Lightweight Salient Object Detection via Network Depth-Width Tradeoff

Self-supervised Hypergraph Representation Learning for Sociological Analysis

A Domain-Specific Language for Simulation-Based Testing of IoT Edge-to-Cloud Solutions

Automatic detection of low surface brightness galaxies from SDSS images

Coexistence of extended flat band and Kekulé order in Li-intercalated graphene

Complementary Feature Enhanced Network with Vision Transformer for Image Dehazing

ConTinTin: Continual Learning from Task Instructions

Cross-scale Attention Guided Multi-instance Learning for Crohn's Disease Diagnosis with Pathological Images

Deep Learning Eliminates Massive Dust Storms from Images of Tianwen-1

ELECRec: Training Sequential Recommenders as Discriminators

Generating Negative Samples for Sequential Recommendation

Hybrid Multimodal Feature Extraction, Mining and Fusion for Sentiment Analysis

Improving Contrastive Learning with Model Augmentation

Intent Contrastive Learning for Sequential Recommendation

Interaction expansion inchworm Monte Carlo solver for lattice and impurity models

Learning Self-adaptations for IoT Networks: A Genetic Programming Approach

MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation

Permutation-based tests for discontinuities in event studies

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection

ReAct: Temporal Action Detection with Relational Queries

Rethinking Graph Neural Networks for Anomaly Detection

RGRecSys: A Toolkit for Robustness Evaluation of Recommender Systems

Styleverse: Towards Identity Stylization across Heterogeneous Domains

Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing

Video Demoireing with Relation-Based Temporal Consistency

Ferromagnetism with in-plane magnetization, Dirac spin-gapless semiconducting property, and tunable topological states in two-dimensional rare-earth-metal dinitrides

Mask-GVAE: Blind Denoising Graphs via Partition

Structural Similarity of Boundary Conditions and an Efficient Local Search Algorithm for Goal Conflict Identification

Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT

Adversarial Attack on Community Detection by Hiding Individuals

Cartoon Face Recognition: A Benchmark Dataset

Cooperative Bi-path Metric for Few-shot Learning

Diagrammatic Monte Carlo Method for Impurity Models with General Interactions and Hybridizations

Forecasting People's Needs in Hurricane Events from Social Network

Intrinsic Relationship Reasoning for Small Object Detection

Is Depth Really Necessary for Salient Object Detection?

Model-guided Multi-path Knowledge Aggregation for Aerial Saliency Prediction

SEKD: Self-Evolving Keypoint Detection and Description

Semi-Supervised Cervical Dysplasia Classification With Learnable Graph Convolutional Network

Single Image Dehazing Using Ranking Convolutional Neural Network

The First Round Result from the TianQin-1 Satellite

Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Direct comparison of many-body methods for realistic electronic Hamiltonians

Sparse sampling approach to efficient ab initio calculations at finite temperature

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

Low Power Microwave Signal Detection With a Spin-Torque Nano-Oscillator in the Active Self-Oscillating Regime

A Distance for HMMs based on Aggregated Wasserstein Metric and State Registration

AutoScaler: Scale-Attention Networks for Visual Correspondence

Hidden Markov Models on Variable Blocks with a Modal Clustering Algorithm and Applications

Modeling Photographic Composition via Triangles

Robust Coordinated Transmission and Generation Expansion Planning Considering Ramping Requirements and Construction Periods

Spin-current probe for phase transition in an insulator

Gaussian Mixture Models with Component Means Constrained in Pre-selected Subspaces

Graphene/g-C2N bilayer:gap opening, enhanced visible light response and electrical field tuning band structure

Monolayer charge-neutral graphene on platinum with extremely weak electron-phonon coupling

Theory of the Dirac Half Metal and Quantum Anomalous Hall Effect in Mn Intercalated Epitaxial Graphene

A random attractor for stochastic porous media equations on infinite lattices

Manganese reduction/oxidation reaction on graphene composites as a reversible process for storing enormous energy at a fast rate

Random Attractor For Stochastic Lattice FitzHugh-Nagumo System Driven By $α$-stable Lévy Noises

An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies

Parallel D2-Clustering: Large-Scale Clustering of Discrete Distributions

Volatility occupation times

Dirac Fermion in Strongly-Bound Graphene Systems

The transition from amorphous to crystalline in Al/Zr multilayers

Thermally induced structural modification in the Al/Zr multilayers

Activated dissociation of O2 on Pb(111) surfaces by Pb adatoms

Spontaneous edge-defect formation and defect-induced conductance suppression in graphene nanoribbons

Ambient Isotopic Meshing of Implicit Algebraic Surface with Singularities