Researcher profile

Yijun Yu

Yijun Yu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
6works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2022arXiv

Energy-bounded Learning for Robust Models of Code

In programming, learning code representations has a variety of applications, including code classification, code search, comment generation, bug prediction, and so on. Various representations of code in terms of tokens, syntax trees, dependency graphs, code navigation paths, or a combination of their variants have been proposed, however, existing vanilla learning techniques have a major limitation in robustness, i.e., it is easy for the models to make incorrect predictions when the inputs are altered in a subtle way. To enhance the robustness, existing approaches focus on recognizing adversarial samples rather than on the valid samples that fall outside a given distribution, which we refer to as out-of-distribution (OOD) samples. Recognizing such OOD samples is the novel problem investigated in this paper. To this end, we propose to first augment the in=distribution datasets with out-of-distribution samples such that, when trained together, they will enhance the model's robustness. We propose the use of an energy-bounded learning objective function to assign a higher score to in-distribution samples and a lower score to out-of-distribution samples in order to incorporate such out-of-distribution samples into the training process of source code models. In terms of OOD detection and adversarial samples detection, our evaluation results demonstrate a greater robustness for existing source code models to become more accurate at recognizing OOD data while being more resistant to adversarial attacks at the same time. Furthermore, the proposed energy-bounded score outperforms all existing OOD detection scores by a large margin, including the softmax confidence score, the Mahalanobis score, and ODIN.

preprint2021arXiv

On the Generalizability of Neural Program Models with respect to Semantic-Preserving Program Transformations

With the prevalence of publicly available source code repositories to train deep neural network models, neural program models can do well in source code analysis tasks such as predicting method names in given programs that cannot be easily done by traditional program analysis techniques. Although such neural program models have been tested on various existing datasets, the extent to which they generalize to unforeseen source code is largely unknown. Since it is very challenging to test neural program models on all unforeseen programs, in this paper, we propose to evaluate the generalizability of neural program models with respect to semantic-preserving transformations: a generalizable neural program model should perform equally well on programs that are of the same semantics but of different lexical appearances and syntactical structures. We compare the results of various neural program models for the method name prediction task on programs before and after automated semantic-preserving transformations. We use three Java datasets of different sizes and three state-of-the-art neural network models for code, namely code2vec, code2seq, and GGNN, to build nine such neural program models for evaluation. Our results show that even with small semantically preserving changes to the programs, these neural program models often fail to generalize their performance. Our results also suggest that neural program models based on data and control dependencies in programs generalize better than neural program models based only on abstract syntax trees. On the positive side, we observe that as the size of the training dataset grows and diversifies the generalizability of correct predictions produced by the neural program models can be improved too. Our results on the generalizability of neural program models provide insights to measure their limitations and provide a stepping stone for their improvement.

preprint2019arXiv

Magnetic-field-induced quantized anomalous Hall effect in intrinsic magnetic topological insulator MnBi$_2$Te$_4$

In a magnetic topological insulator, nontrivial band topology conspires with magnetic order to produce exotic states of matter that are best exemplified by quantum anomalous Hall (QAH) insulators and axion insulators. Up till now, such magnetic topological insulators are obtained by doping topological insulators with magnetic atoms. The random magnetic dopants, however, inevitably introduce disorders that hinder further exploration of quantum effects in the material. Here, we resolve this dilemma by probing quantum transport in MnBi$_2$Te$_4$ thin flake - a topological insulator with intrinsic magnetic order. In this layered van der Waals crystal, the ferromagnetic layers couple anti-parallel to each other, so MnBi$_2$Te$_4$ is an antiferromagnet. A magnetic field, however, aligns all the layers and induces an interlayer ferromagnetic order; we show that a quantized anomalous Hall response emerges in atomically thin MnBi$_2$Te$_4$ under a moderate magnetic field. MnBi$_2$Te$_4$ therefore becomes the first intrinsic magnetic topological insulator exhibiting quantized anomalous Hall effect. The result establishes MnBi$_2$Te$_4$ as an ideal arena for further exploring various topological phenomena.

preprint2015arXiv

A metallic mosaic phase and the origin of Mott insulating state in 1T-TaS2

Electron-electron and electron-phonon interactions are two major driving forces that stabilize various charge-ordered phases of matter. The intricate interplay between the two give rises to a peculiar charge density wave (CDW) state, which is also known as a Mott insulator, as the ground state of layered compound 1T-TaS2. The delicate balance also makes it possible to use external perturbations to create and manipulate novel phases in this material. Here, we study a mosaic CDW phase induced by voltage pulses from the tip of a scanning tunneling microscope (STM), and find that the new phase exhibit electronic structures that are entirely different from the Mott ground state of 1T-TaS2 at low temperatures. The mosaic phase consists of nanometer-sized domains characterized by well-defined phase shifts of the CDW order parameter in the topmost layer, and by altered stacking relative to the layer underneath. We discover that the nature of the new phases is dictated by the stacking order, and our results shed fresh light on the origin of the Mott phase in this layered compound.

preprint2014arXiv

Black phosphorus field-effect transistors

Two-dimensional crystals have emerged as a new class of materials with novel properties that may impact future technologies. Experimentally identifying and characterizing new functional two-dimensional materials in the vast material pool is a tremendous challenge, and at the same time potentially rewarding. In this work, we succeed in fabricating field-effect transistors based on few-layer black phosphorus crystals with thickness down to a few nanometers. Drain current modulation on the order of 10E5 is achieved in samples thinner than 7.5 nm at room temperature, with well-developed current saturation in the IV characteristics, both are important for reliable transistor performance of the device. Sample mobility is also found to be thickness dependent, with the highest value up to ~ 1000 cm2/Vs obtained at thickness ~ 10 nm. Our results demonstrate the potential of black phosphorus thin crystal as a new two-dimensional material for future applications in nano-electronic devices.

preprint2014arXiv

Gate-tunable Phase Transitions in 1T-TaS$_2$

The ability to tune material properties using gate electric field is at the heart of modern electronic technology. It is also a driving force behind recent advances in two-dimensional systems, such as gate-electric-field induced superconductivity and metal-insulator transition. Here we describe an ionic field-effect transistor (termed "iFET"), which uses gate-controlled lithium ion intercalation to modulate the material property of layered atomic crystal 1T-TaS$_2$. The extreme charge doping induced by the tunable ion intercalation alters the energetics of various charge-ordered states in 1T-TaS$_2$, and produces a series of phase transitions in thin-flake samples with reduced dimensionality. We find that the charge-density-wave states in 1T-TaS$_2$ are three-dimensional in nature, and completely collapse in the two-dimensional limit defined by their critical thicknesses. Meanwhile the ionic gating induces multiple phase transitions from Mott-insulator to metal in 1T-TaS$_2$ thin flakes at low temperatures, with 5 orders of magnitude modulation in their resistance. Superconductivity emerges in a textured charge-density-wave state induced by ionic gating. Our method of gate-controlled intercalation of 2D atomic crystals in the bulk limit opens up new possibilities in searching for novel states of matter in the extreme charge-carrier-concentration limit.