Source author record

Junjie Yang

Junjie Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Artificial Intelligence Computation and Language cond-mat.str-el Machine Learning cond-mat.mes-hall cond-mat.supr-con Cryptography and Security Human-Computer Interaction Information Theory math.IT math.OC Neurons and Cognition Robotics

Catalog footprint

What is connected

11works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

As AI capabilities increasingly surpass human proficiency in complex tasks, current alignment techniques, including SFT and RLHF, face fundamental challenges in ensuring reliable oversight. These methods rely on direct human assessment and become impractical when AI outputs exceed human cognitive thresholds. In response to this challenge, we explore two hypotheses: (1) \textit{Critique of critique can be easier than critique itself}, extending the widely-accepted observation that verification is easier than generation to the critique domain, as critique itself is a specialized form of generation; (2) \textit{This difficulty relationship holds recursively}, suggesting that when direct evaluation is infeasible, performing higher-order critiques (e.g., critique of critique of critique) offers a more tractable supervision pathway. We conduct Human-Human, Human-AI, and AI-AI experiments to investigate the potential of recursive self-critiquing for AI supervision. Our results highlight recursive critique as a promising approach for scalable AI oversight.

preprint2025arXiv

Active Sensing Shapes Real-World Decision-Making through Dynamic Evidence Accumulation

Human decision-making heavily relies on active sensing, a well-documented cognitive behaviour for evidence gathering to accommodate ever-changing environments. However, its operational mechanism in the real world remains non-trivial. Currently, an in-laboratory paradigm, called evidence accumulation modelling (EAM), points out that human decision-making involves transforming external evidence into internal mental beliefs. However, the gap in evidence affordance between real-world contexts and laboratory settings hinders the effective application of EAM. Here we generalize EAM to the real world and conduct analysis in real-world driving scenarios. A cognitive scheme is proposed to formalize real-world evidence affordance and capture active sensing through eye movements. Empirically, our scheme can plausibly portray the accumulation of drivers' mental beliefs, explaining how active sensing transforms evidence into mental beliefs from the perspective of information utility. Also, our results demonstrate a negative correlation between evidence affordance and attention recruited by individuals, revealing how human drivers adapt their evidence-collection patterns across various contexts. Moreover, we reveal the positive influence of evidence affordance and attention distribution on decision-making propensity. In a nutshell, our computational scheme generalizes EAM to real-world contexts and provides a comprehensive account of how active sensing underlies real-world decision-making, unveiling multifactorial, integrated characteristics in real-world decision-making.

preprint2020arXiv

Deepening Hidden Representations from Pre-trained Language Models

Transformer-based pre-trained language models have proven to be effective for learning contextualized language representation. However, current approaches only take advantage of the output of the encoder's final layer when fine-tuning the downstream tasks. We argue that only taking single layer's output restricts the power of pre-trained representation. Thus we deepen the representation learned by the model by fusing the hidden representation in terms of an explicit HIdden Representation Extractor (HIRE), which automatically absorbs the complementary representation with respect to the output from the final layer. Utilizing RoBERTa as the backbone encoder, our proposed improvement over the pre-trained models is shown effective on multiple natural language understanding tasks and help our model rival with the state-of-the-art models on the GLUE benchmark.

preprint2020arXiv

Evolution of the structural transition in Mo$_{1-x}$W$_{x}$Te$_{2}$

The composition dependence of the structural transition between the monoclinic 1T$^{\prime}$ and orthorhombic T$_{d}$ phases in the Mo$_{1-x}$W$_{x}$Te$_{2}$ Weyl semimetal was investigated by elastic neutron scattering on single crystals up to $x \approx 0.54$. First observed in MoTe$_{2}$, the transition from T$_{d}$ to 1T$^{\prime}$ is accompanied by an intermediate pseudo-orthorhombic phase, T$_{d}^{*}$. Upon doping with W, the T$_{d}^{*}$ phase vanishes by $x \approx 0.34$. Above this concentration, a phase coexistence behavior with both T$_{d}$ and 1T$^{\prime}$ is observed instead. The interlayer in-plane positioning parameter $δ$, which relates to the 1T$^{\prime}$ $β$ angle, decreases with temperature as well as with W substitution, likely due to strong anharmonicity in the interlayer interactions. The temperature width of the phase coexistence remains almost constant up to $x \approx 0.54$, in contrast to the broadening reported under pressure.

preprint2020arXiv

Second-order nonlinear optical and linear UV-VIS absorption properties of type-II multiferroic candidates RbFe(AO4)2 (A = Mo, Se, S)

Motivated by the search for type-II multiferroics, we present a comprehensive optical study of a complex oxide family of type-II multiferroic candidates: RbFe(MoO4)2, RbFe(SeO4)2, and RbFe(SO4)2. We employ rotational-anisotropy second harmonic generation spectroscopy (RA SHG), a technique sensitive to point symmetries, to address discrepancies in literature-assigned point/space groups and to identify the correct crystal structures. At room temperature we find that our RA SHG patterns rotate away from the crystal axes in RbFe(AO4)2 (A = Se, S), which identifies the lack of mirror symmetry and in-plane two-fold rotational symmetry. Also, the SHG efficiency of RbFe(SeO4)2 is two orders of magnitude stronger than RbFe(AO4)2 (A = Mo, S), which suggests broken inversion symmetry. Additionally, we present temperature-dependent linear optical characterizations near the band edge of this family of materials using ultraviolet-visible (UV-VIS) absorption spectroscopy. Included is experimental evidence of the band gap energy and band gap transition type for this family. Previously unreported sub-band gap absorption is also presented, which reveals prominent optical transitions, some with an unusual central energy temperature dependence. Furthermore, we find that by substituting the A-site in RbFe(AO4)2 (A = Mo, Se, S), the aforementioned transitions are spectrally tunable. Finally, we discuss the potential origin and impact of these tunable transitions.

preprint2020arXiv

Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning

As a popular meta-learning approach, the model-agnostic meta-learning (MAML) algorithm has been widely used due to its simplicity and effectiveness. However, the convergence of the general multi-step MAML still remains unexplored. In this paper, we develop a new theoretical framework to provide such convergence guarantee for two types of objective functions that are of interest in practice: (a) resampling case (e.g., reinforcement learning), where loss functions take the form in expectation and new data are sampled as the algorithm runs; and (b) finite-sum case (e.g., supervised learning), where loss functions take the finite-sum form with given samples. For both cases, we characterize the convergence rate and the computational complexity to attain an $ε$-accurate solution for multi-step MAML in the general nonconvex setting. In particular, our results suggest that an inner-stage stepsize needs to be chosen inversely proportional to the number $N$ of inner-stage steps in order for $N$-step MAML to have guaranteed convergence. From the technical perspective, we develop novel techniques to deal with the nested structure of the meta gradient for multi-step MAML, which can be of independent interest.

preprint2015arXiv

In-plane Charge Fluctuations in Bismuth Sulfide Superconductors

Evidence for local charge fluctuations linked to a charge disproportionation of the Bi ions in the distorted lattice of superconducting LaO$_{1-x}$F$_{x}$ BiS$_{2}$ is presented. In-plane short-range distortions of sulfur atoms up to 0.3 Å in magnitude break site symmetry and create two distinct environments around Bi. Out-of-plane motion of apical sulfur brings it closer to the La-O/F doping layer with increasing $x$ that may lead to a charge transfer conduit between the doping layers and the superconducting BiS$_{2}$ planes. The mechanism for superconductivity may arise from the interplay between charge density fluctuations and an enhanced spin-orbit coupling suggested theoretically, that induces spin polarization.

preprint2015arXiv

Spin Jam: a quantum-fluctuation-induced glassy state of a frustrated magnet

Since the discovery of spin glasses in dilute magnetic systems, their study has been largely focused on understanding randomness and defects as the driving mechanism. The same paradigm has also been applied to explain glassy states found in dense frustrated systems. Recently, however, it has been theoretically suggested that different mechanisms, such as quantum fluctuations and topological features, may induce glassy states in defect-free spin systems, far from the conventional dilute limit. Here we report experimental evidence for the existence of a glassy state, that we call a spin jam, in the vicinity of the clean limit of a frustrated magnet, which is insensitive to a low concentration of defects. We have studied the effect of impurities on SrCr9pGa12-9pO19 (SCGO(p)), a highly frustrated magnet, in which the magnetic Cr3+ (s=3/2) ions form a quasi-two-dimensional triangular system of bi-pyramids. Our experimental data shows that as the nonmagnetic Ga3+ impurity concentration is changed, there are two distinct phases of glassiness: a distinct exotic glassy state, which we call a "spin jam", for high magnetic concentration region (p>0.8) and a cluster spin glass for lower magnetic concentration, (p<0.8). This observation indicates that a spin jam is a unique vantage point from which the class of glassy states in dense frustrated magnets can be understood.

preprint2014arXiv

A Semiblind Two-Way Training Method for Discriminatory Channel Estimation in MIMO Systems

Discriminatory channel estimation (DCE) is a recently developed strategy to enlarge the performance difference between a legitimate receiver (LR) and an unauthorized receiver (UR) in a multiple-input multiple-output (MIMO) wireless system. Specifically, it makes use of properly designed training signals to degrade channel estimation at the UR which in turn limits the UR's eavesdropping capability during data transmission. In this paper, we propose a new two-way training scheme for DCE through exploiting a whitening-rotation (WR) based semiblind method. To characterize the performance of DCE, a closed-form expression of the normalized mean squared error (NMSE) of the channel estimation is derived for both the LR and the UR. Furthermore, the developed analytical results on NMSE are utilized to perform optimal power allocation between the training signal and artificial noise (AN). The advantages of our proposed DCE scheme are two folds: 1) compared to the existing DCE scheme based on the linear minimum mean square error (LMMSE) channel estimator, the proposed scheme adopts a semiblind approach and achieves better DCE performance; 2) the proposed scheme is robust against active eavesdropping with the pilot contamination attack, whereas the existing scheme fails under such an attack.

preprint2014arXiv

Color theorems, chiral domain topology and magnetic properties of FexTaS2

Common mathematical theory can have profound applications in understanding real materials. The intrinsic connection between aperiodic orders observed in the Fibonacci sequence, Penrose tiling, and quasicrystals is a well-known example. Another example is the self-similarity in fractals and dendrites. From transmission electron microscopy experiments, we found that FexTaS2 crystals with x=1/4 and 1/3 exhibit complicated antiphase and chiral domain structures related to ordering of intercalated Fe ions with 2a*2a and sqrt3a*sqrt3a superstructures, respectively. These complex domain patterns are found to be deeply related with the four color theorem, stating that four colors are sufficient to identify the countries on a planar map with proper coloring, and its variations for two-step proper coloring. Furthermore, the domain topology is closely relevant to their magnetic properties. Our discovery unveils the importance of understanding the global topology of domain configurations in functional materials.

preprint2014arXiv

Structural versus electronic distortions of symmetry-broken IrTe$_2$

We investigate atomic and electronic structures of the intriguing low temperature phase of IrTe2 using high-resolution scanning tunneling microscopy and spectroscopy. We confirm various stripe superstructures such as $\times$3, $\times$5, and $\times$8. The strong vertical and lateral distortions of the lattice for the stripe structures are observed in agreement with recent calculations. The spatial modulations of electronic density of states are clearly identified as separated from the structural distortions. These structural and spectroscopic characteristics are not consistent with the charge-density wave and soliton lattice model proposed recently. Instead, we show that the Ir (Te) dimerization together with the Ir 5d charge ordering can explain these superstructures, supporting the Ir dimerization mechanism of the phase transition.

Junjie Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

Active Sensing Shapes Real-World Decision-Making through Dynamic Evidence Accumulation

Deepening Hidden Representations from Pre-trained Language Models

Evolution of the structural transition in Mo$_{1-x}$W$_{x}$Te$_{2}$

Second-order nonlinear optical and linear UV-VIS absorption properties of type-II multiferroic candidates RbFe(AO4)2 (A = Mo, Se, S)

Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning

In-plane Charge Fluctuations in Bismuth Sulfide Superconductors

Spin Jam: a quantum-fluctuation-induced glassy state of a frustrated magnet

A Semiblind Two-Way Training Method for Discriminatory Channel Estimation in MIMO Systems

Color theorems, chiral domain topology and magnetic properties of FexTaS2

Structural versus electronic distortions of symmetry-broken IrTe$_2$