Researcher profile

Junjie Yang

Junjie Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

As AI capabilities increasingly surpass human proficiency in complex tasks, current alignment techniques, including SFT and RLHF, face fundamental challenges in ensuring reliable oversight. These methods rely on direct human assessment and become impractical when AI outputs exceed human cognitive thresholds. In response to this challenge, we explore two hypotheses: (1) \textit{Critique of critique can be easier than critique itself}, extending the widely-accepted observation that verification is easier than generation to the critique domain, as critique itself is a specialized form of generation; (2) \textit{This difficulty relationship holds recursively}, suggesting that when direct evaluation is infeasible, performing higher-order critiques (e.g., critique of critique of critique) offers a more tractable supervision pathway. We conduct Human-Human, Human-AI, and AI-AI experiments to investigate the potential of recursive self-critiquing for AI supervision. Our results highlight recursive critique as a promising approach for scalable AI oversight.

preprint2025arXiv

Active Sensing Shapes Real-World Decision-Making through Dynamic Evidence Accumulation

Human decision-making heavily relies on active sensing, a well-documented cognitive behaviour for evidence gathering to accommodate ever-changing environments. However, its operational mechanism in the real world remains non-trivial. Currently, an in-laboratory paradigm, called evidence accumulation modelling (EAM), points out that human decision-making involves transforming external evidence into internal mental beliefs. However, the gap in evidence affordance between real-world contexts and laboratory settings hinders the effective application of EAM. Here we generalize EAM to the real world and conduct analysis in real-world driving scenarios. A cognitive scheme is proposed to formalize real-world evidence affordance and capture active sensing through eye movements. Empirically, our scheme can plausibly portray the accumulation of drivers' mental beliefs, explaining how active sensing transforms evidence into mental beliefs from the perspective of information utility. Also, our results demonstrate a negative correlation between evidence affordance and attention recruited by individuals, revealing how human drivers adapt their evidence-collection patterns across various contexts. Moreover, we reveal the positive influence of evidence affordance and attention distribution on decision-making propensity. In a nutshell, our computational scheme generalizes EAM to real-world contexts and provides a comprehensive account of how active sensing underlies real-world decision-making, unveiling multifactorial, integrated characteristics in real-world decision-making.

preprint2020arXiv

Deepening Hidden Representations from Pre-trained Language Models

Transformer-based pre-trained language models have proven to be effective for learning contextualized language representation. However, current approaches only take advantage of the output of the encoder's final layer when fine-tuning the downstream tasks. We argue that only taking single layer's output restricts the power of pre-trained representation. Thus we deepen the representation learned by the model by fusing the hidden representation in terms of an explicit HIdden Representation Extractor (HIRE), which automatically absorbs the complementary representation with respect to the output from the final layer. Utilizing RoBERTa as the backbone encoder, our proposed improvement over the pre-trained models is shown effective on multiple natural language understanding tasks and help our model rival with the state-of-the-art models on the GLUE benchmark.

preprint2020arXiv

Evolution of the structural transition in Mo$_{1-x}$W$_{x}$Te$_{2}$

The composition dependence of the structural transition between the monoclinic 1T$^{\prime}$ and orthorhombic T$_{d}$ phases in the Mo$_{1-x}$W$_{x}$Te$_{2}$ Weyl semimetal was investigated by elastic neutron scattering on single crystals up to $x \approx 0.54$. First observed in MoTe$_{2}$, the transition from T$_{d}$ to 1T$^{\prime}$ is accompanied by an intermediate pseudo-orthorhombic phase, T$_{d}^{*}$. Upon doping with W, the T$_{d}^{*}$ phase vanishes by $x \approx 0.34$. Above this concentration, a phase coexistence behavior with both T$_{d}$ and 1T$^{\prime}$ is observed instead. The interlayer in-plane positioning parameter $δ$, which relates to the 1T$^{\prime}$ $β$ angle, decreases with temperature as well as with W substitution, likely due to strong anharmonicity in the interlayer interactions. The temperature width of the phase coexistence remains almost constant up to $x \approx 0.54$, in contrast to the broadening reported under pressure.

preprint2020arXiv

Second-order nonlinear optical and linear UV-VIS absorption properties of type-II multiferroic candidates RbFe(AO4)2 (A = Mo, Se, S)

Motivated by the search for type-II multiferroics, we present a comprehensive optical study of a complex oxide family of type-II multiferroic candidates: RbFe(MoO4)2, RbFe(SeO4)2, and RbFe(SO4)2. We employ rotational-anisotropy second harmonic generation spectroscopy (RA SHG), a technique sensitive to point symmetries, to address discrepancies in literature-assigned point/space groups and to identify the correct crystal structures. At room temperature we find that our RA SHG patterns rotate away from the crystal axes in RbFe(AO4)2 (A = Se, S), which identifies the lack of mirror symmetry and in-plane two-fold rotational symmetry. Also, the SHG efficiency of RbFe(SeO4)2 is two orders of magnitude stronger than RbFe(AO4)2 (A = Mo, S), which suggests broken inversion symmetry. Additionally, we present temperature-dependent linear optical characterizations near the band edge of this family of materials using ultraviolet-visible (UV-VIS) absorption spectroscopy. Included is experimental evidence of the band gap energy and band gap transition type for this family. Previously unreported sub-band gap absorption is also presented, which reveals prominent optical transitions, some with an unusual central energy temperature dependence. Furthermore, we find that by substituting the A-site in RbFe(AO4)2 (A = Mo, Se, S), the aforementioned transitions are spectrally tunable. Finally, we discuss the potential origin and impact of these tunable transitions.

preprint2020arXiv

Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning

As a popular meta-learning approach, the model-agnostic meta-learning (MAML) algorithm has been widely used due to its simplicity and effectiveness. However, the convergence of the general multi-step MAML still remains unexplored. In this paper, we develop a new theoretical framework to provide such convergence guarantee for two types of objective functions that are of interest in practice: (a) resampling case (e.g., reinforcement learning), where loss functions take the form in expectation and new data are sampled as the algorithm runs; and (b) finite-sum case (e.g., supervised learning), where loss functions take the finite-sum form with given samples. For both cases, we characterize the convergence rate and the computational complexity to attain an $ε$-accurate solution for multi-step MAML in the general nonconvex setting. In particular, our results suggest that an inner-stage stepsize needs to be chosen inversely proportional to the number $N$ of inner-stage steps in order for $N$-step MAML to have guaranteed convergence. From the technical perspective, we develop novel techniques to deal with the nested structure of the meta gradient for multi-step MAML, which can be of independent interest.