Researcher profile

Jin Huang

Jin Huang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks

Reasoning benchmarks such as the Abstraction and Reasoning Corpus (ARC) and ARC-AGI are widely used to assess progress in artificial intelligence and are often interpreted as probes of core, so-called ``fluid'' reasoning abilities. Despite their apparent simplicity for humans, these tasks remain challenging for frontier vision-language models (VLMs), a gap commonly attributed to deficiencies in machine reasoning. We challenge this interpretation and hypothesize that the gap arises primarily from limitations in visual perception rather than from shortcomings in inductive reasoning. To verify this hypothesis, we introduce a two-stage experimental pipeline that explicitly separates perception and reasoning. In the perception stage, each image is independently converted into a natural-language description, while in the reasoning stage a model induces and applies rules using these descriptions. This design prevents leakage of cross-image inductive signals and isolates reasoning from perception bottlenecks. Across three ARC-style datasets, Mini-ARC, ACRE, and Bongard-LOGO, we show that the perception capability is the dominant factor underlying the observed performance gap by comparing the two-stage pipeline with against standard end-to-end one-stage evaluation. Manual inspection of reasoning traces in the VLM outputs further reveals that approximately 80 percent of model failures stem from perception errors. Together, these results demonstrate that ARC-style benchmarks conflate perceptual and reasoning challenges and that observed performance gaps may overstate deficiencies in machine reasoning. Our findings underscore the need for evaluation protocols that disentangle perception from reasoning when assessing progress in machine intelligence.

preprint2022arXiv

Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

Deep deterministic policy gradient (DDPG)-based car-following strategy can break through the constraints of the differential equation model due to the ability of exploration on complex environments. However, the car-following performance of DDPG is usually degraded by unreasonable reward function design, insufficient training, and low sampling efficiency. In order to solve this kind of problem, a hybrid car-following strategy based on DDPG and cooperative adaptive cruise control (CACC) is proposed. First, the car-following process is modeled as the Markov decision process to calculate CACC and DDPG simultaneously at each frame. Given a current state, two actions are obtained from CACC and DDPG, respectively. Then, an optimal action, corresponding to the one offering a larger reward, is chosen as the output of the hybrid strategy. Meanwhile, a rule is designed to ensure that the change rate of acceleration is smaller than the desired value. Therefore, the proposed strategy not only guarantees the basic performance of car-following through CACC but also makes full use of the advantages of exploration on complex environments via DDPG. Finally, simulation results show that the car-following performance of the proposed strategy is improved compared with that of DDPG and CACC.

preprint2022arXiv

ODformer: Spatial-Temporal Transformers for Long Sequence Origin-Destination Matrix Forecasting Against Cross Application Scenario

Origin-Destination (OD) matrices record directional flow data between pairs of OD regions. The intricate spatiotemporal dependency in the matrices makes the OD matrix forecasting (ODMF) problem not only intractable but also non-trivial. However, most of the related methods are designed for very short sequence time series forecasting in specific application scenarios, which cannot meet the requirements of the variation in scenarios and forecasting length of practical applications. To address these issues, we propose a Transformer-like model named ODformer, with two salient characteristics: (i) the novel OD Attention mechanism, which captures special spatial dependencies between OD pairs of the same origin (destination), greatly improves the ability of the model to predict cross-application scenarios after combining with 2D-GCN that captures spatial dependencies between OD regions. (ii) a PeriodSparse Self-attention that effectively forecasts long sequence OD matrix series while adapting to the periodic differences in different scenarios. Generous experiments in three application backgrounds (i.e., transportation traffic, IP backbone network traffic, crowd flow) show our method outperforms the state-of-the-art methods.

preprint2022arXiv

Series Photo Selection via Multi-view Graph Learning

Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos. While a great progress has been observed, most of the existing SPS approaches concentrate solely on extracting features from the original image, neglecting that multiple views, e.g, saturation level, color histogram and depth of field of the image, will be of benefit to successfully reflecting the subtle aesthetic changes. Taken multi-view into consideration, we leverage a graph neural network to construct the relationships between multi-view features. Besides, multiple views are aggregated with an adaptive-weight self-attention module to verify the significance of each view. Finally, a siamese network is proposed to select the best one from a series of nearly identical photos. Experimental results demonstrate that our model accomplish the highest success rates compared with competitive methods.

preprint2022arXiv

Smart sensors using artificial intelligence for on-detector electronics and ASICs

Cutting edge detectors push sensing technology by further improving spatial and temporal resolution, increasing detector area and volume, and generally reducing backgrounds and noise. This has led to a explosion of more and more data being generated in next-generation experiments. Therefore, the need for near-sensor, at the data source, processing with more powerful algorithms is becoming increasingly important to more efficiently capture the right experimental data, reduce downstream system complexity, and enable faster and lower-power feedback loops. In this paper, we discuss the motivations and potential applications for on-detector AI. Furthermore, the unique requirements of particle physics can uniquely drive the development of novel AI hardware and design tools. We describe existing modern work for particle physics in this area. Finally, we outline a number of areas of opportunity where we can advance machine learning techniques, codesign workflows, and future microelectronics technologies which will accelerate design, performance, and implementations for next generation experiments.

preprint2022arXiv

Solving Simulation Systematics in and with AI/ML

Training an AI/ML system on simulated data while using that system to infer on data from real detectors introduces a systematic error which is difficult to estimate and in many analyses is simply not confronted. It is crucial to minimize and to quantitatively estimate the uncertainties in such analysis and do so with a precision and accuracy that matches those that AI/ML techniques bring. Here we highlight the need to confront this class of systematic error, discuss conventional ways to estimate it and describe ways to quantify and to minimize the uncertainty using methods which are themselves based on the power of AI/ML. We also describe methods to introduce a simulation into an AI/ML network to allow for training of its semantically meaningful parameters. This whitepaper is a contribution to the Computational Frontier of Snowmass21.

preprint2022arXiv

State Encoders in Reinforcement Learning for Recommendation: A Reproducibility Study

Methods for reinforcement learning for recommendation (RL4Rec) are increasingly receiving attention as they can quickly adapt to user feedback. A typical RL4Rec framework consists of (1) a state encoder to encode the state that stores the users' historical interactions, and (2) an RL method to take actions and observe rewards. Prior work compared four state encoders in an environment where user feedback is simulated based on real-world logged user data. An attention-based state encoder was found to be the optimal choice as it reached the highest performance. However, this finding is limited to the actor-critic method, four state encoders, and evaluation-simulators that do not debias logged user data. In response to these shortcomings, we reproduce and expand on the existing comparison of attention-based state encoders (1) in the publicly available debiased RL4Rec SOFA simulator with (2) a different RL method, (3) more state encoders, and (4) a different dataset. Importantly, our experimental results indicate that existing findings do not generalize to the debiased SOFA simulator generated from a different dataset and a Deep Q-Network (DQN)-based method when compared with more state encoders.

preprint2021arXiv

Costly Features Classification using Monte Carlo Tree Search

We consider the problem of costly feature classification, where we sequentially select the subset of features to make a balance between the classification error and the feature cost. In this paper, we first cast the task into a MDP problem and use Advantage Actor Critic algorithm to solve it. In order to further improve the agent's performance and make the policy explainable, we employ the Monte Carlo Tree Search to update the policy iteratively. During the procedure, we also consider its performance on the unbalanced dataset and its sensitivity to the missing value. We evaluate our model on multiple datasets and find it outperforms other methods.

preprint2020arXiv

A statistical study of the plasma and composition distribution inside magnetic clouds: 1998-2011

A comprehensive analysis of plasma and composition characteristics inside magnetic clouds (MCs) observed by the Advanced Composition Explorer (ACE) spacecraft from 1998 February to 2011 August is presented. The results show that MCs have specific interior structures, and MCs of different speeds show differences in composition and structure. Compared with the slow MCs, fast MCs have enhanced mean charge states of iron, oxygen, silicon, magnesium, $\mathrm{O^{7+}/O^{6+}}$, $\mathrm{C^{6+}/C^{5+}}$, $\mathrm{C^{6+}/C^{4+}}$ and $\mathrm{Fe^{\geq16+}/Fe_{total}}$ values. For ionic species in fast MCs, a higher atomic number represents a greater enhancement of mean charge state than slow MCs. We also find that both the fast and slow MCs display bimodal structure distribution in the mean iron charge state ($\mathrm{\langle Q\rangle Fe}$), which suggests that the existence of flux rope prior to the eruption is common. Furthermore, the $\mathrm{\langle Q\rangle Fe} $, $\mathrm{Fe^{\geq16+}/Fe_{total}}$, and $\mathrm{O^{7+}/O^{6+}}$ ratio distribution inside fast MCs have the feature that the posterior peak is higher than the anterior one. This result agrees with the "standard model" for CME/flares, by which magnetic reconnection occurs beneath the flux rope, thereby ionizing the ions of the posterior part of flux rope sufficiently by high-energy electron collisions or by direct heating in the reconnection region.

preprint2020arXiv

Comparative Analyses of Plasma Properties and Composition in Two Types of Small-Scale Interplanetary Flux-ropes

The origin of small-scale interplanetary magnetic flux-ropes (SIMFRs) and the relationship between SIMFRs and magnetic clouds (MCs) are still controversial. In this study, two populations of SMIFRs were collected, i.e., SIMFRs originating from the Sun (SIMFR-SUN) and those originating from the solar wind (SIMFR-SW). We defined the SIMFR-SUN (SIMFR-SW) as the SMIFRs that include (exclude) the counter-streaming suprathermal electrons and stay away from (close to) the heliospheric current sheet. After fitting with force-free flux-rope model, 52 SIMFR-SUN and 57 SIMFR-SW events observed by Advanced Composition Explorer (ACE) from 1998 February to 2011 August were qualified. Using the approach of relating the measurements to their spatial position within the flux-ropes, a comparative survey of plasma and composition characteristics inside the two populations of SIMFRs is presented. Results show that the two populations of SIMFRs have apparent differences. Compared with SIMFR-SW, SIMFR-SUN are MC-like, featuring lower central proton density, higher Vrad, higher low-FIP element abundances, higher and more fluctuate average ion charge-states and the ion chargestate ratios which are related to the heating in low corona. In addition, for the ion charge-state distributions inside SIMFR-SUN, the sunward side is higher than earthward, which might be caused by the flare heating during eruption. Moreover, both SIMFR-SUN and MCs show anti-correlation between plasma beta and He/P trend. These characteristics indicate that SIMFR-SUN and MCs are very likely to have the identical origination. This study supports the two-source origin of SIMFRs, i.e., the solar corona and the solar wind.

preprint2020arXiv

Hierarchical Neural Architecture Search for Single Image Super-Resolution

Deep neural networks have exhibited promising performance in image super-resolution (SR). Most SR models follow a hierarchical architecture that contains both the cell-level design of computational blocks and the network-level design of the positions of upsampling blocks. However, designing SR models heavily relies on human expertise and is very labor-intensive. More critically, these SR models often contain a huge number of parameters and may not meet the requirements of computation resources in real-world applications. To address the above issues, we propose a Hierarchical Neural Architecture Search (HNAS) method to automatically design promising architectures with different requirements of computation cost. To this end, we design a hierarchical SR search space and propose a hierarchical controller for architecture search. Such a hierarchical controller is able to simultaneously find promising cell-level blocks and network-level positions of upsampling layers. Moreover, to design compact architectures with promising performance, we build a joint reward by considering both the performance and computation cost to guide the search process. Extensive experiments on five benchmark datasets demonstrate the superiority of our method over existing methods.