Source author record

Jin Huang

Jin Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-ex Artificial Intelligence astro-ph.SR Computer Vision Machine Learning physics.space-ph astro-ph.EP Computation and Language cs.CY eess.SY Graphics hep-ph Information Retrieval nucl-ex physics.data-an physics.ins-det physics.plasm-ph Systems and Control

Catalog footprint

What is connected

14works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks

Reasoning benchmarks such as the Abstraction and Reasoning Corpus (ARC) and ARC-AGI are widely used to assess progress in artificial intelligence and are often interpreted as probes of core, so-called ``fluid'' reasoning abilities. Despite their apparent simplicity for humans, these tasks remain challenging for frontier vision-language models (VLMs), a gap commonly attributed to deficiencies in machine reasoning. We challenge this interpretation and hypothesize that the gap arises primarily from limitations in visual perception rather than from shortcomings in inductive reasoning. To verify this hypothesis, we introduce a two-stage experimental pipeline that explicitly separates perception and reasoning. In the perception stage, each image is independently converted into a natural-language description, while in the reasoning stage a model induces and applies rules using these descriptions. This design prevents leakage of cross-image inductive signals and isolates reasoning from perception bottlenecks. Across three ARC-style datasets, Mini-ARC, ACRE, and Bongard-LOGO, we show that the perception capability is the dominant factor underlying the observed performance gap by comparing the two-stage pipeline with against standard end-to-end one-stage evaluation. Manual inspection of reasoning traces in the VLM outputs further reveals that approximately 80 percent of model failures stem from perception errors. Together, these results demonstrate that ARC-style benchmarks conflate perceptual and reasoning challenges and that observed performance gaps may overstate deficiencies in machine reasoning. Our findings underscore the need for evaluation protocols that disentangle perception from reasoning when assessing progress in machine intelligence.

preprint2022arXiv

Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

Deep deterministic policy gradient (DDPG)-based car-following strategy can break through the constraints of the differential equation model due to the ability of exploration on complex environments. However, the car-following performance of DDPG is usually degraded by unreasonable reward function design, insufficient training, and low sampling efficiency. In order to solve this kind of problem, a hybrid car-following strategy based on DDPG and cooperative adaptive cruise control (CACC) is proposed. First, the car-following process is modeled as the Markov decision process to calculate CACC and DDPG simultaneously at each frame. Given a current state, two actions are obtained from CACC and DDPG, respectively. Then, an optimal action, corresponding to the one offering a larger reward, is chosen as the output of the hybrid strategy. Meanwhile, a rule is designed to ensure that the change rate of acceleration is smaller than the desired value. Therefore, the proposed strategy not only guarantees the basic performance of car-following through CACC but also makes full use of the advantages of exploration on complex environments via DDPG. Finally, simulation results show that the car-following performance of the proposed strategy is improved compared with that of DDPG and CACC.

preprint2022arXiv

ODformer: Spatial-Temporal Transformers for Long Sequence Origin-Destination Matrix Forecasting Against Cross Application Scenario

Origin-Destination (OD) matrices record directional flow data between pairs of OD regions. The intricate spatiotemporal dependency in the matrices makes the OD matrix forecasting (ODMF) problem not only intractable but also non-trivial. However, most of the related methods are designed for very short sequence time series forecasting in specific application scenarios, which cannot meet the requirements of the variation in scenarios and forecasting length of practical applications. To address these issues, we propose a Transformer-like model named ODformer, with two salient characteristics: (i) the novel OD Attention mechanism, which captures special spatial dependencies between OD pairs of the same origin (destination), greatly improves the ability of the model to predict cross-application scenarios after combining with 2D-GCN that captures spatial dependencies between OD regions. (ii) a PeriodSparse Self-attention that effectively forecasts long sequence OD matrix series while adapting to the periodic differences in different scenarios. Generous experiments in three application backgrounds (i.e., transportation traffic, IP backbone network traffic, crowd flow) show our method outperforms the state-of-the-art methods.

preprint2022arXiv

Series Photo Selection via Multi-view Graph Learning

Series photo selection (SPS) is an important branch of the image aesthetics quality assessment, which focuses on finding the best one from a series of nearly identical photos. While a great progress has been observed, most of the existing SPS approaches concentrate solely on extracting features from the original image, neglecting that multiple views, e.g, saturation level, color histogram and depth of field of the image, will be of benefit to successfully reflecting the subtle aesthetic changes. Taken multi-view into consideration, we leverage a graph neural network to construct the relationships between multi-view features. Besides, multiple views are aggregated with an adaptive-weight self-attention module to verify the significance of each view. Finally, a siamese network is proposed to select the best one from a series of nearly identical photos. Experimental results demonstrate that our model accomplish the highest success rates compared with competitive methods.

preprint2022arXiv

Smart sensors using artificial intelligence for on-detector electronics and ASICs

Cutting edge detectors push sensing technology by further improving spatial and temporal resolution, increasing detector area and volume, and generally reducing backgrounds and noise. This has led to a explosion of more and more data being generated in next-generation experiments. Therefore, the need for near-sensor, at the data source, processing with more powerful algorithms is becoming increasingly important to more efficiently capture the right experimental data, reduce downstream system complexity, and enable faster and lower-power feedback loops. In this paper, we discuss the motivations and potential applications for on-detector AI. Furthermore, the unique requirements of particle physics can uniquely drive the development of novel AI hardware and design tools. We describe existing modern work for particle physics in this area. Finally, we outline a number of areas of opportunity where we can advance machine learning techniques, codesign workflows, and future microelectronics technologies which will accelerate design, performance, and implementations for next generation experiments.

preprint2022arXiv

Solving Simulation Systematics in and with AI/ML

Training an AI/ML system on simulated data while using that system to infer on data from real detectors introduces a systematic error which is difficult to estimate and in many analyses is simply not confronted. It is crucial to minimize and to quantitatively estimate the uncertainties in such analysis and do so with a precision and accuracy that matches those that AI/ML techniques bring. Here we highlight the need to confront this class of systematic error, discuss conventional ways to estimate it and describe ways to quantify and to minimize the uncertainty using methods which are themselves based on the power of AI/ML. We also describe methods to introduce a simulation into an AI/ML network to allow for training of its semantically meaningful parameters. This whitepaper is a contribution to the Computational Frontier of Snowmass21.

preprint2022arXiv

State Encoders in Reinforcement Learning for Recommendation: A Reproducibility Study

Methods for reinforcement learning for recommendation (RL4Rec) are increasingly receiving attention as they can quickly adapt to user feedback. A typical RL4Rec framework consists of (1) a state encoder to encode the state that stores the users' historical interactions, and (2) an RL method to take actions and observe rewards. Prior work compared four state encoders in an environment where user feedback is simulated based on real-world logged user data. An attention-based state encoder was found to be the optimal choice as it reached the highest performance. However, this finding is limited to the actor-critic method, four state encoders, and evaluation-simulators that do not debias logged user data. In response to these shortcomings, we reproduce and expand on the existing comparison of attention-based state encoders (1) in the publicly available debiased RL4Rec SOFA simulator with (2) a different RL method, (3) more state encoders, and (4) a different dataset. Importantly, our experimental results indicate that existing findings do not generalize to the debiased SOFA simulator generated from a different dataset and a Deep Q-Network (DQN)-based method when compared with more state encoders.

preprint2021arXiv

Costly Features Classification using Monte Carlo Tree Search

We consider the problem of costly feature classification, where we sequentially select the subset of features to make a balance between the classification error and the feature cost. In this paper, we first cast the task into a MDP problem and use Advantage Actor Critic algorithm to solve it. In order to further improve the agent's performance and make the policy explainable, we employ the Monte Carlo Tree Search to update the policy iteratively. During the procedure, we also consider its performance on the unbalanced dataset and its sensitivity to the missing value. We evaluate our model on multiple datasets and find it outperforms other methods.

preprint2020arXiv

A statistical study of the plasma and composition distribution inside magnetic clouds: 1998-2011

A comprehensive analysis of plasma and composition characteristics inside magnetic clouds (MCs) observed by the Advanced Composition Explorer (ACE) spacecraft from 1998 February to 2011 August is presented. The results show that MCs have specific interior structures, and MCs of different speeds show differences in composition and structure. Compared with the slow MCs, fast MCs have enhanced mean charge states of iron, oxygen, silicon, magnesium, $\mathrm{O^{7+}/O^{6+}}$, $\mathrm{C^{6+}/C^{5+}}$, $\mathrm{C^{6+}/C^{4+}}$ and $\mathrm{Fe^{\geq16+}/Fe_{total}}$ values. For ionic species in fast MCs, a higher atomic number represents a greater enhancement of mean charge state than slow MCs. We also find that both the fast and slow MCs display bimodal structure distribution in the mean iron charge state ($\mathrm{\langle Q\rangle Fe}$), which suggests that the existence of flux rope prior to the eruption is common. Furthermore, the $\mathrm{\langle Q\rangle Fe} $, $\mathrm{Fe^{\geq16+}/Fe_{total}}$, and $\mathrm{O^{7+}/O^{6+}}$ ratio distribution inside fast MCs have the feature that the posterior peak is higher than the anterior one. This result agrees with the "standard model" for CME/flares, by which magnetic reconnection occurs beneath the flux rope, thereby ionizing the ions of the posterior part of flux rope sufficiently by high-energy electron collisions or by direct heating in the reconnection region.

preprint2020arXiv

Comparative Analyses of Plasma Properties and Composition in Two Types of Small-Scale Interplanetary Flux-ropes

The origin of small-scale interplanetary magnetic flux-ropes (SIMFRs) and the relationship between SIMFRs and magnetic clouds (MCs) are still controversial. In this study, two populations of SMIFRs were collected, i.e., SIMFRs originating from the Sun (SIMFR-SUN) and those originating from the solar wind (SIMFR-SW). We defined the SIMFR-SUN (SIMFR-SW) as the SMIFRs that include (exclude) the counter-streaming suprathermal electrons and stay away from (close to) the heliospheric current sheet. After fitting with force-free flux-rope model, 52 SIMFR-SUN and 57 SIMFR-SW events observed by Advanced Composition Explorer (ACE) from 1998 February to 2011 August were qualified. Using the approach of relating the measurements to their spatial position within the flux-ropes, a comparative survey of plasma and composition characteristics inside the two populations of SIMFRs is presented. Results show that the two populations of SIMFRs have apparent differences. Compared with SIMFR-SW, SIMFR-SUN are MC-like, featuring lower central proton density, higher Vrad, higher low-FIP element abundances, higher and more fluctuate average ion charge-states and the ion chargestate ratios which are related to the heating in low corona. In addition, for the ion charge-state distributions inside SIMFR-SUN, the sunward side is higher than earthward, which might be caused by the flare heating during eruption. Moreover, both SIMFR-SUN and MCs show anti-correlation between plasma beta and He/P trend. These characteristics indicate that SIMFR-SUN and MCs are very likely to have the identical origination. This study supports the two-source origin of SIMFRs, i.e., the solar corona and the solar wind.

preprint2020arXiv

Hierarchical Neural Architecture Search for Single Image Super-Resolution

Deep neural networks have exhibited promising performance in image super-resolution (SR). Most SR models follow a hierarchical architecture that contains both the cell-level design of computational blocks and the network-level design of the positions of upsampling blocks. However, designing SR models heavily relies on human expertise and is very labor-intensive. More critically, these SR models often contain a huge number of parameters and may not meet the requirements of computation resources in real-world applications. To address the above issues, we propose a Hierarchical Neural Architecture Search (HNAS) method to automatically design promising architectures with different requirements of computation cost. To this end, we design a hierarchical SR search space and propose a hierarchical controller for architecture search. Such a hierarchical controller is able to simultaneously find promising cell-level blocks and network-level positions of upsampling layers. Moreover, to design compact architectures with promising performance, we build a joint reward by considering both the performance and computation cost to guide the search process. Extensive experiments on five benchmark datasets demonstrate the superiority of our method over existing methods.

preprint2016arXiv

Modelling Developable Ribbons Using Ruling Bending Coordinates

This paper presents a new method for modelling the dynamic behaviour of developable ribbons, two dimensional strips with much smaller width than length. Instead of approximating such surface with a general triangle mesh, we characterize it by a set of creases and bending angles across them. This representation allows the developability to be satisfied everywhere while still leaves enough degree of freedom to represent salient global deformation. We show how the potential and kinetic energies can be properly discretized in this configuration space and time integrated in a fully implicit manner. The result is a dynamic simulator with several desirable features: We can model non-trivial deformation using much fewer elements than conventional FEM method. It is stable under extreme deformation, external force or large timestep size. And we can readily handle various user constraints in Euclidean space.

preprint2016arXiv

Spin asymmetries for vector boson production in polarized p+p collisions

We study the cross section for vector boson ($W^{\pm}/Z^0/γ^*$) production in polarized nucleon-nucleon collisions for low transverse momentum of the observed vector boson. For the case where one measures the transverse momentum and azimuthal angle of the vector bosons, we present the cross sections and the associated spin asymmetries in terms of transverse momentum dependent parton distribution functions (TMDs) at tree level within the TMD factorization formalism. To assess the feasibility of experimental measurements, we estimate the spin asymmetries for $W^{\pm}/Z^0$ boson production in polarized proton-proton collisions at the Relativistic Heavy Ion Collider (RHIC) by using current knowledge of the relevant TMDs. We find that some of these asymmetries can be sizable if the suppression effect from TMD evolution is not too strong. The $W$ program at RHIC can, thus, test and constrain spin theory by providing unique information on the universality properties of TMDs, TMD evolution, and the nucleon structure. For example, the single transverse spin asymmetries could be used to probe the well-known Sivers function $f_{1T}^{\perp q}$, as well as the transversal helicity distribution $g_{1T}^{q}$ via the parity-violating nature of $W$ production.

preprint2015arXiv

Sales pipeline win propensity prediction: a regression approach

Sales pipeline analysis is fundamental to proactive management of an enterprize's sales pipeline and critical for business success. In particular, win propensity prediction, which involves quantitatively estimating the likelihood that on-going sales opportunities will be won within a specified time window, is a fundamental building block for sales management and lays the foundation for many applications such as resource optimization and sales gap analysis. With the proliferation of big data, the use of data-driven predictive models as a means to drive better sales performance is increasingly widespread, both in business-to-client (B2C) and business-to-business (B2B) markets. However, the relatively small number of B2B transactions (compared with the volume of B2C transactions), noisy data, and the fast-changing market environment pose challenges to effective predictive modeling. This paper proposes a machine learning-based unified framework for sales opportunity win propensity prediction, aimed at addressing these challenges. We demonstrate the efficacy of our proposed system using data from a top-500 enterprize in the business-to-business market.

Jin Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks

Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

ODformer: Spatial-Temporal Transformers for Long Sequence Origin-Destination Matrix Forecasting Against Cross Application Scenario

Series Photo Selection via Multi-view Graph Learning

Smart sensors using artificial intelligence for on-detector electronics and ASICs

Solving Simulation Systematics in and with AI/ML

State Encoders in Reinforcement Learning for Recommendation: A Reproducibility Study

Costly Features Classification using Monte Carlo Tree Search

A statistical study of the plasma and composition distribution inside magnetic clouds: 1998-2011

Comparative Analyses of Plasma Properties and Composition in Two Types of Small-Scale Interplanetary Flux-ropes

Hierarchical Neural Architecture Search for Single Image Super-Resolution

Modelling Developable Ribbons Using Ruling Bending Coordinates

Spin asymmetries for vector boson production in polarized p+p collisions

Sales pipeline win propensity prediction: a regression approach