Source author record

Jifeng Hu

Jifeng Hu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence hep-ph Machine Learning nucl-ex physics.ins-det

Catalog footprint

What is connected

3works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning

Offline reinforcement learning (RL) provides a promising solution to learning an agent fully relying on a data-driven paradigm. However, constrained by the limited quality of the offline dataset, its performance is often sub-optimal. Therefore, it is desired to further finetune the agent via extra online interactions before deployment. Unfortunately, offline-to-online RL can be challenging due to two main challenges: constrained exploratory behavior and state-action distribution shift. In view of this, we propose a Simple Unified uNcertainty-Guided (SUNG) framework, which naturally unifies the solution to both challenges with the tool of uncertainty. Specifically, SUNG quantifies uncertainty via a VAE-based state-action visitation density estimator. To facilitate efficient exploration, SUNG presents a practical optimistic exploration strategy to select informative actions with both high value and high uncertainty. Moreover, SUNG develops an adaptive exploitation method by applying conservative offline RL objectives to high-uncertainty samples and standard online RL objectives to low-uncertainty samples to smoothly bridge offline and online stages. SUNG achieves state-of-the-art online finetuning performance when combined with different offline RL methods, across various environments and datasets in D4RL benchmark. Codes are made publicly available in https://github.com/guosyjlu/SUNG.

preprint2022arXiv

Study of exotic hadrons with machine learning

We analyzed the invariant mass spectrum of near-threshold exotic states for one-channel candidates with a deep neural network. It can extract the scattering length and effective range, which would shed light on the nature of given states, from the experimental mass spectrum. As an application, the mass spectrum of the $X(3872)$ and the $T_{cc}^+$ are studied. The obtained scattering lengths, effective ranges, and most relevant thresholds are consistent with those from fitting to the experimental data. The advantage of the neural network is that it is more stable than the fitting, especially for low-statistic data. The network, which provides another way to analyze the experimental data, can also be applied to other one-channel near-threshold exotic candidates.

preprint2016arXiv

TOF spectroscopy measurement using waveform digitizer

The photoneutron source (PNS, phase 1), an electron linear accelerator (linac)-based pulsed neutron facility that uses the time-of-flight (TOF) technique, was constructed for the acquisition of nuclear data from the thorium molten salt reactor(TMSR) at the Shanghai Institute of Applied Physics (SINAP). The neutron detector signal, with the information on the pulse arrival time, pulse shape, and pulse height, was recorded by using a waveform digitizer (WFD). By using the pulse height and pulse-shape discrimination (PSD) analysis to identify neutrons and $γ$-rays, the neutron TOF spectrum was obtained by employing a simple electronic design, and a new WFD-based DAQ system was developed and tested in this commissioning experiment. The developed DAQ system is characterized by a very high efficiency with respect to millisecond neutron TOF spectroscopy