Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

FrequencyCT: Frequency domain pseudo-label generation for self-supervised low-dose CT denoising

Despite extensive research on computed tomography (CT) denoising, few studies exploit projection-domain data characteristics to mitigate noise correlation. To address this, this work proposes FrequencyCT, the first zero-shot self-supervised method for pseudo-label generation in the frequency domain for low-dose CT denoising. Leveraging the characteristic of the frequency domain that largely isolates noise from clean signals, a regional low-frequency anchoring technique is proposed. Phase-preserving amplitude modulation and mask perturbation in the high-frequency region generate pseudo-label data for self-supervision. The fluctuating noise variance in the projection domain prompts truncation of the generated samples to stabilize the network's optimization gradient. Evaluation results on multiple public and real-world datasets confirm the clinical application potential of this research, which will have a revolutionary impact on the field of denoising. The code can be obtained from https://github.com/yqx7150/FrequencyCT.

preprint2026arXiv

M$^3$Searcher: Modular Multimodal Information Seeking Agency with Retrieval-Oriented Reasoning

Recent advances in DeepResearch-style agents have demonstrated strong capabilities in autonomous information acquisition and synthesize from real-world web environments. However, existing approaches remain fundamentally limited to text modality. Extending autonomous information-seeking agents to multimodal settings introduces critical challenges: the specialization-generalization trade-off that emerges when training models for multimodal tool-use at scale, and the severe scarcity of training data capturing complex, multi-step multimodal search trajectories. To address these challenges, we propose M$^3$Searcher, a modular multimodal information-seeking agent that explicitly decouples information acquisition from answer derivation. M$^3$Searcher is optimized with a retrieval-oriented multi-objective reward that jointly encourages factual accuracy, reasoning soundness, and retrieval fidelity. In addition, we develop MMSearchVQA, a multimodal multi-hop dataset to support retrieval centric RL training. Experimental results demonstrate that M$^3$Searcher outperforms existing approaches, exhibits strong transfer adaptability and effective reasoning in complex multimodal tasks.

preprint2026arXiv

PilotRL: Training Language Model Agents via Global Planning-Guided Progressive Reinforcement Learning

Large Language Models (LLMs) have shown remarkable advancements in tackling agent-oriented tasks. Despite their potential, existing work faces challenges when deploying LLMs in agent-based environments. The widely adopted agent paradigm ReAct centers on integrating single-step reasoning with immediate action execution, which limits its effectiveness in complex tasks requiring long-term strategic planning. Furthermore, the coordination between the planner and executor during problem-solving is also a critical factor to consider in agent design. Additionally, current approaches predominantly rely on supervised fine-tuning, which often leads models to memorize established task completion trajectories, thereby restricting their generalization ability when confronted with novel problem contexts. To address these challenges, we introduce an adaptive global plan-based agent paradigm AdaPlan, aiming to synergize high-level explicit guidance with execution to support effective long-horizon decision-making. Based on the proposed paradigm, we further put forward PilotRL, a global planning-guided training framework for LLM agents driven by progressive reinforcement learning. We first develop the model's ability to follow explicit guidance from global plans when addressing agent tasks. Subsequently, based on this foundation, we focus on optimizing the quality of generated plans. Finally, we conduct joint optimization of the model's planning and execution coordination. Experiments indicate that PilotRL could achieve state-of-the-art performances, with LLaMA3.1-8B-Instruct + PilotRL surpassing closed-sourced GPT-4o by 3.60%, while showing a more substantial gain of 55.78% comparing to GPT-4o-mini at a comparable parameter scale.

preprint2025arXiv

Fracture interactive geodesic active contours for bone segmentation

For bone segmentation, the classical geodesic active contour model is usually limited by its indiscriminate feature extraction, and then struggles to handle the phenomena of edge obstruction, edge leakage and bone fracture. Thus, we propose a fracture interactive geodesic active contour algorithm tailored for bone segmentation, which can better capture bone features and perform robustly to the presence of bone fractures and soft tissues. Inspired by orthopedic knowledge, we construct a novel edge-detector function that combines the intensity and gradient norm, which guides the contour towards bone edges without being obstructed by other soft tissues and therefore reduces mis-segmentation. Furthermore, distance information, where fracture prompts can be embedded, is introduced into the contour evolution as an adaptive step size to stabilize the evolution and help the contour stop at bone edges and fractures. This embedding provides a way to interact with bone fractures and improves the accuracy in the fracture regions. Experiments in pelvic and ankle segmentation demonstrate the effectiveness on addressing the aforementioned problems and show an accurate, stable and consistent performance, indicating a broader application in other bone anatomies. Our algorithm also provides insights into combining the domain knowledge and deep neural networks.

preprint2023arXiv

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems

As the focus on Large Language Models (LLMs) in the field of recommendation intensifies, the optimization of LLMs for recommendation purposes (referred to as LLM4Rec) assumes a crucial role in augmenting their effectiveness in providing recommendations. However, existing approaches for LLM4Rec often assess performance using restricted sets of candidates, which may not accurately reflect the models' overall ranking capabilities. In this paper, our objective is to investigate the comprehensive ranking capacity of LLMs and propose a two-step grounding framework known as BIGRec (Bi-step Grounding Paradigm for Recommendation). It initially grounds LLMs to the recommendation space by fine-tuning them to generate meaningful tokens for items and subsequently identifies appropriate actual items that correspond to the generated tokens. By conducting extensive experiments on two datasets, we substantiate the superior performance, capacity for handling few-shot scenarios, and versatility across multiple domains exhibited by BIGRec. Furthermore, we observe that the marginal benefits derived from increasing the quantity of training samples are modest for BIGRec, implying that LLMs possess the limited capability to assimilate statistical information, such as popularity and collaborative filtering, due to their robust semantic priors. These findings also underline the efficacy of integrating diverse statistical information into the LLM4Rec framework, thereby pointing towards a potential avenue for future research. Our code and data are available at https://github.com/SAI990323/Grounding4Rec.

preprint2022arXiv

A Survey on Deep Hashing Methods

Nearest neighbor search aims to obtain the samples in the database with the smallest distances from them to the queries, which is a basic task in a range of fields, including computer vision and data mining. Hashing is one of the most widely used methods for its computational and storage efficiency. With the development of deep learning, deep hashing methods show more advantages than traditional methods. In this survey, we detailedly investigate current deep hashing algorithms including deep supervised hashing and deep unsupervised hashing. Specifically, we categorize deep supervised hashing methods into pairwise methods, ranking-based methods, pointwise methods as well as quantization according to how measuring the similarities of the learned hash codes. Moreover, deep unsupervised hashing is categorized into similarity reconstruction-based methods, pseudo-label-based methods and prediction-free self-supervised learning-based methods based on their semantic learning manners. We also introduce three related important topics including semi-supervised deep hashing, domain adaption deep hashing and multi-modal deep hashing. Meanwhile, we present some commonly used public datasets and the scheme to measure the performance of deep hashing algorithms. Finally, we discuss some potential research directions in conclusion.

preprint2022arXiv

A Survey on Dropout Methods and Experimental Verification in Recommendation

Overfitting is a common problem in machine learning, which means the model too closely fits the training data while performing poorly in the test data. Among various methods of coping with overfitting, dropout is one of the representative ways. From randomly dropping neurons to dropping neural structures, dropout has achieved great success in improving model performances. Although various dropout methods have been designed and widely applied in past years, their effectiveness, application scenarios, and contributions have not been comprehensively summarized and empirically compared by far. It is the right time to make a comprehensive survey. In this paper, we systematically review previous dropout methods and classify them into three major categories according to the stage where dropout operation is performed. Specifically, more than seventy dropout methods published in top AI conferences or journals (e.g., TKDE, KDD, TheWebConf, SIGIR) are involved. The designed taxonomy is easy to understand and capable of including new dropout methods. Then, we further discuss their application scenarios, connections, and contributions. To verify the effectiveness of distinct dropout methods, extensive experiments are conducted on recommendation scenarios with abundant heterogeneous information. Finally, we propose some open problems and potential research directions about dropout that worth to be further explored.

preprint2022arXiv

Analysis of the distribution, rotation and scale characteristics of solar wind switchbacks: comparison between the first and second encounters of Parker Solar Probe

The S-shaped magnetic structure in the solar wind formed by the twisting of magnetic field lines is called a switchback, whose main characteristics are the reversal of the magnetic field and the significant increase in the solar wind radial velocity. We identify 242 switchbacks during the first two encounters of Parker Solar Probe (PSP). Statistics methods are applied to analyze the distribution and the rotation angle and direction of the magnetic field rotation of the switchbacks. The diameter of switchbacks is estimated with a minimum variance analysis (MVA) method based on the assumption of a cylindrical magnetic tube. We also make a comparison between switchbacks from inside and the boundary of coronal holes. The main conclusions are as follows: (1) the rotation angles of switchbacks observed during the first encounter seem larger than those of the switchbacks observed during the second encounter in general; (2) the tangential component of the velocity inside the switchbacks tends to be more positive (westward) than in the ambient solar wind; (3) switchbacks are more likely to rotate clockwise than anticlockwise, and the number of switchbacks with clockwise rotation is 1.48 and 2.65 times of those with anticlockwise rotation during the first and second encounters, respectively; (4) the diameter of switchbacks is about 10^5 km on average and across five orders of magnitude (10^3 -- 10^7 km).

preprint2022arXiv

Explicit caching HYB: a new high-performance SpMV framework on GPGPU

Sparse Matrix-Vector Multiplication (SpMV) is a critical operation for the iterative solver of Finite Element Methods on computer simulation. Since the SpMV operation is a memory-bound algorithm, the efficiency of data movements heavily influenced the performance of the SpMV on GPU. In recent years, many research is conducted in accelerating the performance of SpMV on the graphic processing units (GPU). The performance optimization methods used in existing studies focus on the following areas: improve the load balancing between GPU processors, and reduce the execution divergence between GPU threads. Although some studies have made preliminary optimization on the input vector fetching, the effect of explicitly caching the input vector on GPU base SpMV has not been studied in depth yet. In this study, we are trying to minimize the data movements cost for GPU-based SpMV using a new framework named "explicit caching Hybrid (EHYB)". The EHYB framework achieved significant performance improvement by using the following methods: 1. Improve the speed of data movements by partitioning and explicitly caching the input vector to the shared memory of the CUDA kernel. 2. Reduce the volume of data movements by storing the major part of the column index with a compact format. We tested our implementation with sparse matrices derived from FEM applications in different areas. The experiment results show that our implementation can overperform the state-of-the-arts implementation with significant speedup, and leads to higher FLOPs than the theoryperformance up-boundary of the existing GPU-based SpMV implementations.

preprint2022arXiv

Non-Markovian quantum thermometry

The rapidly developing quantum technologies and thermodynamics have put forward a requirement to precisely control and measure the temperature of microscopic matter at the quantum level. Many quantum thermometry schemes have been proposed. However, precisely measuring low temperature is still challenging because the obtained sensing errors generally tend to diverge with decreasing temperature. Using a continuous-variable system as a thermometer, we propose non-Markovian quantum thermometry to measure the temperature of a quantum reservoir. A mechanism to make the sensing error $δT$ scale with the temperature $T$ as the Landau bound $δT\simeq T$ in the full-temperature regime is discovered. Our analysis reveals that it is the quantum criticality of the total thermometer-reservoir system that causes this enhanced sensitivity. Efficiently avoiding the error-divergence problem, our result gives an efficient way to precisely measure the low temperature of quantum systems.

preprint2022arXiv

Recommendation Unlearning

Recommender systems provide essential web services by learning users' personal preferences from collected data. However, in many cases, systems also need to forget some training data. From the perspective of privacy, several privacy regulations have recently been proposed, requiring systems to eliminate any impact of the data whose owner requests to forget. From the perspective of utility, if a system's utility is damaged by some bad data, the system needs to forget these data to regain utility. From the perspective of usability, users can delete noise and incorrect entries so that a system can provide more useful recommendations. While unlearning is very important, it has not been well-considered in existing recommender systems. Although there are some researches have studied the problem of machine unlearning in the domains of image and text data, existing methods can not been directly applied to recommendation as they are unable to consider the collaborative information. In this paper, we propose RecEraser, a general and efficient machine unlearning framework tailored to recommendation task. The main idea of RecEraser is to partition the training set into multiple shards and train a constituent model for each shard. Specifically, to keep the collaborative information of the data, we first design three novel data partition algorithms to divide training data into balanced groups based on their similarity. Then, considering that different shard models do not uniformly contribute to the final prediction, we further propose an adaptive aggregation method to improve the global model utility. Experimental results on three public benchmarks show that RecEraser can not only achieve efficient unlearning, but also outperform the state-of-the-art unlearning methods in terms of model utility. The source code can be found at https://github.com/chenchongthu/Recommendation-Unlearning

preprint2022arXiv

Technical Report (v1.0)--Pseudo-random Cartesian Sampling for Dynamic MRI

For an effective application of compressed sensing (CS), which exploits the underlying compressibility of an image, one of the requirements is that the undersampling artifact be incoherent (noise-like) in the sparsifying transform domain. For cardiovascular MRI (CMR), several pseudo-random sampling methods have been proposed that yield a high level of incoherence. In this technical report, we present a collection of five pseudo-random Cartesian sampling methods that can be applied to 2D cine and flow, 3D volumetric cine, and 4D flow imaging. Four out of the five presented methods yield fast computation for on-the-fly generation of the sampling mask, without the need to create and store pre-computed look-up tables. In addition, the sampling distribution is parameterized, providing control over the sampling density. For each sampling method in the report, (i) we briefly describe the methodology, (ii) list default values of the pertinent parameters, and (iii) provide a publicly available MATLAB implementation.

preprint2022arXiv

Temperature uncertainty relation in non-equilibrium thermodynamics

Temperature uncertainty of a quantum system in canonical ensemble is inversely determined by its energy fluctuation, which is known as the temperature-energy uncertainty relation. No such uncertainty relation was discovered for a non-equilibrium open quantum system. In this article, we derive a universal temperature uncertainty relation for general non-equilibrium processes. We find that it is the fluctuation of heat, which is defined as the change in bath energy, determines the temperature uncertainty in non-equilibrium case. Specifically, the heat is divided into trajectory heat and backaction heat, which are associated with the system's trajectory of evolution and the backaction of partial measurement on system, respectively. Based on this decomposition, we reveal that both correlations between system and bath's process function and state function are the resources for enhancing temperature precision. Our findings are conductive to design ultrahigh sensitive quantum thermometer.

preprint2022arXiv

Towards Representation Alignment and Uniformity in Collaborative Filtering

Collaborative filtering (CF) plays a critical role in the development of recommender systems. Most CF methods utilize an encoder to embed users and items into the same representation space, and the Bayesian personalized ranking (BPR) loss is usually adopted as the objective function to learn informative encoders. Existing studies mainly focus on designing more powerful encoders (e.g., graph neural network) to learn better representations. However, few efforts have been devoted to investigating the desired properties of representations in CF, which is important to understand the rationale of existing CF methods and design new learning objectives. In this paper, we measure the representation quality in CF from the perspective of alignment and uniformity on the hypersphere. We first theoretically reveal the connection between the BPR loss and these two properties. Then, we empirically analyze the learning dynamics of typical CF methods in terms of quantified alignment and uniformity, which shows that better alignment or uniformity both contribute to higher recommendation performance. Based on the analyses results, a learning objective that directly optimizes these two properties is proposed, named DirectAU. We conduct extensive experiments on three public datasets, and the proposed learning framework with a simple matrix factorization model leads to significant performance improvements compared to state-of-the-art CF methods. Our implementations are publicly available at https://github.com/THUwangcy/DirectAU.

preprint2021arXiv

Extracting Quantitative Dielectric Properties from Pump-Probe Spectroscopy

Optical pump-probe spectroscopy is a powerful tool for the study of non-equilibrium electronic dynamics and finds wide applications across a range of fields, from physics and chemistry to material science and biology. However, a shortcoming of conventional pump-probe spectroscopy is that photoinduced changes in transmission, reflection and scattering can simultaneously contribute to the measured differential spectra, leading to ambiguities in assigning the origin of spectral signatures and ruling out quantitative interpretation of the spectra. Ideally, these methods would measure the underlying dielectric function (or the complex refractive index) which would then directly provide quantitative information on the transient excited state dynamics free of these ambiguities. Here we present and test a model independent route to transform differential transmission or reflection spectra, measured via conventional optical pump-probe spectroscopy, to changes in the quantitative transient dielectric function. We benchmark this method against changes in the real refractive index measured using time-resolved Frequency Domain Interferometry in prototypical inorganic and organic semiconductor films. Our methodology can be applied to existing and future pump-probe data sets, allowing for an unambiguous and quantitative characterisation of the transient photoexcited spectra of materials. This in turn will accelerate the adoption of pump-probe spectroscopy as a facile and robust materials characterisation and screening tool.

preprint2021arXiv

Quantum Control in Open and Periodically Driven Systems

Quantum technology resorts to efficient utilization of quantum resources to realize technique innovation. The systems are controlled such that their states follow the desired manners to realize different quantum protocols. However, the decoherence caused by the system-environment interactions causes the states deviating from the desired manners. How to protect quantum resources under the coexistence of active control and passive decoherence is of significance. Recent studies have revealed that the decoherence is determined by the feature of the system-environment energy spectrum: Accompanying the formation of bound states in the energy spectrum, the decoherence can be suppressed. It supplies a guideline to control decoherence. Such idea can be generalized to systems under periodic driving. By virtue of manipulating Floquet bound states in the quasienergy spectrum, coherent control via periodic driving dubbed as Floquet engineering has become a versatile tool not only in controlling decoherence, but also in artificially synthesizing exotic topological phases. We will review the progress on quantum control in open and periodically driven systems. Special attention will be paid to the distinguished role played by the bound states and their controllability via periodic driving in suppressing decoherence and generating novel topological phases.

preprint2020arXiv

Automatic Extraction and Sign Determination of Respiratory Signal in Real-time Cardiac Magnetic Resonance imaging

In real-time (RT) cardiac cine imaging, a stack of 2D slices is collected sequentially under free-breathing conditions. A complete heartbeat from each slice is then used for cardiac function quantification. The inter-slice respiratory mismatch can compromise accurate quantification of cardiac function. Methods based on principal components analysis (PCA) have been proposed to extract the respiratory signal from RT cardiac cine, but these methods cannot resolve the inter-slice sign ambiguity of the respiratory signal. In this work, we propose a fully automatic sign correction procedure based on the similarity of neighboring slices and correlation to the center-of-mass curve. The proposed method is evaluated in eleven volunteers, with ten slices per volunteer. The motion in a manually selected region-of-interest (ROI) is used as a reference. The results show that the extracted respiratory signal has a high, positive correlation with the reference in all cases. The qualitative assessment of images also shows that the proposed approach can accurately identify heartbeats, one from each slice, belonging to the same respiratory phase. This approach can improve cardiac function quantification for RT cine without manual intervention.

preprint2020arXiv

Characteristics and Importance of "ICME-in-Sheath" Phenomenon and Upper Limit for Geomagnetic Storm Activity

As an important source for large geomagnetic storms, an "ICME-in-sheath" is a completely shocked interplanetary coronal mass ejection (ICME) stuck in the sheath between a shock and host ejecta. Typical characteristics are identified from coordinated multi-sets of observations: (1) it is usually short in duration and lasts a few hours at 1 AU; (2) its solar wind parameters, in particular the magnetic field, seem to keep enhanced for a large range of distances; and (3) common ICME signatures are often lost. The host ejecta could be a single ICME or a complex ejecta, being fast enough to drive a shock. These results clarify previous misinterpretations of this phenomenon as a normal part of a sheath region. The "ICME-in-sheath" phenomenon, together with a preconditioning effect, produced an extreme set of the magnetic field, speed and density near 1 AU in the 2012 July 23 case, all around their upper limits at the same time. This is probably the most extreme solar wind driving at 1 AU and enables us to estimate the plausible upper limit for geomagnetic storm activity. With an appropriate modification in the southward field, we suggest that a geomagnetic storm with a minimum $D_{\rm st}$ of about $-2000$ nT could occur in principle. The magnetopause would be compressed to about 3.3 Earth radii from the Earth's center, well inside the geosynchronous orbit.

preprint2020arXiv

Deep Robust Clustering by Contrastive Learning

Recently, many unsupervised deep learning methods have been proposed to learn clustering with unlabelled data. By introducing data augmentation, most of the latest methods look into deep clustering from the perspective that the original image and its transformation should share similar semantic clustering assignment. However, the representation features could be quite different even they are assigned to the same cluster since softmax function is only sensitive to the maximum value. This may result in high intra-class diversities in the representation feature space, which will lead to unstable local optimal and thus harm the clustering performance. To address this drawback, we proposed Deep Robust Clustering (DRC). Different from existing methods, DRC looks into deep clustering from two perspectives of both semantic clustering assignment and representation feature, which can increase inter-class diversities and decrease intra-class diversities simultaneously. Furthermore, we summarized a general framework that can turn any maximizing mutual information into minimizing contrastive loss by investigating the internal relationship between mutual information and contrastive learning. And we successfully applied it in DRC to learn invariant features and robust clusters. Extensive experiments on six widely-adopted deep clustering benchmarks demonstrate the superiority of DRC in both stability and accuracy. e.g., attaining 71.6% mean accuracy on CIFAR-10, which is 7.1% higher than state-of-the-art results.

preprint2020arXiv

OCMR (v1.0)--Open-Access Multi-Coil k-Space Dataset for Cardiovascular Magnetic Resonance Imaging

Cardiovascular MRI (CMR) is a non-invasive imaging modality that provides excellent soft-tissue contrast without the use of ionizing radiation. Physiological motions and limited speed of MRI data acquisition necessitate development of accelerated methods, which typically rely on undersampling. Recovering diagnostic quality CMR images from highly undersampled data has been an active area of research. Recently, several data acquisition and processing methods have been proposed to accelerate CMR. The availability of data to objectively evaluate and compare different reconstruction methods could expedite innovation and promote clinical translation of these methods. In this work, we introduce an open-access dataset, called OCMR, that provides multi-coil k-space data from 53 fully sampled and 212 prospectively undersampled cardiac cine series.