Source author record

Bin Huang

Bin Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence math.AP Molecular Networks Cell Behavior Computer Vision Distributed, Parallel, and Cluster Computing Machine Learning Quantitative Methods Biological Physics cond-mat.mes-hall cond-mat.mtrl-sci hep-ex Information Retrieval physics.comp-ph physics.soc-ph q-fin.ST Social and Information Networks

Catalog footprint

What is connected

20works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Asymmetric Generative Recommendation via Multi-Expert Projection and Multi-Faceted Hierarchical Quantization

Generative Recommendation (GenRec) models reformulate recommendation as a sequence generation task, representing items as discrete Semantic IDs used symmetrically as both inputs and prediction targets. We identify a critical dual-stage information bottleneck in this design: (1) the Input Bottleneck, where lossy quantization degrades fine-grained semantics, while popularity bias skews the learned representations toward frequent items, and (2) the Output Bottleneck, where imprecise discrete targets limit supervision quality. To address these issues, we propose AsymRec, an asymmetric continuous-discrete framework that decouples input and output representations. Specifically, Multi-expert Semantic Projection (MSP) maps continuous embeddings into the Transformer's hidden space via expert-specialized projections, preserving semantic richness and improving generalization to infrequent items. Multi-faceted Hierarchical Quantization (MHQ) constructs high-capacity, structured discrete targets through multi-view and multi-level quantization with semantic regularization, preventing dimensional collapse while retaining fine-grained distinctions. Extensive experiments demonstrate that AsymRec consistently outperforms state-of-the-art generative recommenders by an average of 15.8 %. The code will be released.

preprint2026arXiv

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

Traditional workflow-based agents exhibit limited intelligence when addressing real-world problems requiring tool invocation. Tool-integrated reasoning (TIR) agents capable of autonomous reasoning and tool invocation are rapidly emerging as a powerful approach for complex decision-making tasks involving multi-step interactions with external environments. In this work, we introduce MindWatcher, a TIR agent integrating interleaved thinking and multimodal chain-of-thought (CoT) reasoning. MindWatcher can autonomously decide whether and how to invoke diverse tools and coordinate their use, without relying on human prompts or workflows. The interleaved thinking paradigm enables the model to switch between thinking and tool calling at any intermediate stage, while its multimodal CoT capability allows manipulation of images during reasoning to yield more precise search results. We implement automated data auditing and evaluation pipelines, complemented by manually curated high-quality datasets for training, and we construct a benchmark, called MindWatcher-Evaluate Bench (MWE-Bench), to evaluate its performance. MindWatcher is equipped with a comprehensive suite of auxiliary reasoning tools, enabling it to address broad-domain multimodal problems. A large-scale, high-quality local image retrieval database, covering eight categories including cars, animals, and plants, endows model with robust object recognition despite its small size. Finally, we design a more efficient training infrastructure for MindWatcher, enhancing training speed and hardware utilization. Experiments not only demonstrate that MindWatcher matches or exceeds the performance of larger or more recent models through superior tool invocation, but also uncover critical insights for agent training, such as the genetic inheritance phenomenon in agentic RL.

preprint2026arXiv

Safactory: A Scalable Agentic Infrastructure for Training Trustworthy Autonomous Intelligence

As large models evolve from conversational assistants into autonomous agents, challenges increasingly arise from long-horizon decision making, tool use, and real environment interaction. Existing agenticinfrastructure remain fragmented across evaluation, data management, and agent evolution, making it difficult to discover risks systematically and improve models in a continuous closed loop. In this report, we present \textbf{Safactory}, a scalable agent factory for trustworthy autonomous intelligence. Safactory integrates three tightly coupled platforms: a \textbf{Parallel Simulation Platform} for trajectory generation, a \textbf{Trustworthy Data Platform} for trajectory storage and experience extraction, and an \textbf{Autonomous Evolution Platform} for asynchronous reinforcement learning and on-policy distillation. As far as we know, Safactory is the first framework to propose a unified evolutionary pipeline for next-generation trustworthy autonomous intelligence.

preprint2023arXiv

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes expensive Lidar sensors, making it a feasible solution for economical autonomous driving. However, most existing BEV solutions either suffer from modest performance or require considerable resources to execute on-vehicle inference. This paper proposes a simple yet effective framework, termed Fast-BEV, which is capable of performing real-time BEV perception on the on-vehicle chips. Towards this goal, we first empirically find that the BEV representation can be sufficiently powerful without expensive view transformation or depth representation. Starting from M2BEV baseline, we further introduce (1) a strong data augmentation strategy for both image and BEV space to avoid over-fitting (2) a multi-frame feature fusion mechanism to leverage the temporal information (3) an optimized deployment-friendly view transformation to speed up the inference. Through experiments, we show Fast-BEV model family achieves considerable accuracy and efficiency on edge. In particular, our M1 model (R18@256x704) can run over 50FPS on the Tesla T4 platform, with 47.0% NDS on the nuScenes validation set. Our largest model (R101@900x1600) establishes a new state-of-the-art 53.5% NDS on the nuScenes validation set. The code is released at: https://github.com/Sense-GVT/Fast-BEV.

preprint2023arXiv

Financial Time-Series Forecasting: Towards Synergizing Performance And Interpretability Within a Hybrid Machine Learning Approach

In the realm of cryptocurrency, the prediction of Bitcoin prices has garnered substantial attention due to its potential impact on financial markets and investment strategies. This paper propose a comparative study on hybrid machine learning algorithms and leverage on enhancing model interpretability. Specifically, linear regression(OLS, LASSO), long-short term memory(LSTM), decision tree regressors are introduced. Through the grounded experiments, we observe linear regressor achieves the best performance among candidate models. For the interpretability, we carry out a systematic overview on the preprocessing techniques of time-series statistics, including decomposition, auto-correlational function, exponential triple forecasting, which aim to excavate latent relations and complex patterns appeared in the financial time-series forecasting. We believe this work may derive more attention and inspire more researches in the realm of time-series analysis and its realistic applications.

preprint2023arXiv

GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy

Graph neural networks (GNNs) have been demonstrated as a powerful tool for analyzing non-Euclidean graph data. However, the lack of efficient distributed graph learning systems severely hinders applications of GNNs, especially when graphs are big and GNNs are relatively deep. Herein, we present GraphTheta, the first distributed and scalable graph learning system built upon vertex-centric distributed graph processing with neural network operators implemented as user-defined functions. This system supports multiple training strategies and enables efficient and scalable big-graph learning on distributed (virtual) machines with low memory. To facilitate graph convolutions, GraphTheta puts forward a new graph learning abstraction named NN-TGAR to bridge the gap between graph processing and graph deep learning. A distributed graph engine is proposed to conduct the stochastic gradient descent optimization with a hybrid-parallel execution, and a new cluster-batched training strategy is supported. We evaluate GraphTheta using several datasets with network sizes ranging from small-, modest- to large-scale. Experimental results show that GraphTheta can scale well to 1,024 workers for training an in-house developed GNN on an industry-scale Alipay dataset of 1.4 billion nodes and 4.1 billion attributed edges, with a cluster of CPU virtual machines (dockers) of small memory each (5$\sim$12GB). Moreover, GraphTheta can outperform DistDGL by up to $2.02\times$, with better scalability, and GraphLearn by up to $30.56\times$. As for model accuracy, GraphTheta is capable of learning as good GNNs as existing frameworks. To the best of our knowledge, this work presents the largest edge-attributed GNN learning task in the literature.

preprint2023arXiv

Parallel Reasoning Network for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection aims to learn how human interacts with surrounding objects. Previous HOI detection frameworks simultaneously detect human, objects and their corresponding interactions by using a predictor. Using only one shared predictor cannot differentiate the attentive field of instance-level prediction and relation-level prediction. To solve this problem, we propose a new transformer-based method named Parallel Reasoning Network(PR-Net), which constructs two independent predictors for instance-level localization and relation-level understanding. The former predictor concentrates on instance-level localization by perceiving instances' extremity regions. The latter broadens the scope of relation region to reach a better relation-level semantic understanding. Extensive experiments and analysis on HICO-DET benchmark exhibit that our PR-Net effectively alleviated this problem. Our PR-Net has achieved competitive results on HICO-DET and V-COCO benchmarks.

preprint2022arXiv

Global Strong and Weak Solutions to the Initial-boundary-value Problem of 2D Compressible MHD System with Large Initial Data and Vacuum

In this paper, we study the barotropic compressible magnetohydrodynamic equations with the shear viscosity being a positive constant and the bulk one being proportional to a power of the density in a general two-dimensional bounded simply connected domain. For initial density allowed to vanish, we prove that the initial-boundary-value problem of 2D compressible MHD system admits the global strong and weak solutions without any restrictions on the size of initial data provided the shear viscosity is a positive constant and bulk one is $λ=ρ^β$ with $β>4/3$. As we known, this is the first result concerning the global existence of strong solutions to the compressible MHD system in general two-dimensional bounded domains with large initial data and vacuum.

preprint2021arXiv

Global Well-posedness of Classical Solutions to the Compressible Navier-Stokes-Poisson Equations with Slip Boundary Conditions in 3D Bounded Domains

We consider the initial-boundary-value problem of the isentropic compressible Navier-Stokes-Poisson equations subject to large and non-flat doping profile in 3D bounded domain with slip boundary condition and vacuum. The global well-posedness of classical solution is established with small initial energy but possibly large oscillations and vacuum. The steady state (except velocity) and the doping profile are allowed to be of large variation.

preprint2021arXiv

On Compressible Navier-Stokes Equations Subject to Large Potential Forces with Slip Boundary Conditions in 3D Bounded Domains

We deal with the barotropic compressible Navier-Stokes equations subject to large external potential forces with slip boundary condition in a 3D simply connected bounded domain, whose smooth boundary has a finite number of 2D connected components. The global existence of strong or classical solutions to the initial boundary value problem of this system is established provided the initial energy is suitably small. Moreover, the density has large oscillations and contains vacuum states. Finally, we show that the global strong or classical solutions decay exponentially in time to the equilibrium in some Sobolev's spaces, but the oscillation of the density will grow unboundedly in the long run with an exponential rate when the initial density contains vacuum states.

preprint2020arXiv

Computational prediction of RNA tertiary structures using machine learning methods

RNAs play crucial and versatile roles in biological processes. Computational prediction approaches can help to understand RNA structures and their stabilizing factors, thus providing information on their functions, and facilitating the design of new RNAs. Machine learning (ML) techniques have made tremendous progress in many fields in the past few years. Although their usage in protein-related fields has a long history, the use of ML methods in predicting RNA tertiary structures is new and rare. Here, we review the recent advances of using ML methods on RNA structure predictions and discuss the advantages and limitation, the difficulties and potentials of these approaches when applied in the field.

preprint2020arXiv

Decoding the mechanisms underlying cell-fate decision-making during stem cell differentiation by Random Circuit Perturbation

Stem cells can precisely and robustly undergo cellular differentiation and lineage commitment, referred to as stemness. However, how the gene network underlying stemness regulation reliably specifies cell fates is not well understood. To address this question, we applied a recently developed computational method, Random Circuit Perturbation (RACIPE), to a nine-component gene regulatory network (GRN) governing stemness, from which we identified fifteen robust gene states. Among them, four out of the five most probable gene states exhibit gene expression patterns observed in single mouse embryonic cells at 32-cell and 64-cell stages. These gene states can be robustly predicted by the stemness GRN but not by randomized versions of the stemness GRN. Strikingly, we found a hierarchical structure of the GRN with the Oct4/Cdx2 motif functioning as the first decision-making module followed by Gata6/Nanog. We propose that stem cell populations, instead of being viewed as all having a specific cellular state, can be regarded as a heterogeneous mixture including cells in various states. Upon perturbations by external signals, stem cells lose the capacity to access certain cellular states, thereby becoming differentiated. The findings demonstrate that the functions of the stemness GRN is mainly determined by its well-evolved network topology rather than by detailed kinetic parameters.

preprint2018arXiv

Global Strong Solutions to Magnetohydrodynamics with Density-Dependent Viscosity and Degenerate Heat-Conductivity

We deal with the equations of a planar magnetohydrodynamic compressible flow with the viscosity depending on the specific volume of the gas and the heat conductivity proportional to a positive power of the temperature. Under the same conditions on the initial data as those of the constant viscosity and heat conductivity case ([Kazhikhov (1987)], we obtain the global existence and uniqueness of strong solutions which means no shock wave, vacuum, or mass or heat concentration will be developed in finite time, although the motion of the flow has large oscillations and the interaction between the hydrodynamic and magnetodynamic effects is complex. Our result can be regarded as a natural generalization of the Kazhikhov's theory for the constant viscosity and heat conductivity case to that of nonlinear viscosity and degenerate heat-conductivity.

preprint2016arXiv

Modeling delayed processes in biological systems

Delayed processes are ubiquitous in biological systems and are often characterized by delay differential equations (DDEs) and their extension to include stochastic effects. DDEs do not explicitly incorporate intermediate states associated with a delayed process but instead use an estimated average delay time. In an effort to examine the validity of this approach, we study systems with significant delays by explicitly incorporating intermediate steps. We show by that such explicit models often yield significantly different equilibrium distributions and transition times as compared to DDEs with deterministic delay values. Additionally, different explicit models with qualitatively different dynamics can give rise to the same DDEs revealing important ambiguities. We also show that DDE-based predictions of oscillatory behavior may fail for the corresponding explicit model.

preprint2016arXiv

Role of remote interfacial phonon (RIP) scattering in heat transport across graphene/SiO2 interfaces

Heat transfer across interfaces of graphene and polar dielectrics (e.g. SiO2) could be mediated by direct phonon coupling, as well as electronic coupling with remote interfacial phonons (RIPs). To understand the relative contribution of each component, we develop a new pump-probe technique, called voltage-modulated thermoreflectance (VMTR), to accurately measure the change of interfacial thermal conductance under an electrostatic field. We employed VMTR on top gates of graphene field-effect transistors and find that the thermal conductance of SiO2/graphene/SiO2 interfaces increases by up to ΔG=0.8 MW m-2 K-1 under electrostatic fields of <0.2 V nm-1 . We propose two possible explanations for the observed ΔG. First, since the applied electrostatic field induces charge carriers in graphene, our VMTR measurements could originate from heat transfer between the charge carriers in graphene and RIPs in SiO2. Second, the increase in heat conduction could be caused by better conformity of graphene interfaces un-der electrostatic pressure exerted by the induced charge carriers. Regardless of the origins of the observed ΔG, our VMTR measurements establish an upper limit for heat transfer from unbiased graphene to SiO2 substrates via RIP scattering; i.e., only <2 % of the interfacial heat transport is facilitated by RIP scattering even at a carrier concentration of 4x10^12 cm-2.

preprint2015arXiv

Implications of the hybrid epithelial/mesenchymal phenotype in metastasis

Understanding cell-fate decisions during tumorigenesis and metastasis is a major challenge in modern cancer biology. One canonical cell-fate decision that cancer cells undergo is Epithelial-to-Mesenchymal Transition (EMT) and its reverse Mesenchymal-to-Epithelial Transition (MET). While transitioning between these two phenotypes - epithelial and mesenchymal - cells can also attain a hybrid epithelial/mesenchymal (i.e. partial or intermediate EMT) phenotype. Cells in this phenotype have mixed epithelial (e.g. adhesion) and mesenchymal (e.g. migration) properties, thereby allowing them to move collectively as clusters of Circulating Tumor Cells (CTCs). If these clusters enter the circulation, they can be more apoptosis-resistant and more capable of initiating metastatic lesions than cancer cells moving individually with wholly mesenchymal phenotypes, having undergo a complete EMT. Here, we review the operating principles of the core regulatory network for EMT/MET that acts as a three-way switch giving rise to three distinct phenotypes - epithelial, mesenchymal and hybrid epithelial/mesenchymal. We further characterize this hybrid E/M phenotype in terms of its capabilities in terms of collective cell migration, tumor-initiation, cell-cell communication, and drug resistance. We elucidate how the highly interconnected coupling between these modules coordinates cell-fate decisions among a population of cancer cells in the dynamic tumor, hence facilitating tumor-stoma interactions, formation of CTC clusters, and consequently cancer metastasis. Finally, we discuss the multiple advantages that the hybrid epithelial/mesenchymal phenotype have as compared to a complete EMT phenotype and argue that these collectively migrating cells are the primary 'bad actors' of metastasis.

preprint2015arXiv

Improved topological conformity enhances heat conduction across metal contacts on transferred graphene

Thermal conductance of metal contacts on transferred graphene (trG) could be significantly reduced from the intrinsic value of similar contacts on as-grown graphene (grG), due to additional resistance by increased roughness, residues, oxides and voids. In this paper, we compare the thermal conductance (G) of Al/trG/Cu interfaces with that of Al/grG/Cu interfaces to understand heat transfer across metal contacts on transferred graphene. Our samples are polycrystalline graphene grown on Cu foils by chemical vapor deposition (CVD) and CVD-grown graphene transferred to evaporated Cu thin films. We find that for the Al/grG/Cu interfaces of as-grown CVD graphene, G=31 MW m^{-2} K^{-1} at room temperature, two orders of magnitude lower than that of Al/Cu interfaces. For most as-transferred graphene on Cu films, G=20 MW m^{-2} K^{-1}, 35% lower than that of as-grown CVD graphene. We carefully rule out the contributions of residues, native oxides and interfaces roughness, and attribute the difference in the thermal conductance of as-grown and as-transferred CVD graphene to different degrees of conformity of graphene to the Cu substrates. We find that a contact area of 50% only reduces the thermal conductance by 35%, suggesting that a small amount of heat transfer occurs across voids at graphene interfaces. We successfully improve the conformity of the as-transferred graphene to the substrates by annealing the samples at 300°C, and thus enhance the thermal conductance of the transferred graphene to the intrinsic value. From the temperature dependence measurements of G of Al/trG/Cu and Al/grG/Cu interfaces, we also confirm that phonons are the dominant heat carries across the metal/graphene/metal interfaces despite a substantial carrier concentration of 3x10^{12} cm^{-2} induced in the graphene.

preprint2015arXiv

Tuning and Optimizing the Finite Element Analysis with Elements of Large Nodal DOF on a Linux Cluster

The finite element analysis of high frequency vibrations of quartz crystal plates is a necessary process required in the design of quartz crystal resonators of precision types for applications in filters and sensors. The anisotropic materials and extremely high frequency in radiofrequency range of resonators determine that vibration frequency spectra are complicated with strong couplings of large number of different vibration modes representing deformations which do not appear in usual structural problems. For instance, the higher-order thickness-shear vibrations usually representing the sharp deformation of thin plates in the thickness direction, expecting the analysis is to be done with refined meshing schemes along the relatively small thickness and consequently the large plane area. To be able to represent the precise vibration mode shapes, a very large number of elements are needed in the finite element analysis with either the three-dimensional theory or the higher-order plate theory, although considerable reduction of numbers of degree-of-freedom (DOF) are expected for the two-dimensional analysis without scarifying the accuracy. In this paper, we reviewed the software architecture for the analysis and demonstrated the evaluation and tuning of parameters for the improvement of the analysis with problems of elements with a large number of DOF in each node, or a problem with unusually large bandwidth of the banded stiffness and mass matrices in comparison with conventional finite element formulation. Such a problem can be used as an example for the optimization and tuning of problems from multi-physics analysis which are increasingly important in applications with excessive large number of DOF and bandwidth in engineering.

preprint2014arXiv

Identifying effective multiple spreaders by coloring complex networks

How to identify influential nodes in social networks is of theoretical significance, which relates to how to prevent epidemic spreading or cascading failure, how to accelerate information diffusion, and so on. In this Letter, we make an attempt to find \emph{effective multiple spreaders} in complex networks by generalizing the idea of the coloring problem in graph theory to complex networks. In our method, each node in a network is colored by one kind of color and nodes with the same color are sorted into an independent set. Then, for a given centrality index, the nodes with the highest centrality in an independent set are chosen as multiple spreaders. Comparing this approach with the traditional method, in which nodes with the highest centrality from the \emph{entire} network perspective are chosen, we find that our method is more effective in accelerating the spreading process and maximizing the spreading coverage than the traditional method, no matter in network models or in real social networks. Meanwhile, the low computational complexity of the coloring algorithm guarantees the potential applications of our method.

preprint2008arXiv

A simulation study on the measurement of D0-D0bar mixing parameter y at BES-III

We established a method on measuring the $\dzdzb$ mixing parameter $y$ for BESIII experiment at the BEPCII $e^+e^-$ collider. In this method, the doubly tagged $ψ(3770) \to D^0 \overline{D^0}$ events, with one $D$ decays to CP-eigenstates and the other $D$ decays semileptonically, are used to reconstruct the signals. Since this analysis requires good $e/π$ separation, a likelihood approach, which combines the $dE/dx$, time of flight and the electromagnetic shower detectors information, is used for particle identification. We estimate the sensitivity of the measurement of $y$ to be 0.007 based on a $20fb^{-1}$ fully simulated MC sample.

Bin Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

Asymmetric Generative Recommendation via Multi-Expert Projection and Multi-Faceted Hierarchical Quantization

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

Safactory: A Scalable Agentic Infrastructure for Training Trustworthy Autonomous Intelligence

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Financial Time-Series Forecasting: Towards Synergizing Performance And Interpretability Within a Hybrid Machine Learning Approach

GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy

Parallel Reasoning Network for Human-Object Interaction Detection

Global Strong and Weak Solutions to the Initial-boundary-value Problem of 2D Compressible MHD System with Large Initial Data and Vacuum

Global Well-posedness of Classical Solutions to the Compressible Navier-Stokes-Poisson Equations with Slip Boundary Conditions in 3D Bounded Domains

On Compressible Navier-Stokes Equations Subject to Large Potential Forces with Slip Boundary Conditions in 3D Bounded Domains

Computational prediction of RNA tertiary structures using machine learning methods

Decoding the mechanisms underlying cell-fate decision-making during stem cell differentiation by Random Circuit Perturbation

Global Strong Solutions to Magnetohydrodynamics with Density-Dependent Viscosity and Degenerate Heat-Conductivity

Modeling delayed processes in biological systems

Role of remote interfacial phonon (RIP) scattering in heat transport across graphene/SiO2 interfaces

Implications of the hybrid epithelial/mesenchymal phenotype in metastasis

Improved topological conformity enhances heat conduction across metal contacts on transferred graphene

Tuning and Optimizing the Finite Element Analysis with Elements of Large Nodal DOF on a Linux Cluster

Identifying effective multiple spreaders by coloring complex networks

A simulation study on the measurement of D0-D0bar mixing parameter y at BES-III