Source author record

Feng Huang

Feng Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

18works

25topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Global Context Compression with Interleaved Vision-Text Transformation

Recent achievements of vision-language models in end-to-end OCR point to a new avenue for low-loss compression of textual information. This motivates earlier works that render the Transformer's input into images for prefilling, which effectively reduces the number of tokens through visual encoding, thereby alleviating the quadratically increased Attention computations. However, this partial compression fails to save computational or memory costs at token-by-token inference. In this paper, we investigate global context compression, which saves tokens at both prefilling and inference stages. Consequently, we propose VIST2, a novel Transformer that interleaves input text chunks alongside their visual encoding, while depending exclusively on visual tokens in the pre-context to predict the next text token distribution. Around this idea, we render text chunks into sketch images and train VIST2 in multiple stages, starting from curriculum-scheduled pretraining for optical language modeling, followed by modal-interleaved instruction tuning. We conduct extensive experiments using VIST2 families scaled from 0.6B to 8B to explore the training recipe and hyperparameters. With a 4$\times$ compression ratio, the resulting models demonstrate significant superiority over baselines on long writing tasks, achieving, on average, a 3$\times$ speedup in first-token generation, 77% reduction in memory usage, and 74% reduction in FLOPS. Our codes and datasets will be public to support further studies.

preprint2024arXiv

A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation

Therapeutic peptides represent a unique class of pharmaceutical agents crucial for the treatment of human diseases. Recently, deep generative models have exhibited remarkable potential for generating therapeutic peptides, but they only utilize sequence or structure information alone, which hinders the performance in generation. In this study, we propose a Multi-Modal Contrastive Diffusion model (MMCD), fusing both sequence and structure modalities in a diffusion framework to co-generate novel peptide sequences and structures. Specifically, MMCD constructs the sequence-modal and structure-modal diffusion models, respectively, and devises a multi-modal contrastive learning strategy with intercontrastive and intra-contrastive in each diffusion timestep, aiming to capture the consistency between two modalities and boost model performance. The inter-contrastive aligns sequences and structures of peptides by maximizing the agreement of their embeddings, while the intra-contrastive differentiates therapeutic and non-therapeutic peptides by maximizing the disagreement of their sequence/structure embeddings simultaneously. The extensive experiments demonstrate that MMCD performs better than other state-of-theart deep generative methods in generating therapeutic peptides across various metrics, including antimicrobial/anticancer score, diversity, and peptide-docking.

preprint2022arXiv

Dark matter admixed neutron star properties in the light of X-ray pulse profile observations

The distribution of the dark matter (DM) in DM-admixed-neutron stars (DANSs) is supposed to be either a dense dark core or an extended dark halo, which is subject to the DM fraction of DANS ($f_χ$) and the DM properties, such as the mass ($m_χ$) and the strength of the self-interaction ($y$). In this paper, we perform an in-depth analysis of the formation criterion for dark core/dark halo and point out that the relative distribution of these two components is essentially determined by the ratio of the central enthalpy of the DM component to that of the baryonic matter component inside DANSs. For the critical case where the radii of DM and baryonic matter are the same, we further derive an analytical formula to describe the dependence of $f^{\rm crit}_χ$ on $m_χ$ and $y$ for given DANS mass. The relative distribution of the two components in DANSs can lead to different observational effects. We here focus on the modification of the pulsar pulse profile due to the extra light-bending effect in the case of a dark-halo existence and conduct the first investigation of the dark-halo effects on the pulse profile. We find that the peak flux deviation is strongly dependent on the ratio of the halo mass to the radius of the DM component. Lastly, we perform Bayesian parameter estimation on the DM particle properties based on the recent X-ray observations of PSR J0030+0451 and PSR J0740+6620 by the Neutron Star Interior Composition Explorer.

preprint2022arXiv

First passage of a diffusing particle under stochastic resetting in bounded domains with spherical symmetry

We investigate the first passage properties of a Brownian particle diffusing freely inside a $d$-dimensional sphere with absorbing spherical surface subject to stochastic resetting. We derive the mean time to absorption (MTA) as functions of resetting rate $γ$ and initial distance $r$ of the particle to the center of the sphere. We find that when $r>r_c$ there exists a nonzero optimal resetting rate $γ_{\rm opt}$ at which the MTA is a minimum, where $r_c=\sqrt {d/\left( {d + 4} \right)} R$ and $R$ is the radius of sphere. As $r$ increases, $γ_{\rm opt}$ exhibits a continuous transition from zero to nonzero at $r=r_c$. Furthermore, we consider that the particle lies in between two two-dimensional or three-dimensional concentric spheres, and obtain the domain in which resetting expedites the MTA, which is $(R_1, r_{c_1}) \cup (r_{c_2},R_2)$, with $R_1$ and $R_2$ being the radius of inner and outer spheres, respectively. Interestingly, when $R_1/R_2$ is less than a critical value, $γ_{\rm opt}$ exhibits a discontinuous transition at $r=r_{c_1}$; otherwise, such a transition is continuous. However, at $r=r_{c_2}$, $γ_{\rm opt}$ always shows a continuous transition.

preprint2022arXiv

Robust optimal policies for team Markov games

In stochastic dynamic environments, team Markov games have emerged as a versatile paradigm for studying sequential decision-making problems of fully cooperative multi-agent systems. However, the optimality of the derived policies is usually sensitive to model parameters, which are typically unknown and required to be estimated from noisy data in practice. To mitigate the sensitivity of optimal policies to these uncertain parameters, we propose a robust model of team Markov games in this paper, where agents utilize robust optimization approaches to update strategies. This model extends team Markov games to the scenario of incomplete information and meanwhile provides an alternative solution concept of robust team optimality. To seek such a solution, we develop a robust iterative learning algorithm of team policies and prove its convergence. This algorithm, compared with robust dynamic programming, not only possesses a faster convergence rate, but also allows for using approximation calculations to alleviate the curse of dimensionality. Moreover, some numerical simulations are presented to demonstrate the effectiveness of the algorithm by generalizing the game model of sequential social dilemmas to uncertain scenarios.

preprint2022arXiv

Simulation of the FDA Nozzle Benchmark: A Lattice Boltzmann Study

Background and objective: Contrary to flows in small intracranial vessels, many blood flow configurations such as those found in aortic vessels and aneurysms involve larger Reynolds numbers and, therefore, transitional or turbulent conditions. Dealing with such systems require both robust and efficient numerical methods. Methods: We assess here the performance of a lattice Boltzmann solver with full Hermite expansion of the equilibrium and central Hermite moments collision operator at higher Reynolds numbers, especially for under-resolved simulations. To that end the food and drug administration's benchmark nozzle is considered at three different Reynolds numbers covering all regimes: 1) laminar at a Reynolds number of 500, 2) transitional at a Reynolds number of $3500$, and 3) low-level turbulence at a Reynolds number of 6500. Results: The lattice Boltzmann results are compared with previously published inter-laboratory experimental data obtained by particle image velocimetry. Our results show good agreement with the experimental measurements throughout the nozzle, demonstrating the good performance of the solver even in under-resolved simulations. Conclusion: In this manner, fast but sufficiently accurate numerical predictions can be achieved for flow configurations of practical interest regarding medical applications.

preprint2021arXiv

First passage in discrete-time absorbing Markov chains under stochastic resetting

First passage of stochastic processes under resetting has recently been an active research topic in the field of statistical physics. However, most of previous studies mainly focused on the systems with continuous time and space. In this paper, we study the effect of stochastic resetting on first passage properties of discrete-time absorbing Markov chains, described by a transition matrix $\brm{Q}$ between transient states and a transition matrix $\brm{R}$ from transient states to absorbing states. Using a renewal approach, we exactly derive the unconditional mean first passage time (MFPT) to either of absorbing states, the splitting probability the and conditional MFPT to each absorbing state. All the quantities can be expressed in terms of a deformed fundamental matrix $\brm{Z_γ}=\left[\brm{I}-(1-γ) \brm{Q} \right]^{-1}$ and $\brm{R}$, where $\brm{I}$ is the identity matrix, and $γ$ is the resetting probability at each time step. We further show a sufficient condition under which the unconditional MPFT can be optimized by stochastic resetting. Finally, we apply our results to two concrete examples: symmetric random walks on one-dimensional lattices with absorbing boundaries and voter model on complete graphs.

preprint2020arXiv

A Guaranteed Convergence Analysis for the Projected Fast Iterative Soft-Thresholding Algorithm in Parallel MRI

The boom of non-uniform sampling and compressed sensing techniques dramatically alleviates the lengthy data acquisition problem of magnetic resonance imaging. Sparse reconstruction, thanks to its fast computation and promising performance, has attracted researchers to put numerous efforts on it and has been adopted in commercial scanners. To perform sparse reconstruction, choosing a proper algorithm is essential in providing satisfying results and saving time in tuning parameters. The pFISTA, a simple and efficient algorithm for sparse reconstruction, has been successfully extended to parallel imaging. However, its convergence criterion is still an open question. And the existing convergence criterion of single-coil pFISTA cannot be applied to the parallel imaging pFISTA, which, therefore, imposes confusions and difficulties on users about determining the only parameter - step size. In this work, we provide the guaranteed convergence analysis of the parallel imaging version pFISTA to solve the two well-known parallel imaging reconstruction models, SENSE and SPIRiT. Along with the convergence analysis, we provide recommended step size values for SENSE and SPIRiT reconstructions to obtain fast and promising reconstructions. Experiments on in vivo brain images demonstrate the validity of the convergence criterion. Besides, experimental results show that compared to using backtracking and power iteration to determine the step size, our recommended step size achieves more than five times acceleration in reconstruction time in most tested cases.

preprint2020arXiv

Event Arguments Extraction via Dilate Gated Convolutional Neural Network with Enhanced Local Features

Event Extraction plays an important role in information-extraction to understand the world. Event extraction could be split into two subtasks: one is event trigger extraction, the other is event arguments extraction. However, the F-Score of event arguments extraction is much lower than that of event trigger extraction, i.e. in the most recent work, event trigger extraction achieves 80.7%, while event arguments extraction achieves only 58%. In pipelined structures, the difficulty of event arguments extraction lies in its lack of classification feature, and the much higher computation consumption. In this work, we proposed a novel Event Extraction approach based on multi-layer Dilate Gated Convolutional Neural Network (EE-DGCNN) which has fewer parameters. In addition, enhanced local information is incorporated into word features, to assign event arguments roles for triggers predicted by the first subtask. The numerical experiments demonstrated significant performance improvement beyond state-of-art event extraction approaches on real-world datasets. Further analysis of extraction procedure is presented, as well as experiments are conducted to analyze impact factors related to the performance improvement.

preprint2019arXiv

pISTA-SENSE-ResNet for Parallel MRI Reconstruction

Magnetic resonance imaging has been widely applied in clinical diagnosis, however, is limited by its long data acquisition time. Although imaging can be accelerated by sparse sampling and parallel imaging, achieving promising reconstruction images with a fast reconstruction speed remains a challenge. Recently, deep learning approaches have attracted a lot of attention for its encouraging reconstruction results but without a proper interpretability. In this letter, to enable high-quality image reconstruction for the parallel magnetic resonance imaging, we design the network structure from the perspective of sparse iterative reconstruction and enhance it with the residual structure. The experimental results of a public knee dataset show that compared with the optimization-based method and the latest deep learning parallel imaging methods, the proposed network has less error in reconstruction and is more stable under different acceleration factors.

preprint2016arXiv

Discontinuous phase transition in an annealed multi-state majority-vote model

In this paper, we generalize the original majority-vote (MV) model with noise from two states to arbitrary $q$ states, where $q$ is an integer no less than two. The main emphasis is paid to the comparison on the nature of phase transitions between the two-state MV (MV2) model and the three-state MV (MV3) model. By extensive Monte Carlo simulation and mean-field analysis, we find that the MV3 model undergoes a discontinuous order-disorder phase transition, in contrast to a continuous phase transition in the MV2 model. A central feature of such a discontinuous transition is a strong hysteresis behavior as noise intensity goes forward and backward. Within the hysteresis region, the disordered phase and ordered phase are coexisting.

preprint2015arXiv

A straightforward method to assess motion blur for different types of displays

A simulation method based on the liquid crystal response and the human visual system is suitable to characterize motion blur for LCDs but not other display types. We propose a more straightforward and widely applicable method to quantify motion blur based on the width of the moving object. We thus compare various types of displays objectively. A perceptual experiment was conducted to validate the proposed method. We test varying motion velocities for nine commercial displays. We compare the three motion blur evaluation methods (simulation, human perception, and our method) using z-scores. Our comparisons indicate that our method accurately characterizes motion blur for various display types.

preprint2014arXiv

Complex activated transition in a system of two coupled bistable oscillators

We study the fluctuation-activated transition process in a system of two coupled bistable oscillators, in which each oscillator is driven by one constant force and an independent Gaussian white noise. The transition pathway has been identified and the transition rate has been computed as the coupling strength $μ$ and the mismatch $σ$ in the force constants are varied. For identical oscillators ($σ=0$), the transition undergoes a change from a two-step process with two candidate pathways to a one-step process with also two candidate pathways to a one-step process with a single pathway as $μ$ is increased. For nonidentical oscillators ($σ\neq0$), a novel transition emerges that is a mixture of a two-step pathway and a one-step pathway. Interestingly, we find that the total transition rate depends nonmonotonically on $μ$: a maximal rate appears in an intermediate magnitude of $μ$. Moreover, in the presence of weak coupling the rate also exhibits an unexpected maximum as a function of $σ$. The results are in an excellent agreement with our numerical simulations by forward flux sampling.

preprint2013arXiv

Explosive synchronization transitions in complex neural network

It has been recently reported that explosive synchronization transitions can take place in networks of phase oscillators [Gómez-Gardeñes \emph{et al.} Phys.Rev.Letts. 106, 128701 (2011)] and chaotic oscillators [Leyva \emph{et al.} Phys.Rev.Letts. 108, 168702 (2012)]. Here, we investigate the effect of a microscopic correlation between the dynamics and the interacting topology of coupled FitzHugh-Nagumo oscillators on phase synchronization transition in Barabási-Albert (BA) scale-free networks and Erdös-Rényi (ER) random networks. We show that, if the width of distribution of natural frequencies of the oscillations is larger than a threshold value, a strong hysteresis loop arises in the synchronization diagram of BA networks due to the positive correlation between node degrees and natural frequencies of the oscillations, indicating the evidence of an explosive transition towards synchronization of relaxation oscillators system. In contrast to the results in BA networks, in more homogeneous ER networks the synchronization transition is always of continuous type regardless of the the width of the frequency distribution. Moreover, we consider the effect of degree-mixing patterns on the nature of the synchronization transition, and find that the degree assortativity is unfavorable for the occurrence of such an explosive transition.

preprint2013arXiv

How does degree heterogeneity affect nucleation of Ising model on complex networks?

We investigate the nucleation of Ising model on complex networks and focus on the role played by the heterogeneity of degree distribution on nucleation rate. Using Monte Carlo simulation combined with forward flux sampling, we find that for a weak external field the nucleation rate decreases monotonically as degree heterogeneity increases. Interestingly, for a relatively strong external field the nucleation rate exhibits a nonmonotonic dependence on degree heterogeneity, in which there exists a maximal nucleation rate at an intermediate level of degree heterogeneity. Furthermore, we develop a heterogeneous mean-field theory for evaluating the free-energy barrier of nucleation. The theoretical estimations are qualitatively consistent with the simulation results. Our study suggests that degree heterogeneity plays a nontrivial role in the dynamics of phase transition in networked Ising systems.

preprint2012arXiv

Too massive neutron stars: The role of dark matter?

The maximum mass of a neutron star is generally determined by the equation of state of the star material. In this study, we take into account dark matter particles, assumed to behave like fermions with a free parameter to account for the interaction strength among the particles, as a possible constituent of neutron stars. We find dark matter inside the star would soften the equation of state more strongly than that of hyperons, and reduce largely the maximum mass of the star. However, the neutron star maximum mass is sensitive to the particle mass of dark matter, and a very high neutron star mass larger than 2 times solar mass could be achieved when the particle mass is small enough. Such kind of dark-matter- admixed neutron stars could explain the recent measurement of the Shapiro delay in the radio pulsar PSR J1614-2230, which yielded a neutron star mass of 2 times solar mass that may be hardly reached when hyperons are considered only, as in the case of the microscopic Brueckner theory. Furthermore, in this particular case, we point out that the dark matter around a neutron star should also contribute to the mass measurement due to its pure gravitational effect. However, our numerically calculation illustrates that such contribution could be safely ignored because of the usual diluted dark matter environment assumed. We conclude that a very high mass measurement of about 2 times solar mass requires a really stiff equation of state in neutron stars, and find a strong upper limit (<= 0.64 GeV) for the particle mass of non-self- annihilating dark matter based on the present model.

preprint2010arXiv

Dark matter annihilation and non-thermal Sunyaev-Zel'dovich effect: II. dwarf spheroidal galaxy

We calculate the CMB temperature distortion due to the energetic electrons and positrons produced by dark matter annihilation (Sunyaev-Zel'dovich effect), in dwarf spheroidal galaxies (dSphs). In the calculation we have included two important effects which were previously ignored. First we show that the electron-positron pairs with energy less than GeV, which were neglected in previous calculation, could contribute a significant fraction of the total signal. Secondly we also consider the full effects of diffusion loss, which could significantly reduce the density of electron-positron pairs at the center of cuspy halos. For neutralinos, we confirm that detecting such kind of SZ effect is beyond the capability of the current or even the next generation experiments. In the case of light dark matter (LDM) the signal is much larger, but even in this case it is only marginally detectable with the next generation of experiment such as ALMA. We conclude that similar to the case of galaxy clusters, in the dwarf galaxies the $SZ_2DM}$ effect is not a strong probe of DM annihilations.

preprint2009arXiv

Dark matter annihilation and non-thermal Sunyaev-Zel'dovich effect: I. galaxy cluster

In this work we calculate the Sunyaev-Zel'dovich (SZ) effect due to the $e^+e^-$ from dark matter (DM) annihilation in galaxy clusters. Two candidates of DM particle, (1) the weakly-interacting massive particle (WIMP) and (2) the light dark matter (LDM) are investigated. For each case, we also consider several DM profiles with and without central cusp. We generally find smaller signals than previously reported. Moreover, the diffusion of electrons and positrons in the galaxy clusters, which was generally thought to be negligible, is considered and found to have significant effect on the central electron/positron distribution for DM profile with large spatial gradient. We find that the SZ effect from WIMP is almost always non-observable, even for the highly cuspy DM profile, and using the next generation SZ interferometer such as ALMA. Although the signal of the LDM is much larger than that of the WIMP, the final SZ effect is still very small due to the smoothing effect of diffusion. Only for the configuration with large central cusp and extremely small diffusion effect, the LDM induced SZ effect might have a bit chance of being detected.

Feng Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Global Context Compression with Interleaved Vision-Text Transformation

A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation

Dark matter admixed neutron star properties in the light of X-ray pulse profile observations

First passage of a diffusing particle under stochastic resetting in bounded domains with spherical symmetry

Robust optimal policies for team Markov games

Simulation of the FDA Nozzle Benchmark: A Lattice Boltzmann Study

First passage in discrete-time absorbing Markov chains under stochastic resetting

A Guaranteed Convergence Analysis for the Projected Fast Iterative Soft-Thresholding Algorithm in Parallel MRI

Event Arguments Extraction via Dilate Gated Convolutional Neural Network with Enhanced Local Features

pISTA-SENSE-ResNet for Parallel MRI Reconstruction

Discontinuous phase transition in an annealed multi-state majority-vote model

A straightforward method to assess motion blur for different types of displays

Complex activated transition in a system of two coupled bistable oscillators

Explosive synchronization transitions in complex neural network

How does degree heterogeneity affect nucleation of Ising model on complex networks?

Too massive neutron stars: The role of dark matter?

Dark matter annihilation and non-thermal Sunyaev-Zel'dovich effect: II. dwarf spheroidal galaxy

Dark matter annihilation and non-thermal Sunyaev-Zel'dovich effect: I. galaxy cluster