Source author record

Hongxuan Zhang

Hongxuan Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence astro-ph.GA Computation and Language Distributed, Parallel, and Cluster Computing Information Retrieval Machine Learning Networking and Internet Architecture

Catalog footprint

What is connected

3works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

RAG-R1: Incentivizing the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism

Large Language Models (LLMs), despite their remarkable capabilities, are prone to generating hallucinated or outdated content due to their static internal knowledge. While Retrieval-Augmented Generation (RAG) integrated with Reinforcement Learning (RL) offers a solution, these methods are fundamentally constrained by a single-query mode, leading to prohibitive latency and inherent brittleness. To overcome these limitations, we introduce RAG-R1, a novel two-stage training framework centered around multi-query parallelism. Our framework enables LLMs to adaptively leverage internal and external knowledge during the reasoning process while transitioning from the single-query mode to multi-query parallelism. This architectural shift bolsters reasoning robustness while significantly reducing inference latency. Extensive experiments on seven question-answering benchmarks confirm the superiority of our method, which outperforms the strongest baseline by up to 13.7% and decreases inference time by 11.1%.

preprint2021arXiv

Dynamic DNN Decomposition for Lossless Synergistic Inference

Deep neural networks (DNNs) sustain high performance in today's data processing applications. DNN inference is resource-intensive thus is difficult to fit into a mobile device. An alternative is to offload the DNN inference to a cloud server. However, such an approach requires heavy raw data transmission between the mobile device and the cloud server, which is not suitable for mission-critical and privacy-sensitive applications such as autopilot. To solve this problem, recent advances unleash DNN services using the edge computing paradigm. The existing approaches split a DNN into two parts and deploy the two partitions to computation nodes at two edge computing tiers. Nonetheless, these methods overlook collaborative device-edge-cloud computation resources. Besides, previous algorithms demand the whole DNN re-partitioning to adapt to computation resource changes and network dynamics. Moreover, for resource-demanding convolutional layers, prior works do not give a parallel processing strategy without loss of accuracy at the edge side. To tackle these issues, we propose D3, a dynamic DNN decomposition system for synergistic inference without precision loss. The proposed system introduces a heuristic algorithm named horizontal partition algorithm to split a DNN into three parts. The algorithm can partially adjust the partitions at run time according to processing time and network conditions. At the edge side, a vertical separation module separates feature maps into tiles that can be independently run on different edge nodes in parallel. Extensive quantitative evaluation of five popular DNNs illustrates that D3 outperforms the state-of-the-art counterparts up to 3.4 times in end-to-end DNN inference time and reduces backbone network communication overhead up to 3.68 times.

preprint2021arXiv

Spatially Resolved Properties of Supernova Host Galaxies in SDSS-IV MaNGA

We crossmatch galaxies from Mapping Nearby Galaxies at Apache Point Observatory with the Open Supernova Catalog, obtaining a total of 132 SNe within MaNGA bundle. These 132 SNe can be classified into 67 Type Ia and 65 Type CC. We study the global and local properties of supernova host galaxies statistically. Type Ia SNe are distributed in both star-forming galaxies and quiescent galaxies, while Type CC SNe are all distributed along the star-forming main sequence. As the stellar mass increases, the Type Ia/CC number ratio increases. We find: (1) there is no obvious difference in the interaction possibilities and environments between Type Ia SN hosts and a control sample of galaxies with similar stellar mass and SFR distributions, except that Type Ia SNe tend to appear in galaxies which are more bulge-dominated than their controls. For Type CC SNe, there is no difference between their hosts and the control galaxies in galaxy morphology, interaction possibilities as well as environments; (2) the SN locations have smaller velocity dispersion, lower metallicity, and younger stellar population than galaxy centers. This is a natural result of radius gradients of all these parameters. The SN location and the its symmetrical position relative to the galaxy center, as well as regions with similar effective radii have very similar [Mg/Fe], gas-phase metallicity, gas velocity dispersion and stellar population age.

Hongxuan Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

RAG-R1: Incentivizing the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism

Dynamic DNN Decomposition for Lossless Synergistic Inference

Spatially Resolved Properties of Supernova Host Galaxies in SDSS-IV MaNGA