Source author record

Jincheng Yu

Jincheng Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.GA astro-ph.SR Computer Vision Robotics Artificial Intelligence astro-ph.IM Multiagent Systems

Catalog footprint

What is connected

11works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

While the Transformer architecture dominates many fields, its quadratic self-attention complexity hinders its use in large-scale applications. Linear attention offers an efficient alternative, but its direct application often degrades performance, with existing fixes typically re-introducing computational overhead through extra modules (e.g., depthwise separable convolution) that defeat the original purpose. In this work, we identify a key failure mode in these methods: global context collapse, where the model loses representational diversity. To address this, we propose Multi-Head Linear Attention (MHLA), which preserves this diversity by computing attention within divided heads along the token dimension. We prove that MHLA maintains linear complexity while recovering much of the expressive power of softmax attention, and verify its effectiveness across multiple domains, achieving a 3.6\% improvement on ImageNet classification, a 6.3\% gain on NLP, a 12.6\% improvement on image generation, and a 41\% enhancement on video generation under the same time complexity.

preprint2026arXiv

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

We introduce SANA-WM, an efficient 2.6B-parameter open-source world model natively trained for one-minute generation, synthesizing high-fidelity, 720p, minute-scale videos with precise camera control. SANA-WM achieves visual quality comparable to large-scale industrial baselines such as LingBot-World and HY-WorldPlay, while significantly improving efficiency. Four core designs drive our architecture: (1) Hybrid Linear Attention combines frame-wise Gated DeltaNet (GDN) with softmax attention for memory-efficient long-context modeling. (2) Dual-Branch Camera Control ensures precise 6-DoF trajectory adherence. (3) Two-Stage Generation Pipeline applies a long-video refiner to stage-1 outputs, improving quality and consistency across sequences. (4) Robust Annotation Pipeline extracts accurate metric-scale 6-DoF camera poses from public videos to yield high-quality, spatiotemporally consistent action labels. Driven by these designs, SANA-WMdemonstrates remarkable efficiency across data, training compute, and inference hardware: it uses only $\sim$213K public video clips with metric-scale pose supervision, completes training in 15 days on 64 H100s, and generates each 60s clip on a single GPU; its distilled variant can be deployed on a single RTX 5090 with NVFP4 quantization to denoise a 60s 720p clip in 34s. On our one-minute world-model benchmark, SANA-WM demonstrates stronger action-following accuracy than prior open-source baselines and achieves comparable visual quality at $36\times$ higher throughput for scalable world modeling.

preprint2023arXiv

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

The most advanced text-to-image (T2I) models require significant training costs (e.g., millions of GPU hours), seriously hindering the fundamental innovation for the AIGC community while increasing CO2 emissions. This paper introduces PIXART-$α$, a Transformer-based T2I diffusion model whose image generation quality is competitive with state-of-the-art image generators (e.g., Imagen, SDXL, and even Midjourney), reaching near-commercial application standards. Additionally, it supports high-resolution image synthesis up to 1024px resolution with low training cost, as shown in Figure 1 and 2. To achieve this goal, three core designs are proposed: (1) Training strategy decomposition: We devise three distinct training steps that separately optimize pixel dependency, text-image alignment, and image aesthetic quality; (2) Efficient T2I Transformer: We incorporate cross-attention modules into Diffusion Transformer (DiT) to inject text conditions and streamline the computation-intensive class-condition branch; (3) High-informative data: We emphasize the significance of concept density in text-image pairs and leverage a large Vision-Language model to auto-label dense pseudo-captions to assist text-image alignment learning. As a result, PIXART-$α$'s training speed markedly surpasses existing large-scale T2I models, e.g., PIXART-$α$ only takes 10.8% of Stable Diffusion v1.5's training time (675 vs. 6,250 A100 GPU days), saving nearly \$300,000 (\$26,000 vs. \$320,000) and reducing 90% CO2 emissions. Moreover, compared with a larger SOTA model, RAPHAEL, our training cost is merely 1%. Extensive experiments demonstrate that PIXART-$α$ excels in image quality, artistry, and semantic control. We hope PIXART-$α$ will provide new insights to the AIGC community and startups to accelerate building their own high-quality yet low-cost generative models from scratch.

preprint2022arXiv

Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Autonomous exploration and mapping of unknown terrains employing single or multiple robots is an essential task in mobile robotics and has therefore been widely investigated. Nevertheless, given the lack of unified data sets, metrics, and platforms to evaluate the exploration approaches, we develop an autonomous robot exploration benchmark entitled Explore-Bench. The benchmark involves various exploration scenarios and presents two types of quantitative metrics to evaluate exploration efficiency and multi-robot cooperation. Explore-Bench is extremely useful as, recently, deep reinforcement learning (DRL) has been widely used for robot exploration tasks and achieved promising results. However, training DRL-based approaches requires large data sets, and additionally, current benchmarks rely on realistic simulators with a slow simulation speed, which is not appropriate for training exploration strategies. Hence, to support efficient DRL training and comprehensive evaluation, the suggested Explore-Bench designs a 3-level platform with a unified data flow and $12 \times$ speed-up that includes a grid-based simulator for fast evaluation and efficient training, a realistic Gazebo simulator, and a remotely accessible robot testbed for high-accuracy tests in physical environments. The practicality of the proposed benchmark is highlighted with the application of one DRL-based and three frontier-based exploration approaches. Furthermore, we analyze the performance differences and provide some insights about the selection and design of exploration methods. Our benchmark is available at https://github.com/efc-robot/Explore-Bench.

preprint2022arXiv

Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

For scenes such as floods and earthquakes, the disaster area is large, and rescue time is tight. Multi-UAV exploration is more efficient than a single UAV. Existing UAV exploration work is modeled as a Coverage Path Planning (CPP) task to achieve full coverage of the area in the presence of obstacles. However, the endurance capability of UAV is limited, and the rescue time is urgent. Thus, even using multiple UAVs cannot achieve complete disaster area coverage in time. Therefore, in this paper we propose a multi-Agent Endurance-limited CPP (MAEl-CPP) problem based on a priori heatmap of the disaster area, which requires the exploration of more valuable areas under limited energy. Furthermore, we propose a path planning algorithm for the MAEl-CPP problem, by ranking the possible disaster areas according to their importance through satellite or remote aerial images and completing path planning according to the importance level. Experimental results show that our proposed algorithm is at least twice as effective as the existing method in terms of search efficiency.

preprint2020arXiv

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2023. According to the detailed survey plan, we expect that LAMOST-MRS can observe about 2 million stellar spectra with ~7500 and limiting magnitude of around G=15 mag. Moreover, it will also provide about 200 thousand stars with averagely 60-epoch observations and limiting magnitude of G~14 mag. These high quality spectra will give around 20 elemental abundances, rotational velocities, emission line profiles as well as precise radial velocity with uncertainty less than 1 km/s. With these data, we expect that LAMOST can effectively leverage sciences on stellar physics, e.g. exotic binary stars, detailed observation of many types of variable stars etc., planet host stars, emission nebulae, open clusters, young pre-main-sequence stars etc.

preprint2020arXiv

Multiple populations in low mass globular clusters: Palomar 13

Since the discovery of chemically peculiar stars in globular clusters in the last century, the study of multiple populations has become increasingly important, given that chemical inhomogeneity is found in almost all globular clusters. Despite various proposed theories attempting to explain this phenomenon, fitting all the observational evidence in globular clusters with one single theory remains notoriously difficult and currently unsuccessful. In order to improve existing models and motivate new ones, we are observing globular clusters at critical conditions, e.g., metal-rich end, metal-poor end, and low mass end. In this paper, we present our first attempt to investigate multiple populations in low mass globular clusters. We obtained low-resolution spectra around 4000 A of 30 members of the globular cluster Palomar 13 using OSIRIS/Multi-object spectrograph mounted at the Gran Telescopio Canarias. The membership of red giant branch stars is confirmed by the latest proper motions from Gaia DR2 and literature velocities. After comparing the measured CN and CH spectral indices with those of the stellar models, we found a clear sign of nitrogen variation among the red giant branch stars. Palomar 13 may be the lowest mass globular cluster showing multiple populations.

preprint2020arXiv

On the Chemical and Kinematic Consistency Between N-rich Metal-poor Field Stars and Enriched Populations in Globular Clusters

Interesting chemically peculiar field stars may reflect their stellar evolution history and their possible origin in a different environment from where they are found now, which is one of the most important research fields in Galactic archaeology. To explore this further, we have used the CN-CH bands around 4000 A to identify N-rich metal-poor field stars in LAMOST DR3. Here we expand our N-rich metal-poor field star sample to ~100 stars in LAMOST DR5, where 53 of them are newly found in this work. We investigate light elements of the common stars between our sample and APOGEE DR14. While Mg, Al, and Si abundances generally agree with the hypothesis that N-rich metal-poor field stars come from enriched populations in globular clusters, it is still inconclusive for C, N, and O. After integrating the orbits of our N-rich field stars and a control sample of normal metal-poor field stars, we find that N-rich field stars have different orbital parameter distributions compared to the control sample, specifically, apocentric distances, maximum vertical amplitude (Zmax), orbital energy, and z direction angular momentum (Lz). The orbital parameters of N-rich field stars indicate that most of them are inner-halo stars. The kinematics of N-rich field stars support their possible GC origin. The spatial and velocity distributions of our bona fide N-rich field star sample are important observational evidence to constrain simulations of the origin of these interesting objects.

preprint2012arXiv

A Test for Radial Mixing Using Local Star Samples

We use samples of local main-sequence stars to show that the radial gradient of [Fe/H] in the thin disk of the Milky Way decreases with mean effective stellar temperature. Many of these stars are visiting the solar neighborhood from the inner and outer Galaxy. We use the angular momentum of each star about the Galactic center to determine the guiding center radius and to eliminate the effects of epicyclic motion, which would otherwise blur the estimated gradients. We interpret the effective temperature as a proxy for mean age, and conclude that the decreasing gradient is consistent with the predictions of radial mixing due to transient spiral patterns. We find some evidence that the trend of decreasing gradient with increasing mean age breaks to a constant gradient for samples of stars whose main-sequence life-times exceed the likely age of the thin disk.

preprint2012arXiv

The selection of LEGUE disk targets for LAMOST's pilot survey

We describe the target selection algorithm for the low latitude disk portion of the LAMOST Pilot Survey, which aims to test systems in preparation for the LAMOST spectroscopic survey. We use the PPMXL (Roeser et al. 2010) astrometric catalog, which provides positions, proper motions, B/R/I magnitudes (mostly) from USNO-B (Monet et al. 2003) and J/H/Ks from The Two Micron All Sky Survey (2MASS, see Skrutskie et al. 2006) as well. We chose 8 plates along the Galactic plane, in the region $0^\circ<α<67^\circ$ and $42^\circ<δ<59^\circ$, that cover 22 known open clusters with a range of ages. Adjacent plates may have small overlapping. Each plate covers an area $2.5^\circ$ in radius,with central star (for Shack-Hartmann guider) brighter than $\sim8^{\rm th}$ magnitude. For each plate, we create an input catalog in the magnitude range $11.3<Imag<16.3$ and $Bmag$ available from PPMXL. The stars are selected to satisfy the requirements of the fiber positioning system and have a uniform distribution in the $I$ vs. $B-I$ color-magnitude diagram. Our final input catalog consists of 12,000 objects on each of 8 plates that are observable during the winter observing season in Xinglong Station of the National Astronomical Observatory of China.

preprint2011arXiv

Rapid dynamical mass segregation and properties of fractal star clusters

We investigate the evolution of young star clusters using N-body simulations. We confirm that subvirial and fractal-structured clusters will dynamically mass segregate on a short timescale (within 0.5 Myr). We adopt a modified minimum-spanning-tree (MST) method to measure the degree of mass segregation, demonstrating that the stars escaping from a cluster's potential are important for the temporal dependence of mass segregation in the cluster. The form of the initial velocity distribution will also affect the degree of mass segregation. If it depends on radius, the outer parts of the cluster would expand without undergoing collapse. In velocity space, we find 'inverse mass segregation,' which indicates that massive stars have higher velocity dispersions than their lower-mass counterparts.

Jincheng Yu

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Multiple populations in low mass globular clusters: Palomar 13

On the Chemical and Kinematic Consistency Between N-rich Metal-poor Field Stars and Enriched Populations in Globular Clusters

A Test for Radial Mixing Using Local Star Samples

The selection of LEGUE disk targets for LAMOST's pilot survey

Rapid dynamical mass segregation and properties of fractal star clusters