Source author record

Hao Tian

Hao Tian appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

34works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Enhanced multi-parameter metrology in dissipative Rydberg atom time crystals

The pursuit of unprecedented sensitivity in quantum enhanced metrology has spurred interest in non-equilibrium quantum phases of matter and their symmetry breaking. In particular, criticality-enhanced metrology through time-translation symmetry breaking in many-body systems, a distinct paradigm compared to spatial symmetry breaking, is a field still in its infancy. Here, we have investigated the enhanced sensing at the boundary of a continuous time-crystal (CTC) phase in a driven Rydberg atomic gas. By mapping the full phase diagram, we identify the parameter-dependent phase boundary where the time-translation symmetry is broken. This allows us to use a single setup for measuring multiple parameters, in particular frequency and amplitude of a microwave field. By increasing the microwave field amplitude, we first observe a phase transition from a thermal phase to a CTC phase, followed by a second transition into a distinct CTC state, characterized by a different oscillation frequency. Furthermore, we reveal the precise relationship between the CTC phase boundary and the scanning rate, displaying enhanced precision beyond the Standard Quantum Limit. This work not only provides a promising paradigm rooted in the critical properties of time crystals, but also advances a method for multi-parameter sensing in non-equilibrium quantum phases.

preprint2026arXiv

MiMo-V2-Flash Technical Report

We present MiMo-V2-Flash, a Mixture-of-Experts (MoE) model with 309B total parameters and 15B active parameters, designed for fast, strong reasoning and agentic capabilities. MiMo-V2-Flash adopts a hybrid attention architecture that interleaves Sliding Window Attention (SWA) with global attention, with a 128-token sliding window under a 5:1 hybrid ratio. The model is pre-trained on 27 trillion tokens with Multi-Token Prediction (MTP), employing a native 32k context length and subsequently extended to 256k. To efficiently scale post-training compute, MiMo-V2-Flash introduces a novel Multi-Teacher On-Policy Distillation (MOPD) paradigm. In this framework, domain-specialized teachers (e.g., trained via large-scale reinforcement learning) provide dense and token-level reward, enabling the student model to perfectly master teacher expertise. MiMo-V2-Flash rivals top-tier open-weight models such as DeepSeek-V3.2 and Kimi-K2, despite using only 1/2 and 1/3 of their total parameters, respectively. During inference, by repurposing MTP as a draft model for speculative decoding, MiMo-V2-Flash achieves up to 3.6 acceptance length and 2.6x decoding speedup with three MTP layers. We open-source both the model weights and the three-layer MTP weights to foster open research and community collaboration.

preprint2026arXiv

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Recent advances in multimodal large language models have driven growing interest in graphical user interface (GUI) agents, yet their generalization remains constrained by the scarcity of large-scale training data spanning diverse real-world applications. Existing datasets rely heavily on costly manual annotations and are typically confined to narrow domains. To address this challenge, we propose Video2GUI, a fully automated framework that extracts grounded GUI interaction trajectories directly from unlabeled Internet videos. Video2GUI employs a coarse-to-fine filtering strategy to identify high-quality GUI tutorial videos and convert them into structured agent trajectories. Applying this pipeline to 500 million video metadata entries, we construct WildGUI, a large-scale dataset containing 12 million interaction trajectories spanning over 1,500 applications and websites. Pre-training Qwen2.5-VL and Mimo-VL on WildGUI yields consistent improvements of 5-20% across multiple GUI grounding and action benchmarks, matching or surpassing state-of-the-art performance. We will release both the WildGUI dataset and the Video2GUI pipeline to support future research of GUI agents.

preprint2025arXiv

Introduction to the Chinese Space Station Survey Telescope (CSST)

The Chinese Space Station Survey Telescope (CSST) is an upcoming Stage-IV sky survey telescope, distinguished by its large field of view (FoV), high image quality, and multi-band observation capabilities. It can simultaneously conduct precise measurements of the Universe by performing multi-color photometric imaging and slitless spectroscopic surveys. The CSST is equipped with five scientific instruments, i.e. Multi-band Imaging and Slitless Spectroscopy Survey Camera (SC), Multi-Channel Imager (MCI), Integral Field Spectrograph (IFS), Cool Planet Imaging Coronagraph (CPI-C), and THz Spectrometer (TS). Using these instruments, CSST is expected to make significant contributions and discoveries across various astronomical fields, including cosmology, galaxies and active galactic nuclei (AGN), the Milky Way and nearby galaxies, stars, exoplanets, Solar System objects, astrometry, and transients and variable sources. This review aims to provide a comprehensive overview of the CSST instruments, observational capabilities, data products, and scientific potential.

preprint2025arXiv

MiMo-Audio: Audio Language Models are Few-Shot Learners

Existing audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples or simple instructions. GPT-3 has shown that scaling next-token prediction pretraining enables strong generalization capabilities in text, and we believe this paradigm is equally applicable to the audio domain. By scaling MiMo-Audio's pretraining data to over one hundred million of hours, we observe the emergence of few-shot learning capabilities across a diverse set of audio tasks. We develop a systematic evaluation of these capabilities and find that MiMo-Audio-7B-Base achieves SOTA performance on both speech intelligence and audio understanding benchmarks among open-source models. Beyond standard metrics, MiMo-Audio-7B-Base generalizes to tasks absent from its training data, such as voice conversion, style transfer, and speech editing. MiMo-Audio-7B-Base also demonstrates powerful speech continuation capabilities, capable of generating highly realistic talk shows, recitations, livestreaming and debates. At the post-training stage, we curate a diverse instruction-tuning corpus and introduce thinking mechanisms into both audio understanding and generation. MiMo-Audio-7B-Instruct achieves open-source SOTA on audio understanding benchmarks (MMSU, MMAU, MMAR, MMAU-Pro), spoken dialogue benchmarks (Big Bench Audio, MultiChallenge Audio) and instruct-TTS evaluations, approaching or surpassing closed-source models. Model checkpoints and full evaluation suite are available at https://github.com/XiaomiMiMo/MiMo-Audio.

preprint2024arXiv

COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems

Combinatorial Optimization (CO) problems over graphs appear routinely in many applications such as in optimizing traffic, viral marketing in social networks, and matching for job allocation. Due to their combinatorial nature, these problems are often NP-hard. Existing approximation algorithms and heuristics rely on the search space to find the solutions and become time-consuming when this space is large. In this paper, we design a neural method called COMBHelper to reduce this space and thus improve the efficiency of the traditional CO algorithms based on node selection. Specifically, it employs a Graph Neural Network (GNN) to identify promising nodes for the solution set. This pruned search space is then fed to the traditional CO algorithms. COMBHelper also uses a Knowledge Distillation (KD) module and a problem-specific boosting module to bring further efficiency and efficacy. Our extensive experiments show that the traditional CO algorithms with COMBHelper are at least 2 times faster than their original versions.

preprint2024arXiv

Efficient Asynchronous Federated Learning with Sparsification and Quantization

While data is distributed in multiple edge devices, Federated Learning (FL) is attracting more and more attention to collaboratively train a machine learning model without transferring raw data. FL generally exploits a parameter server and a large number of edge devices during the whole process of the model training, while several devices are selected in each round. However, straggler devices may slow down the training process or even make the system crash during training. Meanwhile, other idle edge devices remain unused. As the bandwidth between the devices and the server is relatively low, the communication of intermediate data becomes a bottleneck. In this paper, we propose Time-Efficient Asynchronous federated learning with Sparsification and Quantization, i.e., TEASQ-Fed. TEASQ-Fed can fully exploit edge devices to asynchronously participate in the training process by actively applying for tasks. We utilize control parameters to choose an appropriate number of parallel edge devices, which simultaneously execute the training tasks. In addition, we introduce a caching mechanism and weighted averaging with respect to model staleness to further improve the accuracy. Furthermore, we propose a sparsification and quantitation approach to compress the intermediate data to accelerate the training. The experimental results reveal that TEASQ-Fed improves the accuracy (up to 16.67% higher) while accelerating the convergence of model training (up to twice faster).

preprint2023arXiv

ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization

Task-agnostic knowledge distillation attempts to address the problem of deploying large pretrained language model in resource-constrained scenarios by compressing a large pretrained model called teacher into a smaller one called student such that the student can be directly finetuned on downstream tasks and retains comparable performance. However, we empirically find that there is a generalization gap between the student and the teacher in existing methods. In this work, we show that we can leverage multi-task learning in task-agnostic distillation to advance the generalization of the resulted student. In particular, we propose Multi-task Infused Task-agnostic Knowledge Distillation (MITKD). We first enhance the teacher by multi-task training it on multiple downstream tasks and then perform distillation to produce the student. Experimental results demonstrate that our method yields a student with much better generalization, significantly outperforms existing baselines, and establishes a new state-of-the-art result on in-domain, out-domain, and low-resource datasets in the setting of task-agnostic distillation. Moreover, our method even exceeds an 8x larger BERT$_{\text{Base}}$ on SQuAD and four GLUE tasks. In addition, by combining ERNIE 3.0, our method achieves state-of-the-art results on 10 Chinese datasets.

preprint2022arXiv

ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval

Neural retrievers based on pre-trained language models (PLMs), such as dual-encoders, have achieved promising performance on the task of open-domain question answering (QA). Their effectiveness can further reach new state-of-the-arts by incorporating cross-architecture knowledge distillation. However, most of the existing studies just directly apply conventional distillation methods. They fail to consider the particular situation where the teacher and student have different structures. In this paper, we propose a novel distillation method that significantly advances cross-architecture distillation for dual-encoders. Our method 1) introduces a self on-the-fly distillation method that can effectively distill late interaction (i.e., ColBERT) to vanilla dual-encoder, and 2) incorporates a cascade distillation process to further improve the performance with a cross-encoder teacher. Extensive experiments are conducted to validate that our proposed solution outperforms strong baselines and establish a new state-of-the-art on open-domain QA benchmarks.

preprint2022arXiv

ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention

Sparse Transformer has recently attracted a lot of attention since the ability for reducing the quadratic dependency on the sequence length. We argue that two factors, information bottleneck sensitivity and inconsistency between different attention topologies, could affect the performance of the Sparse Transformer. This paper proposes a well-designed model named ERNIE-Sparse. It consists of two distinctive parts: (i) Hierarchical Sparse Transformer (HST) to sequentially unify local and global information. (ii) Self-Attention Regularization (SAR) method, a novel regularization designed to minimize the distance for transformers with different attention topologies. To evaluate the effectiveness of ERNIE-Sparse, we perform extensive evaluations. Firstly, we perform experiments on a multi-modal long sequence modeling task benchmark, Long Range Arena (LRA). Experimental results demonstrate that ERNIE-Sparse significantly outperforms a variety of strong baseline methods including the dense attention and other efficient sparse attention methods and achieves improvements by 2.77% (57.78% vs. 55.01%). Secondly, to further show the effectiveness of our method, we pretrain ERNIE-Sparse and verified it on 3 text classification and 2 QA downstream tasks, achieve improvements on classification benchmark by 0.83% (92.46% vs. 91.63%), on QA benchmark by 3.24% (74.67% vs. 71.43%). Experimental results continue to demonstrate its superior performance.

preprint2022arXiv

Indicative Image Retrieval: Turning Blackbox Learning into Grey

Deep learning became the game changer for image retrieval soon after it was introduced. It promotes the feature extraction (by representation learning) as the core of image retrieval, with the relevance/matching evaluation being degenerated into simple similarity metrics. In many applications, we need the matching evidence to be indicated rather than just have the ranked list (e.g., the locations of the target proteins/cells/lesions in medical images). It is like the matched words need to be highlighted in search engines. However, this is not easy to implement without explicit relevance/matching modeling. The deep representation learning models are not feasible because of their blackbox nature. In this paper, we revisit the importance of relevance/matching modeling in deep learning era with an indicative retrieval setting. The study shows that it is possible to skip the representation learning and model the matching evidence directly. By removing the dependency on the pre-trained models, it has avoided a lot of related issues (e.g., the domain gap between classification and retrieval, the detail-diffusion caused by convolution, and so on). More importantly, the study demonstrates that the matching can be explicitly modeled and backtracked later for generating the matching evidence indications. It can improve the explainability of deep inference. Our method obtains a best performance in literature on both Oxford-5k and Paris-6k, and sets a new record of 97.77% on Oxford-5k (97.81% on Paris-6k) without extracting any deep features.

preprint2022arXiv

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

LAMOST-MRS-B is one of the sub-surveys of LAMOST medium-resolution (R~7500) spectroscopic survey. It aims at studying the statistical properties (e.g., binary fraction, orbital period distribution, mass ratio distribution) of binary stars and exotic stars. We intend to observe about 30000 stars (10 mag <= G <= 14.5 mag) with at least 10 visits in five years. We first planned to observe 25 plates around the galactic plane in 2018. Then the plates were reduced to 12 in 2019 because of the limitation of observation. At the same time, two new plates located at the high galactic latitude were added to explore binary properties influenced by the different environments. In this survey project, we set the identified exotic and low-metallicity stars with the highest observation priorities. For the rest of the selected stars, we gave higher priority to the relatively brighter stars in order to obtain high-quality spectra as many as possible. Spectra of 49129 stars have been obtained in LAMOST-MRS-B field and released in DR8, of which 28828 and 3375 stars have been visited more than twice and ten times with SNR >= 10, respectively. Most of the sources are B-, A-, and F-type stars with 0.6 < [Fe/H] < 0.4 dex. We also obtain 347 identified variable and exotic stars and about 250 stars with [Fe/H] < 1 dex. We measure radial velocities (RVs) by using 892233 spectra of the stars. The uncertainties of RV achieve about 1 km/s and 10 km/s1 for 95% of late- and early-type stars, respectively. The datasets presented in this paper are available at http://www.doi.org/10.57760/sciencedb.j00113.00035.

preprint2022arXiv

LAST: Latent Space Assisted Adaptive Sampling for Protein Trajectories

Molecular dynamics (MD) simulation is widely used to study protein conformations and dynamics. However, conventional simulation suffers from being trapped in some local energy minima that are hard to escape. Thus, most computational time is spent sampling in the already visited regions. This leads to an inefficient sampling process and further hinders the exploration of protein movements in affordable simulation time. The advancement of deep learning provides new opportunities for protein sampling. Variational autoencoders are a class of deep learning models to learn a low-dimensional representation (referred to as the latent space) that can capture the key features of the input data. Based on this characteristic, we proposed a new adaptive sampling method, latent space assisted adaptive sampling for protein trajectories (LAST), to accelerate the exploration of protein conformational space. This method comprises cycles of (i) variational autoencoders training, (ii) seed structure selection on the latent space and (iii) conformational sampling through additional MD simulations. The proposed approach is validated through the sampling of four structures of two protein systems: two metastable states of E. Coli adenosine kinase (ADK) and two native states of Vivid (VVD). In all four conformations, seed structures were shown to lie on the boundary of conformation distributions. Moreover, large conformational changes were observed in a shorter simulation time when compared with conventional MD (cMD) simulations in both systems. In metastable ADK simulations, LAST explored two transition paths toward two stable states while cMD became trapped in an energy basin. In VVD light state simulations, LAST was three times faster than cMD simulation with a similar conformational space.

preprint2022arXiv

Searching Extra-tidal Features around the Globular Cluster Whiting 1

Whiting 1 is a faint and young globular cluster in the halo of the Milky Way, and was suggested to have originated in the Sagittarius spherical dwarf galaxy (Sgr dSph). In this paper, we use the deep DESI Legacy Imaging Surveys to explore tentative spatial connection between Whiting 1 and the Sgr dSph. We redetermine the fundamental parameters of Whiting 1 and use the best-fitting isochrone (age $τ$=6.5 Gyr, metalicity Z=0.005 and $\rm d_{\odot}$=26.9 kpc) to construct a theoretical matched filter for the extra-tidal features searching. Without any smooth technique to the matched filter density map, we detect a round-shape feature with possible leading and trailing tails on either side of the cluster. This raw image is not totally new compared to old discoveries, but confirms that no more large-scale features can be detected under a depth of r<=22.5 mag. In our results, the whole feature stretches 0.1-0.2 degree along the orbit of Whiting 1, which gives a much larger area than the cluster core. The tails on both sides of the cluster align along the orbital direction of the Sgr dSph as well as the cluster itself, which implies that these debris are probably stripped remnants of Whiting 1 by the Milky Way.

preprint2022arXiv

Snowmass 2021 White Paper: The Windchime Project

The absence of clear signals from particle dark matter in direct detection experiments motivates new approaches in disparate regions of viable parameter space. In this Snowmass white paper, we outline the Windchime project, a program to build a large array of quantum-enhanced mechanical sensors. The ultimate aim is to build a detector capable of searching for Planck mass-scale dark matter purely through its gravitational coupling to ordinary matter. In the shorter term, we aim to search for a number of other physics targets, especially some ultralight dark matter candidates. Here, we discuss the basic design, open R&D challenges and opportunities, current experimental efforts, and both short- and long-term physics targets of the Windchime project.

preprint2021arXiv

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Conventional methods for the image-text generation tasks mainly tackle the naturally bidirectional generation tasks separately, focusing on designing task-specific frameworks to improve the quality and fidelity of the generated samples. Recently, Vision-Language Pre-training models have greatly improved the performance of the image-to-text generation tasks, but large-scale pre-training models for text-to-image synthesis task are still under-developed. In this paper, we propose ERNIE-ViLG, a unified generative pre-training framework for bidirectional image-text generation with transformer model. Based on the image quantization models, we formulate both image generation and text generation as autoregressive generative tasks conditioned on the text/image input. The bidirectional image-text generative modeling eases the semantic alignments across vision and language. For the text-to-image generation process, we further propose an end-to-end training method to jointly learn the visual sequence generator and the image reconstructor. To explore the landscape of large-scale pre-training for bidirectional text-image generation, we train a 10-billion parameter ERNIE-ViLG model on a large-scale dataset of 145 million (Chinese) image-text pairs which achieves state-of-the-art performance for both text-to-image and image-to-text tasks, obtaining an FID of 7.9 on MS-COCO for text-to-image synthesis and best results on COCO-CN and AIC-ICC for image captioning.

preprint2021arXiv

LAMOST Time-Domain Survey: First Results of four $K$2 plates

From Oct. 2019 to Apr. 2020, LAMOST performs a time-domain spectroscopic survey of four $K$2 plates with both low- and med-resolution observations. The low-resolution spectroscopic survey gains 282 exposures ($\approx$46.6 hours) over 25 nights, yielding a total of about 767,000 spectra, and the med-resolution survey takes 177 exposures ($\approx$49.1 hours) over 27 nights, collecting about 478,000 spectra. More than 70%/50% of low-resolution/med-resolution spectra have signal-to-noise ratio higher than 10. We determine stellar parameters (e.g., $T_{\rm eff}$, log$g$, [Fe/H]) and radial velocity (RV) with different methods, including LASP, DD-Payne, and SLAM. In general, these parameter estimations from different methods show good agreement, and the stellar parameter values are consistent with those of APOGEE. We use the $Gaia$ DR2 RV data to calculate a median RV zero point (RVZP) for each spectrograph exposure by exposure, and the RVZP-corrected RVs agree well with the APOGEE data. The stellar evolutionary and spectroscopic masses are estimated based on the stellar parameters, multi-band magnitudes, distances and extinction values. Finally, we construct a binary catalog including about 2700 candidates by analyzing their light curves, fitting the RV data, calculating the binarity parameters from med-resolution spectra, and cross-matching the spatially resolved binary catalog from $Gaia$ EDR3. The LAMOST TD survey is expected to get breakthrough in various scientific topics, such as binary system, stellar activity, and stellar pulsation, etc.

preprint2021arXiv

The Spectroscopic Binaries from LAMOST Medium-Resolution Survey (MRS). I. Searching for Double-lined Spectroscopic Binaries (SB2s) with Convolutional Neural Network

We developed a convolutional neural network (CNN) model to distinguish the double-lined spectroscopic binaries (SB2s) from others based on single exposure medium-resolution spectra ($R\sim 7,500$). The training set consists of a large set of mock spectra of single stars and binaries synthesized based on the MIST stellar evolutionary model and ATLAS9 atmospheric model. Our model reaches a novel theoretic false positive rate by adding a proper penalty on the negative sample (e.g., 0.12\% and 0.16\% for the blue/red arm when the penalty parameter $Λ=16$). Tests show that the performance is as expected and favors FGK-type Main-sequence binaries with high mass ratio ($q \geq 0.7$) and large radial velocity separation ($Δv \geq 50\,\mathrm{km\,s^{-1}}$). Although the real false positive rate can not be estimated reliably, validating on eclipsing binaries identified from Kepler light curves indicates that our model predicts low binary probabilities at eclipsing phases (0, 0.5, and 1.0) as expected. The color-magnitude diagram also helps illustrate its feasibility and capability of identifying FGK MS binaries from spectra. We conclude that this model is reasonably reliable and can provide an automatic approach to identify SB2s with period $\lesssim 10$ days. This work yields a catalog of binary probabilities for over 5 million spectra of 1 million sources from the LAMOST medium-resolution survey (MRS), and a catalog of 2198 SB2 candidates whose physical properties will be analyzed in our following-up paper. Data products are made publicly available at the journal as well as our Github website.

preprint2021arXiv

The Stellar "Snake" I: Whole Structure and Properties

To complement our previous discovery of the young snake-like structure in the solar neighborhood and reveal the structure's full extent, we build two samples of stars within the Snake and its surrounding territory from {\tt Gaia EDR3}. With the friends-of-friends algorithm, we identify 2694 and 9615 Snake member candidates from the two samples. Thirteen open clusters are embedded in these member candidates. By combining the spectroscopic data from multiple surveys, we investigate the comprehensive properties of the candidates and find that they \thj{are very likely to} belong to one sizable structure, since most of the components are well bridged in their spatial distributions, and follow a single stellar population with an age of $30-40$\,Myr and solar metallicity. This sizable structure is best explained as hierarchically primordial, and probably formed from a filamentary giant molecular cloud with unique formation history in localized regions. To analyze the dynamics of the Snake, we divide the structure into five groups according to their tangential velocities; we find that the groups are expanding at a coherent rate ($κ_X\sim3.0\,\times10^{-2}\,\rm km\,s^{-1}\,pc^{-1}$) along the length of the structure ($X$-direction). \thj{The corresponding expansion age ($τ\sim33$\,Myr) is highly consistent with the age of the Snake}. With over ten thousand member stars, the Snake is an ideal laboratory to study nearby coeval stellar formation, stellar physics, and environmental evolution over a large spatial extent.

preprint2020arXiv

Deciphering the Protein Motion of S1 Subunit in SARS-CoV-2 Spike Glycoprotein Through Integrated Computational Methods

The novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a major worldwide public health emergency that has infected over $1.5$ million people. The partially open state of S1 subunit in spike glycoprotein is considered vital for its infection with host cell and is represented as a key target for neutralizing antibodies. However, the mechanism elucidating the transition from the closed state to the partially open state still remains unclear. Here, we applied a combination of Markov state model, transition path theory and random forest to analyze the S1 motion. Our results explored a promising complete conformational movement of receptor-binding domain, from buried, partially open, to detached states. We also numerically confirmed the transition probability between those states. Based on the asymmetry in both the dynamics behavior and backbone C$α$ importance, we further suggested a relation between chains in the trimer spike protein, which may help in the vaccine design and antibody neutralization.

preprint2020arXiv

Differential rotation of the halo traced by the K-giant stars

We use K-giant stars selected from the LAMOST DR5 to study the variation of the rotational velocity of the galactic halo at different space positions. Modelling the rotational velocity distribution with both the halo and disk components, we find that the rotational velocity of the halo population decreases almost linearly with increasing vertical distance to the galactic disk plane, $Z$, at fixed galactocentric radius, $R$. The samples are separated into two parts with $6<R<12$ kpc and $12<R<20$ kpc. We derive that the decreasing rates along $Z$ for the two subsamples are $-3.07\pm0.63$ and $-1.89\pm0.37$ km s$^{-1}$ kpc$^{-1}$, respectively. Compared with the TNG simulations, we suggest that this trend is probably caused by the interaction between the disk and halo. The results from the simulations show that only the oblate halo can provide a decreasing rotational velocity with an increasing $Z$. This indicates that the Galactic halo is oblate with galactocentric radius $R<20$ kpc. On the other hand, the flaring of the disk component (mainly the thick disk) is clearly traced by this study, with $R$ between 12 and 20 kpc, the disk can vertically extend to $6\sim10$ kpc above the disk plane. What is more interesting is that, we find the Gaia-Enceladus-Sausage (GES) component has a significant contribution only in the halo with $R<12$ kpc, i.e. a fraction of 23$-$47\%. While in the outer subsample, the contribution is too low to be well constrained.

preprint2020arXiv

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

Current pre-training works in natural language generation pay little attention to the problem of exposure bias on downstream tasks. To address this issue, we propose an enhanced multi-flow sequence to sequence pre-training and fine-tuning framework named ERNIE-GEN, which bridges the discrepancy between training and inference with an infilling generation mechanism and a noise-aware generation method. To make generation closer to human writing patterns, this framework introduces a span-by-span generation flow that trains the model to predict semantically-complete spans consecutively rather than predicting word by word. Unlike existing pre-training methods, ERNIE-GEN incorporates multi-granularity target sampling to construct pre-training data, which enhances the correlation between encoder and decoder. Experimental results demonstrate that ERNIE-GEN achieves state-of-the-art results with a much smaller amount of pre-training data and parameters on a range of language generation tasks, including abstractive summarization (Gigaword and CNN/DailyMail), question generation (SQuAD), dialogue generation (Persona-Chat) and generative question answering (CoQA).

preprint2020arXiv

ivis Dimensionality Reduction Framework for Biomacromolecular Simulations

Molecular dynamics (MD) simulations have been widely applied to study macromolecules including proteins. However, high-dimensionality of the datasets produced by simulations makes it difficult for thorough analysis, and further hinders a deeper understanding of biomacromolecules. To gain more insights into the protein structure-function relations, appropriate dimensionality reduction methods are needed to project simulations onto low-dimensional spaces. Linear dimensionality reduction methods, such as principal component analysis (PCA) and time-structure based independent component analysis (t-ICA), could not preserve sufficient structural information. Though better than linear methods, nonlinear methods, such as t-distributed stochastic neighbor embedding (t-SNE), still suffer from the limitations in avoiding system noise and keeping inter-cluster relations. ivis is a novel deep learning-based dimensionality reduction method originally developed for single-cell datasets. Here we applied this framework for the study of light, oxygen and voltage (LOV) domain of diatom Phaeodactylum tricornutum aureochrome 1a (PtAu1a). Compared with other methods, ivis is shown to be superior in constructing Markov state model (MSM), preserving information of both local and global distances and maintaining similarity between high dimension and low dimension with the least information loss. Moreover, ivis framework is capable of providing new prospective for deciphering residue-level protein allostery through the feature weights in the neural network. Overall, ivis is a promising member in the analysis toolbox for proteins.

preprint2020arXiv

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2023. According to the detailed survey plan, we expect that LAMOST-MRS can observe about 2 million stellar spectra with ~7500 and limiting magnitude of around G=15 mag. Moreover, it will also provide about 200 thousand stars with averagely 60-epoch observations and limiting magnitude of G~14 mag. These high quality spectra will give around 20 elemental abundances, rotational velocities, emission line profiles as well as precise radial velocity with uncertainty less than 1 km/s. With these data, we expect that LAMOST can effectively leverage sciences on stellar physics, e.g. exotic binary stars, detailed observation of many types of variable stars etc., planet host stars, emission nebulae, open clusters, young pre-main-sequence stars etc.

preprint2020arXiv

MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems

In recommender systems such as news feed stream, it is essential to optimize the long-term utilities in the continuous user-system interaction processes. Previous works have proved the capability of reinforcement learning in this problem. However, there are many practical challenges to implement deep reinforcement learning in online systems, including low sample efficiency, uncontrollable risks, and excessive variances. To address these issues, we propose a novel reinforcement learning method, namely model-based counterfactual advantage learning (MBCAL). The proposed method takes advantage of the characteristics of recommender systems and draws ideas from the model-based reinforcement learning method for higher sample efficiency. It has two components: an environment model that predicts the instant user behavior one-by-one in an auto-regressive form, and a future advantage model that predicts the future utility. To alleviate the impact of excessive variance when learning the future advantage model, we employ counterfactual comparisons derived from the environment model. In consequence, the proposed method possesses high sample efficiency and significantly lower variance; Also, it is able to use existing user logs to avoid the risks of starting from scratch. In contrast to its capability, its implementation cost is relatively low, which fits well with practical systems. Theoretical analysis and elaborate experiments are presented. Results show that the proposed method transcends the other supervised learning and RL-based methods in both sample efficiency and asymptotic performances.

preprint2020arXiv

Monolithic piezoelectric control of soliton microcombs

High-speed laser frequency actuation is critical in all applications employing lasers and frequency combs, and is prerequisite for phase locking, frequency stabilization and stability transfer among multiple optical carriers. Soliton microcombs have emerged as chip-scale, broadband and low-power-consumption frequency comb sources.Yet, integrated microcombs relying on thermal heaters for on-chip actuation all exhibit only kilohertz actuation bandwidth. Consequently, high-speed actuation and locking of microcombs have been attained only with off-chip bulk modulators. Here, we present high-speed microcomb actuation using integrated components. By monolithically integrating piezoelectric AlN actuators on ultralow-loss Si3N4 photonic circuits, we demonstrate voltage-controlled soliton tuning, modulation and stabilization. The integrated AlN actuators feature bi-directional tuning with high linearity and low hysteresis, operate with 300 nW power and exhibit flat actuation response up to megahertz frequency, significantly exceeding bulk piezo tuning bandwidth. We use this novel capability to demonstrate a microcomb engine for parallel FMCW LiDAR, via synchronously tuning the laser and microresonator. By applying a triangular sweep at the modulation rate matching the frequency spacing of HBAR modes, we exploit the resonant build-up of bulk acoustic energy to significantly lower the required driving to a CMOS voltage of only 7 Volts. Our approach endows soliton microcombs with integrated, ultralow-power-consumption, and fast actuation, significantly expanding the repertoire of technological applications.

preprint2020arXiv

SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis

Recently, sentiment analysis has seen remarkable advance with the help of pre-training approaches. However, sentiment knowledge, such as sentiment words and aspect-sentiment pairs, is ignored in the process of pre-training, despite the fact that they are widely used in traditional sentiment analysis approaches. In this paper, we introduce Sentiment Knowledge Enhanced Pre-training (SKEP) in order to learn a unified sentiment representation for multiple sentiment analysis tasks. With the help of automatically-mined knowledge, SKEP conducts sentiment masking and constructs three sentiment knowledge prediction objectives, so as to embed sentiment information at the word, polarity and aspect level into pre-trained sentiment representation. In particular, the prediction of aspect-sentiment pairs is converted into multi-label classification, aiming to capture the dependency between words in a pair. Experiments on three kinds of sentiment tasks show that SKEP significantly outperforms strong pre-training baseline, and achieves new state-of-the-art results on most of the test datasets. We release our code at https://github.com/baidu/Senta.

preprint2020arXiv

The Zwicky Transient Facility Catalog of Periodic Variable Stars

The number of known periodic variables has grown rapidly in recent years. Thanks to its large field of view and faint limiting magnitude, the Zwicky Transient Facility (ZTF) offers a unique opportunity to detect variable stars in the northern sky. Here, we exploit ZTF Data Release 2 (DR2) to search for and classify variables down to r ~ 20.6 mag. We classify 781,602 periodic variables into 11 main types using an improved classification method. Comparison with previously published catalogs shows that 621,702 objects (79.5%) are newly discovered or newly classified, including ~700 Cepheids, ~5000 RR Lyrae stars, ~15,000 Delta Scuti variables, ~350,000 eclipsing binaries, ~100,000 long-period variables, and about 150,000 rotational variables. The typical misclassification rate and period accuracy are on the order of 2% and 99%, respectively. 74% of our variables are located at Galactic latitudes, $|b|<10^\circ$. This large sample of Cepheids, RR Lyrae, Delta Scuti stars, and contact (EW-type) eclipsing binaries is helpful to investigate the Galaxy's disk structure and evolution with an improved completeness, areal coverage, and age resolution. Specifically, the northern warp and the disk's edge at distances of 15--20 kpc are significantly better covered than previously. Among rotational variables, RS Canum Venaticorum and BY Draconis-type variables can be separated easily. Our knowledge of stellar chromospheric activity would benefit greatly from a statistical analysis of these types of variables.

preprint2019arXiv

Hybrid Integrated Photonics Using Bulk Acoustic Resonators

Microwave frequency acousto-optic modulation is realized by exciting high overtone bulk acoustic wave resonances (HBAR resonances) in the photonic stack. These confined mechanical stress waves transmit exhibit vertically transmitting, high quality factor (Q) acoustic Fabry Perot resonances that extend into the Gigahertz domain, and offer stress-optical interaction with the optical modes of the microresonator. Although HBAR are ubiquitously used in modern communication, and often exploited in superconducting circuits, this is the first time they have been incorporated on a photonic circuit based chip. The electro-acousto-optical interaction observed within the optical modes exhibits high actuation linearity, low actuation power and negligible crosstalk. Using the electro-acousto-optic interaction, fast optical resonance tuning is achieved with sub-nanosecond transduction time. By removing the silicon backreflection, broadband acoustic modulation at 4.1 and 8.7 GHz is realized with a 3 dB bandwidth of 250 MHz each. The novel hybrid HBAR nanophotonic platform demonstrated here, allowing on chip integration of micron-scale acoustic and photonic resonators, can find immediate applications in tunable microwave photonics, high bandwidth soliton microcomb stabilization, compact opto-electronic oscillators, and in microwave to optical conversion schemes. Moreover the hybrid platform allows implementation of momentum biasing, which allows realization of on chip non-reciprocal devices such as isolators or circulators and topological photonic bandstructures.

preprint2019arXiv

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kpc. The metallicity distributions of Sgr K-, M-giants, and BHBs present that the M-giants are on average the most metal-rich population, followed by K-giants and BHBs. All of the K-, M-giants, and BHBs indicate that the trailing arm is on average more metal-rich than leading arm, and the K-giants show that the Sgr debris is the most metal-poor part. The $α$-abundance of Sgr stars exhibits a similar trend with the Galactic halo stars at lower metallicity ([Fe/H] $<\sim$ $-$1.0 dex), and then evolve down to lower [$α$/Fe] than disk stars at higher metallicity, which is close to the evolution pattern of $α$-element of Milky Way dwarf galaxies. We find $V_Y$ and metallicity of K-giants have gradients along the direction of line-of-sight from the Galactic center in $X$-$Z$ plane, and the K-giants show that $V_Y$ increases with metallicity at [Fe/H] $>\sim-$1.5 dex. After dividing the Sgr stream into bright and faint stream according to their locations in equatorial coordinate, the K-giants and BHBs show that the bright and faint stream present different $V_Y$ and metallicities, the bright stream is on average higher in $V_Y$ and metallicity than the faint stream.

preprint2017arXiv

Diophantine equations involving Euler's totient function

In this paper, we consider the equations involving Euler's totient function $ϕ$ and Lucas type sequences. In particular, we prove that the equation $ϕ(x^m-y^m)=x^n-y^n$ has no solutions in positive integers $x, y, m, n$ except for the trivial solutions $(x, y, m , n)=(a+1, a, 1, 1)$, where $a$ is a positive integer, and the equation $ϕ((x^m-y^m)/(x-y))=(x^n-y^n)/(x-y)$ has no solutions in positive integers $x, y, m, n$ except for the trivial solutions $(x, y, m , n)=(a, b, 1, 1)$, where $a, b$ are integers with $a>b\ge 1$.

preprint2016arXiv

Selecting M-giants with infra-red photometry: Distances, metallicities and the Sagittarius stream

Using a spectroscopically confirmed sample of M-giants, M-dwarfs and quasars from the LAMOST survey, we assess how well WISE $\&$ 2MASS color-cuts can be used to select M-giant stars. The WISE bands are very efficient at separating M-giants from M-dwarfs and we present a simple classification that can produce a clean and relatively complete sample of M-giants. We derive a new photometric relation to estimate the metallicity for M-giants, calibrated using data from the APOGEE survey. We find a strong correlation between the $(W1-W2)$ color and $\rm [M/H]$, where almost all of the scatter is due to photometric uncertainties. We show that previous photometric distance relations, which are mostly based on stellar models, may be biased and devise a new empirical distance relation, investigating trends with metallicity and star formation history. Given these relations, we investigate the properties of M-giants in the Sagittarius stream. The offset in the orbital plane between the leading and trailing tails is reproduced and, by identifying distant M-giants in the direction of the Galactic anti-center, we confirm that the previously detected debris in the outer halo is the apocenter of the trailing tail. We also find tentative evidence supporting an existing overdensity near the leading tail in the Northern Galactic hemisphere, possibly an extension to the trailing tail (so-called Branch C). We have measured the metallicity distribution along the stream, finding a clear metallicity offset between the leading and trailing tails, in agreement with models for the stream formation. We include an online table of M-giants to facilitate further studies.

preprint2015arXiv

Broadband high reflectivity in subwavelength-grating slab waveguides

We computationally study a subwavelength dielectric grating structure, show that slab waveguide modes can be used to obtain broadband high reflectivity, and analyze how slab waveguide modes influence reflection. A structure showing interference between Fabry-Perot modes, slab waveguide modes, and waveguide array modes is designed with ultra-broadband high reflectivity. Owing to the coupling of guided modes, the region with reflectivity R > 0.99 has an ultra-high bandwidth ( Δf/f > 30%). The incident-angle region with R > 0.99 extends over a range greater than 40°. Moreover, an asymmetric waveguide structure is studied using a semiconductor substrate.

preprint2015arXiv

Normal incidence narrowband transmission filtering in zero-contrast gratings

We report narrowband transmission filtering based on zero-contrast grating (ZCG) reflectors at normal incidence. Computational results show that the filtering is realized through symmetry-protected modes coupling. The guided modes introduced by the slab layer make the filter frequencies flexible to modify. The rectangular structure of the filter allows simple fabrication and integration into optical systems. The quality factor of the filters could exceed 106. Owing to the low refraction index dispersion of the semiconductor and their scale-invariant operations, these filters can be applied in a broad infrared range from near infrared to terahertz wavelengths.

Hao Tian

What is connected

Connect this record

See the researcher in context

Building this map preview

34 published item(s)

Enhanced multi-parameter metrology in dissipative Rydberg atom time crystals

MiMo-V2-Flash Technical Report

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Introduction to the Chinese Space Station Survey Telescope (CSST)

MiMo-Audio: Audio Language Models are Few-Shot Learners

COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems

Efficient Asynchronous Federated Learning with Sparsification and Quantization

ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization

ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval

ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention

Indicative Image Retrieval: Turning Blackbox Learning into Grey

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

LAST: Latent Space Assisted Adaptive Sampling for Protein Trajectories

Searching Extra-tidal Features around the Globular Cluster Whiting 1

Snowmass 2021 White Paper: The Windchime Project

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

LAMOST Time-Domain Survey: First Results of four $K$2 plates

The Spectroscopic Binaries from LAMOST Medium-Resolution Survey (MRS). I. Searching for Double-lined Spectroscopic Binaries (SB2s) with Convolutional Neural Network

The Stellar "Snake" I: Whole Structure and Properties

Deciphering the Protein Motion of S1 Subunit in SARS-CoV-2 Spike Glycoprotein Through Integrated Computational Methods

Differential rotation of the halo traced by the K-giant stars

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

ivis Dimensionality Reduction Framework for Biomacromolecular Simulations

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems

Monolithic piezoelectric control of soliton microcombs

SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis

The Zwicky Transient Facility Catalog of Periodic Variable Stars

Hybrid Integrated Photonics Using Bulk Acoustic Resonators

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

Diophantine equations involving Euler's totient function

Selecting M-giants with infra-red photometry: Distances, metallicities and the Sagittarius stream

Broadband high reflectivity in subwavelength-grating slab waveguides

Normal incidence narrowband transmission filtering in zero-contrast gratings