Source author record

Xiujun Li

Xiujun Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computation and Language Computer Vision Artificial Intelligence Information Retrieval nucl-ex Robotics Systems and Control

Catalog footprint

What is connected

7works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Charm and beauty isolation from heavy flavor decay electrons in p+p and Pb+Pb collisions at $\sqrt{s_{\mathrm{NN}}}$ = 5.02 TeV at LHC

We present an analysis on the heavy flavor hadron decay electrons with charm and beauty contributions decomposed via a data driven method in p+p and Pb+Pb collisions at $\sqrt{s_{\mathrm{NN}}}$ = 5.02 TeV at LHC. The transverse momentum $p_{\mathrm{T}}$ spectra, nuclear modification factor $R_{\mathrm{AA}}$ and azimuthal anisotropic flow $v_2$ distributions of electrons from charm and beauty decays are obtained. We find that the electron $R_{\mathrm{AA}}$ from charm ($R_{\mathrm{AA}}^{\mathrm{c\rightarrow e}}$) and beauty ($R_{\mathrm{AA}}^{\mathrm{b\rightarrow e}}$) decays are suppressed at $p_{\mathrm{T}}$ $>$ 2.0 and $p_{\mathrm{T}}$ $>$ 3.0 GeV/$c$ in Pb+Pb collisions, respectively, which indicates that charm and beauty interact with and lose their energy in the hot-dense medium. A less suppression of electron $R_{\mathrm{AA}}$ from beauty decays than that from charm decays at 2.0 $<$ $p_{\mathrm{T}}$ $<$ 8.0 GeV/$c$ is observed, which is consistent with the mass-dependent partonic energy loss scenario. A non-zero electron $v_2$ from beauty decays ($v_{2}^{\mathrm{b\rightarrow e}}$) is observed and in good agreement with ALICE measurement. At low $p_{\mathrm{T}}$ region from 1.0 to 3.0 GeV/$c$, a discrepancy between RHIC and LHC results is observed with 68\% confidence level, which suggests different degree of thermalization of beauty quark under different temperatures of the medium. At 3.0 GeV/$c$ $<$ $p_{\mathrm{T}}$ $<$ 7.0 GeV/$c$, $v_{2}^{\mathrm{b\rightarrow e}}$ deviates from a number-of-constituent-quark (NCQ) scaling hypothesis, which favors that beauty quark is unlikely thermalized in heavy-ion collisions at LHC energy.

preprint2021arXiv

Self-supervised Pre-training with Hard Examples Improves Visual Representations

Self-supervised pre-training (SSP) employs random image transformations to generate training data for visual representation learning. In this paper, we first present a modeling framework that unifies existing SSP methods as learning to predict pseudo-labels. Then, we propose new data augmentation methods of generating training examples whose pseudo-labels are harder to predict than those generated via random image transformations. Specifically, we use adversarial training and CutMix to create hard examples (HEXA) to be used as augmented views for MoCo-v2 and DeepCluster-v2, leading to two variants HEXA_{MoCo} and HEXA_{DCluster}, respectively. In our experiments, we pre-train models on ImageNet and evaluate them on multiple public benchmarks. Our evaluation shows that the two new algorithm variants outperform their original counterparts, and achieve new state-of-the-art on a wide range of tasks where limited task supervision is available for fine-tuning. These results verify that hard examples are instrumental in improving the generalization of the pre-trained models.

preprint2020arXiv

Few-shot Natural Language Generation for Task-Oriented Dialog

As a crucial component in task-oriented dialog systems, the Natural Language Generation (NLG) module converts a dialog act represented in a semantic form into a response in natural language. The success of traditional template-based or statistical models typically relies on heavily annotated data, which is infeasible for new domains. Therefore, it is pivotal for an NLG system to generalize well with limited labelled data in real applications. To this end, we present FewShotWoz, the first NLG benchmark to simulate the few-shot learning setting in task-oriented dialog systems. Further, we develop the SC-GPT model. It is pre-trained on a large set of annotated NLG corpus to acquire the controllable generation ability, and fine-tuned with only a few domain-specific labels to adapt to new domains. Experiments on FewShotWoz and the large Multi-Domain-WOZ datasets show that the proposed SC-GPT significantly outperforms existing methods, measured by various automatic metrics and human evaluations.

preprint2020arXiv

Multi-View Learning for Vision-and-Language Navigation

Learning to navigate in a visual environment following natural language instructions is a challenging task because natural language instructions are highly variable, ambiguous, and under-specified. In this paper, we present a novel training paradigm, Learn from EveryOne (LEO), which leverages multiple instructions (as different views) for the same trajectory to resolve language ambiguity and improve generalization. By sharing parameters across instructions, our approach learns more effectively from limited training data and generalizes better in unseen environments. On the recent Room-to-Room (R2R) benchmark dataset, LEO achieves 16% improvement (absolute) over a greedy agent as the base agent (25.3% $\rightarrow$ 41.4%) in Success Rate weighted by Path Length (SPL). Further, LEO is complementary to most existing models for vision-and-language navigation, allowing for easy integration with the existing techniques, leading to LEO+, which creates the new state of the art, pushing the R2R benchmark to 62% (9% absolute improvement).

preprint2020arXiv

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

Large-scale pre-training methods of learning cross-modal representations on image-text pairs are becoming popular for vision-language tasks. While existing methods simply concatenate image region features and text features as input to the model to be pre-trained and use self-attention to learn image-text semantic alignments in a brute force manner, in this paper, we propose a new learning method Oscar (Object-Semantics Aligned Pre-training), which uses object tags detected in images as anchor points to significantly ease the learning of alignments. Our method is motivated by the observation that the salient objects in an image can be accurately detected, and are often mentioned in the paired text. We pre-train an Oscar model on the public corpus of 6.5 million text-image pairs, and fine-tune it on downstream tasks, creating new state-of-the-arts on six well-established vision-language understanding and generation tasks.

preprint2020arXiv

Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training

Learning to navigate in a visual environment following natural-language instructions is a challenging task, because the multimodal inputs to the agent are highly variable, and the training data on a new task is often limited. In this paper, we present the first pre-training and fine-tuning paradigm for vision-and-language navigation (VLN) tasks. By training on a large amount of image-text-action triplets in a self-supervised learning manner, the pre-trained model provides generic representations of visual environments and language instructions. It can be easily used as a drop-in for existing VLN frameworks, leading to the proposed agent called Prevalent. It learns more effectively in new tasks and generalizes better in a previously unseen environment. The performance is validated on three VLN tasks. On the Room-to-Room benchmark, our model improves the state-of-the-art from 47% to 51% on success rate weighted by path length. Further, the learned representation is transferable to other VLN tasks. On two recent tasks, vision-and-dialog navigation and "Help, Anna!" the proposed Prevalent leads to significant improvement over existing methods, achieving a new state of the art.

preprint2015arXiv

Recurrent Reinforcement Learning: A Hybrid Approach

Successful applications of reinforcement learning in real-world problems often require dealing with partially observable states. It is in general very challenging to construct and infer hidden states as they often depend on the agent's entire interaction history and may require substantial domain knowledge. In this work, we investigate a deep-learning approach to learning the representation of states in partially observable tasks, with minimal prior knowledge of the domain. In particular, we propose a new family of hybrid models that combines the strength of both supervised learning (SL) and reinforcement learning (RL), trained in a joint fashion: The SL component can be a recurrent neural networks (RNN) or its long short-term memory (LSTM) version, which is equipped with the desired property of being able to capture long-term dependency on history, thus providing an effective way of learning the representation of hidden states. The RL component is a deep Q-network (DQN) that learns to optimize the control for maximizing long-term rewards. Extensive experiments in a direct mailing campaign problem demonstrate the effectiveness and advantages of the proposed approach, which performs the best among a set of previous state-of-the-art methods.

Xiujun Li

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Charm and beauty isolation from heavy flavor decay electrons in p+p and Pb+Pb collisions at $\sqrt{s_{\mathrm{NN}}}$ = 5.02 TeV at LHC

Self-supervised Pre-training with Hard Examples Improves Visual Representations

Few-shot Natural Language Generation for Task-Oriented Dialog

Multi-View Learning for Vision-and-Language Navigation

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training

Recurrent Reinforcement Learning: A Hybrid Approach