Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
44works
0followers
23topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

44 published item(s)

preprint2026arXiv

S^2tory: Story Spine Distillation for Movie Script Summarization

Movie scripts pose a fundamental challenge for automatic summarization due to their non-linear, cross-cut narrative structure, which makes surface-level saliency methods ineffective at preserving core story progression. To address this, we introduce S^2tory (Story Spine Distillation), a narratology-grounded framework that leverages character development trajectories to identify plot nuclei, the essential events that drive the narrative forward, while filtering out peripheral satellite events that merely enrich atmosphere or emotion. Our Narrative Expert Agent (NEAgent) performs theory-constrained reasoning, whose distilled knowledge conditions a small model to identify plot nuclei. Another model then uses these plot nuclei to generate the summary. Experiments on the MovieSum dataset demonstrate state-of-the-art semantic fidelity at approximately 3.5x compression, and zero-shot evaluation on BookSum confirms strong out-of-domain generalization. Human evaluation further validates that narratological theory provides an indispensable foundation for modeling complex, non-linear narratives.

preprint2023arXiv

DAP: Domain-aware Prompt Learning for Vision-and-Language Navigation

Following language instructions to navigate in unseen environments is a challenging task for autonomous embodied agents. With strong representation capabilities, pretrained vision-and-language models are widely used in VLN. However, most of them are trained on web-crawled general-purpose datasets, which incurs a considerable domain gap when used for VLN tasks. To address the problem, we propose a novel and model-agnostic domain-aware prompt learning (DAP) framework. For equipping the pretrained models with specific object-level and scene-level cross-modal alignment in VLN tasks, DAP applies a low-cost prompt tuning paradigm to learn soft visual prompts for extracting in-domain image semantics. Specifically, we first generate a set of in-domain image-text pairs with the help of the CLIP model. Then we introduce soft visual prompts in the input space of the visual encoder in a pretrained model. DAP injects in-domain visual knowledge into the visual encoder of the pretrained model in an efficient way. Experimental results on both R2R and REVERIE show the superiority of DAP compared to existing state-of-the-art methods.

preprint2022arXiv

Capturing, Reconstructing, and Simulating: the UrbanScene3D Dataset

We present UrbanScene3D, a large-scale data platform for research of urban scene perception and reconstruction. UrbanScene3D contains over 128k high-resolution images covering 16 scenes including large-scale real urban regions and synthetic cities with 136 km^2 area in total. The dataset also contains high-precision LiDAR scans and hundreds of image sets with different observation patterns, which provide a comprehensive benchmark to design and evaluate aerial path planning and 3D reconstruction algorithms. In addition, the dataset, which is built on Unreal Engine and Airsim simulator together with the manually annotated unique instance label for each building in the dataset, enables the generation of all kinds of data, e.g., 2D depth maps, 2D/3D bounding boxes, and 3D point cloud/mesh segmentations, etc. The simulator with physical engine and lighting system not only produce variety of data but also enable users to simulate cars or drones in the proposed urban environment for future research.

preprint2022arXiv

CLSEG: Contrastive Learning of Story Ending Generation

Story Ending Generation (SEG) is a challenging task in natural language generation. Recently, methods based on Pre-trained Language Models (PLM) have achieved great prosperity, which can produce fluent and coherent story endings. However, the pre-training objective of PLM-based methods is unable to model the consistency between story context and ending. The goal of this paper is to adopt contrastive learning to generate endings more consistent with story context, while there are two main challenges in contrastive learning of SEG. First is the negative sampling of wrong endings inconsistent with story contexts. The second challenge is the adaptation of contrastive learning for SEG. To address these two issues, we propose a novel Contrastive Learning framework for Story Ending Generation (CLSEG), which has two steps: multi-aspect sampling and story-specific contrastive learning. Particularly, for the first issue, we utilize novel multi-aspect sampling mechanisms to obtain wrong endings considering the consistency of order, causality, and sentiment. To solve the second issue, we well-design a story-specific contrastive training strategy that is adapted for SEG. Experiments show that CLSEG outperforms baselines and can produce story endings with stronger consistency and rationality.

preprint2022arXiv

CogIntAc: Modeling the Relationships between Intention, Emotion and Action in Interactive Process from Cognitive Perspective

Intention, emotion and action are important psychological factors in human activities, which play an important role in the interaction between individuals. How to model the interaction process between individuals by analyzing the relationship of their intentions, emotions, and actions at the cognitive level is challenging. In this paper, we propose a novel cognitive framework of individual interaction. The core of the framework is that individuals achieve interaction through external action driven by their inner intention. Based on this idea, the interactions between individuals can be constructed by establishing relationships between the intention, emotion and action. Furthermore, we conduct analysis on the interaction between individuals and give a reasonable explanation for the predicting results. To verify the effectiveness of the framework, we reconstruct a dataset and propose three tasks as well as the corresponding baseline models, including action abduction, emotion prediction and action generation. The novel framework shows an interesting perspective on mimicking the mental state of human beings in cognitive science.

preprint2022arXiv

COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities

Motivations, emotions, and actions are inter-related essential factors in human activities. While motivations and emotions have long been considered at the core of exploring how people take actions in human activities, there has been relatively little research supporting analyzing the relationship between human mental states and actions. We present the first study that investigates the viability of modeling motivations, emotions, and actions in language-based human activities, named COMMA (Cognitive Framework of Human Activities). Guided by COMMA, we define three natural language processing tasks (emotion understanding, motivation understanding and conditioned action generation), and build a challenging dataset Hail through automatically extracting samples from Story Commonsense. Experimental results on NLP applications prove the effectiveness of modeling the relationship. Furthermore, our models inspired by COMMA can better reveal the essential relationship among motivations, emotions and actions than existing methods.

preprint2022arXiv

Decomposing Magnetic Fields in Three Dimensions over the Central Molecular Zone

Measuring magnetic fields in the interstellar medium and obtaining their distribution along line-of-sight is very challenging with the traditional techniques. The Velocity Gradient Technique (VGT), which utilizes anisotropy of magnetohydrodynamic (MHD) turbulence, provides an attractive solution. Targeting the central molecular zone (CMZ), we test this approach by applying the VGT to $\rm ^{12}CO$ and $\rm ^{13}CO$ (J = 1-0) data cubes. We first used the SCOUSEPY algorithm to decompose the CO line emissions into separate velocity components, and then we constructed pseudo-Stokes parameters via the VGT to map the plane-of-the-sky magnetic fields in three-dimension. We present the decomposed magnetic field maps and investigate their significance. While the line-of-sight integrated magnetic field orientation is shown to be consistent with the polarized dust emission from the Planck survey at 353 GHz, individual velocity components may exhibit different magnetic fields. We present a scheme of magnetic field configuration in the CMZ based on the decomposed magnetic fields. In particular, we observe a nearly vertical magnetic field orientation in the dense clump near the Sgr B2 and a change in the outflow regions around the Sgr A*. Two high-velocity structures associated with an expanding ring in the CMZ show distinct swirling magnetic field structures. These results demonstrate the potential power of the VGT to decompose velocity or density-dependent magnetic structures.

preprint2022arXiv

Dir-MUSIC Algorithm for DOA Estimation of Partial Discharge Based on Signal Strength represented by Antenna Gain Array Manifold

Inspection robots are widely used in the field of smart grid monitoring in substations, and partial discharge (PD) is an important sign of the insulation state of equipments. PD direction of arrival (DOA) algorithms using conventional beamforming and time difference of arrival (TDOA) require large-scale antenna arrays and high computational complexity, which make them difficult to implement on inspection robots. To address this problem, a novel directional multiple signal classification (Dir-MUSIC) algorithm for PD direction finding based on signal strength is proposed, and a miniaturized directional spiral antenna circular array is designed in this paper. First, the Dir-MUSIC algorithm is derived based on the array manifold characteristics. This method uses strength intensity information rather than the TDOA information, which could reduce the computational difficulty and the requirement of array size. Second, the effects of signal-to-noise ratio (SNR) and array manifold error on the performance of the algorithm are discussed through simulations in detail. Then according to the positioning requirements, the antenna array and its arrangement are developed, optimized, and simulation results suggested that the algorithm has reliable direction-finding performance in the form of 6 elements. Finally, the effectiveness of the algorithm is tested by using the designed spiral circular array in real scenarios. The experimental results show that the PD direction-finding error is 3.39°, which can meet the need for Partial discharge DOA estimation using inspection robots in substations.

preprint2022arXiv

Do You Know My Emotion? Emotion-Aware Strategy Recognition towards a Persuasive Dialogue System

Persuasive strategy recognition task requires the system to recognize the adopted strategy of the persuader according to the conversation. However, previous methods mainly focus on the contextual information, little is known about incorporating the psychological feedback, i.e. emotion of the persuadee, to predict the strategy. In this paper, we propose a Cross-channel Feedback memOry Network (CFO-Net) to leverage the emotional feedback to iteratively measure the potential benefits of strategies and incorporate them into the contextual-aware dialogue information. Specifically, CFO-Net designs a feedback memory module, including strategy pool and feedback pool, to obtain emotion-aware strategy representation. The strategy pool aims to store historical strategies and the feedback pool is to obtain updated strategy weight based on feedback emotional information. Furthermore, a cross-channel fusion predictor is developed to make a mutual interaction between the emotion-aware strategy representation and the contextual-aware dialogue information for strategy recognition. Experimental results on \textsc{PersuasionForGood} confirm that the proposed model CFO-Net is effective to improve the performance on M-F1 from 61.74 to 65.41.

preprint2022arXiv

Dynamic Cardiac MRI Reconstruction Using Combined Tensor Nuclear Norm and Casorati Matrix Nuclear Norm Regularizations

Low-rank tensor models have been applied in accelerating dynamic magnetic resonance imaging (dMRI). Recently, a new tensor nuclear norm based on t-SVD has been proposed and applied to tensor completion. Inspired by the different properties of the tensor nuclear norm (TNN) and the Casorati matrix nuclear norm (MNN), we introduce a combined TNN and Casorati MNN regularizations framework to reconstruct dMRI, which we term as TMNN. The proposed method simultaneously exploits the spatial structure and the temporal correlation of the dynamic MR data. The optimization problem can be efficiently solved by the alternating direction method of multipliers (ADMM). In order to further improve the computational efficiency, we develop a fast algorithm under the Cartesian sampling scenario. Numerical experiments based on cardiac cine MRI and perfusion MRI data demonstrate the performance improvement over the traditional Casorati nuclear norm regularization method.

preprint2022arXiv

FedHiSyn: A Hierarchical Synchronous Federated Learning Framework for Resource and Data Heterogeneity

Federated Learning (FL) enables training a global model without sharing the decentralized raw data stored on multiple devices to protect data privacy. Due to the diverse capacity of the devices, FL frameworks struggle to tackle the problems of straggler effects and outdated models. In addition, the data heterogeneity incurs severe accuracy degradation of the global model in the FL training process. To address aforementioned issues, we propose a hierarchical synchronous FL framework, i.e., FedHiSyn. FedHiSyn first clusters all available devices into a small number of categories based on their computing capacity. After a certain interval of local training, the models trained in different categories are simultaneously uploaded to a central server. Within a single category, the devices communicate the local updated model weights to each other based on a ring topology. As the efficiency of training in the ring topology prefers devices with homogeneous resources, the classification based on the computing capacity mitigates the impact of straggler effects. Besides, the combination of the synchronous update of multiple categories and the device communication within a single category help address the data heterogeneity issue while achieving high accuracy. We evaluate the proposed framework based on MNIST, EMNIST, CIFAR10 and CIFAR100 datasets and diverse heterogeneous settings of devices. Experimental results show that FedHiSyn outperforms six baseline methods, e.g., FedAvg, SCAFFOLD, and FedAT, in terms of training accuracy and efficiency.

preprint2022arXiv

Latency-Aware Collaborative Perception

Collaborative perception has recently shown great potential to improve perception capabilities over single-agent perception. Existing collaborative perception methods usually consider an ideal communication environment. However, in practice, the communication system inevitably suffers from latency issues, causing potential performance degradation and high risks in safety-critical applications, such as autonomous driving. To mitigate the effect caused by the inevitable latency, from a machine learning perspective, we present the first latency-aware collaborative perception system, which actively adapts asynchronous perceptual features from multiple agents to the same time stamp, promoting the robustness and effectiveness of collaboration. To achieve such a feature-level synchronization, we propose a novel latency compensation module, called SyncNet, which leverages feature-attention symbiotic estimation and time modulation techniques. Experiments results show that the proposed latency aware collaborative perception system with SyncNet can outperforms the state-of-the-art collaborative perception method by 15.6% in the communication latency scenario and keep collaborative perception being superior to single agent perception under severe latency.

preprint2022arXiv

Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation

The principal task in supervised neural machine translation (NMT) is to learn to generate target sentences conditioned on the source inputs from a set of parallel sentence pairs, and thus produce a model capable of generalizing to unseen instances. However, it is commonly observed that the generalization performance of the model is highly influenced by the amount of parallel data used in training. Although data augmentation is widely used to enrich the training data, conventional methods with discrete manipulations fail to generate diverse and faithful training samples. In this paper, we present a novel data augmentation paradigm termed Continuous Semantic Augmentation (CsaNMT), which augments each training instance with an adjacency semantic region that could cover adequate variants of literal expression under the same meaning. We conduct extensive experiments on both rich-resource and low-resource settings involving various language pairs, including WMT14 English-{German,French}, NIST Chinese-English and multiple low-resource IWSLT translation tasks. The provided empirical evidences show that CsaNMT sets a new level of performance among existing augmentation techniques, improving on the state-of-the-art by a large margin. The core codes are contained in Appendix E.

preprint2022arXiv

Magnetic Field of Molecular Gas Measured with the Velocity Gradient Technique I. Orion A

Magnetic fields play an important role in the evolution of molecular clouds and star formation. Using the Velocity Gradient Technique (VGT) model, we measured the magnetic field in Orion A using the 12CO, 13CO, and C18O (1-0) emission lines at a scale of 0.07 pc. The measured B-field shows an east-west orientation that is perpendicular to the integral shaped filament of Orion A at large scale. The VGT magnetic fields obtained from 13CO and C18O are in agreement with the B-field that is measured from the Planck 353 GHz dust polarization at a scale of 0.55 pc. Removal of density effects by using a Velocity Decomposition Algorithm can significantly improve the accuracy of the VGT in tracing magnetic fields with the 12CO (1-0) line. The magnetic field strength of seven sub-clouds, OMC-1, OMC-2, OMC-3, OMC-4, OMC-5, L 1641-N, and NGC 1999 has also been estimated with the Davis-Chandrasekhar-Fermi (DCF) and MM2 technique, and these are found to be in agreement with previous results obtained from dust polarization at far-infrared and sub-millimeter wavelengths. At smaller scales, the VGT proves a good method to measure magnetic fields.

preprint2022arXiv

Modeling Intention, Emotion and External World in Dialogue Systems

Intention, emotion and action are important elements in human activities. Modeling the interaction process between individuals by analyzing the relationships between these elements is a challenging task. However, previous work mainly focused on modeling intention and emotion independently, and neglected of exploring the mutual relationships between intention and emotion. In this paper, we propose a RelAtion Interaction Network (RAIN), consisting of Intention Relation Module and Emotion Relation Module, to jointly model mutual relationships and explicitly integrate historical intention information. The experiments on the dataset show that our model can take full advantage of the intention, emotion and action between individuals and achieve a remarkable improvement over BERT-style baselines. Qualitative analysis verifies the importance of the mutual interaction between the intention and emotion.

preprint2022arXiv

MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering

Knowledge-based visual question answering requires the ability of associating external knowledge for open-ended cross-modal scene understanding. One limitation of existing solutions is that they capture relevant knowledge from text-only knowledge bases, which merely contain facts expressed by first-order predicates or language descriptions while lacking complex but indispensable multimodal knowledge for visual understanding. How to construct vision-relevant and explainable multimodal knowledge for the VQA scenario has been less studied. In this paper, we propose MuKEA to represent multimodal knowledge by an explicit triplet to correlate visual objects and fact answers with implicit relations. To bridge the heterogeneous gap, we propose three objective losses to learn the triplet representations from complementary views: embedding structure, topological relation and semantic space. By adopting a pre-training and fine-tuning learning strategy, both basic and domain-specific multimodal knowledge are progressively accumulated for answer prediction. We outperform the state-of-the-art by 3.35% and 6.08% respectively on two challenging knowledge-required datasets: OK-VQA and KRVQA. Experimental results prove the complementary benefits of the multimodal knowledge with existing knowledge bases and the advantages of our end-to-end framework over the existing pipeline methods. The code is available at https://github.com/AndersonStra/MuKEA.

preprint2022arXiv

Multi-level Contrastive Learning Framework for Sequential Recommendation

Sequential recommendation (SR) aims to predict the subsequent behaviors of users by understanding their successive historical behaviors. Recently, some methods for SR are devoted to alleviating the data sparsity problem (i.e., limited supervised signals for training), which take account of contrastive learning to incorporate self-supervised signals into SR. Despite their achievements, it is far from enough to learn informative user/item embeddings due to the inadequacy modeling of complex collaborative information and co-action information, such as user-item relation, user-user relation, and item-item relation. In this paper, we study the problem of SR and propose a novel multi-level contrastive learning framework for sequential recommendation, named MCLSR. Different from the previous contrastive learning-based methods for SR, MCLSR learns the representations of users and items through a cross-view contrastive learning paradigm from four specific views at two different levels (i.e., interest- and feature-level). Specifically, the interest-level contrastive mechanism jointly learns the collaborative information with the sequential transition patterns, and the feature-level contrastive mechanism re-observes the relation between users and items via capturing the co-action information (i.e., co-occurrence). Extensive experiments on four real-world datasets show that the proposed MCLSR outperforms the state-of-the-art methods consistently.

preprint2022arXiv

Multi-scale Magnetic Fields in the Central Molecular Zone: Inference from the Gradient Technique

The central molecular zone (CMZ) plays an essential role in regulating the nuclear ecosystem of our Galaxy. To get an insight into the magnetic fields of the CMZ, we employ the Gradient Technique (GT), which is rooted in the anisotropy of magnetohydrodynamic turbulence. Our analysis is based on the data of multiple wavelengths, including molecular emission lines, radio 1.4 GHz continuum image, and Herschel 70 $μ$m image, as well as ionized [Ne II] and Paschen-alpha emissions. The results are compared with the observations of Planck 353 GHz and High-resolution Airborne Wideband Camera Plus (HWAC+) 53 $μ$m polarized dust emissions. We map the orientation of the magnetic field at multiple wavelengths across the central molecular zone, including close-ups of the Radio Arc and Sagittarius A West regions, on multi scales from $\sim$ 0.1 pc to 10 pc. The magnetic fields towards the central molecular zone traced by GT are globally compatible with the polarization measurements, accounting for the contribution from the galactic foreground and background. This correspondence suggests that the magnetic field and turbulence are dynamically crucial in the galactic center. We find that the magnetic fields associated with the Arched filaments and the thermal components of the Radio Arc are in good agreement with the HAWC+ polarization. Our measurement towards the non-thermal Radio Arc reveals the poloidal magnetic field components in the galactic center. For Sagittarius A West region, we find a great agreement between the GT measurement using [Ne II] emission and HWAC+ 53 $μ$m observation. We use GT to predict the magnetic fields associated with ionized Paschen-alpha gas down to scales of 0.1 pc.

preprint2022arXiv

Neural Message Passing for Visual Relationship Detection

Visual relationship detection aims to detect the interactions between objects in an image; however, this task suffers from combinatorial explosion due to the variety of objects and interactions. Since the interactions associated with the same object are dependent, we explore the dependency of interactions to reduce the search space. We explicitly model objects and interactions by an interaction graph and then propose a message-passing-style algorithm to propagate the contextual information. We thus call the proposed method neural message passing (NMP). We further integrate language priors and spatial cues to rule out unrealistic interactions and capture spatial interactions. Experimental results on two benchmark datasets demonstrate the superiority of our proposed method. Our code is available at https://github.com/PhyllisH/NMP.

preprint2022arXiv

Probing Three-Dimensional Magnetic Fields: I -- Polarized Dust Emission

Polarized dust emission is widely used to trace the plane-of-the-sky (POS) component of interstellar magnetic fields in two dimensions. Its potential to access three-dimensional magnetic fields, including the inclination angle of the magnetic fields relative to the line-of-sight (LOS), is crucial for a variety of astrophysical problems. Based on the statistical features of observed polarization fraction and POS Alfvén Mach number $\overline{M_{\rm A}}_{,\bot}$ distribution, we present a new method for estimating the inclination angle. The magnetic field fluctuations raised by anisotropic magnetohydrodynamic (MHD) turbulence are taken into account in our method. By using synthetic dust emission generated from 3D compressible MHD turbulence simulations, we show that the fluctuations are preferentially perpendicular to the mean magnetic field. We find the inclination angle is the major agent for depolarization, while fluctuations of magnetic field strength and density have an insignificant contribution. We propose and demonstrate that the mean inclination angle over a region of interest can be calculated from the polarization fraction in a strongly magnetized reference position, where $\overline{M_{\rm A}}_{,\bot}^2\ll1$. We test and show that the new method can trace the 3D magnetic fields in sub-Alfvénic, trans-Alfvénic, and moderately super-Alfvénic conditions ($0.4\lesssim M_{\rm A}\lesssim1.2$). We numerically quantify that the difference between the estimated inclination angle and actual inclination angle ranges from 0 to $20^\circ$ with a median value of $\le10^\circ$.

preprint2022arXiv

Superdiffusion of Cosmic Rays in Compressible Magnetized Turbulence

Owing to the complexity of turbulent magnetic fields, modeling the diffusion of cosmic rays is challenging. Based on the current understanding of anisotropic magnetohydrodynamic (MHD) turbulence, we use test particles to examine the cosmic rays' superdiffusion in the direction perpendicular to the mean magnetic field. By changing Alfven Mach number $M_A$ and sonic Mach number $M_S$ of compressible MHD simulations, our study covers a wide range of astrophysical conditions including subsonic warm gas phase and supersonic cold molecular gas. We show that freely streaming cosmic rays' perpendicular displacement increases as 3/2 to the power of the time traveled along local magnetic field lines. This power-law index changes to 3/4 if the parallel propagation is diffusive. We find that the cosmic rays' parallel mean free path decreases in a power-law relation of $M_A^{-2}$ in supersonic turbulence. We investigate the energy fraction of slow, fast, and Alfvenic modes and confirm the dominance of Alfvenic modes in the perpendicular superdiffusion. In particular, the energy fraction of fast mode, which is the main agent for pitch-angle scattering, increases with $M_A$ but is insensitive to $M_S \ge 2$. Accordingly, our results suggest that the suppressed diffusion in supersonic molecular clouds arises primarily due to the variations of $M_A$ instead of $M_S$.

preprint2022arXiv

Testing the parametric form of the conditional variance in regressions based on distance covariance

In this paper, we propose a new test for checking the parametric form of the conditional variance based on distance covariance in nonlinear and nonparametric regression models. Inherit from the nice properties of distance covariance, our test is very easy to implement in practice and less effected by the dimensionality of covariates. The asymptotic properties of the test statistic are investigated under the null and alternative hypotheses. We show that the proposed test is consistent against any alternative and can detect local alternatives converging to the null hypothesis at the parametric rate 1/root(n) in both the nonlinear and nonparametric settings. As the limiting null distribution of the test statistic is intractable, we propose a residual bootstrap to approximate the limiting null distribution. Simulation studies are presented to assess the finite sample performance of the proposed test. We also apply the proposed test to a real data set for illustration.

preprint2022arXiv

The Velocity Statistics of Turbulent Clouds in the Presence of Gravity, Magnetic fields, Radiation, and Outflow Feedback

The interaction of turbulence, magnetic fields, self-gravity, and stellar feedback within molecular clouds is crucial for understanding star formation. We study the effects of self-gravity and outflow feedback on the properties of the turbulent velocity via the structure function over length scales from $\sim$ 0.01 pc to 2 pc. We analyze a series of three-dimensional, magnetohydrodynamical (MHD) simulations of star cluster formation. We find outflow feedback can change the scaling of velocity fluctuations but still roughly being in between Kolmogorov and Burgers turbulence. We observe that self-gravity and protostellar outflows increase the velocity fluctuations over all length scales. Outflows can amplify the velocity fluctuations by up to a factor of $\sim$7 on scales $\sim$ 0.01 - 0.2 pc and drive turbulence up to a scale of $\sim$ 1 pc. The amplified velocity fluctuations provide more support against gravity and enhance fragmentation on small scales. The self-gravity's effect is more significant on smaller dense clumps and it increases the fraction of the compressive velocity component up to a scale of $\sim$ 0.2 pc. However, outflow feedback drives both solenoidal and compressive modes, but it induces a higher fraction of solenoidal modes relative to compressive modes. Thus, with outflows, the dense core ends up with a slightly higher fraction of solenoidal modes. We find that the compressible fraction is fairly constant with about 1/3 on scales $\sim$ 0.1 - 0.2 pc. The combined effect of enhanced velocity dispersion and reduced compressive fraction contributes to a reduction in the star formation rate.

preprint2021arXiv

A Synergy of the Velocity Gradients Technique and the Probability Density Functions for Identifying Gravitational Collapse in Self-Absorbing Media

The Velocity Gradients Technique (VGT) and the Probability Density Functions (PDFs) of mass density are tools to study turbulence, magnetic fields, and self-gravity in molecular clouds. However, self-absorption can significantly make the observed intensity different from the column density structures. In this work, we study the effects of self-absorption on the VGT and the intensity PDFs utilizing three synthetic emission lines of CO isotopologs $^{12}$CO (1-0), $^{13}$CO (1-0), and C$^{18}$O (1-0). We confirm that the performance of VGT is insensitive to the radiative transfer effect. We numerically show the possibility of constructing 3D magnetic fields tomography through VGT. We find that the intensity PDFs change their shape from the pure log-normal to a distribution that exhibits a power-law tail depending on the optical depth for supersonic turbulence. We conclude the change of CO isotopologs' intensity PDFs can be independent of self-gravity, which makes the intensity PDFs less reliable in identifying gravitational collapsing regions. We compute the intensity PDFs for a star-forming region NGC 1333 and find the change of intensity PDFs in observation agrees with our numerical results. The synergy of VGT and the column density PDFs confirms that the self-gravitating gas occupies a large volume in NGC 1333.

preprint2021arXiv

End-to-End Human Object Interaction Detection with HOI Transformer

We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components. HOI Transformer reasons about the relations of objects and humans from global image context and directly predicts HOI instances in parallel. A quintuple matching loss is introduced to force HOI predictions in a unified way. Our method is conceptually much simpler and demonstrates improved accuracy. Without bells and whistles, HOI Transformer achieves $26.61\% $ $ AP $ on HICO-DET and $52.9\%$ $AP_{role}$ on V-COCO, surpassing previous methods with the advantage of being much simpler. We hope our approach will serve as a simple and effective alternative for HOI tasks. Code is available at https://github.com/bbepoch/HoiTransformer .

preprint2021arXiv

IIE-NLP-Eyas at SemEval-2021 Task 4: Enhancing PLM for ReCAM with Special Tokens, Re-Ranking, Siamese Encoders and Back Translation

This paper introduces our systems for all three subtasks of SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning. To help our model better represent and understand abstract concepts in natural language, we well-design many simple and effective approaches adapted to the backbone model (RoBERTa). Specifically, we formalize the subtasks into the multiple-choice question answering format and add special tokens to abstract concepts, then, the final prediction of question answering is considered as the result of subtasks. Additionally, we employ many finetuning tricks to improve the performance. Experimental results show that our approaches achieve significant performance compared with the baseline systems. Our approaches achieve eighth rank on subtask-1 and tenth rank on subtask-2.

preprint2021arXiv

Shuffle algebra realization of quantum affine superalgebra $U_{v}(\hat{\mathfrak{D}}(2,1;θ))$

We give shuffle algebra realization of positive part of quantum affine superalgebra $U_{v}(\widehat{\mathfrak{D}}(2,1;θ))$ associated to any simple root systems. We also determine the shuffle algebra associated to $\widehat{\mathfrak{sl}}(2|1)$ with odd root system when $v$ is a primitive root of unity of even order, generalizing results in \cite{FJMMT03}.

preprint2021arXiv

Velocity Gradients: Magnetic Field Tomography towards the Supernova Remnant W44

As a novel approach for tracing interstellar magnetic fields, the Velocity Gradient Technique (VGT) has been proven to be effective for probing magnetic fields in the diffuse interstellar medium (ISM). In this work, we verify the VGT in a broader context by applying the technique to a molecular cloud interacting with the supernovae remnant (SNR) W44. We probe the magnetic fields with the VGT using CO, $\rm HCO^+$, and H I emission lines and make a comparison with the Planck 353 GHZ dust polarization. We show that the VGT gives an accurate measurement that coheres with the Planck polarization especially in intense molecular gas emission regions. We further study the foreground's contribution to the polarization that results in a misalignment between the VGT and the Planck measurements in low-intensity molecular gas areas. We advance the VGT to achieve magnetic field tomography by decomposing the W44 into various velocity components. We show that W44's velocity component at $v\sim45$ km s$^{-1}$ exhibits the largest coverage and gives the best agreement with Planck polarization in terms of magnetic field orientation.

preprint2020arXiv

Anomalous thermal transport in metallic transition-metal nitrides originated from strong electron-phonon interactions

Metallic transition-metal nitrides (TMNs) are promising conductive ceramics for many applications, whose thermal transport is of great importance in device design. It is found metallic TiN and HfN hold anomalous thermal transport behaviors compared to common metals and nonmetallic TMNs. They have extremely large intrinsic phonon thermal conductivity mainly due to the large acoustic-optic phonon frequency gaps. The phonon thermal conductivity is reduced by two orders of magnitude as the phonon-isotope and phonon-electron scatterings are considered, which also induce the nontrivial temperature-independent behavior of phonon thermal conductivity. Nesting Fermi surfaces exist in both TiN and HfN, which cause the strong electron-phonon coupling strengths and heavily harm the transport of phonons and electrons. The phonon component takes an abnormally large ratio in total thermal conductivity, as 29% for TiN and 26% for HfN at 300 K. The results for thin films are also presented and it is shown that the phonon thermal conductivity can be efficiently limited by size. Our findings provide a deep understanding on the thermal transport in metallic TMNs and expand the scope of heat conduction theory in metal.

preprint2020arXiv

Asymptotic Optimality of the Binomial-Exhaustive Policy for Polling Systems with Large Switchover Times

We study an optimal-control problem of polling systems with large switchover times, when a holding cost is incurred on the queues. In particular, we consider a stochastic network with a single server that switches between several buffers (queues) according to a pre-specified order, assuming that the switchover times between the queues are large relative to the processing times of individual jobs. Due to its complexity, computing an optimal control for such a system is prohibitive, and so we instead search for an asymptotically optimal control. To this end, we first solve an optimal control problem for a deterministic relaxation (namely, for a fluid model), that is represented as a hybrid dynamical system. We then "translate" the solution to that fluid problem to a binomial-exhaustive policy for the underlying stochastic system, and prove that this policy is asymptotically optimal in a large-switchover-time scaling regime, provided a certain uniform integrability (UI) condition holds. Finally, we demonstrate that the aforementioned UI condition holds in the following cases: (i) the holding cost has (at most) linear growth, and all service times have finite second moments; (ii) the holding cost grows at most at a polynomial rate (of any degree), and the service-time distributions possess finite moment generating functions.

preprint2020arXiv

Collaborative Motion Prediction via Neural Motion Message Passing

Motion prediction is essential and challenging for autonomous vehicles and social robots. One challenge of motion prediction is to model the interaction among traffic actors, which could cooperate with each other to avoid collisions or form groups. To address this challenge, we propose neural motion message passing (NMMP) to explicitly model the interaction and learn representations for directed interactions between actors. Based on the proposed NMMP, we design the motion prediction systems for two settings: the pedestrian setting and the joint pedestrian and vehicle setting. Both systems share a common pattern: we use an individual branch to model the behavior of a single actor and an interactive branch to model the interaction between actors, while with different wrappers to handle the varied input formats and characteristics. The experimental results show that both systems outperform the previous state-of-the-art methods on several existing benchmarks. Besides, we provide interpretability for interaction learning.

preprint2020arXiv

DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue

Visual Dialogue task requires an agent to be engaged in a conversation with human about an image. The ability of generating detailed and non-repetitive responses is crucial for the agent to achieve human-like conversation. In this paper, we propose a novel generative decoding architecture to generate high-quality responses, which moves away from decoding the whole encoded semantics towards the design that advocates both transparency and flexibility. In this architecture, word generation is decomposed into a series of attention-based information selection steps, performed by the novel recurrent Deliberation, Abandon and Memory (DAM) module. Each DAM module performs an adaptive combination of the response-level semantics captured from the encoder and the word-level semantics specifically selected for generating each word. Therefore, the responses contain more detailed and non-repetitive descriptions while maintaining the semantic accuracy. Furthermore, DAM is flexible to cooperate with existing visual dialogue encoders and adaptive to the encoder structures by constraining the information selection mode in DAM. We apply DAM to three typical encoders and verify the performance on the VisDial v1.0 dataset. Experimental results show that the proposed models achieve new state-of-the-art performance with high-quality responses. The code is available at https://github.com/JXZe/DAM.

preprint2020arXiv

Existence and Approximations of Moments for Polling Systems under the Binomial-Exhaustive Policy

We establish sufficient conditions for the existence of moments of the steady-state queue in polling systems operating under the binomial-exhaustive policy (BEP). We assume that the server switches between the different buffers according to a pre-specified table, and that switchover times are incurred whenever the server moves from one buffer to the next. We further assume that customers arrive according to independent Poisson processes, and that the service and switchover times are independent random variables with general distributions. We then propose a simple scheme to approximate the moments, which is shown to be asymptotically exact as the switchover times grow without bound, and whose computation complexity does not grow with the order of the moment. Finally, we demonstrate that the proposed asymptotic approximation for the moments is related to the fluid limit under a large-switchover-time scaling; thus, similar approximations can be easily derived for other server-switching policies, by simply identifying the fluid limits under those controls. Numerical examples demonstrate the effectiveness of our approximations for the moments under BEP and under other policies, and their increased accuracy as the switchover times increase.

preprint2020arXiv

IIE-NLP-NUT at SemEval-2020 Task 4: Guiding PLM with Prompt Template Reconstruction Strategy for ComVE

This paper introduces our systems for the first two subtasks of SemEval Task4: Commonsense Validation and Explanation. To clarify the intention for judgment and inject contrastive information for selection, we propose the input reconstruction strategy with prompt templates. Specifically, we formalize the subtasks into the multiple-choice question answering format and construct the input with the prompt templates, then, the final prediction of question answering is considered as the result of subtasks. Experimental results show that our approaches achieve significant performance compared with the baseline systems. Our approaches secure the third rank on both official test sets of the first two subtasks with an accuracy of 96.4 and an accuracy of 94.3 respectively.

preprint2020arXiv

Magnetic Field Morphology in Interstellar Clouds with the Velocity Gradient Technique

Magnetic fields, while ubiquitous in many astrophysical environments, are challenging to measure observationally. Based on the properties of anisotropy of eddies in magnetized turbulence, the Velocity Gradient Technique is a method synergistic to dust polarimetry that is capable of tracing plane-of-the-sky magnetic field, measuring the magnetization of interstellar media and estimating the fraction of gravitational collapsing gas in molecular clouds using spectral line observations. In this paper, we apply this technique to five low-mass star-forming molecular clouds in the Gould Belt and compare the results to the magnetic-field orientation obtained from polarized dust emission. We find the estimates of magnetic field orientations and magnetization for both methods are statistically similar. We estimate the fraction of collapsing gas in the selected clouds. By means of the Velocity Gradient Technique, we also present the plane-of-the-sky magnetic field orientation and magnetization of the Smith cloud, for which dust polarimetry data are unavailable.

preprint2020arXiv

Modeling of Galactic Foreground Polarization with Velocity Gradients

The detection of primordial B-mode polarization is still challenging due to the relatively low amplitude compared to the galactic foregrounds. To remove the contribution from the foreground, a comprehensive picture of the galactic magnetic field is indispensable. The Velocity Gradient Technique (VGT) is promising in tracing magnetic fields based on the modern understanding of the magneto-hydrodynamic turbulence. In this work, we apply VGT to a HI region containing an intermediate velocity cloud and a local velocity cloud, which are distinguishable in position-position-velocity space. We show that VGT gives an excellent agreement with the Planck polarization and stellar polarization. We confirm the advantages of VGT in constructing the 3D galactic magnetic field.

preprint2020arXiv

Multiscale Collaborative Deep Models for Neural Machine Translation

Recent evidence reveals that Neural Machine Translation (NMT) models with deeper neural networks can be more effective but are difficult to train. In this paper, we present a MultiScale Collaborative (MSC) framework to ease the training of NMT models that are substantially deeper than those used previously. We explicitly boost the gradient back-propagation from top to bottom levels by introducing a block-scale collaboration mechanism into deep NMT models. Then, instead of forcing the whole encoder stack directly learns a desired representation, we let each encoder block learns a fine-grained representation and enhance it by encoding spatial dependencies using a context-scale collaboration. We provide empirical evidence showing that the MSC nets are easy to optimize and can obtain improvements of translation quality from considerably increased depth. On IWSLT translation tasks with three translation directions, our extremely deep models (with 72-layer encoders) surpass strong baselines by +2.2~+3.1 BLEU points. In addition, our deep MSC achieves a BLEU score of 30.56 on WMT14 English-German task that significantly outperforms state-of-the-art deep NMT models.

preprint2020arXiv

Study Turbulence and Probe Magnetic Field Using Gradients Technique: Application to HI-to-H2 Transition Regions

The atomic-to-molecular (HI-to-H$_2$) transition in photodissociation regions (PDRs) has been investigated over the last several decades through analytic and numerical modeling. However, classical PDR models typically assume uniform density gas, ignoring the turbulent nature of the interstellar medium. Recently, Bialy et al. (2017b, 2019) have presented a theoretical framework for studying the HI-to-H$_2$ in a realistic turbulent medium with a non-homogeneous density structure. Here we extend these turbulent-chemical models to explore the possibility of tracing the magnetic field direction in turbulent PDRs using the Gradients Technique. We utilize both subsonic and supersonic magnetohydrodynamic numerical simulations for chemical HI/H$_2$ balance calculations. We confirm that the density fluctuations induced by turbulence can disperse the distribution of H$_2$ and HI fraction. We find that the energy spectrum of moment maps gets shallower when the sonic Mach number MS increases. We explore the ability in magnetic field tracing of gradients of higher-order velocity centroids and compare their performance with that of traditional velocity centroid gradients (VCGs) and with intensity gradients (IGs). We find that the velocity gradients of the second-order centroids (VC$_2$Gs) are more accurate than VCGs and IGs in probing the magnetic field orientation.

preprint2020arXiv

Velocity Gradient in the Presence of Self-Gravity: Identifying Gravity-induced Inflow and Determining Collapsing Stage

Understanding how star formation is regulated requires studying the energy balance between turbulence, magnetic fields, stellar feedback, and gravity within molecular clouds. However, identifying the transition region where the gravity takes over remains elusive. Recent studies of the Velocity Gradient Technique (VGT), which is an advanced tool for magnetic field studies, reveal that the gradients of spectroscopic observables change their directions by 90 degrees with respect to the magnetic fields in the regions of gravitational collapse. In this study, we perform 3D MHD numerical simulations. We observe that star formation successfully proceeds in strongly magnetized and fully ionized media. We confirm that the self-gravity induces the change of gradients' orientation and high gradients' amplitude. We explore two ways of identifying collapsing self-gravitating regions through the double-peak feature in the histogram of gradients' orientation and the curvature of gradients. We show that velocity gradients' morphology and amplitude can be synthetically used to trace the convergent inflows. By comparing with the column density Probability Density Functions (N-PDFs) method, we show that VGT is a powerful new tool for studying the gas dynamics and tracing magnetic field in star-forming regions. By analogy with VGT, we extend the Intensity Gradient Technique (IGT) to locate the gravitational collapsing region and shocks. We demonstrate the synergy of VGT and IGT can determine the collapsing stages in a star-forming region.

preprint2019arXiv

A Benchmarking of DCM Based Architectures for Position, Velocity and Torque Controlled Humanoid Robots

This paper contributes towards the benchmarking of control architectures for bipedal robot locomotion. It considers architectures that are based on the Divergent Component of Motion (DCM) and composed of three main layers: trajectory optimization, simplified model control, and whole-body QP control layer. While the first two layers use simplified robot models, the whole-body QP control layer uses a complete robot model to produce either desired positions, velocities, or torques inputs at the joint-level. This paper then compares two implementations of the simplified model control layer, which are tested with position, velocity, and torque control modes for the whole-body QP control layer. In particular, both an instantaneous and a Receding Horizon controller are presented for the simplified model control layer. We show also that one of the proposed architectures allows the humanoid robot iCub to achieve a forward walking velocity of 0.3372 meters per second, which is the highest walking velocity achieved by the iCub robot.

preprint2019arXiv

Analyzing Client Behavior in a Syringe Exchange Program

Multiple syringe exchange programs serve the Chicago metropolitan area, providing support for drug users to help prevent infectious diseases. Using data from one program over a ten-year period, we study the behavior of its clients, focusing on the temporal process governing their visits to service locations and on their demographics. We construct a phase-type distribution with an affine relationship between model parameters and features of an individual client. The phase-type distribution governs inter-arrival times between reoccurring visits of each client and is informed by characteristics of a client including age, gender, ethnicity, and more. The inter-arrival time model is a sub-model in a simulation that we construct for the larger system, which allows us to provide a personalized prediction regarding the client's time-to-return to a service location so that better intervention decisions can be made with the help of simulation.

preprint2019arXiv

Influence of Boundaries and Thermostatting on Nonequilibrium Molecular Dynamics Simulations of Heat Conduction in Solids

Nonequilibrium molecular dynamics (NEMD) has been extensively used to study thermal transport at various length scales in many materials. In this method, two local thermostats at different temperatures are used to generate a nonequilibrium steady state with a constant heat flux. Conventionally, the thermal conductivity of a finite system is calculated as the ratio between the heat flux and the temperature gradient extracted from the linear part of the temperature profile away from the local thermostats. Here we show that, with a proper choice of the thermostat, the nonlinear part of the temperature profile should actually not be excluded in thermal transport calculations. We compare NEMD results against those from the atomistic Green's function method in the ballistic regime, and those from the homogeneous nonequilibrium molecular dynamics method in the ballistic-to-diffusive regime. These comparisons suggest that in all the transport regimes, one should directly calculate the thermal conductance from the temperature difference between the heat source and sink and, if needed, convert it to the thermal conductivity by multiplying it with the system length. Furthermore, we find that the Langevin thermostat outperforms the Nosé-Hoover (chain) thermostat in NEMD simulations because of its stochastic and local nature. We show that this is particularly important for studying asymmetric carbon-based nanostructures, for which the Nosé-Hoover thermostat can produce artifacts leading to unphysical thermal rectification. Our findings are important to obtain correct results from molecular dynamics simulations of nanoscale heat transport as the accuracy of the interatomic potentials is rapidly improving.

preprint2019arXiv

Intensity Gradients Technique: Synergy with Velocity Gradients and Polarization Studies

Magnetic fields are ubiquitous in the interstellar medium but are notoriously difficult to study through observation. Making use of the advances in our understanding of MHD turbulence and turbulent reconnection, the Velocity Gradient Technique (VGT) was suggested and successfully applied to study magnetic fields utilizing spectroscopic data. Applying the tools developed for VGT to intensity statistics, we introduce the Intensity Gradients Technique (IGT) as a complementary tool that can be used synergistically with VGT. In this paper, we apply IGT to a diffuse HI region selected from the GALFA-HI survey and compare the intensity gradient maps with those obtained using velocity gradients as well as Planck polarization measurements. We demonstrate the possibility of using IGT and VGT for both studying the magnetic field and identifying shocks in the diffuse interstellar medium. We also explore the ability of IGT in locating self-gravitating regions and calculating Alfvenic Mach number, both alone and in combination with VGT and polarimetry. We compare IGT with the Histogram of Relative Orientation (HRO), which utilizes intensity gradients to characterize the relative orientation of column density structures and local magnetic fields.

preprint2019arXiv

Predictions of Cosmic Microwave Background Foregrounds Dust Polarization Using Velocity Gradients

The observations of fluctuations in the cosmic microwave background provide information about primordial inhomogeneities in the universe. However, the B-mode polarization of the inflationary gravitational wave is contaminated by the Galactic foreground polarized radiation arising from dust aligned by interstellar magnetic fields. To trace magnetic fields we use the Velocity Gradient Technique, which employs a modern understanding of the nature of magneto-hydrodynamics turbulent motions. In this paper, we combine the VGT with the Principal Component Analysis (PCA) to improve the accuracy of magnetic field tracing. We apply the VGT-PCA to the high-resolution neutral hydrogen data from the GALFA-HI survey to predict the polarization of dust. We report that the predicted directions of dust polarization provide good correspondence with those reported by Planck 353GHz, the alignment measure between the two AM=0.79. We show that our results statistically agree with the Planck polarization in terms of magnetic field tracing. We find that the variation of dust emission efficiency across the sky is small. Using our maps of predicted polarization we calculate the ratio of the E and B modes and show that BB/EE=0.53, which is similar to the result from Plank polarization.