Source author record

Bin He

Bin He appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language hep-ph cond-mat.mtrl-sci Computer Vision physics.optics Robotics Artificial Intelligence cond-mat.supr-con hep-ex physics.chem-ph

Catalog footprint

What is connected

16works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Enhance Accuracy: Sensitivity and Uncertainty Theory in LiDAR Odometry and Mapping

Currently, the improvement of LiDAR poses estimation accuracy is an urgent need for mobile robots. Research indicates that diverse LiDAR points have different influences on the accuracy of pose estimation. This study aimed to select a good point set to enhance accuracy. Accordingly, the sensitivity and uncertainty of LiDAR point residuals were formulated as a fundamental basis for derivation and analysis. High-sensitivity and low -uncertainty point residual terms are preferred to achieve higher pose estimation accuracy. The proposed selection method has been theoretically proven to be capable of achieving a global statistical optimum. It was tested on artificial data and compared with the KITTI benchmark. It was also implemented in LiDAR odometry (LO) and LiDAR inertial odometry (LIO), both indoors and outdoors. The experiments revealed that utilizing selected LiDAR point residuals simultaneously enhances optimization accuracy, decreases residual terms, and guarantees real-time performance.

preprint2022arXiv

Exploring Effective Information Utilization in Multi-Turn Topic-Driven Conversations

Conversations are always related to certain topics. However, it is challenging to fuse dialogue history and topic information from various sources at the same time in current dialogue generation models because of the input length limit of pre-trained language models (PLMs). In order to expand the information that PLMs can utilize, we encode topic and dialogue history information using certain prompts with multiple channels of Fusion-in-Decoder (FiD) and explore the influence of three different channel settings. In this paper, our experiments focus on a specific Chinese dataset named NaturalConv, where the conversation revolves around a piece of recent news. We thoroughly compared different dialogue models and different FiD channel settings. Empirical results show that by combining our proposed whole passage channel with additional history channel, our methods can achieve competitive performance on NaturalConv, making it possible to encode various information from excessively long texts.

preprint2022arXiv

Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling

As a technically challenging topic, visual storytelling aims at generating an imaginary and coherent story with narrative multi-sentences from a group of relevant images. Existing methods often generate direct and rigid descriptions of apparent image-based contents, because they are not capable of exploring implicit information beyond images. Hence, these schemes could not capture consistent dependencies from holistic representation, impairing the generation of reasonable and fluent story. To address these problems, a novel knowledge-enriched attention network with group-wise semantic model is proposed. Three main novel components are designed and supported by substantial experiments to reveal practical advantages. First, a knowledge-enriched attention network is designed to extract implicit concepts from external knowledge system, and these concepts are followed by a cascade cross-modal attention mechanism to characterize imaginative and concrete representations. Second, a group-wise semantic module with second-order pooling is developed to explore the globally consistent guidance. Third, a unified one-stage story generation model with encoder-decoder structure is proposed to simultaneously train and infer the knowledge-enriched attention network, group-wise semantic module and multi-modal story generation decoder in an end-to-end fashion. Substantial experiments on the popular Visual Storytelling dataset with both objective and subjective evaluation metrics demonstrate the superior performance of the proposed scheme as compared with other state-of-the-art methods.

preprint2022arXiv

KQA Pro: A Dataset with Explicit Compositional Programs for Complex Question Answering over Knowledge Base

Complex question answering over knowledge base (Complex KBQA) is challenging because it requires various compositional reasoning capabilities, such as multi-hop inference, attribute comparison, set operation. Existing benchmarks have some shortcomings that limit the development of Complex KBQA: 1) they only provide QA pairs without explicit reasoning processes; 2) questions are poor in diversity or scale. To this end, we introduce KQA Pro, a dataset for Complex KBQA including ~120K diverse natural language questions. We introduce a compositional and interpretable programming language KoPL to represent the reasoning process of complex questions. For each question, we provide the corresponding KoPL program and SPARQL query, so that KQA Pro serves for both KBQA and semantic parsing tasks. Experimental results show that SOTA KBQA methods cannot achieve promising results on KQA Pro as on current datasets, which suggests that KQA Pro is challenging and Complex KBQA requires further research efforts. We also treat KQA Pro as a diagnostic dataset for testing multiple reasoning skills, conduct a thorough evaluation of existing models and discuss further directions for Complex KBQA. Our codes and datasets can be obtained from https://github.com/shijx12/KQAPro_Baselines.

preprint2022arXiv

Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning

Goal-conditioned reinforcement learning is a crucial yet challenging algorithm which enables agents to achieve multiple user-specified goals when learning a set of skills in a dynamic environment. However, it typically requires millions of the environmental interactions explored by agents, which is sample-inefficient. In the paper, we propose a skill learning framework DR-GRL that aims to improve the sample efficiency and policy generalization by combining the Disentangled Representation learning and Goal-conditioned visual Reinforcement Learning. In a weakly supervised manner, we propose a Spatial Transform AutoEncoder (STAE) to learn an interpretable and controllable representation in which different parts correspond to different object attributes (shape, color, position). Due to the high controllability of the representations, STAE can simply recombine and recode the representations to generate unseen goals for agents to practice themselves. The manifold structure of the learned representation maintains consistency with the physical position, which is beneficial for reward calculation. We empirically demonstrate that DR-GRL significantly outperforms the previous methods in sample efficiency and policy generalization. In addition, DR-GRL is also easy to expand to the real robot.

preprint2021arXiv

Anomalous thermoelectric effects and quantum oscillations in the kagome metal CsV$_3$Sb$_5$

The kagome metal compounds $A$V$_3$Sb$_5$ ($A$ = K, Rb, and Cs) feature a wealth of phenomena including nontrivial band topology, charge density wave (CDW), and superconductivity. One intriguing property is the time-reversal symmetry breaking in the CDW state without local moments, which leads to anomalous transport responses. Here, we report the investigation of magneto-thermoelectric effects on high-quality CsV$_3$Sb$_5$ single crystals. A large anomalous Nernst effect is observed at temperatures below 30 K. Multiple Fermi surfaces with small effective masses are revealed by quantum oscillations in Nernst and Seebeck signals under high magnetic field. Furthermore, we find an unknown frequency, and attribute it to the magnetic breakdown across two smaller Fermi surfaces. A gap around 20 meV can be resolved from the breakdown threshold field, which we propose to be introduced by the CDW. These results shed new light on the CDW-related phenomena, particularly in $A$V$_3$Sb$_5$ compounds.

preprint2021arXiv

DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge

Commonsense knowledge is crucial for artificial intelligence systems to understand natural language. Previous commonsense knowledge acquisition approaches typically rely on human annotations (for example, ATOMIC) or text generation models (for example, COMET.) Human annotation could provide high-quality commonsense knowledge, yet its high cost often results in relatively small scale and low coverage. On the other hand, generation models have the potential to automatically generate more knowledge. Nonetheless, machine learning models often fit the training data well and thus struggle to generate high-quality novel knowledge. To address the limitations of previous approaches, in this paper, we propose an alternative commonsense knowledge acquisition framework DISCOS (from DIScourse to COmmonSense), which automatically populates expensive complex commonsense knowledge to more affordable linguistic knowledge resources. Experiments demonstrate that we can successfully convert discourse knowledge about eventualities from ASER, a large-scale discourse knowledge graph, into if-then commonsense knowledge defined in ATOMIC without any additional annotation effort. Further study suggests that DISCOS significantly outperforms previous supervised approaches in terms of novelty and diversity with comparable quality. In total, we can acquire 3.4M ATOMIC-like inferential commonsense knowledge by populating ATOMIC on the core part of ASER. Codes and data are available at https://github.com/HKUST-KnowComp/DISCOS-commonsense.

preprint2021arXiv

Giant anomalous Nernst signal in the antiferromagnet YbMnBi2

Searching for a high anomalous Nernst effect (ANE) is crucial for thermoelectric energy conversion applications because the associated unique transverse geometry facilitates module fabrication. Topological ferromagnets with large Berry curvatures show high ANEs; however, they face drawbacks such as strong magnetic disturbances and low mobility due to high magnetization. Herein, we demonstrate that YbMnBi2, a canted antiferromagnet, has a large ANE conductivity of ~10 Am-1K-1 that surpasses the common high values (i.e. 3-5 Am-1K-1) observed so far in ferromagnets. The canted spin structure of Mn guarantees a nonzero Berry curvature but generates only a weak magnetization three orders of magnitude lower than that of general ferromagnets. The heavy Bi with a large spin-orbit coupling enables a high ANE and low thermal conductivity, whereas its highly dispersive px/y orbitals ensure low resistivity. The high anomalous transverse thermoelectric performance and extremely small magnetization makes YbMnBi2 an excellent candidate for transverse thermoelectrics.

preprint2016arXiv

Building a comprehensive syntactic and semantic corpus of Chinese clinical texts

Objective: To build a comprehensive corpus covering syntactic and semantic annotations of Chinese clinical texts with corresponding annotation guidelines and methods as well as to develop tools trained on the annotated corpus, which supplies baselines for research on Chinese texts in the clinical domain. Materials and methods: An iterative annotation method was proposed to train annotators and to develop annotation guidelines. Then, by using annotation quality assurance measures, a comprehensive corpus was built, containing annotations of part-of-speech (POS) tags, syntactic tags, entities, assertions, and relations. Inter-annotator agreement (IAA) was calculated to evaluate the annotation quality and a Chinese clinical text processing and information extraction system (CCTPIES) was developed based on our annotated corpus. Results: The syntactic corpus consists of 138 Chinese clinical documents with 47,424 tokens and 2553 full parsing trees, while the semantic corpus includes 992 documents that annotated 39,511 entities with their assertions and 7695 relations. IAA evaluation shows that this comprehensive corpus is of good quality, and the system modules are effective. Discussion: The annotated corpus makes a considerable contribution to natural language processing (NLP) research into Chinese texts in the clinical domain. However, this corpus has a number of limitations. Some additional types of clinical text should be introduced to improve corpus coverage and active learning methods should be utilized to promote annotation efficiency. Conclusions: In this study, several annotation guidelines and an annotation method for Chinese clinical texts were proposed, and a comprehensive corpus with its NLP modules were constructed, providing a foundation for further study of applying NLP techniques to Chinese texts in the clinical domain.

preprint2014arXiv

Polarization-dependent exciton dynamics in tetracene single crystals

We conduct polarization-dependent ultrafast spectroscopy to study the dynamics of singlet fission in tetracene single crystals. The spectrotemporal species for singlet and triplet excitons in transient absorption spectra are found to be strongly dependent on probe polarization. By carefully analyzing the polarization dependence, the signals contributed by different transitions related to singlet excitons have been disentangled, which is further applied to construct the correlation between dynamics of singlet and triplet excitons. The anisotropy of exciton dynamics provides an alternative approach to tackle the long-standing challenge in understanding the mechanism of singlet fission in organic semiconductors.

preprint2012arXiv

125 GeV Higgs, Type III Seesaw and Gauge-Higgs Unification

Recently, both the ATLAS and CMS experiments have observed an excess of events that could be the first evidence for a 125 GeV Higgs boson. This is a few GeV below the (absolute) vacuum stability bound on the Higgs mass in the Standard Model (SM), assuming a Planck mass ultraviolet (UV) cutoff. In this paper, we study some implications of a 125 GeV Higgs boson for new physics in terms of the vacuum stability bound. We first consider the seesaw extension of the SM and find that in type III seesaw, the vacuum stability bound on the Higgs mass can be as low as 125 GeV for the seesaw scale around a TeV. Next we dicuss some alternative new physics models which provide an effective ultraviolet cutoff lower than the Planck mass. An effective cutoff Λ\simeq 10^{11} GeV leads to a vacuum stability bound on the Higgs mass of 125 GeV. In a gauge-Higgs unification scenario with five-dimensional flat spacetime, the so-called gauge-Higgs condition allows us to predict a Higgs mass of 125 GeV, with the compactification scale of the extra-dimension being identified as the cutoff scale Λ\simeq 10^{11} GeV. Identifying the compactification scale with the unification scale of the SM SU(2) gauge coupling and the top quark Yukawa coupling yields a Higgs mass of 121\pm 2 GeV.

preprint2012arXiv

Impact of LHC Searches on NLSP Top Squark and Gluino Mass

We explore the implications of 7 TeV LHC searches for a scenario in which one of the stops is the next-to lightest supersymmetric particle (NLSP). The NLSP stop (\tilde{t}_1) is assumed to decay exclusively into neutralino and charm quark. We consider processes where the stops are pair produced together with a hard QCD jet. We also consider stop quarks from gluino decays, \tilde{g}\to t\tilde{t}_1^\ast+\bar{t}\tilde{t}_1. We show that the monojet ATLAS and CMS searches corresponding to 1 fb^{-1} of integrated luminosity are sensitive to stop masses of up to 160 GeV, with the 20% neutralino-stop coannihilation region essentially ruled out for M_{\tilde{t}_1}\lesssim 140 GeV. The region M_{\tilde{t}_1}\lesssim 130 GeV is excluded with even relatively larger mass difference, M_{\tilde{t}_1}-M_{\tildeχ_1^0}\sim 40 GeV, by the multi-jets search. The b-jet and same-sign dilepton searches are sensitive to a heavier gluino because they only pick up gluino pair production events followed by top quarks decaying into b-jets and same-sign dileptons, respectively. We find that the LHC data places a lower limit on the gluino mass in this scenario of about 600 GeV (700 GeV) from b-jets (same-sign dileptons) searches.

preprint2012arXiv

Inverse Seesaw in NMSSM and 126 GeV Higgs Boson

We consider extensions of the next-to-minimal supersymmetric model (NMSSM) in which the observed neutrino masses are generated through a TeV scale inverse seesaw mechanism. The new particles associated with this mechanism can have sizable couplings to the Higgs field which can yield a large contribution to the mass of the lightest CP-even Higgs boson. With this new contribution, a 126 GeV Higgs is possible along with order of 200 GeV masses for the stop quarks for a broad range of \tanβ. The Higgs production and decay in the diphoton channel can be enhanced due to this new contribution. It is also possible to solve the little hierarchy problem in this model without invoking a maximal value for the NMSSM trilinear coupling and without severe restrictions on the value of \tanβ.

preprint2012arXiv

Ultrafast pump-probe spectroscopic signatures of superconducting and pseudogap phases in YBa2Cu3O7-δ films

Femtosecond pump-probe spectroscopy is applied to identify transient optical signatures of phase transitions in optimally doped YBa2Cu3O7-δ films. To elucidate the dynamics of superconducting and pseudogap phases, the slow thermal component is removed from the time-domain traces of photo-induced reflectivity in a high-flux regime with low frequency pulse rate. The rescaled data exhibit distinct signatures of the phase separation with abrupt changes at the onsets of TSC and TPG in excellent agreement with transport data. Compared to the superconducting phase, the response of the pseudogap phase is characterized by the strongly reduced reflectivity change accompanied by a faster recovery time.

preprint2010arXiv

Higgs Boson Mass Bounds in Seesaw Extended Standard Model with Non-Minimal Gravitational Coupling

In the presence of non-minimal gravitational coupling $ ξH^\dagger H {\cal R}$ between the standard model (SM) Higgs doublet $H$ and the curvature scalar ${\cal R}$, the effective ultraviolet cutoff scale is given by $Λ_{\rm cut}\approx m_P/ξ$, where $m_P$ is the reduced Planck mass, and $ξ\gtrsim 1$ is a dimensionless coupling constant. In type I and type III seesaw extended SM, which can naturally explain the observed solar and atmospheric neutrino oscillations, we investigate the implications of this non-minimal gravitational coupling for the SM Higgs boson mass bounds based on vacuum stability and perturbativity arguments. A lower bound on the Higgs boson mass close to 120 GeV is realized with type III seesaw and $ξ\sim 10-10^3$.

preprint2010arXiv

New Fermions at the LHC and Mass of the Higgs Boson

Unification at M_{GUT}\sim 3\times 10^{16} GeV of the three Standard Model (SM) gauge couplings can be achieved by postulating the existence of a pair of vectorlike fermions carrying SM charges and masses of order 300 GeV -- 1 TeV. The presence of these fermions significantly modifies the vacuum stability and perturbativity bounds on the mass of the SM Higgs boson. The new vacuum stability bound in this extended SM is estimated to be 117 GeV, to be compared with the SM prediction of about 128 GeV. An upper bound of 190 GeV is obtained based on perturbativity arguments. The impact on these predictions of type I seesaw physics is also discussed. The discovery of a relatively `light' Higgs boson with mass \sim 117 GeV could signal the presence of new vectorlike fermions within reach of the LHC.

Bin He

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Enhance Accuracy: Sensitivity and Uncertainty Theory in LiDAR Odometry and Mapping

Exploring Effective Information Utilization in Multi-Turn Topic-Driven Conversations

Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling

KQA Pro: A Dataset with Explicit Compositional Programs for Complex Question Answering over Knowledge Base

Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning

Anomalous thermoelectric effects and quantum oscillations in the kagome metal CsV$_3$Sb$_5$

DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge

Giant anomalous Nernst signal in the antiferromagnet YbMnBi2

Building a comprehensive syntactic and semantic corpus of Chinese clinical texts

Polarization-dependent exciton dynamics in tetracene single crystals

125 GeV Higgs, Type III Seesaw and Gauge-Higgs Unification

Impact of LHC Searches on NLSP Top Squark and Gluino Mass

Inverse Seesaw in NMSSM and 126 GeV Higgs Boson

Ultrafast pump-probe spectroscopic signatures of superconducting and pseudogap phases in YBa2Cu3O7-δ films

Higgs Boson Mass Bounds in Seesaw Extended Standard Model with Non-Minimal Gravitational Coupling

New Fermions at the LHC and Mass of the Higgs Boson