Source author record

Xindi Wang

Xindi Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Information Retrieval Machine Learning Networking and Internet Architecture Neurons and Cognition

Catalog footprint

What is connected

5works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

KenMeSH: Knowledge-enhanced End-to-end Biomedical Text Labelling

Currently, Medical Subject Headings (MeSH) are manually assigned to every biomedical article published and subsequently recorded in the PubMed database to facilitate retrieving relevant information. With the rapid growth of the PubMed database, large-scale biomedical document indexing becomes increasingly important. MeSH indexing is a challenging task for machine learning, as it needs to assign multiple labels to each article from an extremely large hierachically organized collection. To address this challenge, we propose KenMeSH, an end-to-end model that combines new text features and a dynamic \textbf{K}nowledge-\textbf{en}hanced mask attention that integrates document features with MeSH label hierarchy and journal correlation features to index MeSH terms. Experimental results show the proposed method achieves state-of-the-art performance on a number of measures.

preprint2022arXiv

MeSHup: A Corpus for Full Text Biomedical Document Indexing

Medical Subject Heading (MeSH) indexing refers to the problem of assigning a given biomedical document with the most relevant labels from an extremely large set of MeSH terms. Currently, the vast number of biomedical articles in the PubMed database are manually annotated by human curators, which is time consuming and costly; therefore, a computational system that can assist the indexing is highly valuable. When developing supervised MeSH indexing systems, the availability of a large-scale annotated text corpus is desirable. A publicly available, large corpus that permits robust evaluation and comparison of various systems is important to the research community. We release a large scale annotated MeSH indexing corpus, MeSHup, which contains 1,342,667 full text articles in English, together with the associated MeSH labels and metadata, authors, and publication venues that are collected from the MEDLINE database. We train an end-to-end model that combines features from documents and their associated labels on our corpus and report the new baseline.

preprint2021arXiv

Technical Report for A Joint User Scheduling and Trajectory Planning Data Collection Strategy for the UAV-assisted WSN

Unmanned aerial vehicles (UAVs) are usually dispatched as mobile sinks to assist data collection in large-scale wireless sensor networks (WSNs). However, when considering the limitations of UAV's mobility and communication capabilities in a large-scale WSN, some sensor nodes may run out of storage space as they fail to offload their data to the UAV for an extended period of time. To minimize the data loss caused by the above issue, a joint user scheduling and trajectory planning data collection strategy is proposed in this letter, which is formulated as a non-convex optimization problem. The problem is further divided into two sub-problems and solved sequentially. Simulation results show that the proposed strategy is more effective in minimizing data loss rate than other strategies.

preprint2019arXiv

End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models

Named entity recognition (NER) and relation extraction (RE) are two important tasks in information extraction and retrieval (IE \& IR). Recent work has demonstrated that it is beneficial to learn these tasks jointly, which avoids the propagation of error inherent in pipeline-based systems and improves performance. However, state-of-the-art joint models typically rely on external natural language processing (NLP) tools, such as dependency parsers, limiting their usefulness to domains (e.g. news) where those tools perform well. The few neural, end-to-end models that have been proposed are trained almost completely from scratch. In this paper, we propose a neural, end-to-end model for jointly extracting entities and their relations which does not rely on external NLP tools and which integrates a large, pre-trained language model. Because the bulk of our model's parameters are pre-trained and we eschew recurrence for self-attention, our model is fast to train. On 5 datasets across 3 domains, our model matches or exceeds state-of-the-art performance, sometimes by a large margin.

preprint2016arXiv

Differentially Categorized Structural Connectome Hubs are Involved in Differential Microstructural Basis and Functional Implications and Contribute to Individual Identification

Human brain structural networks contain sets of centrally embedded hub regions that enable efficient information communication. However, it remains largely unknown about categories of structural brain hubs and their microstructural, functional and cognitive characteristics as well as contributions to individual identification. Here, we employed three multi-modal imaging data sets with structural MRI, diffusion MRI and resting-state functional MRI to construct individual structural brain networks, identify brain hubs based on eight commonly used graph-nodal metrics, and perform comprehensive validation analysis. We found three categories of structural hubs in the brain networks, namely, aggregated, distributed and connector hubs. Spatially, these distinct categories of hubs were primarily located in the default-mode system and additionally in the visual and limbic systems for aggregated hubs, in the frontoparietal system for distributed hubs, and in the sensorimotor and ventral attention systems for connector hubs. Importantly, these three categories of hubs exhibited various distinct characteristics, with the highest level of microstructural organization in the aggregated hubs, the largest wiring cost and topological vulnerability in the distributed hubs, and the highest functional associations and cognitive flexibility in the connector hubs, although they behaved better regarding these characteristics compared to non-hubs. Finally, all three categories of hub indices displayed high across-session spatial similarities and acted as a structural fingerprint with high predictive rates (100%, 100% and 84.2%) for individual identification. Collectively, our findings highlighted three categories of brain hubs with differential microstructural, functional and cognitive associations, which may shed light on the topological mechanisms of the human connectome.