Source author record

Xi Zhou

Xi Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Computation and Language cond-mat.supr-con Artificial Intelligence physics.med-ph Software Engineering Sound

Catalog footprint

What is connected

9works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AFFormer: Adaptive Feature Fusion Transformer for V2X Cooperative Perception under Channel Impairments

Accurate 3D object detection is essential for ensuring the safety of autonomous vehicles. Cooperative perception, which leverages vehicle-to-everything (V2X) communication to share perceptual data, enhances detection but is vulnerable to channel impairments, such as noise, fading, and interference. To strengthen the reliability of intelligent transportation systems, this work improves the robustness of V2X cooperative perception under communication conditions that reflect common channel impairments. This paper proposes an Adaptive Feature Fusion Transformer (AFFormer), a Transformer-based framework that mitigates the adverse effects of corrupted features by modeling temporal, inter-agent, and spatial correlations. AFFormer introduces three key modules: Multi-Agent and Temporal Aggregation for context-aware fusion across agents and over time, Dual Spatial Attention for efficient modeling of spatial dependencies, and Uncertainty-Guided Fusion for entropy-driven refinement of fused features. A teacher-student knowledge distillation strategy further enhances robustness by aligning fused features with reliable early-collaboration supervision. AFFormer is validated on the V2XSet and DAIR-V2X datasets, where it consistently outperforms existing methods under both ideal and impaired communication conditions, demonstrating improved robustness to communication-induced feature degradation while maintaining a competitive efficiency-accuracy trade-off.

preprint2022arXiv

AI-based Medical e-Diagnosis for Fast and Automatic Ventricular Volume Measurement in the Patients with Normal Pressure Hydrocephalus

Based on CT and MRI images acquired from normal pressure hydrocephalus (NPH) patients, using machine learning methods, we aim to establish a multi-modal and high-performance automatic ventricle segmentation method to achieve efficient and accurate automatic measurement of the ventricular volume. First, we extract the brain CT and MRI images of 143 definite NPH patients. Second, we manually label the ventricular volume (VV) and intracranial volume (ICV). Then, we use machine learning method to extract features and establish automatic ventricle segmentation model. Finally, we verify the reliability of the model and achieved automatic measurement of VV and ICV. In CT images, the Dice similarity coefficient (DSC), Intraclass Correlation Coefficient (ICC), Pearson correlation, and Bland-Altman analysis of the automatic and manual segmentation result of the VV were 0.95, 0.99, 0.99, and 4.2$\pm$2.6 respectively. The results of ICV were 0.96, 0.99, 0.99, and 6.0$\pm$3.8 respectively. The whole process takes 3.4$\pm$0.3 seconds. In MRI images, the DSC, ICC, Pearson correlation, and Bland-Altman analysis of the automatic and manual segmentation result of the VV were 0.94, 0.99, 0.99, and 2.0$\pm$0.6 respectively. The results of ICV were 0.93, 0.99, 0.99, and 7.9$\pm$3.8 respectively. The whole process took 1.9$\pm$0.1 seconds. We have established a multi-modal and high-performance automatic ventricle segmentation method to achieve efficient and accurate automatic measurement of the ventricular volume of NPH patients. This can help clinicians quickly and accurately understand the situation of NPH patient's ventricles.

preprint2020arXiv

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Accurate temporal action proposals play an important role in detecting actions from untrimmed videos. The existing approaches have difficulties in capturing global contextual information and simultaneously localizing actions with different durations. To this end, we propose a Relation-aware pyramid Network (RapNet) to generate highly accurate temporal action proposals. In RapNet, a novel relation-aware module is introduced to exploit bi-directional long-range relations between local features for context distilling. This embedded module enhances the RapNet in terms of its multi-granularity temporal proposal generation ability, given predefined anchor boxes. We further introduce a two-stage adjustment scheme to refine the proposal boundaries and measure their confidence in containing an action with snippet-level actionness. Extensive experiments on the challenging ActivityNet and THUMOS14 benchmarks demonstrate our RapNet generates superior accurate proposals over the existing state-of-the-art methods.

preprint2020arXiv

DCMN+: Dual Co-Matching Network for Multi-choice Reading Comprehension

Multi-choice reading comprehension is a challenging task to select an answer from a set of candidate options when given passage and question. Previous approaches usually only calculate question-aware passage representation and ignore passage-aware question representation when modeling the relationship between passage and question, which obviously cannot take the best of information between passage and question. In this work, we propose dual co-matching network (DCMN) which models the relationship among passage, question and answer options bidirectionally. Besides, inspired by how human solve multi-choice questions, we integrate two reading strategies into our model: (i) passage sentence selection that finds the most salient supporting sentences to answer the question, (ii) answer option interaction that encodes the comparison information between answer options. DCMN integrated with the two strategies (DCMN+) obtains state-of-the-art results on five multi-choice reading comprehension datasets which are from different domains: RACE, SemEval-2018 Task 11, ROCStories, COIN, MCTest.

preprint2020arXiv

Receptive Multi-granularity Representation for Person Re-Identification

A key for person re-identification is achieving consistent local details for discriminative representation across variable environments. Current stripe-based feature learning approaches have delivered impressive accuracy, but do not make a proper trade-off between diversity, locality, and robustness, which easily suffers from part semantic inconsistency for the conflict between rigid partition and misalignment. This paper proposes a receptive multi-granularity learning approach to facilitate stripe-based feature learning. This approach performs local partition on the intermediate representations to operate receptive region ranges, rather than current approaches on input images or output features, thus can enhance the representation of locality while remaining proper local association. Toward this end, the local partitions are adaptively pooled by using significance-balanced activations for uniform stripes. Random shifting augmentation is further introduced for a higher variance of person appearing regions within bounding boxes to ease misalignment. By two-branch network architecture, different scales of discriminative identity representation can be learned. In this way, our model can provide a more comprehensive and efficient feature representation without larger model storage costs. Extensive experiments on intra-dataset and cross-dataset evaluations demonstrate the effectiveness of the proposed approach. Especially, our approach achieves a state-of-the-art accuracy of 96.2%@Rank-1 or 90.0%@mAP on the challenging Market-1501 benchmark.

preprint2020arXiv

Semantics-aware BERT for Language Understanding

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.

preprint2015arXiv

Superconductivity and Disorder Effect in TlNi$_2$Se$_{2-x}$S$_x$ crystals

After our first discovery of superconductivity (SC) with $T_C$=3.7 K in TlNi$_2$Se$_2$, we grew successfully a series of TlNi$_2$Se$_{2-x}$S$_x$ (0.0 $\leq$ x $\leq$2.0) single crystals. The measurements of resistivity, susceptibility and specific heat were carried out. We found that SC with $T_C$=2.3 K also emerges in TlNi$_2$S$_2$ crystal, which appears to involve heavy electrons with an effective mass $m^*$=13$\sim$25 $m_b$, as inferred from the normal state electronic specific heat and the upper critical field, $H_{C2}(T)$. It was found that the $T_C$ and superconducting volume fraction in TlNi$_2$Se$_{2-x}$S$_x$ crystals changes with the disorder degree induced by the partial substitution of S for Se, which is characterized by the residual resistivity ratio (\textit{RRR}). The effect of the disorder on SC may provide some information for understanding the mechanism of SC in this new Ni-chalcogenide system.

preprint2013arXiv

Multiband Superconductivity of Heavy Electrons in TlNi2Se2 single crystal

Superconductivity has been first observed in TlNi$_2$Se$_2$ at T$_C$=3.7 K and appears to involve heavy electrons with an effective mass $m^*$=14$\sim$20 $m_b$, as inferred from the normal state electronic specific heat and the upper critical field, H_${C2}$(T). Although the zero-field electronic specific heat data, $C_{es}(T)$, in low temperatures (T < 1/4 T$_C$) can be fitted with a gap BCS model, indicating that TlNi$_2$Se$_2$ is a fully gapped superconductor, the two-gap BCS model presents the best fit to all the $C_{es}(T)$ data below $T_C$. It is also found that the electronic specific heat coefficient in the mixed state, $γ_N(H)$, exhibits a \textit{H}$^{1/2}$ behavior, which was also observed in some \textit{s}-wave superconductors, although once considered as a common feature of the \textit{d}-wave superconductors. Anyway, these results indicate that TlNi$_2$Se$_2$, as a non-magnetic analogue of TlFe$_x$Se$_2$ superconductor, is a multiband superconductor of heavy electron system.

preprint2012arXiv

MIDI-LAB, a Powerful Visual Basic Program for Creating MIDI Music

Creating MIDI music can be a practical challenge. In the past, working with it was difficult and frustrating to all but the most accomplished and determined. Now, however, we are offering a powerful Visual Basic program called MIDI-LAB, that is easy to learn, and instantly rewarding to even the newest users. MIDI-LAB has been developed to give users the ability to quickly create music with a limitless variety of tunes, tempos, speeds, volumes, instruments, rhythms and major scales. This program has a simple, intuitive, and user-friendly interface, which provides a straightforward way to enter musical data with Numbered Musical Notation (NMN) and immediately create MIDI music. The key feature of this program is the digitalization of music input. It vastly simplifies creating, editing, and saving MIDI music. MIDI-LAB can be used virtually anywhere to write music for entertainment, teaching, computer games, and mobile phone ringtones.

Xi Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

AFFormer: Adaptive Feature Fusion Transformer for V2X Cooperative Perception under Channel Impairments

AI-based Medical e-Diagnosis for Fast and Automatic Ventricular Volume Measurement in the Patients with Normal Pressure Hydrocephalus

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

DCMN+: Dual Co-Matching Network for Multi-choice Reading Comprehension

Receptive Multi-granularity Representation for Person Re-Identification

Semantics-aware BERT for Language Understanding

Superconductivity and Disorder Effect in TlNi$_2$Se$_{2-x}$S$_x$ crystals

Multiband Superconductivity of Heavy Electrons in TlNi2Se2 single crystal

MIDI-LAB, a Powerful Visual Basic Program for Creating MIDI Music