Source author record

Zhiyong Wang

Zhiyong Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

30works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

Vector approximate nearest neighbor search (ANNS) underpins search engines, recommendation systems, and advertising services. Recent advances in ANNS indexes make CPU a cost-effective choice for serving million-scale, in-memory vector search, yet per-core throughput remains constrained by memory access latency of vector reading and the compute intensity of distance evaluations in production deployments. With the growing scale of the business and advances in hardware, modern CCD-based multi-core CPUs have been widely deployed for high throughput in our services. However, we find that simply increasing core counts does not yield optimal performance scaling. To improve the efficiency of more cores from the CCD-based architecture, we analyze the distributions of real-world requests in our production environments. We observe high access locality in vector search in our online services and low cache utilization, resulting from overlooking the multi-chiplet nature of CCD based CPUs. Hence, we propose a workload- and hardware-aware thread orchestration framework at CCD-level that (i) provides a uniform interface for both inter-query parallel HNSW search and intra-query parallel IVF search, (ii) achieves cache-friendly and workload-adaptive mapping of task dispatching, and (iii) employs CCD-aware task stealing to address load imbalance. Applied to real production workloads from search, recommendation, and advertising services of Xiaohongshu (RedNote), our approach delivers up to 3.7x higher throughput and 30-90% reductions in P50 and P999 latency. In detail, compared with the original framework, the cache-miss ratio decreases by 6-30%, and the total CPU stall is reduced by 20-80%.

preprint2026arXiv

Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception

The rapid advancement of audio generation technologies has escalated the risks of malicious deepfake audio across speech, sound, singing voice, and music, threatening multimedia security and trust. While existing countermeasures (CMs) perform well in single-type audio deepfake detection (ADD), their performance declines in cross-type scenarios. This paper is dedicated to studying the all-type ADD task. We are the first to comprehensively establish an all-type ADD benchmark to evaluate current CMs, incorporating cross-type deepfake detection across speech, sound, singing voice, and music. Then, we introduce the prompt tuning self-supervised learning (PT-SSL) training paradigm, which optimizes SSL front-end by learning specialized prompt tokens for ADD, requiring 458x fewer trainable parameters than fine-tuning (FT). Considering the auditory perception of different audio types, we propose the wavelet prompt tuning (WPT)-SSL method to capture type-invariant auditory deepfake information from the frequency domain without requiring additional training parameters, thereby enhancing performance over FT in the all-type ADD task. To achieve an universally CM, we utilize all types of deepfake audio for co-training. Experimental results demonstrate that WPT-XLSR-AASIST achieved the best performance, with an average EER of 3.58% across all evaluation sets.

preprint2026arXiv

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models

Large language models (LLMs) exhibit strong reasoning capabilities when guided by high-quality demonstrations, yet such data is often distributed across organizations that cannot centralize it due to regulatory, proprietary, or institutional constraints. We study federated reasoning, where a server improves multi-step reasoning by coordinating with heterogeneous clients holding private demonstrations, without centralized training or raw data sharing. The key challenge is that client reliability is query-dependent, while the server cannot inspect client data to determine which contributions are trustworthy. To address this, we propose Uncertainty-Aware Federated Reasoning (FERA), a training-free framework based on iterative server-client co-refinement. Across communication rounds, clients generate reasoning traces with lightweight uncertainty estimates, and the server synthesizes them into improved reasoning that is redistributed as context for the next round, progressively improving both server outputs and client-side reasoning. Within each round, Uncertainty-Aware Self-Critique Aggregation (UA-SCA) resolves conflicts among heterogeneous client traces through query-dependent trust weighting and structured cross-client verification. Rather than simply discarding low-quality traces, UA-SCA revises flawed reasoning steps to recover useful information. We provide theoretical guarantees showing that the proposed iterative protocol converges and that uncertainty-aware weighting accelerates convergence. Experiments on multiple reasoning benchmarks show that FERA consistently outperforms both federated training and training-free baselines, achieving progressively higher accuracy across rounds while maintaining communication and computational efficiency.

preprint2026arXiv

McCast: Memory-Guided Latent Drift Correction for Long-Horizon Precipitation Nowcasting

Existing precipitation nowcasting methods typically adopt an autoregressive formulation, where future states are predicted from previous outputs. However, such an approach accumulates errors over long rollouts, causing forecasts to drift away from physically plausible evolution trajectories. Although various studies have attempted to alleviate this problem by improving step-wise prediction accuracy, they largely neglect the global temporal evolution of meteorological systems and lack mechanisms to actively correct drift during rollouts. To address this issue, we propose McCast, a memory-guided latent drift correction method for precipitation nowcasting. Rather than treating memory as an unordered dictionary of latent states for passive conditioning, McCast leverages temporally organized memory to actively correct autoregressive latent evolution. Specifically, McCast introduces a Drift-Corrective Memory Bank (DCBank) that explicitly estimates the temporally consistent drift corrections to calibrate the divergent trajectory. DCBank performs drift correction in two stages: a Corrective Latent Extractor first predicts an initial correction from the current prediction and a reference latent state, and a Correction-Aware Memory Retrieval module then refines the initial correction using temporally organized historical memory. By explicitly correcting latent evolution, instead of improving step-wise prediction accuracy only, McCast produces more temporally coherent and reliable long-horizon forecasts. Experiments on two widely used benchmarks, SEVIR and MeteoNet, show that McCast achieves state-of-the-art performance, particularly in challenging long-horizon forecasting scenarios.

preprint2026arXiv

Stable Attention Response for Reliable Precipitation Nowcasting

Precipitation nowcasting remains challenging due to the highly localized, rapidly evolving, and heterogeneous nature of atmospheric dynamics. Although recent methods increasingly adopt attention-based architectures in both unimodal and multimodal settings, they mainly emphasize stronger representation learning and prediction capacity, while paying less attention to the stability of attention responses across samples. In this work, we show that cross-sample instability of attention-response energy is an important and previously underexplored source of forecasting unreliability. Empirically, inaccurate forecasts are associated with larger attention-response energy variance across heads and layers. Theoretically, we show that cross-sample variability can propagate through self-attention, and enlarge a lower bound on prediction error. Based on this insight, we propose HARECast, a Head-wise Attention Response Energy-regulated framework for precipitation nowcasting. HARECast explicitly models head-wise attention-response energy and stabilizes it through a group-wise regularization objective that reduces cross-sample fluctuations. The proposed formulation is generic and applicable to both unimodal and multimodal nowcasting architectures. We instantiate HARECast in a standard forecasting pipeline with reconstruction branches and a diffusion-based predictor, and evaluate it on commonly used benchmarks--SEVIR and MeteoNet. Experimental results demonstrate that HARECast achieves state-of-the-art performance.

preprint2026arXiv

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Autonomous systems are increasingly deployed in open and dynamic environments -- from city streets to aerial and indoor spaces -- where perception models must remain reliable under sensor noise, environmental variation, and platform shifts. However, even state-of-the-art methods often degrade under unseen conditions, highlighting the need for robust and generalizable robot sensing. The RoboSense 2025 Challenge is designed to advance robustness and adaptability in robot perception across diverse sensing scenarios. It unifies five complementary research tracks spanning language-grounded decision making, socially compliant navigation, sensor configuration generalization, cross-view and cross-modal correspondence, and cross-platform 3D perception. Together, these tasks form a comprehensive benchmark for evaluating real-world sensing reliability under domain shifts, sensor failures, and platform discrepancies. RoboSense 2025 provides standardized datasets, baseline models, and unified evaluation protocols, enabling large-scale and reproducible comparison of robust perception methods. The challenge attracted 143 teams from 85 institutions across 16 countries, reflecting broad community engagement. By consolidating insights from 23 winning solutions, this report highlights emerging methodological trends, shared design principles, and open challenges across all tracks, marking a step toward building robots that can sense reliably, act robustly, and adapt across platforms in real-world environments.

preprint2025arXiv

GARDO: Reinforcing Diffusion Models without Reward Hacking

Fine-tuning diffusion models via online reinforcement learning (RL) has shown great potential for enhancing text-to-image alignment. However, since precisely specifying a ground-truth objective for visual tasks remains challenging, the models are often optimized using a proxy reward that only partially captures the true goal. This mismatch often leads to reward hacking, where proxy scores increase while real image quality deteriorates and generation diversity collapses. While common solutions add regularization against the reference policy to prevent reward hacking, they compromise sample efficiency and impede the exploration of novel, high-reward regions, as the reference policy is usually sub-optimal. To address the competing demands of sample efficiency, effective exploration, and mitigation of reward hacking, we propose Gated and Adaptive Regularization with Diversity-aware Optimization (GARDO), a versatile framework compatible with various RL algorithms. Our key insight is that regularization need not be applied universally; instead, it is highly effective to selectively penalize a subset of samples that exhibit high uncertainty. To address the exploration challenge, GARDO introduces an adaptive regularization mechanism wherein the reference model is periodically updated to match the capabilities of the online policy, ensuring a relevant regularization target. To address the mode collapse issue in RL, GARDO amplifies the rewards for high-quality samples that also exhibit high diversity, encouraging mode coverage without destabilizing the optimization process. Extensive experiments across diverse proxy rewards and hold-out unseen metrics consistently show that GARDO mitigates reward hacking and enhances generation diversity without sacrificing sample efficiency or exploration, highlighting its effectiveness and robustness.

preprint2024arXiv

Geometric topics related to Besov type spaces on the Grushin setting

The Grushin spaces, as one of the most important models in the Carnot-Carathéodory space, are a class of locally compact and geodesic metric spaces which admit a dilation. Function spaces on Grushin spaces and some related geometric problems are always the research hotspots in this field. Firstly, we investigate two classes of Besov type spaces based on the Grushin semigroup and the fractional Grushin semigroup, respectively, and prove some important properties of these two Besov type spaces. Moreover, we also reveal the relationship between them. Secondly, we establish the isoperimetric inequality for the fractional perimeter, which is defined by the Grushin-Laplace operator on Grushin spaces. Finally, we combine the semigroup theory with a nonlocal calculus for the Grushin-Laplace operator to obtain the Sobolev type inequality. As a corollary, we also obtain the embedding theorem for Besov type spaces.

preprint2023arXiv

Robust Knowledge Adaptation for Federated Unsupervised Person ReID

Person Re-identification (ReID) has been extensively studied in recent years due to the increasing demand in public security. However, collecting and dealing with sensitive personal data raises privacy concerns. Therefore, federated learning has been explored for Person ReID, which aims to share minimal sensitive data between different parties (clients). However, existing federated learning based person ReID methods generally rely on laborious and time-consuming data annotations and it is difficult to guarantee cross-domain consistency. Thus, in this work, a federated unsupervised cluster-contrastive (FedUCC) learning method is proposed for Person ReID. FedUCC introduces a three-stage modelling strategy following a coarse-to-fine manner. In detail, generic knowledge, specialized knowledge and patch knowledge are discovered using a deep neural network. This enables the sharing of mutual knowledge among clients while retaining local domain-specific knowledge based on the kinds of network layers and their parameters. Comprehensive experiments on 8 public benchmark datasets demonstrate the state-of-the-art performance of our proposed method.

preprint2023arXiv

XAI for In-hospital Mortality Prediction via Multimodal ICU Data

Predicting in-hospital mortality for intensive care unit (ICU) patients is key to final clinical outcomes. AI has shown advantaged accuracy but suffers from the lack of explainability. To address this issue, this paper proposes an eXplainable Multimodal Mortality Predictor (X-MMP) approaching an efficient, explainable AI solution for predicting in-hospital mortality via multimodal ICU data. We employ multimodal learning in our framework, which can receive heterogeneous inputs from clinical data and make decisions. Furthermore, we introduce an explainable method, namely Layer-Wise Propagation to Transformer, as a proper extension of the LRP method to Transformers, producing explanations over multimodal inputs and revealing the salient features attributed to prediction. Moreover, the contribution of each modality to clinical outcomes can be visualized, assisting clinicians in understanding the reasoning behind decision-making. We construct a multimodal dataset based on MIMIC-III and MIMIC-III Waveform Database Matched Subset. Comprehensive experiments on benchmark datasets demonstrate that our proposed framework can achieve reasonable interpretation with competitive prediction accuracy. In particular, our framework can be easily transferred to other clinical tasks, which facilitates the discovery of crucial factors in healthcare research.

preprint2022arXiv

1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task

This paper describes our system for the SemEval2022 task of matching dictionary glosses to word embeddings. We focus on the Reverse Dictionary Track of the competition, which maps multilingual glosses to reconstructed vector representations. More specifically, models convert the input of sentences to three types of embeddings: SGNS, Char, and Electra. We propose several experiments for applying neural network cells, general multilingual and multitask structures, and language-agnostic tricks to the task. We also provide comparisons over different types of word embeddings and ablation studies to suggest helpful strategies. Our initial transformer-based model achieves relatively low performance. However, trials on different retokenization methodologies indicate improved performance. Our proposed Elmobased monolingual model achieves the highest outcome, and its multitask, and multilingual varieties show competitive results as well.

preprint2022arXiv

Deep Laparoscopic Stereo Matching with Transformers

The self-attention mechanism, successfully employed with the transformer structure is shown promise in many computer vision tasks including image recognition, and object detection. Despite the surge, the use of the transformer for the problem of stereo matching remains relatively unexplored. In this paper, we comprehensively investigate the use of the transformer for the problem of stereo matching, especially for laparoscopic videos, and propose a new hybrid deep stereo matching framework (HybridStereoNet) that combines the best of the CNN and the transformer in a unified design. To be specific, we investigate several ways to introduce transformers to volumetric stereo matching pipelines by analyzing the loss landscape of the designs and in-domain/cross-domain accuracy. Our analysis suggests that employing transformers for feature representation learning, while using CNNs for cost aggregation will lead to faster convergence, higher accuracy and better generalization than other options. Our extensive experiments on Sceneflow, SCARED2019 and dVPN datasets demonstrate the superior performance of our HybridStereoNet.

preprint2022arXiv

Federated Visualization: A Privacy-preserving Strategy for Aggregated Visual Query

We present a novel privacy preservation strategy for decentralized visualization. The key idea is to imitate the flowchart of the federated learning framework, and reformulate the visualization process within a federated infrastructure. The federation of visualization is fulfilled by leveraging a shared global module that composes the encrypted externalizations of transformed visual features of data pieces in local modules. We design two implementations of federated visualization: a prediction-based scheme, and a query-based scheme. We demonstrate the effectiveness of our approach with a set of visual forms, and verify its robustness with evaluations. We report the value of federated visualization in real scenarios with an expert review.

preprint2022arXiv

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds

Existing motion capture datasets are largely short-range and cannot yet fit the need of long-range applications. We propose LiDARHuman26M, a new human motion capture dataset captured by LiDAR at a much longer range to overcome this limitation. Our dataset also includes the ground truth human motions acquired by the IMU system and the synchronous RGB images. We further present a strong baseline method, LiDARCap, for LiDAR point cloud human motion capture. Specifically, we first utilize PointNet++ to encode features of points and then employ the inverse kinematics solver and SMPL optimizer to regress the pose through aggregating the temporally encoded features hierarchically. Quantitative and qualitative experiments show that our method outperforms the techniques based only on RGB images. Ablation experiments demonstrate that our dataset is challenging and worthy of further research. Finally, the experiments on the KITTI Dataset and the Waymo Open Dataset show that our method can be generalized to different LiDAR sensor settings.

preprint2022arXiv

OTExtSum: Extractive Text Summarisation with Optimal Transport

Extractive text summarisation aims to select salient sentences from a document to form a short yet informative summary. While learning-based methods have achieved promising results, they have several limitations, such as dependence on expensive training and lack of interpretability. Therefore, in this paper, we propose a novel non-learning-based method by for the first time formulating text summarisation as an Optimal Transport (OT) problem, namely Optimal Transport Extractive Summariser (OTExtSum). Optimal sentence extraction is conceptualised as obtaining an optimal summary that minimises the transportation cost to a given document regarding their semantic distributions. Such a cost is defined by the Wasserstein distance and used to measure the summary's semantic coverage of the original document. Comprehensive experiments on four challenging and widely used datasets - MultiNews, PubMed, BillSum, and CNN/DM demonstrate that our proposed method outperforms the state-of-the-art non-learning-based methods and several recent learning-based methods in terms of the ROUGE metric.

preprint2022arXiv

Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings

In practice, many medical datasets have an underlying taxonomy defined over the disease label space. However, existing classification algorithms for medical diagnoses often assume semantically independent labels. In this study, we aim to leverage class hierarchy with deep learning algorithms for more accurate and reliable skin lesion recognition. We propose a hyperbolic network to learn image embeddings and class prototypes jointly. The hyperbola provably provides a space for modeling hierarchical relations better than Euclidean geometry. Meanwhile, we restrict the distribution of hyperbolic prototypes with a distance matrix that is encoded from the class hierarchy. Accordingly, the learned prototypes preserve the semantic class relations in the embedding space and we can predict the label of an image by assigning its feature to the nearest hyperbolic class prototype. We use an in-house skin lesion dataset which consists of around 230k dermoscopic images on 65 skin diseases to verify our method. Extensive experiments provide evidence that our model can achieve higher accuracy with less severe classification errors than models without considering class relations.

preprint2022arXiv

Towards Efficient Visual Simplification of Computational Graphs in Deep Neural Networks

A computational graph in a deep neural network (DNN) denotes a specific data flow diagram (DFD) composed of many tensors and operators. Existing toolkits for visualizing computational graphs are not applicable when the structure is highly complicated and large-scale (e.g., BERT [1]). To address this problem, we propose leveraging a suite of visual simplification techniques, including a cycle-removing method, a module-based edge-pruning algorithm, and an isomorphic subgraph stacking strategy. We design and implement an interactive visualization system that is suitable for computational graphs with up to 10 thousand elements. Experimental results and usage scenarios demonstrate that our tool reduces 60% elements on average and hence enhances the performance for recognizing and diagnosing DNN models. Our contributions are integrated into an open-source DNN visualization toolkit, namely, MindInsight [2].

preprint2020arXiv

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

Spatial-temporal graphs have been widely used by skeleton-based action recognition algorithms to model human action dynamics. To capture robust movement patterns from these graphs, long-range and multi-scale context aggregation and spatial-temporal dependency modeling are critical aspects of a powerful feature extractor. However, existing methods have limitations in achieving (1) unbiased long-range joint relationship modeling under multi-scale operators and (2) unobstructed cross-spacetime information flow for capturing complex spatial-temporal dependencies. In this work, we present (1) a simple method to disentangle multi-scale graph convolutions and (2) a unified spatial-temporal graph convolutional operator named G3D. The proposed multi-scale aggregation scheme disentangles the importance of nodes in different neighborhoods for effective long-range modeling. The proposed G3D module leverages dense cross-spacetime edges as skip connections for direct information propagation across the spatial-temporal graph. By coupling these proposals, we develop a powerful feature extractor named MS-G3D based on which our model outperforms previous state-of-the-art methods on three large-scale datasets: NTU RGB+D 60, NTU RGB+D 120, and Kinetics Skeleton 400.

preprint2016arXiv

Knowledge-based machine learning methods for macromolecular 3D structure prediction

Predicting the 3D structure of a macromolecule, such as a protein or an RNA molecule, is ranked top among the most difficult and attractive problems in bioinformatics and computational biology. Its importance comes from the relationship between the 3D structure and the function of a given protein or RNA. 3D structures also help to find the ligands of the protein, which are usually small molecules, a key step in drug design. Unfortunately, there is no shortcut to accurately obtain the 3D structure of a macromolecule. Many physical measurements of macromolecular 3D structures cannot scale up, due to their large labor costs and the requirements for lab conditions. In recent years, computational methods have made huge progress due to advance in computation speed and machine learning methods. These methods only need the sequence information to predict 3D structures by employing various mathematical models and machine learning methods. The success of computational methods is highly dependent on a large database of the proteins and RNA with known structures. However, the performance of computational methods are always expected to be improved. There are several reasons for this. First, we are facing, and will continue to face sparseness of data.Secondly, the 3D structure space is too large for our computational capability. The two obstacles can be removed by knowledge-based methods, which combine knowledge learned from the known structures and biologists' knowledge of the folding process of protein or RNA. In the dissertation, I will present my results in building a knowledge-based method by using machine learning methods to tackle this problem. My methods include the knowledge constraints on intermediate states, which can highly reduce the solution space of a protein or RNA, in turn increasing the efficiency of the structure folding method and improving its accuracy.

preprint2015arXiv

Life Span of Solutions for a Semilinear Heat Equation with Initial Data Non-Rarefied at $\infty$

We study the Cauchy problem for a semilinear heat equation with initial data non-rarefied at $\infty$. Our interest lies in the discussion of the effect of the non-rarefied factors on the life span of solutions, and some sharp estimates on the life span is established.

preprint2015arXiv

Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

Protein contacts contain important information for protein structure and functional study, but contact prediction from sequence remains very challenging. Both evolutionary coupling (EC) analysis and supervised machine learning methods are developed to predict contacts, making use of different types of information, respectively. This paper presents a group graphical lasso (GGL) method for contact prediction that integrates joint multi-family EC analysis and supervised learning. Different from existing single-family EC analysis that uses residue co-evolution information in only the target protein family, our joint EC analysis uses residue co-evolution in both the target family and its related families, which may have divergent sequences but similar folds. To implement joint EC analysis, we model a set of related protein families using Gaussian graphical models (GGM) and then co-estimate their precision matrices by maximum-likelihood, subject to the constraint that the precision matrices shall share similar residue co-evolution patterns. To further improve the accuracy of the estimated precision matrices, we employ a supervised learning method to predict contact probability from a variety of evolutionary and non-evolutionary information and then incorporate the predicted probability as prior into our GGL framework. Experiments show that our method can predict contacts much more accurately than existing methods, and that our method performs better on both conserved and family-specific contacts.

preprint2014arXiv

MRFalign: Protein Homology Detection through Alignment of Markov Random Fields

Sequence-based protein homology detection has been extensively studied and so far the most sensitive method is based upon comparison of protein sequence profiles, which are derived from multiple sequence alignment (MSA) of sequence homologs in a protein family. A sequence profile is usually represented as a position-specific scoring matrix (PSSM) or an HMM (Hidden Markov Model) and accordingly PSSM-PSSM or HMM-HMM comparison is used for homolog detection. This paper presents a new homology detection method MRFalign, consisting of three key components: 1) a Markov Random Fields (MRF) representation of a protein family; 2) a scoring function measuring similarity of two MRFs; and 3) an efficient ADMM (Alternating Direction Method of Multipliers) algorithm aligning two MRFs. Compared to HMM that can only model very short-range residue correlation, MRFs can model long-range residue interaction pattern and thus, encode information for the global 3D structure of a protein family. Consequently, MRF-MRF comparison for remote homology detection shall be much more sensitive than HMM-HMM or PSSM-PSSM comparison. Experiments confirm that MRFalign outperforms several popular HMM or PSSM-based methods in terms of both alignment accuracy and remote homology detection and that MRFalign works particularly well for mainly beta proteins. For example, tested on the benchmark SCOP40 (8353 proteins) for homology detection, PSSM-PSSM and HMM-HMM succeed on 48% and 52% of proteins, respectively, at superfamily level, and on 15% and 27% of proteins, respectively, at fold level. In contrast, MRFalign succeeds on 57.3% and 42.5% of proteins at superfamily and fold level, respectively. This study implies that long-range residue interaction patterns are very helpful for sequence-based homology detection. The software is available for download at http://raptorx.uchicago.edu/download/.

preprint2014arXiv

Proximity-induced ferromagnetism in graphene revealed by anomalous Hall effect

We demonstrate the anomalous Hall effect (AHE) in single-layer graphene exchange-coupled to an atomically flat yttrium iron garnet (YIG) ferromagnetic thin film. The anomalous Hall conductance has magnitude of ~0.09(2e2/h) at low temperatures and is measurable up to ~ 300 K. Our observations indicate not only proximity-induced ferromagnetism in graphene/YIG with large exchange interaction, but also enhanced spin-orbit coupling which is believed to be inherently weak in ideal graphene. The proximity-induced ferromagnetic order in graphene can lead to novel transport phenomena such as the quantized AHE which are potentially useful for spintronics.

preprint2013arXiv

77Se NMR Investigation of Fe-doped Bi2Se3

Bismuth selenide is both a thermoelectric material and topological insulator. Defects and dopants create conduction in thermoelectric applications. However, such defects may degrade the performance as a topological insulator (TI). Magnetic impurities such as iron open a band gap at the Dirac point on the surface. Since magnetically-doped TIs are important in technological applications, a good understanding of their properties is needed. In this article, 77Se nuclear magnetic resonance (NMR) spectroscopy has been used to investigate Fe-doped Bi2Se3. Spin-lattice relaxation measurements indicate that the Fe dopants provide a spin diffusion relaxation mechanism at low temperatures for the 77Se. Above 320 K, the predominant 77Se relaxation mechanism resulting from interaction with the conduction carriers is thermally induced with an activation energy of 21.5 kJ/mol (5.1 kcal/mol, 222 meV) and likely arises from inter-band excitations. Magic-angle spinning produces negligible narrowing of the 77Se resonance at 7 T, suggesting a statistical distribution of material defects and is also consistent with a dipolar interaction with the neighboring quadrupolar nucleus.

preprint2013arXiv

Monobit Digital Receivers for QPSK: Design, Analysis and Performance

Future communication system requires large bandwidth to achieve high data rate up to multigigabit/ sec, which makes analog-to-digital (ADC) become a key bottleneck for the implementation of digital receivers due to its high complexity and large power consumption. Therefore, monobit receivers for BPSK have been proposed to address this problem. In this work, QPSK modulation is considered for higher data rate. First, the optimal receiver based on monobit ADC with Nyquist sampling is derived, and its corresponding performance in the form of deflection ratio is calculated. Then a suboptimal but more practical monobit receiver is obtained, along with iterative demodulation and small sample removal. The effect of the imbalances between the In-phase (I) and Quadrature-phase (Q) branches, including the amplitude and phase imbalances, is carefully investigated too. To combat the performance loss caused by IQ imbalances, monobit receivers based on double training sequences are proposed. Numerical simulations show that the low-complexity suboptimal receiver suffers only 3dB signal to noise ratio (SNR) loss in AWGN channels and 1dB SNR loss in multipath static channels compared with the matched filter based monobit receiver with full channel state information (CSI). The impact of the phase difference between the transmitter and receiver is presented. It is observed that the performance degradation caused by the amplitude imbalance is negligible. Receivers based on double training sequences can efficiently compensate the performance loss in AWGN channel. Thanks to the diversity offered by the multipath, the effect of imbalances on monobit receivers in fading channels is slight. I

preprint2013arXiv

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

Motivation. Protein contact map describes the pairwise spatial and functional relationship of residues in a protein and contains key information for protein 3D structure prediction. Although studied extensively, it remains very challenging to predict contact map using only sequence information. Most existing methods predict the contact map matrix element-by-element, ignoring correlation among contacts and physical feasibility of the whole contact map. A couple of recent methods predict contact map based upon residue co-evolution, taking into consideration contact correlation and enforcing a sparsity restraint, but these methods require a very large number of sequence homologs for the protein under consideration and the resultant contact map may be still physically unfavorable. Results. This paper presents a novel method PhyCMAP for contact map prediction, integrating both evolutionary and physical restraints by machine learning and integer linear programming (ILP). The evolutionary restraints include sequence profile, residue co-evolution and context-specific statistical potential. The physical restraints specify more concrete relationship among contacts than the sparsity restraint. As such, our method greatly reduces the solution space of the contact map matrix and thus, significantly improves prediction accuracy. Experimental results confirm that PhyCMAP outperforms currently popular methods no matter how many sequence homologs are available for the protein under consideration. PhyCMAP can predict contacts within minutes after PSIBLAST search for sequence homologs is done, much faster than the two recent methods PSICOV and EvFold. See http://raptorx.uchicago.edu for the web server.

preprint2012arXiv

Field-effect mobility enhanced by tuning the Fermi level into the band gap of Bi2Se3

By eliminating normal fabrication processes, we preserve the bulk insulating state of calcium-doped Bi2Se3 single crystals in suspended nanodevices, as indicated by the activated temperature dependence of the resistivity at low temperatures. We perform low-energy electron beam irradiation (<16 keV) and electrostatic gating to control the carrier density and therefore the Fermi level position in the nanodevices. In slightly p-doped Bi2-xCaxSe3 devices, continuous tuning of the Fermi level from the bulk valence band to the band-gap reveals dramatic enhancement (> a factor of 10) in the field-effect mobility, which suggests suppressed backscattering expected for the Dirac fermion surface states in the gap of topological insulators.

preprint2012arXiv

Joint Viterbi Decoding and Decision Feedback Equalization for Monobit Digital Receivers

In ultra-wideband (UWB) communication systems with impulse radio (IR) modulation, the bandwidth is usually 1GHz or more. To process the received signal digitally, high sampling rate analog-digital-converters (ADC) are required. Due to the high complexity and large power consumption, monobit ADC is appropriate. The optimal monobit receiver has been derived. But it is not efficient to combat intersymbol interference (ISI). Decision feedback equalization (DFE) is an effect way dealing with ISI. In this paper, we proposed a algorithm that combines Viterbi decoding and DFE together for monobit receivers. In this way, we suppress the impact of ISI effectively, thus improving the bit error rate (BER) performance. By state expansion, we achieve better performance. The simulation results show that the algorithm has about 1dB SNR gain compared to separate demodulation and decoding method and 1dB loss compared to the BER performance in the channel without ISI. Compare to the full resolution detection in fading channel without ISI, it has 3dB SNR loss after state expansion.

preprint2010arXiv

Suspension and Measurement of Graphene and Bi2Se3 Atomic Membranes

Coupling high quality, suspended atomic membranes to specialized electrodes enables investigation of many novel phenomena, such as spin or Cooper pair transport in these two dimensional systems. However, many electrode materials are not stable in acids that are used to dissolve underlying substrates. Here we present a versatile and powerful multi-level lithographical technique to suspend atomic membranes, which can be applied to the vast majority of substrate, membrane and electrode materials. Using this technique, we fabricated suspended graphene devices with Al electrodes and mobility of 5500 cm^2/Vs. We also demonstrate, for the first time, fabrication and measurement of a free-standing thin Bi2Se3 membrane, which has low contact resistance to electrodes and a mobility of >~500 cm^2/Vs.

preprint2010arXiv

Tuning carrier type and density in Bi2Se3 by Ca-doping

The carrier type and density in Bi2Se3 single crystals are systematically tuned by introducing a calcium (Ca) dopant. A carrier density of ~1x1017 cm-3 which corresponds to ~25 meV in the Fermi energy is obtained in both n- and p-type materials. Electrical transport properties show that the insulating behavior is achieved in low carrier density crystals. In addition, both the band gap and reduced effective mass of carriers are determined.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2512.24138:author:5:zhiyong-wang

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.10082:author:3:zhiyong-wang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.13197:author:7:zhiyong-wang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.13181:author:8:zhiyong-wang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.10090:author:7:zhiyong-wang

Imported May 20, 2026Synced May 20, 2026

5 works

Jing Shi

Researcher

Jing Shi contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Jinbo Xu

Researcher

Jinbo Xu contributes to research discovery and scholarly infrastructure.

Open to collaborate

2 works

Cheng Wang

Researcher

Cheng Wang contributes to research discovery and scholarly infrastructure.

Open to collaborate

2 works

Huarui Yin

Researcher

Huarui Yin contributes to research discovery and scholarly infrastructure.

Open to collaborate

Zhiyong Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

30 published item(s)

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models

McCast: Memory-Guided Latent Drift Correction for Long-Horizon Precipitation Nowcasting

Stable Attention Response for Reliable Precipitation Nowcasting

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

GARDO: Reinforcing Diffusion Models without Reward Hacking

Geometric topics related to Besov type spaces on the Grushin setting

Robust Knowledge Adaptation for Federated Unsupervised Person ReID

XAI for In-hospital Mortality Prediction via Multimodal ICU Data

1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task

Deep Laparoscopic Stereo Matching with Transformers

Federated Visualization: A Privacy-preserving Strategy for Aggregated Visual Query

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds

OTExtSum: Extractive Text Summarisation with Optimal Transport

Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings

Towards Efficient Visual Simplification of Computational Graphs in Deep Neural Networks

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

Knowledge-based machine learning methods for macromolecular 3D structure prediction

Life Span of Solutions for a Semilinear Heat Equation with Initial Data Non-Rarefied at $\infty$

Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

MRFalign: Protein Homology Detection through Alignment of Markov Random Fields

Proximity-induced ferromagnetism in graphene revealed by anomalous Hall effect

77Se NMR Investigation of Fe-doped Bi2Se3

Monobit Digital Receivers for QPSK: Design, Analysis and Performance

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

Field-effect mobility enhanced by tuning the Fermi level into the band gap of Bi2Se3

Joint Viterbi Decoding and Decision Feedback Equalization for Monobit Digital Receivers

Suspension and Measurement of Graphene and Bi2Se3 Atomic Membranes

Tuning carrier type and density in Bi2Se3 by Ca-doping