Researcher profile

Xin Hu

Xin Hu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

Dynamic Graph Neural Networks for Physiological Based Pharmacokinetic Modeling: A Novel Data Driven Approach to Drug Concentration Prediction

Physiologically Based Pharmacokinetic (PBPK) modeling is a key tool in drug development for predicting drug concentration dynamics across organs. Traditional PBPK approaches rely on ordinary differential equations with simplifying assumptions that limit their ability to capture nonlinear and system-level physiological interactions. In this work, we investigate data-driven PBPK modeling using deep learning. We implement two baseline architectures -- a multilayer perceptron (MLP) and a long short-term memory (LSTM) network -- and propose a Dynamic Graph Neural Network (Dynamic GNN) that explicitly models inter-organ interactions through recurrent message passing on a physiological graph. Experiments on a multi-organ pharmacokinetic dataset show that the Dynamic GNN achieves the lowest mean absolute percentage error (MAPE) of 15.7% among all models, demonstrating improved relative accuracy despite slightly higher absolute error compared to the MLP baseline. The model attains an R2 of 0.9342 with more stable error behavior and better captures inter-organ pharmacokinetic relationships. These results highlight the importance of structure-aware modeling for PBPK applications and demonstrate that the proposed Dynamic GNN offers a scalable, equation-free alternative for data-driven pharmacokinetic prediction.

preprint2025arXiv

PediaMind-R1: A Temperament-Aware Language Model for Personalized Early Childhood Care Reasoning via Cognitive Modeling and Preference Alignment

This paper presents PediaMind-R1, a domain-specialized large language model designed to achieve active personalization in intelligent parenting scenarios. Unlike conventional systems that provide generic suggestions, PediaMind-R1 draws on insights from developmental psychology. It introduces temperament theory from the Thomas-Chess framework and builds a temperament knowledge graph for infants and toddlers (0-3 years). Our two-stage training pipeline first uses supervised fine-tuning to teach structured chain-of-thought reasoning, and then applies a GRPO-based alignment stage to reinforce logical consistency, domain expertise, and empathetic caregiving strategies. We further design an evaluation framework comprising temperament-sensitive multiple-choice tests and human assessments. The results demonstrate that PediaMind-R1 can accurately interpret early childhood temperament profiles and proactively engage in individualized reasoning. This work highlights the value of integrating vertical-domain modeling with psychological theory. It offers a novel approach to developing user-centered LLMs that advance the practice of active personalization in sensitive caregiving contexts.

preprint2025arXiv

Quantum oscillations of valley current driven by microwave irradiation in transition-metal dichalcogenide/ferromagnet hybrids

We theoretically study spin and valley transport in a transition-metal dichalcogenide(TMDC)/ferromagnet heterostructure under a perpendicular magnetic field. We find that microwave-driven spin pumping induces a valley-selective spin excitation, a direct consequence of the valley-asymmetric Landau levels in the TMDC conduction band. This process generates a pure valley current which, as our central finding, exhibits pronounced quantum oscillations as a function of chemical potential. These oscillations provide a definitive experimental signature of the quantized valley states and establish another pathway to interface spintronics and valleytronics.

preprint2022arXiv

Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Alignment (CLISA) to tackle the cross-subject emotion recognition problem. Contrastive learning was employed to minimize the inter-subject differences by maximizing the similarity in EEG signal representations across subjects when they received the same emotional stimuli in contrast to different ones. Specifically, a convolutional neural network was applied to learn inter-subject aligned spatiotemporal representations from EEG time series in contrastive learning. The aligned representations were subsequently used to extract differential entropy features for emotion classification. CLISA achieved state-of-the-art cross-subject emotion recognition performance on our THU-EP dataset with 80 subjects and the publicly available SEED dataset with 15 subjects. It could generalize to unseen subjects or unseen emotional stimuli in testing. Furthermore, the spatiotemporal representations learned by CLISA could provide insights into the neural mechanisms of human emotion processing.

preprint2022arXiv

Electrical manipulation of plasmon-phonon polaritons in heterostructures of graphene on biaxial crystals

Phonon polaritons in natural anisotropic crystals hold great promise for infrared nano-optics. However, the direct electrical control of these polaritons is difficult, preventing the development of active polaritonic devices. Here we propose the heterostructures of graphene on a biaxial crystal (α-phase molybdenum trioxide) slab and theoretically study the hybridized plasmon-phonon polaritons with dependence on the Fermi level of graphene from three aspects: dispersion relationships, iso-frequency contours, and the quantum spin Hall effects. We demonstrate the distinct wavelength tunability of the plasmon-phonon polaritons modes and the optical topologic transitions from open (hyperbolic) to closed (bow-tie-like) iso-frequency contours as the increase of the Fermi level of graphene. Furthermore, we observe the tunable quantum spin Hall effects of the plasmon-phonon polaritons, manifesting propagation direction switching by the Fermi level tuning of the graphene. Our findings open opportunities for novel electrically tunable polaritonic devices and programmable quantum optical networks.

preprint2022arXiv

GPTR: Gestalt-Perception Transformer for Diagram Object Detection

Diagram object detection is the key basis of practical applications such as textbook question answering. Because the diagram mainly consists of simple lines and color blocks, its visual features are sparser than those of natural images. In addition, diagrams usually express diverse knowledge, in which there are many low-frequency object categories in diagrams. These lead to the fact that traditional data-driven detection model is not suitable for diagrams. In this work, we propose a gestalt-perception transformer model for diagram object detection, which is based on an encoder-decoder architecture. Gestalt perception contains a series of laws to explain human perception, that the human visual system tends to perceive patches in an image that are similar, close or connected without abrupt directional changes as a perceptual whole object. Inspired by these thoughts, we build a gestalt-perception graph in transformer encoder, which is composed of diagram patches as nodes and the relationships between patches as edges. This graph aims to group these patches into objects via laws of similarity, proximity, and smoothness implied in these edges, so that the meaningful objects can be effectively detected. The experimental results demonstrate that the proposed GPTR achieves the best results in the diagram object detection task. Our model also obtains comparable results over the competitors in natural image object detection.

preprint2022arXiv

Hetero-interface of electrolyte/2D materials

Electrochemical gating has been demonstrated as a powerful tool to tune the physical properties of two-dimensional (2D) materials, leading to lots of fascinating quantum phenomena. However, the reported liquid-nature electrolytes (e.g, ionic liquid and ion-gel) cover the top surface of 2D materials, introduce the strain at the hetero-interface, and present sensitivity to humidity, which strongly limits the further exploration of the hetero-interface between electrolyte and 2D materials, and their wide applications for electronics and optoelectronics. Herein, by introducing a lithium-ion solid-state electrolyte, the character of the electric double layer (EDL) at hetero-interface and its effect on the optical property of transition metal chalcogenides (TMDs) have been revealed by Kelvin probe force microscopy (KPFM) and (time-resolved) photoluminescence measurements. The work function of TMDs can be strongly tailored by electrochemical gating, up to 0.7eV for WSe2 and 0.3 eV for MoS2, respectively. Besides, from the gate-dependent surface potential of TMDs with different thicknesses, the potential drop across the EDL has been quantitatively revealed. Furthermore, from the gate-dependent PL emission at room temperature, monolayer WS2 exhibits only neutral exciton emission in the whole range of gate voltage applied, which also exhibits exciton-exciton annihilation. Our results demonstrate that lithium-ion substrate is a promising alternative to explore the physics of 2D materials and the hetero-interface of electrolyte/2D materials by easily integrating both scanning probe and optical techniques.

preprint2021arXiv

A Feature Fusion-Net Using Deep Spatial Context Encoder and Nonstationary Joint Statistical Model for High Resolution SAR Image Classification

Convolutional neural networks (CNNs) have been applied to learn spatial features for high-resolution (HR) synthetic aperture radar (SAR) image classification. However, there has been little work on integrating the unique statistical distributions of SAR images which can reveal physical properties of terrain objects, into CNNs in a supervised feature learning framework. To address this problem, a novel end-to-end supervised classification method is proposed for HR SAR images by considering both spatial context and statistical features. First, to extract more effective spatial features from SAR images, a new deep spatial context encoder network (DSCEN) is proposed, which is a lightweight structure and can be effectively trained with a small number of samples. Meanwhile, to enhance the diversity of statistics, the nonstationary joint statistical model (NS-JSM) is adopted to form the global statistical features. Specifically, SAR images are transformed into the Gabor wavelet domain and the produced multi-subbands magnitudes and phases are modeled by the log-normal and uniform distribution. The covariance matrix is further utilized to capture the inter-scale and intra-scale nonstationary correlation between the statistical subbands and make the joint statistical features more compact and distinguishable. Considering complementary advantages, a feature fusion network (Fusion-Net) base on group compression and smooth normalization is constructed to embed the statistical features into the spatial features and optimize the fusion feature representation. As a result, our model can learn the discriminative features and improve the final classification performance. Experiments on four HR SAR images validate the superiority of the proposed method over other related algorithms.

preprint2020arXiv

A General Architecture for Behavior Modeling of Nonlinear Power Amplifier using Deep Convolutional Neural Network

Nonlinearity of power amplifier is one of the major limitations to the achievable capacity in wireless transmission systems. Nonlinear impairments are determined by the nonlinear distortions of the power amplifier and modulator imperfections. The Volterra model, several compact Volterra models and neural network models to establish a nonlinear model of power amplifier have all been demonstrated. However, the computational cost of these models increases and their implementation demands more signal processing resources as the signal bandwidth gets wider or the number of carrier aggregation. A completely different approach uses deep convolutional neural network to learn from the training data to figure out the nonlinear distortion. In this work, a low complexity, general architecture based on the deep real-valued convolutional neural network (DRVCNN) is proposed to build the nonlinear behavior of the power amplifier. With each of the multiple inputs equivalent to an input vector, the DRVCNN tensor weights are constructed from training data thanks to the current and historical envelope-dependent terms, I, and Q, which are components of the input. The effectiveness of the general framework in modeling single-carrier and multi-carrier power amplifiers is verified.

preprint2020arXiv

Convolutional Neural Network for Behavioral Modeling and Predistortion of Wideband Power Amplifiers

In this paper, we propose a novel behavior model for wideband PAs using a real-valued time-delay convolutional neural network (RVTDCNN). The input data of the model are sorted and arranged as the graph composed of the in-phase and quadrature (I/Q) components and envelope-dependent terms of current and past signals. We design a pre-designed filter using the convolutional layer to extract the basis functions required for the PA forward or reverse modeling. The generated rich basis functions are modeled using a simple fully connected layer. Because of the weight sharing characteristics of the convolutional structure, the strong memory effect does not lead to a obvious increase in the complexity of the model. Meanwhile, the extraction effect of the pre-designed filter also reduces the training complexity of the model. The experimental results show that the performance of the RVTDCNN model is almost the same as the NN models and the multilayer NN models.

preprint2020arXiv

Deep Reinforcement Learning (DRL): Another Perspective for Unsupervised Wireless Localization

Location is key to spatialize internet-of-things (IoT) data. However, it is challenging to use low-cost IoT devices for robust unsupervised localization (i.e., localization without training data that have known location labels). Thus, this paper proposes a deep reinforcement learning (DRL) based unsupervised wireless-localization method. The main contributions are as follows. (1) This paper proposes an approach to model a continuous wireless-localization process as a Markov decision process (MDP) and process it within a DRL framework. (2) To alleviate the challenge of obtaining rewards when using unlabeled data (e.g., daily-life crowdsourced data), this paper presents a reward-setting mechanism, which extracts robust landmark data from unlabeled wireless received signal strengths (RSS). (3) To ease requirements for model re-training when using DRL for localization, this paper uses RSS measurements together with agent location to construct DRL inputs. The proposed method was tested by using field testing data from multiple Bluetooth 5 smart ear tags in a pasture. Meanwhile, the experimental verification process reflected the advantages and challenges for using DRL in wireless localization.

preprint2020arXiv

Inertial Sensing Meets Artificial Intelligence: Opportunity or Challenge?

The inertial navigation system (INS) has been widely used to provide self-contained and continuous motion estimation in intelligent transportation systems. Recently, the emergence of chip-level inertial sensors has expanded the relevant applications from positioning, navigation, and mobile mapping to location-based services, unmanned systems, and transportation big data. Meanwhile, benefit from the emergence of big data and the improvement of algorithms and computing power, artificial intelligence (AI) has become a consensus tool that has been successfully applied in various fields. This article reviews the research on using AI technology to enhance inertial sensing from various aspects, including sensor design and selection, calibration and error modeling, navigation and motion-sensing algorithms, multi-sensor information fusion, system evaluation, and practical application. Based on the over 30 representative articles selected from the nearly 300 related publications, this article summarizes the state of the art, advantages, and challenges on each aspect. Finally, it summarizes nine advantages and nine challenges of AI-enhanced inertial sensing and then points out future research directions.

preprint2019arXiv

Mining Maximal Dynamic Spatial Co-Location Patterns

A spatial co-location pattern represents a subset of spatial features whose instances are prevalently located together in a geographic space. Although many algorithms of mining spatial co-location pattern have been proposed, there are still some problems: 1) they miss some meaningful patterns (e.g., {Ganoderma_lucidumnew, maple_treedead} and {water_hyacinthnew(increase), algaedead(decrease)}), and get the wrong conclusion that the instances of two or more features increase/decrease (i.e., new/dead) in the same/approximate proportion, which has no effect on prevalent patterns. 2) Since the number of prevalent spatial co-location patterns is very large, the efficiency of existing methods is very low to mine prevalent spatial co-location patterns. Therefore, first, we propose the concept of dynamic spatial co-location pattern that can reflect the dynamic relationships among spatial features. Second, we mine small number of prevalent maximal dynamic spatial co-location patterns which can derive all prevalent dynamic spatial co-location patterns, which can improve the efficiency of obtaining all prevalent dynamic spatial co-location patterns. Third, we propose an algorithm for mining prevalent maximal dynamic spatial co-location patterns and two pruning strategies. Finally, the effectiveness and efficiency of the method proposed as well as the pruning strategies are verified by extensive experiments over real/synthetic datasets.