Researcher profile

Xin Zhong

Xin Zhong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Invariant Features in Language Models: Geometric Characterization and Model Attribution

Language models exhibit strong robustness to paraphrasing, suggesting that semantic information may be encoded through stable internal representations, yet the structure and origin of such invariance remain unclear. We propose a local geometric framework in which semantically equivalent inputs occupy structured regions in latent space, with paraphrastic variation along nuisance directions and semantic identity preserved in invariant subspaces. Building on this view, we make three contributions: (1) a geometric characterization of invariant latent features, (2) a contrastive subspace discovery method that separates semantic-changing from semantic-preserving variation, and (3) an application of invariant representations to zero-shot model attribution. Across models and layers, empirical results support these contributions. Invariant structure emerges in specific depth regions, semantic displacement lies largely outside the nuisance subspace, and representation-level interventions indicate a causal role of invariant components in model outputs. Invariant representations also capture model-specific geometric patterns, enabling accurate attribution. These findings suggest that semantic invariance can be viewed as a local geometric property of latent representations, offering a principled perspective on how language models organize meaning.

preprint2026arXiv

The Impact of Ionic Anharmonicity on Superconductivity in Metal-Stuffed B-C Clathrates

Metal-stuffed B$-$C compounds with sodalite clathrate structure have captured increasing attention due to their predicted exceptional superconductivity above liquid nitrogen temperature at ambient pressure. However, by neglecting the quantum lattice anharmonicity, the existing studies may result in an incomplete understanding of such a lightweight system. Here, using state-of-the-art ab initio methods incorporating quantum effects and machine learning potentials, we revisit the properties of a series of $XY$$\text{B}_{6}\text{C}_{6}$ clathrates where $X$ and $Y$ are metals. Our findings show that ionic quantum and anharmonic effects can harden the $E_g$ and $E_u$ vibrational modes, enabling the dynamical stability of 15 materials previously considered unstable in the harmonic approximation, including materials with previously unreported ($XY$)$^{1+}$ state, which is demonstrated here to be crucial to reach high critical temperatures. Further calculations based on the anisotropic Migdal-Eliashberg equation demonstrate that the $T_\text{c}$ values for KRb$\text{B}_{6}\text{C}_{6}$ and Rb$\text{B}_{3}\text{C}_{3}$ among these stabilized compounds are 102 and 115 K at 0 and 15 GPa, respectively, both being higher than $T_\text{c}$ of 92 K of KPb$\text{B}_{6}\text{C}_{6}$ at the anharmonic level. These record-high $T_\text{c}$ values, surpassing liquid nitrogen temperatures, emphasize the importance of anharmonic effects in stabilizing B-C clathrates with large electron-phonon coupling strength and advancing the search for high-$T_\text{c}$ superconductivity at (near) ambient pressure.

preprint2023arXiv

Padding Module: Learning the Padding in Deep Neural Networks

During the last decades, many studies have been dedicated to improving the performance of neural networks, for example, the network architectures, initialization, and activation. However, investigating the importance and effects of learnable padding methods in deep learning remains relatively open. To mitigate the gap, this paper proposes a novel trainable Padding Module that can be placed in a deep learning model. The Padding Module can optimize itself without requiring or influencing the model's entire loss function. To train itself, the Padding Module constructs a ground truth and a predictor from the inputs by leveraging the underlying structure in the input data for supervision. As a result, the Padding Module can learn automatically to pad pixels to the border of its input images or feature maps. The padding contents are realistic extensions to its input data and simultaneously facilitate the deep learning model's downstream task. Experiments have shown that the proposed Padding Module outperforms the state-of-the-art competitors and the baseline methods. For example, the Padding Module has 1.23% and 0.44% more classification accuracy than the zero padding when tested on the VGG16 and ResNet50.

preprint2022arXiv

An Improved Normed-Deformable Convolution for Crowd Counting

In recent years, crowd counting has become an important issue in computer vision. In most methods, the density maps are generated by convolving with a Gaussian kernel from the ground-truth dot maps which are marked around the center of human heads. Due to the fixed geometric structures in CNNs and indistinct head-scale information, the head features are obtained incompletely. Deformable convolution is proposed to exploit the scale-adaptive capabilities for CNN features in the heads. By learning the coordinate offsets of the sampling points, it is tractable to improve the ability to adjust the receptive field. However, the heads are not uniformly covered by the sampling points in the deformable convolution, resulting in loss of head information. To handle the non-uniformed sampling, an improved Normed-Deformable Convolution (\textit{i.e.,}NDConv) implemented by Normed-Deformable loss (\textit{i.e.,}NDloss) is proposed in this paper. The offsets of the sampling points which are constrained by NDloss tend to be more even. Then, the features in the heads are obtained more completely, leading to better performance. Especially, the proposed NDConv is a light-weight module which shares similar computation burden with Deformable Convolution. In the extensive experiments, our method outperforms state-of-the-art methods on ShanghaiTech A, ShanghaiTech B, UCF\_QNRF, and UCF\_CC\_50 dataset, achieving 61.4, 7.8, 91.2, and 167.2 MAE, respectively. The code is available at https://github.com/bingshuangzhuzi/NDConv

preprint2022arXiv

FOD-A: A Dataset for Foreign Object Debris in Airports

Foreign Object Debris (FOD) detection has attracted increased attention in the area of machine learning and computer vision. However, a robust and publicly available image dataset for FOD has not been initialized. To this end, this paper introduces an image dataset of FOD, named FOD in Airports (FOD-A). FOD-A object categories have been selected based on guidance from prior documentation and related research by the Federal Aviation Administration (FAA). In addition to the primary annotations of bounding boxes for object detection, FOD-A provides labeled environmental conditions. As such, each annotation instance is further categorized into three light level categories (bright, dim, and dark) and two weather categories (dry and wet). Currently, FOD-A has released 31 object categories and over 30,000 annotation instances. This paper presents the creation methodology, discusses the publicly available dataset extension process, and demonstrates the practicality of FOD-A with widely used machine learning models for object detection.

preprint2022arXiv

Global strong solution for 3D compressible heat-conducting magnetohydrodynamic equations revisited

We revisit the 3D Cauchy problem of compressible heat-conducting magnetohydrodynamic equations with vacuum as far field density. By delicate energy method, we derive global existence and uniqueness of strong solutions provided that $(\|ρ_0\|_{L^\infty}+1)\big[\|ρ_0\|_{L^3}+ \|ρ_0\|_{L^\infty}+1)^2\big(\|\sqrt{ρ_0}u_0\|_{L^2}^2 +\|b_0\|_{L^2}^2\big)\big]\big[\|\nabla u_0\|_{L^2}^2+(\|ρ_0\|_{L^\infty}+1)\big(\|\sqrt{ρ_0}E_0\|_{L^2}^2+\|\nabla b_0\|_{L^2}^2\big)\big]$ is properly small. In particular, the smallness condition is independent of any norms of the initial data. This work improves our previous results [18, 19].

preprint2022arXiv

Global well-posedness for three-dimensional compressible viscous micropolar and heat-conducting fluids with vacuum at infinity and large oscillations

We investigate global well-posedness to the Cauchy problem of three-dimensional compressible viscous and heat-conducting micropolar fluid equations with zero density at infinity. By delicate energy estimates, we establish global existence and uniqueness of strong solutions under some smallness condition depending only on the parameters appeared in the system and the initial mass. In particular, the initial mass can be arbitrarily large. This improves our previous work [23]. Moreover, we also generalize the result [13] to the case that vacuum is allowed at infinity.

preprint2022arXiv

Local well-posedness to the 2D Cauchy problem of full compressible magnetohydrodynamic equations with vacuum at infinity

This paper concerns the Cauchy problem of two-dimensional (2D) full compressible magnetohydrodynamic (MHD) equations in the whole plane $\mathbb{R}^2$ with zero density at infinity. By spatial weighted energy method, we derive the local existence and uniqueness of strong solutions provided that the initial density and the initial magnetic field decay not too slowly at infinity. Note that the initial temperature does not need to decay slowly at infinity. In particular, vacuum states at both the interior domain and the far field are allowed.

preprint2021arXiv

DeepGOMIMO: Deep Learning-Aided Generalized Optical MIMO with CSI-Free Blind Detection

Generalized optical multiple-input multiple-output (GOMIMO) techniques have been recently shown to be promising for high-speed optical wireless communication (OWC) systems. In this paper, we propose a novel deep learning-aided GOMIMO (DeepGOMIMO) framework for GOMIMO systems, where channel state information (CSI)-free blind detection can be enabled by employing a specially designed deep neural network (DNN)-based MIMO detector. The CSI-free blind DNN detector mainly consists of two modules: one is the pre-processing module which is designed to address both the path loss and channel crosstalk issues caused by MIMO transmission, and the other is the feed-forward DNN module which is used for joint detection of spatial and constellation information by learning the statistics of both the input signal and the additive noise. Our simulation results clearly verify that, in a typical indoor 4 $\times$ 4 MIMO-OWC system using both generalized optical spatial modulation (GOSM) and generalized optical spatial multiplexing (GOSMP) with unipolar non-zero 4-ary pulse amplitude modulation (4-PAM) modulation, the proposed CSI-free blind DNN detector achieves near the same bit error rate (BER) performance as the optimal joint maximum-likelihood (ML) detector, but with much reduced computational complexity. Moreover, since the CSI-free blind DNN detector does not require instantaneous channel estimation to obtain accurate CSI, it enjoys the unique advantages of improved achievable data rate and reduced communication time delay in comparison to the CSI-based zero-forcing DNN (ZF-DNN) detector.

preprint2021arXiv

Global well-posedness and exponential decay to the Cauchy problem of nonhomogeneous Navier-Stokes equations with density-dependent viscosity and vacuum in $\mathbb{R}^2$

We study global well-posedness of strong solutions for the nonhomogeneous Navier-Stokes equations with density-dependent viscosity and initial density allowing vanish in $\mathbb{R}^2$. Applying a logarithmic interpolation inequality and delicate energy estimates, we show the global existence of a unique strong solution provided that $\|\nablaμ(ρ_0)\|_{L^q}$ is suitably small, which improves the previous result of Huang and Wang [SIAM J. Math. Anal. 46, 1771--1788 (2014)] to the whole space case. Moreover, we also derive exponential decay rates of the solution. In particular, there is no need to require additional initial compatibility condition despite the presence of vacuum.

preprint2021arXiv

Global well-posedness to the 2D Cauchy problem of nonhomogeneous heat conducting Navier-Stokes and magnetohydrodynamic equations with vacuum at infinity

We revisit the 2D Cauchy problem of nonhomogeneous heat conducting magnetohydrodynamic (MHD) equations in $\mathbb{R}^2$. For the initial density allowing vacuum at infinity, we derive the global existence and uniqueness of strong solutions provided that the initial density and the initial magnetic decay not too slowly at infinity. In particular, the initial data can be arbitrarily large. This improves our previous work where the initial density has non-vacuum states at infinity. The result could also be viewed as an extension of the study in L{ü}-Xu-Zhong for the inhomogeneous case to the full inhomogeneous situation. The method is based on delicate spatial weighted estimates and the structural characteristic of the system under consideration. As a byproduct, we get the global existence of strong solutions to the 2D Cauchy problem for nonhomogeneous heat conducting Navier-Stokes equations with vacuum at infinity.

preprint2020arXiv

An Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks

Digital image watermarking is the process of embedding and extracting a watermark covertly on a cover-image. To dynamically adapt image watermarking algorithms, deep learning-based image watermarking schemes have attracted increased attention during recent years. However, existing deep learning-based watermarking methods neither fully apply the fitting ability to learn and automate the embedding and extracting algorithms, nor achieve the properties of robustness and blindness simultaneously. In this paper, a robust and blind image watermarking scheme based on deep learning neural networks is proposed. To minimize the requirement of domain knowledge, the fitting ability of deep neural networks is exploited to learn and generalize an automated image watermarking algorithm. A deep learning architecture is specially designed for image watermarking tasks, which will be trained in an unsupervised manner to avoid human intervention and annotation. To facilitate flexible applications, the robustness of the proposed scheme is achieved without requiring any prior knowledge or adversarial examples of possible attacks. A challenging case of watermark extraction from phone camera-captured images demonstrates the robustness and practicality of the proposal. The experiments, evaluation, and application cases confirm the superiority of the proposed scheme.

preprint2020arXiv

Self-formed 2D/3D Heterostructure on the Edge of 2D Ruddlesden-Popper Hybrid Perovskites Responsible for Intriguing Optoelectronic Properties and Higher Cell Efficiency

The observation of low energy edge photoluminescence and its beneficial effect on the solar cell efficiency of Ruddlesden-Popper perovskites has unleashed an intensive research effort to reveal its origin. This effort, however, has been met with more challenges as the underlying material structure has still not been identified; new modellings and observations also do not seem to converge. Using 2D (BA)2(MA)2Pb3Br10 as an example, we show that 3D MAPbBr3 is formed due to the loss of BA on the edge. This self-formed MAPbBr3 can explain the reported edge emission under various conditions, while the reported intriguing optoelectronic properties such as fast exciton trapping from the interior 2D perovskite, rapid exciton dissociation and long carrier lifetime can be understood via the self-formed 2D/3D lateral perovskite heterostructure. The 3D perovskite is identified by submicron infrared spectroscopy, the emergence of XRD signature from freezer-milled nanometer-sized 2D perovskite and its photoluminescence response to external hydrostatic pressure. The revelation of this edge emission mystery and the identification of a self-formed 2D/3D heterostructure provide a new approach to engineering 2D perovskites for high-performance optoelectronic devices.

preprint2019arXiv

A Robust Image Watermarking System Based on Deep Neural Networks

Digital image watermarking is the process of embedding and extracting watermark covertly on a carrier image. Incorporating deep learning networks with image watermarking has attracted increasing attention during recent years. However, existing deep learning-based watermarking systems cannot achieve robustness, blindness, and automated embedding and extraction simultaneously. In this paper, a fully automated image watermarking system based on deep neural networks is proposed to generalize the image watermarking processes. An unsupervised deep learning structure and a novel loss computation are proposed to achieve high capacity and high robustness without any prior knowledge of possible attacks. Furthermore, a challenging application of watermark extraction from camera-captured images is provided to validate the practicality as well as the robustness of the proposed system. Experimental results show the superiority performance of the proposed system as comparing against several currently available techniques.

preprint2019arXiv

Automatic Image Pixel Clustering based on Mussels Wandering Optimiz

Image segmentation as a clustering problem is to identify pixel groups on an image without any preliminary labels available. It remains a challenge in machine vision because of the variations in size and shape of image segments. Furthermore, determining the segment number in an image is NP-hard without prior knowledge of the image content. This paper presents an automatic color image pixel clustering scheme based on mussels wandering optimization. By applying an activation variable to determine the number of clusters along with the cluster centers optimization, an image is segmented with minimal prior knowledge and human intervention. By revising the within- and between-class sum of squares ratio for random natural image contents, we provide a novel fitness function for image pixel clustering tasks. Comprehensive empirical studies of the proposed scheme against other state-of-the-art competitors on synthetic data and the ASD dataset have demonstrated the promising performance of the proposed scheme.

preprint2019arXiv

Global existence of weak solutions to the compressible quantum Navier-Stokes equations with degenerate viscosity

We study the compressible quantum Navier-Stokes (QNS) equations with degenerate viscosity in the three dimensional periodic domains. On the one hand, we consider QNS with additional damping terms. Motivated by the recent works [Li-Xin, arXiv:1504.06826] and [Antonelli-Spirito, Arch. Ration. Mech. Anal., 203(2012), 499--527], we construct a suitable approximate system which has smooth solutions satisfying the energy inequality and the BD entropy estimate. Using this system, we obtain the global existence of weak solutions to the compressible QNS equations with damping terms for large initial data. Moreover, we obtain some new a priori estimates, which can avoid using the assumption that the gradient of the velocity is a well-defined function, which is indeed used directly in [Vasseur-Yu, SIAM J. Math. Anal., 48 (2016), 1489--1511; Invent. Math., 206 (2016), 935--974]. On the other hand, in the absence of damping terms, we also prove the global existence of weak solutions to the compressible QNS equations without the lower bound assumption on the dispersive coefficient, which improves the previous result due to [Antonelli-Spirito, Arch. Ration. Mech. Anal., 203(2012), 499--527].