Researcher profile

Xing Fan

Xing Fan contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity

We introduce WildAGTEval, a benchmark designed to evaluate large language model (LLM) agents' function-calling capabilities under realistic API complexity. Unlike prior work that assumes an idealized API system and disregards real-world factors such as noisy API outputs, WildAGTEval accounts for two dimensions of real-world complexity: 1. API specification, which includes detailed documentation and usage constraints, and 2. API execution, which captures runtime challenges. Consequently, WildAGTEval offers (i) an API system encompassing 60 distinct complexity scenarios that can be composed into approximately 32K test configurations, and (ii) user-agent interactions for evaluating LLM agents on these scenarios. Using WildAGTEval, we systematically assess several advanced LLMs and observe that most scenarios are challenging, with irrelevant information complexity posing the greatest difficulty and reducing the performance of strong LLMs by 27.3%. Furthermore, our qualitative analysis reveals that LLMs occasionally distort user intent merely to claim task completion, critically affecting user satisfaction.

preprint2022arXiv

Robust Regularized Low-Rank Matrix Models for Regression and Classification

While matrix variate regression models have been studied in many existing works, classical statistical and computational methods for the analysis of the regression coefficient estimation are highly affected by high dimensional and noisy matrix-valued predictors. To address these issues, this paper proposes a framework of matrix variate regression models based on a rank constraint, vector regularization (e.g., sparsity), and a general loss function with three special cases considered: ordinary matrix regression, robust matrix regression, and matrix logistic regression. We also propose an alternating projected gradient descent algorithm. Based on analyzing our objective functions on manifolds with bounded curvature, we show that the algorithm is guaranteed to converge, all accumulation points of the iterates have estimation errors in the order of $O(1/\sqrt{n})$ asymptotically and substantially attaining the minimax rate. Our theoretical analysis can be applied to general optimization problems on manifolds with bounded curvature and can be considered an important technical contribution to this work. We validate the proposed method through simulation studies and real image data examples.

preprint2021arXiv

1550 nm compatible ultrafast photoconductive material based on a GaAs/ErAs/GaAs heterostructure

The sub-bandgap absorption and ultrafast relaxation in a GaAs/ErAs/GaAs heterostructure are reported. The infrared absorption and 1550 nm-excited ultrafast photo-response are studied by Fourier transform infrared (FTIR) spectrometry and time-domain pump-probe technique. The two absorption peaks located at 2.0 um (0.62 eV) and 2.7 um (0.45 eV) are originated from the ErAs/GaAs interfacial Schottky states and sub-bandgap transition within GaAs, respectively. The photo-induced carrier lifetime, excited using 1550 nm light, is measured to be as low as 190 fs for the GaAs/ErAs/GaAs heterostructure, making it a promising material for 1550-nm-technology-compatible, high critical-breakdown-field THz devices. The relaxation mechanism is proposed and the functionality of ErAs is revealed.

preprint2021arXiv

Driven One-Particle Quantum Cyclotron

A quantum cyclotron is one trapped electron or positron that occupies only its lowest cyclotron and spin states. A master equation is solved for a driven quantum cyclotron with a QND (quantum nondemolition) coupling to a detection oscillator in thermal equilibrium - the first quantum calculation for this coupled and open system. The predicted rate of cyclotron and spin quantum jumps as a function of drive frequency, for a small coupling between the detection motion and its thermal reservoir, differs sharply from what has been predicted and used for past measurements. The calculation suggests a ten times more precise electron magnetic moment measurement is possible, as needed to investigate current differences between the most precise prediction of the standard model of particle physics, and the most accurate measurement of a property of an elementary particle.

preprint2020arXiv

Circumventing Detector Backaction on a Quantum Cyclotron

Detector backaction can be completely evaded when the state of a one-electron quantum cyclotron is detected, but it nonetheless significantly broadens the quantum-jump resonance lineshapes from which the cyclotron frequency can be deduced. This limits the accuracy with which the electron magnetic moment can be determined to test the standard model's most precise prediction. A steady state solution to a master equation, the first quantum calculation for the open quantum cyclotron system, illustrates a method to circumvent the detection backaction upon the measured frequency.

preprint2020arXiv

Cross-Spectrum Dual-Subspace Pairing for RGB-infrared Cross-Modality Person Re-Identification

Due to its potential wide applications in video surveillance and other computer vision tasks like tracking, person re-identification (ReID) has become popular and been widely investigated. However, conventional person re-identification can only handle RGB color images, which will fail at dark conditions. Thus RGB-infrared ReID (also known as Infrared-Visible ReID or Visible-Thermal ReID) is proposed. Apart from appearance discrepancy in traditional ReID caused by illumination, pose variations and viewpoint changes, modality discrepancy produced by cameras of the different spectrum also exists, which makes RGB-infrared ReID more difficult. To address this problem, we focus on extracting the shared cross-spectrum features of different modalities. In this paper, a novel multi-spectrum image generation method is proposed and the generated samples are utilized to help the network to find discriminative information for re-identifying the same person across modalities. Another challenge of RGB-infrared ReID is that the intra-person (images from the same person) discrepancy is often larger than the inter-person (images from different persons) discrepancy, so a dual-subspace pairing strategy is proposed to alleviate this problem. Combining those two parts together, we also design a one-stream neural network combining the aforementioned methods to extract compact representations of person images, called Cross-spectrum Dual-subspace Pairing (CDP) model. Furthermore, during the training process, we also propose a Dynamic Hard Spectrum Mining method to automatically mine more hard samples from hard spectrum based on the current model state to further boost the performance. Extensive experimental results on two public datasets, SYSU-MM01 with RGB + near-infrared images and RegDB with RGB + far-infrared images, have demonstrated the efficiency and generality of our proposed method.

preprint2020arXiv

Knowledge Distillation from Internal Representations

Knowledge distillation is typically conducted by training a small model (the student) to mimic a large and cumbersome model (the teacher). The idea is to compress the knowledge from the teacher by using its output probabilities as soft-labels to optimize the student. However, when the teacher is considerably large, there is no guarantee that the internal knowledge of the teacher will be transferred into the student; even if the student closely matches the soft-labels, its internal representations may be considerably different. This internal mismatch can undermine the generalization capabilities originally intended to be transferred from the teacher to the student. In this paper, we propose to distill the internal representations of a large model such as BERT into a simplified version of it. We formulate two ways to distill such representations and various algorithms to conduct the distillation. We experiment with datasets from the GLUE benchmark and consistently show that adding knowledge distillation from internal representations is a more powerful method than only using soft-label distillation.

preprint2020arXiv

Modern cities emerge as 'super-cells' where enclosed industrial systems are hotspots of goods and services

Prevailing hypotheses recognize cities as 'super-organisms' which both provides organizing principles for cities and fills the scalar gap in the hierarchical living system between ecosystems and the entire planet. However, most analogies between the traits of organisms and cities are inappropriate making the super-organism model impractical as a means to acquire new knowledge. Using a cluster analysis of 15 traits of cities and other living systems, we found that modern cities are more similar to eukaryotic cells than to multicellular organisms. Enclosed industrial systems, such as factories and greenhouses, dominate modern cities and are analogous to organelles as hotspots that provide high-flux goods and services. Therefore, we propose a 'super-cell city model' as more appropriate than the super-organism model. In addition to the theoretical significance, our model also recognizes enclosed industrial systems as functional components that improve the vitality and sustainability of cities.

preprint2020arXiv

Pre-Training for Query Rewriting in A Spoken Language Understanding System

Query rewriting (QR) is an increasingly important technique to reduce customer friction caused by errors in a spoken language understanding pipeline, where the errors originate from various sources such as speech recognition errors, language understanding errors or entity resolution errors. In this work, we first propose a neural-retrieval based approach for query rewriting. Then, inspired by the wide success of pre-trained contextual language embeddings, and also as a way to compensate for insufficient QR training data, we propose a language-modeling (LM) based approach to pre-train query embeddings on historical user conversation data with a voice assistant. In addition, we propose to use the NLU hypotheses generated by the language understanding system to augment the pre-training. Our experiments show pre-training provides rich prior information and help the QR task achieve strong performance. We also show joint pre-training with NLU hypotheses has further benefit. Finally, after pre-training, we find a small set of rewrite pairs is enough to fine-tune the QR model to outperform a strong baseline by full training on all QR training data.

preprint2020arXiv

STNReID : Deep Convolutional Networks with Pairwise Spatial Transformer Networks for Partial Person Re-identification

Partial person re-identification (ReID) is a challenging task because only partial information of person images is available for matching target persons. Few studies, especially on deep learning, have focused on matching partial person images with holistic person images. This study presents a novel deep partial ReID framework based on pairwise spatial transformer networks (STNReID), which can be trained on existing holistic person datasets. STNReID includes a spatial transformer network (STN) module and a ReID module. The STN module samples an affined image (a semantically corresponding patch) from the holistic image to match the partial image. The ReID module extracts the features of the holistic, partial, and affined images. Competition (or confrontation) is observed between the STN module and the ReID module, and two-stage training is applied to acquire a strong STNReID for partial ReID. Experimental results show that our STNReID obtains 66.7% and 54.6% rank-1 accuracies on partial ReID and partial iLIDS datasets, respectively. These values are at par with those obtained with state-of-the-art methods.