Source author record

Tong Chen

Tong Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

37works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Efficient Content-based Recommendation Model Training via Noise-aware Coreset Selection

Content-based recommendation systems (CRSs) utilize content features to predict user-item interactions, serving as essential tools for helping users navigate information-rich web services. However, ensuring the effectiveness of CRSs requires large-scale and even continuous model training to accommodate diverse user preferences, resulting in significant computational costs and resource demands. A promising approach to this challenge is coreset selection, which identifies a small but representative subset of data samples that preserves model quality while reducing training overhead. Yet, the selected coreset is vulnerable to the pervasive noise in user-item interactions, particularly when it is minimally sized. To this end, we propose Noise-aware Coreset Selection (NaCS), a specialized framework for CRSs. NaCS constructs coresets through submodular optimization based on training gradients, while simultaneously correcting noisy labels using a progressively trained model. Meanwhile, we refine the selected coreset by filtering out low-confidence samples through uncertainty quantification, thereby avoid training with unreliable interactions. Through extensive experiments, we show that NaCS produces higher-quality coresets for CRSs while achieving better efficiency than existing coreset selection techniques. Notably, NaCS recovers 93-95\% of full-dataset training performance using merely 1\% of the training data. The source code is available at \href{https://github.com/chenxing1999/nacs}{https://github.com/chenxing1999/nacs}.

preprint2026arXiv

FINER-SQL: Boosting Small Language Models for Text-to-SQL

Large language models have driven major advances in Text-to-SQL generation. However, they suffer from high computational cost, long latency, and data privacy concerns, which make them impractical for many real-world applications. A natural alternative is to use small language models (SLMs), which enable efficient and private on-premise deployment. Yet, SLMs often struggle with weak reasoning and poor instruction following. Conventional reinforcement learning methods based on sparse binary rewards (0/1) provide little learning signal when the generated SQLs are incorrect, leading to unstable or collapsed training. To overcome these issues, we propose FINER-SQL, a scalable and reusable reinforcement learning framework that enhances SLMs through fine-grained execution feedback. Built on group relative policy optimization, FINER-SQL replaces sparse supervision with dense and interpretable rewards that offer continuous feedback even for incorrect SQLs. It introduces two key reward functions: a memory reward, which aligns reasoning with verified traces for semantic stability, and an atomic reward, which measures operation-level overlap to grant partial credit for structurally correct but incomplete SQLs. This approach transforms discrete correctness into continuous learning, enabling stable, critic-free optimization. Experiments on the BIRD and Spider benchmarks show that FINER-SQL achieves up to 67.73\% and 85\% execution accuracy with a 3B model -- matching much larger LLMs while reducing inference latency to 5.57~s/sample. These results highlight a cost-efficient and privacy-preserving path toward high-performance Text-to-SQL generation. Our code is available at https://github.com/thanhdath/finer-sql.

preprint2026arXiv

GRAFT: Graph-Tokenized LLMs for Tool Planning

Large language models (LLMs) are increasingly used to complete complex tasks by selecting and coordinating external tools across multiple steps. This requires aligning tool choices with subtask intent while satisfying directional execution dependencies among tools. To do this, existing methods model these dependencies as tool graphs and incorporate the graphs with LLMs through retrieval, serialization, or prompt-level injection. However, these external graph-use strategies all follow a matching paradigm, which often fails to align tool choices with the underlying subtask structure, producing semantically plausible plans that violate graph constraints. This issue is further exacerbated by error accumulation, where an early incorrect tool selection shifts the plan into an invalid graph state and causes subsequent predictions to drift away from the valid execution path. To address these challenges, we propose GRAFT, a graph-tokenized language model framework for dependency-aware tool planning. GRAFT internalizes the tool graph by mapping each tool node to a dedicated special token and learning directed tool dependencies within the representation space. It further introduces on-policy tool context distillation, training the model on its own sampled trajectories while distilling stepwise planning signals. Experiments show that GRAFT achieves state-of-the-art performance in exact sequence matching and dependency legality, supporting more reliable LLM tool planning in complex workflows.

preprint2026arXiv

Gyral-Sulcal-Net: An Integrated Network Representation of Brain Folding Patterns

Our brain functions as a complex communication network, and studying it from a network perspective offers valuable insights into its organizational principles and links to cognitive functions and brain disorders. However, most current network studies typically use brain regions as nodes, often overlooking the intricate folding patterns of finer-scale anatomical landmarks within these regions. In this study, we introduce a novel approach to integrate the brain's two primary folding patterns - gyri and sulci - into a unified network termed the Gyral-Sulcal-Net (GS-Net), in which three different types of finer-scale landmarks have been successfully identified. We evaluated the proposed GS-Net across multiple datasets, comprising over 1,600 brains, spanning different age groups (from 34 gestational weeks to elderly adults) and cohorts (healthy brains and those with pathological conditions). The experimental results demonstrate that the GS-Net can effectively integrate and represent diverse cortical folding patterns from a network perspective. More importantly, this approach offers a promising way for integrating different folding patterns into a unified anatomical brain network, alongside structural and functional networks, providing a comprehensive framework for studying brain networks.

preprint2026arXiv

ProMax: Exploring the Potential of LLM-derived Profiles with Distribution Shaping for Recommender Systems

The remarkable text understanding and generation capabilities of large language models (LLMs) have revitalized the field of general recommendation based on implicit user feedback. Rather than deploying LLMs directly as recommendation models, a more flexible paradigm leverages their ability to interpret users' historical interactions and semantic contexts to extract structured profiles that characterize user preferences. These profiles can be further transformed into actionable high-dimensional representations, serving as powerful signals to augment and strengthen recommendation models. However, the mechanism by which such profiles enhance recommendation performance within the feature space remains insufficiently understood. Moreover, existing studies predominantly rely on nonlinear alignment and fusion strategies to incorporate these profiles, which often lead to semantic loss and fail to fully exploit their potential. To address these limitations, we revisit profiles from a retrieval perspective and propose a simple yet effective recommendation framework built upon distribution shaping (ProMax) in this paper. We begin by employing dense retrieval to uncover the collaborative relationships between user and item profiles within the feature space. Based on this insight, we introduce a dual distribution-reshaping process, in which the profile distribution acts as a guiding signal to steer the recommendation model toward learning user preferences for unseen items beyond the scope of observed interactions. We apply ProMax to four classic recommendation methods on three public datasets. The results indicate that ProMax substantially improves base model performance and outperforms existing LLM-based recommendation approaches.

preprint2026arXiv

Reasoning Pattern Alignment Merging for Adaptive Reasoning

Recent large reasoning models (LRMs) have made substantial progress in complex reasoning tasks, yet they often generate lengthy reasoning paths for every query, incurring unnecessary computation and latency. Existing speed-up approaches typically rely on retraining the model or designing sophisticated prompting, which are either prohibitively expensive or highly sensitive to the input and prompt formulation. In this work, we study model merging as a lightweight alternative for efficient reasoning: by combining a long chain-of-thought (Long-CoT) reasoning model with a Short-CoT instruction model, we obtain an adaptive reasoner without training from scratch or requiring large-scale additional data. Building on this idea, we propose Reasoning Pattern Alignment Merging (RPAM), a layer-wise model merging framework based on feature alignment to facilitate query-adaptive reasoning. RPAM first constructs a small pattern-labeled calibration set that assigns each query an appropriate reasoning pattern. It then optimizes layer-wise merging coefficients by aligning the merged model's intermediate representations with those of the selected model, while a contrastive objective explicitly pushes them away from the non-selected model. Experiments on seven widely used reasoning benchmarks show that RPAM substantially reduces inference cost while maintaining strong performance. Upon article acceptance, we will provide open-source code to reproduce experiments for RPAM.

preprint2024arXiv

A Data-driven dE/dx Simulation with Normalizing Flow

In high-energy physics, precise measurements rely on highly reliable detector simulations. Traditionally, these simulations involve incorporating experiment data to model detector responses and fine-tuning them. However, due to the complexity of the experiment data, tuning the simulation can be challenging. One crucial aspect for charged particle identification is the measurement of energy deposition per unit length (referred to as dE/dx). This paper proposes a data-driven dE/dx simulation method using the Normalizing Flow technique, which can learn the dE/dx distribution directly from experiment data. By employing this method, not only can the need for manual tuning of the dE/dx simulation be eliminated, but also high-precision simulation can be achieved.

preprint2023arXiv

Evidence for gapless quantum spin liquid in a honeycomb lattice

One main theme in current condensed matter physics is the search of quantum spin liquid (QSL), an exotic magnetic state with strongly-fluctuating and highly-entangled spins down to zero temperature without static order. However, there is no consensus on the existence of a QSL ground state in any real material so far. The disorders and competing exchange interactions may prevent the formation of an ideal QSL state on frustrated spin lattices. Here we report systematic heat transport measurements on a honeycomb-lattice compound BaCo2(AsO4)2, which manifests magnetic order in zero field. In a narrow field range after the magnetic order is nearly suppressed by an in-plane field, in both perpendicular and parallel to the zigzag direction, a finite residual linear term of thermal conductivity is clearly observed, which is attributed to the mobile fractionalized spinon excitations. This provides smoking-gun evidence for a gapless QSL state in BaCo2(AsO4)2. We discuss the underlying physics to form this exotic gapless QSL state in Co2+ honeycomb lattice.

preprint2022arXiv

Are Graph Augmentations Necessary? Simple Graph Contrastive Learning for Recommendation

Contrastive learning (CL) recently has spurred a fruitful line of research in the field of recommendation, since its ability to extract self-supervised signals from the raw data is well-aligned with recommender systems' needs for tackling the data sparsity issue. A typical pipeline of CL-based recommendation models is first augmenting the user-item bipartite graph with structure perturbations, and then maximizing the node representation consistency between different graph augmentations. Although this paradigm turns out to be effective, what underlies the performance gains is still a mystery. In this paper, we first experimentally disclose that, in CL-based recommendation models, CL operates by learning more evenly distributed user/item representations that can implicitly mitigate the popularity bias. Meanwhile, we reveal that the graph augmentations, which were considered necessary, just play a trivial role. Based on this finding, we propose a simple CL method which discards the graph augmentations and instead adds uniform noises to the embedding space for creating contrastive views. A comprehensive experimental study on three benchmark datasets demonstrates that, though it appears strikingly simple, the proposed method can smoothly adjust the uniformity of learned representations and has distinct advantages over its graph augmentation-based counterparts in terms of recommendation accuracy and training efficiency. The code is released at https://github.com/Coder-Yu/QRec.

preprint2022arXiv

Choosing good subsamples for regression modelling

A common problem in health research is that we have a large database with many variables measured on a large number of individuals. We are interested in measuring additional variables on a subsample; these measurements may be newly available, or expensive, or simply not considered when the data were first collected. The intended use for the new measurements is to fit a regression model generalisable to the whole cohort (and to its source population). This is a two-phase sampling problem; it differs from some other two-phase sampling problems in the richness of the phase I data and in the goal of regression modelling. In particular, an important special case is measurement-error models, where a variable strongly correlated with the phase II measurements is available at phase I. We will explain how influence functions have been useful as a unifying concept for extending classical results to this setting, and describe the steps from designing for a simple weighted estimator at known parameter values through adaptive multiwave designs and the use of prior information. We will conclude with some comments on the information gap between design-based and model-based estimators in this setting.

preprint2022arXiv

Decentralized Collaborative Learning Framework for Next POI Recommendation

Next Point-of-Interest (POI) recommendation has become an indispensable functionality in Location-based Social Networks (LBSNs) due to its effectiveness in helping people decide the next POI to visit. However, accurate recommendation requires a vast amount of historical check-in data, thus threatening user privacy as the location-sensitive data needs to be handled by cloud servers. Although there have been several on-device frameworks for privacy-preserving POI recommendations, they are still resource-intensive when it comes to storage and computation, and show limited robustness to the high sparsity of user-POI interactions. On this basis, we propose a novel decentralized collaborative learning framework for POI recommendation (DCLR), which allows users to train their personalized models locally in a collaborative manner. DCLR significantly reduces the local models' dependence on the cloud for training, and can be used to expand arbitrary centralized recommendation models. To counteract the sparsity of on-device user data when learning each local model, we design two self-supervision signals to pretrain the POI representations on the server with geographical and categorical correlations of POIs. To facilitate collaborative learning, we innovatively propose to incorporate knowledge from either geographically or semantically similar users into each local model with attentive aggregation and mutual information maximization. The collaborative learning process makes use of communications between devices while requiring only minor engagement from the central server for identifying user groups, and is compatible with common privacy preservation mechanisms like differential privacy. We evaluate DCLR with two real-world datasets, where the results show that DCLR outperforms state-of-the-art on-device frameworks and yields competitive results compared with centralized counterparts.

preprint2022arXiv

DeHIN: A Decentralized Framework for Embedding Large-scale Heterogeneous Information Networks

Modeling heterogeneity by extraction and exploitation of high-order information from heterogeneous information networks (HINs) has been attracting immense research attention in recent times. Such heterogeneous network embedding (HNE) methods effectively harness the heterogeneity of small-scale HINs. However, in the real world, the size of HINs grow exponentially with the continuous introduction of new nodes and different types of links, making it a billion-scale network. Learning node embeddings on such HINs creates a performance bottleneck for existing HNE methods that are commonly centralized, i.e., complete data and the model are both on a single machine. To address large-scale HNE tasks with strong efficiency and effectiveness guarantee, we present \textit{Decentralized Embedding Framework for Heterogeneous Information Network} (DeHIN) in this paper. In DeHIN, we generate a distributed parallel pipeline that utilizes hypergraphs in order to infuse parallelization into the HNE task. DeHIN presents a context preserving partition mechanism that innovatively formulates a large HIN as a hypergraph, whose hyperedges connect semantically similar nodes. Our framework then adopts a decentralized strategy to efficiently partition HINs by adopting a tree-like pipeline. Then, each resulting subnetwork is assigned to a distributed worker, which employs the deep information maximization theorem to locally learn node embeddings from the partition it receives. We further devise a novel embedding alignment scheme to precisely project independently learned node embeddings from all subnetworks onto a common vector space, thus allowing for downstream tasks like link prediction and node classification.

preprint2022arXiv

Discovering Collaborative Signals for Next POI Recommendation with Iterative Seq2Graph Augmentation

Being an indispensable component in location-based social networks, next point-of-interest (POI) recommendation recommends users unexplored POIs based on their recent visiting histories. However, existing work mainly models check-in data as isolated POI sequences, neglecting the crucial collaborative signals from cross-sequence check-in information. Furthermore, the sparse POI-POI transitions restrict the ability of a model to learn effective sequential patterns for recommendation. In this paper, we propose Sequence-to-Graph (Seq2Graph) augmentation for each POI sequence, allowing collaborative signals to be propagated from correlated POIs belonging to other sequences. We then devise a novel Sequence-to-Graph POI Recommender (SGRec), which jointly learns POI embeddings and infers a user's temporal preferences from the graph-augmented POI sequence. To overcome the sparsity of POI-level interactions, we further infuse category-awareness into SGRec with a multi-task learning scheme that captures the denser category-wise transitions. As such, SGRec makes full use of the collaborative signals for learning expressive POI representations, and also comprehensively uncovers multi-level sequential patterns for user preference modelling. Extensive experiments on two real-world datasets demonstrate the superiority of SGRec against state-of-the-art methods in next POI recommendation.

preprint2022arXiv

Geometrical frustration versus Kitaev interactions in BaCo$_2$(AsO$_4$)$_2$

Recently, Co-based honeycomb magnets have been proposed as promising candidate materials to host the Kitaev spin liquid state. One of the front-runners is BaCo$_2$(AsO$_4$)$_2$ (BCAO), where it was suggested that the exchange processes between Co$^{2+}$ ions via the surrounding edge-sharing oxygen octahedra could give rise to bond-dependent Kitaev interactions. In this work, we present and analyze comprehensive inelastic neutron scattering studies of BCAO with fields in the honeycomb plane. Combining the constraints from the magnon excitations in the high-field polarized state and the inelastic spin structure factor measured in zero magnetic field, we examine two leading theoretical models: the Kitaev-type \JKG model and the \XXZ model. We show that the existing experimental data can be consistently accounted for by the \XXZ model but not by the \JKG model, and we discuss the implications of these results for the realization of a spin liquid phase in BCAO and more generally for the realization of the Kitaev model in cobaltates.

preprint2022arXiv

ReFRS: Resource-efficient Federated Recommender System for Dynamic and Diversified User Preferences

Owing to its nature of scalability and privacy by design, federated learning (FL) has received increasing interest in decentralized deep learning. FL has also facilitated recent research on upscaling and privatizing personalized recommendation services, using on-device data to learn recommender models locally. These models are then aggregated globally to obtain a more performant model, while maintaining data privacy. Typically, federated recommender systems (FRSs) do not consider the lack of resources and data availability at the end-devices. In addition, they assume that the interaction data between users and items is i.i.d. and stationary across end-devices, and that all local recommender models can be directly averaged without considering the user's behavioral diversity. However, in real scenarios, recommendations have to be made on end-devices with sparse interaction data and limited resources. Furthermore, users' preferences are heterogeneous and they frequently visit new items. This makes their personal preferences highly skewed, and the straightforwardly aggregated model is thus ill-posed for such non-i.i.d. data. In this paper, we propose Resource Efficient Federated Recommender System (ReFRS) to enable decentralized recommendation with dynamic and diversified user preferences. On the device side, ReFRS consists of a lightweight self-supervised local model built upon the variational autoencoder for learning a user's temporal preference from a sequence of interacted items. On the server side, ReFRS utilizes a semantic sampler to adaptively perform model aggregation within each identified user cluster. The clustering module operates in an asynchronous and dynamic manner to support efficient global model update and cope with shifting user interests. As a result, ReFRS achieves superior performance in terms of both accuracy and scalability, as demonstrated by comparative experiments.

preprint2022arXiv

Reinforcement Learning-enhanced Shared-account Cross-domain Sequential Recommendation

Shared-account Cross-domain Sequential Recommendation (SCSR) is an emerging yet challenging task that simultaneously considers the shared-account and cross-domain characteristics in the sequential recommendation. Existing works on SCSR are mainly based on Recurrent Neural Network (RNN) and Graph Neural Network (GNN) but they ignore the fact that although multiple users share a single account, it is mainly occupied by one user at a time. This observation motivates us to learn a more accurate user-specific account representation by attentively focusing on its recent behaviors. Furthermore, though existing works endow lower weights to irrelevant interactions, they may still dilute the domain information and impede the cross-domain recommendation. To address the above issues, we propose a reinforcement learning-based solution, namely RL-ISN, which consists of a basic cross-domain recommender and a reinforcement learning-based domain filter. Specifically, to model the account representation in the shared-account scenario, the basic recommender first clusters users' mixed behaviors as latent users, and then leverages an attention model over them to conduct user identification. To reduce the impact of irrelevant domain information, we formulate the domain filter as a hierarchical reinforcement learning task, where a high-level task is utilized to decide whether to revise the whole transferred sequence or not, and if it does, a low-level task is further performed to determine whether to remove each interaction within it or not. To evaluate the performance of our solution, we conduct extensive experiments on two real-world datasets, and the experimental results demonstrate the superiority of our RL-ISN method compared with the state-of-the-art recommendation methods.

preprint2022arXiv

Spatial-Temporal Meta-path Guided Explainable Crime Prediction

Exposure to crime and violence can harm individuals' quality of life and the economic growth of communities. In light of the rapid development in machine learning, there is a rise in the need to explore automated solutions to prevent crimes. With the increasing availability of both fine-grained urban and public service data, there is a recent surge in fusing such cross-domain information to facilitate crime prediction. By capturing the information about social structure, environment, and crime trends, existing machine learning predictive models have explored the dynamic crime patterns from different views. However, these approaches mostly convert such multi-source knowledge into implicit and latent representations (e.g., learned embeddings of districts), making it still a challenge to investigate the impacts of explicit factors for the occurrences of crimes behind the scenes. In this paper, we present a Spatial-Temporal Metapath guided Explainable Crime prediction (STMEC) framework to capture dynamic patterns of crime behaviours and explicitly characterize how the environmental and social factors mutually interact to produce the forecasts. Extensive experiments show the superiority of STMEC compared with other advanced spatiotemporal models, especially in predicting felonies (e.g., robberies and assaults with dangerous weapons).

preprint2022arXiv

Time Interval-enhanced Graph Neural Network for Shared-account Cross-domain Sequential Recommendation

Shared-account Cross-domain Sequential Recommendation (SCSR) task aims to recommend the next item via leveraging the mixed user behaviors in multiple domains. It is gaining immense research attention as more and more users tend to sign up on different platforms and share accounts with others to access domain-specific services. Existing works on SCSR mainly rely on mining sequential patterns via Recurrent Neural Network (RNN)-based models, which suffer from the following limitations: 1) RNN-based methods overwhelmingly target discovering sequential dependencies in single-user behaviors. They are not expressive enough to capture the relationships among multiple entities in SCSR. 2) All existing methods bridge two domains via knowledge transfer in the latent space, and ignore the explicit cross-domain graph structure. 3) None existing studies consider the time interval information among items, which is essential in the sequential recommendation for characterizing different items and learning discriminative representations for them. In this work, we propose a new graph-based solution, namely TiDA-GCN, to address the above challenges. Specifically, we first link users and items in each domain as a graph. Then, we devise a domain-aware graph convolution network to learn userspecific node representations. To fully account for users' domainspecific preferences on items, two effective attention mechanisms are further developed to selectively guide the message passing process. Moreover, to further enhance item- and account-level representation learning, we incorporate the time interval into the message passing, and design an account-aware self-attention module for learning items' interactive characteristics. Experiments demonstrate the superiority of our proposed method from various aspects.

preprint2021arXiv

A Sublevel Moment-SOS Hierarchy for Polynomial Optimization

We introduce a sublevel Moment-SOS hierarchy where each SDP relaxation can be viewed as an intermediate (or interpolation) between the d-th and (d+1)-th order SDP relaxations of the Moment-SOS hierarchy (dense or sparse version). With the flexible choice of determining the size (level) and number (depth) of subsets in the SDP relaxation, one is able to obtain different improvements compared to the d-th order relaxation, based on the machine memory capacity. In particular, we provide numerical experiments for d=1 and various types of problems both in combinatorial optimization (Max-Cut, Mixed Integer Programming) and deep learning (robustness certification, Lipschitz constant of neural networks), where the standard Lasserre's relaxation (or its sparse variant) is computationally intractable. In our numerical results, the lower bounds from the sublevel relaxations improve the bound from Shor's relaxation (first order Lasserre's relaxation) and are significantly closer to the optimal value or to the best-known lower/upper bounds.

preprint2021arXiv

FENet: A Frequency Extraction Network for Obstructive Sleep Apnea Detection

Obstructive Sleep Apnea (OSA) is a highly prevalent but inconspicuous disease that seriously jeopardizes the health of human beings. Polysomnography (PSG), the gold standard of detecting OSA, requires multiple specialized sensors for signal collection, hence patients have to physically visit hospitals and bear the costly treatment for a single detection. Recently, many single-sensor alternatives have been proposed to improve the cost efficiency and convenience. Among these methods, solutions based on RR-interval (i.e., the interval between two consecutive pulses) signals reach a satisfactory balance among comfort, portability and detection accuracy. In this paper, we advance RR-interval based OSA detection by considering its real-world practicality from energy perspectives. As photoplethysmogram (PPG) pulse sensors are commonly equipped on smart wrist-worn wearable devices (e.g., smart watches and wristbands), the energy efficiency of the detection model is crucial to fully support an overnight observation on patients. This creates challenges as the PPG sensors are unable to keep collecting continuous signals due to the limited battery capacity on smart wrist-worn devices. Therefore, we propose a novel Frequency Extraction Network (FENet), which can extract features from different frequency bands of the input RR-interval signals and generate continuous detection results with downsampled, discontinuous RR-interval signals. With the help of the one-to-multiple structure, FENet requires only one-third of the operation time of the PPG sensor, thus sharply cutting down the energy consumption and enabling overnight diagnosis. Experimental results on real OSA datasets reveal the state-of-the-art performance of FENet.

preprint2021arXiv

Graph Embedding for Recommendation against Attribute Inference Attacks

In recent years, recommender systems play a pivotal role in helping users identify the most suitable items that satisfy personal preferences. As user-item interactions can be naturally modelled as graph-structured data, variants of graph convolutional networks (GCNs) have become a well-established building block in the latest recommenders. Due to the wide utilization of sensitive user profile data, existing recommendation paradigms are likely to expose users to the threat of privacy breach, and GCN-based recommenders are no exception. Apart from the leakage of raw user data, the fragility of current recommenders under inference attacks offers malicious attackers a backdoor to estimate users' private attributes via their behavioral footprints and the recommendation results. However, little attention has been paid to developing recommender systems that can defend such attribute inference attacks, and existing works achieve attack resistance by either sacrificing considerable recommendation accuracy or only covering specific attack models or protected information. In our paper, we propose GERAI, a novel differentially private graph convolutional network to address such limitations. Specifically, in GERAI, we bind the information perturbation mechanism in differential privacy with the recommendation capability of graph convolutional networks. Furthermore, based on local differential privacy and functional mechanism, we innovatively devise a dual-stage encryption paradigm to simultaneously enforce privacy guarantee on users' sensitive features and the model optimization process. Extensive experiments show the superiority of GERAI in terms of its resistance to attribute inference attacks and recommendation effectiveness.

preprint2021arXiv

Knowledge Graph Completion with Text-aided Regularization

Knowledge Graph Completion is a task of expanding the knowledge graph/base through estimating possible entities, or proper nouns, that can be connected using a set of predefined relations, or verb/predicates describing interconnections of two things. Generally, we describe this problem as adding new edges to a current network of vertices and edges. Traditional approaches mainly focus on using the existing graphical information that is intrinsic of the graph and train the corresponding embeddings to describe the information; however, we think that the corpus that are related to the entities should also contain information that can positively influence the embeddings to better make predictions. In our project, we try numerous ways of using extracted or raw textual information to help existing KG embedding frameworks reach better prediction results, in the means of adding a similarity function to the regularization part in the loss function. Results have shown that we have made decent improvements over baseline KG embedding methods.

preprint2021arXiv

Non-Thermal Emergence of an Orbital-Selective Mott Phase in FeTe$_{1-x}$Se$_x$

Electronic correlation is of fundamental importance to high temperature superconductivity. Iron-based superconductors are believed to possess moderate correlation strength, which combined with their multi-orbital nature makes them a fascinating platform for the emergence of exotic phenomena. A particularly striking form is the emergence of an orbital selective Mott phase, where the localization of a subset of orbitals leads to a drastically reconstructed Fermi surface. Here, we report spectroscopic evidence of the reorganization of the Fermi surface from FeSe to FeTe as Se is substituted by Te. We uncover a particularly transparent way to visualize the localization of the $d_{xy}$ electron orbital through the suppression of its hybridization with the more coherent $d$ electron orbitals, which leads to a redistribution of the orbital-dependent spectral weight near the Fermi level. These noteworthy features of the Fermi surface are accompanied by a divergent behavior of a band renormalization in the $d_{xy}$ orbital. All of our observations are further supported by our theoretical calculations to be salient spectroscopic signatures of such a non-thermal evolution from a strongly correlated metallic phase towards an orbital-selective Mott phase in FeTe$_{1-x}$Se$_x$ as Se concentration is reduced.

preprint2021arXiv

Optimal sampling for design-based estimators of regression models

Two-phase designs measure variables of interest on a subcohort where the outcome and covariates are readily available or cheap to collect on all individuals in the cohort. Given limited resource availability, it is of interest to find an optimal design that includes more informative individuals in the final sample. We explore the optimal designs and efficiencies for analysis by design-based estimators. Generalized raking is an efficient design-based estimator that improves on the inverse-probability weighted (IPW) estimator by adjusting weights based on the auxiliary information. We derive a closed-form solution of the optimal design for estimating regression coefficients from generalized raking estimators. We compare it with the optimal design for analysis via the IPW estimator and other two-phase designs in measurement-error settings. We consider general two-phase designs where the outcome variable and variables of interest can be continuous or discrete. Our results show that the optimal designs for analysis by the two design-based estimators can be very different. The optimal design for IPW estimation is optimal for analysis via the IPW estimator and typically gives near-optimal efficiency for generalized raking, though we show there is potential improvement in some settings.

preprint2021arXiv

Passenger Mobility Prediction via Representation Learning for Dynamic Directed and Weighted Graph

In recent years, ride-hailing services have been increasingly prevalent as they provide huge convenience for passengers. As a fundamental problem, the timely prediction of passenger demands in different regions is vital for effective traffic flow control and route planning. As both spatial and temporal patterns are indispensable passenger demand prediction, relevant research has evolved from pure time series to graph-structured data for modeling historical passenger demand data, where a snapshot graph is constructed for each time slot by connecting region nodes via different relational edges (e.g., origin-destination relationship, geographical distance, etc.). Consequently, the spatiotemporal passenger demand records naturally carry dynamic patterns in the constructed graphs, where the edges also encode important information about the directions and volume (i.e., weights) of passenger demands between two connected regions. However, existing graph-based solutions fail to simultaneously consider those three crucial aspects of dynamic, directed, and weighted (DDW) graphs, leading to limited expressiveness when learning graph representations for passenger demand prediction. Therefore, we propose a novel spatiotemporal graph attention network, namely Gallat (Graph prediction with all attention) as a solution. In Gallat, by comprehensively incorporating those three intrinsic properties of DDW graphs, we build three attention layers to fully capture the spatiotemporal dependencies among different regions across all historical time slots. Moreover, the model employs a subtask to conduct pretraining so that it can obtain accurate results more quickly. We evaluate the proposed model on real-world datasets, and our experimental results demonstrate that Gallat outperforms the state-of-the-art approaches.

preprint2021arXiv

Spin-excitation anisotropy in the nematic state of detwinned FeSe

The origin of the electronic nematicity in FeSe is one of the most important unresolved puzzles in the study of iron-based superconductors. In both spin- and orbital-nematic models, the intrinsic magnetic excitations at $\mathbf{Q}_1=(1, 0)$ and $\mathbf{Q}_2=(0, 1)$ of twin-free FeSe are expected to provide decisive criteria for clarifying this issue. Although a spin-fluctuation anisotropy below 10 meV between $\mathbf{Q}_1$ and $\mathbf{Q}_2$ has been observed by inelastic neutron scattering around $T_c\sim 9$ K ($<<T_s\sim 90$ K), it remains unclear whether such an anisotropy also persists at higher energies and associates with the nematic transition $T_{\rm s}$. Here we use resonant inelastic x-ray scattering (RIXS) to probe the high-energy magnetic excitations of uniaxial-strain detwinned FeSe and {\BFA}. A prominent anisotropy between the magnetic excitations along the $H$ and $K$ directions is found to persist to $\sim200$ meV in FeSe, which is even more pronounced than the anisotropy of spin waves in {\BFA}. This anisotropy decreases gradually with increasing temperature and finally vanishes at a temperature around the nematic transition temperature $T_{\rm s}$. Our results reveal an unprecedented strong spin-excitation anisotropy with a large energy scale well above the $d_{xz}/d_{yz}$ orbital splitting, suggesting that the nematic phase transition is primarily spin-driven. Moreover, the measured high-energy spin excitations are dispersive and underdamped, which can be understood from a local-moment perspective. Our findings provide the much-needed understanding of the mechanism for the nematicity of FeSe and points to a unified description of the correlation physics across seemingly distinct classes of Fe-based superconductors.

preprint2021arXiv

Van Hove Singularity Arising from Mexican-Hat-Shaped Inverted Bands in the Topological Insulator Sn-doped Bi$_{1.1}$Sb$_{0.9}$Te$_{2}$S

The optical properties of Sn-doped Bi$_{1.1}$Sb$_{0.9}$Te$_{2}$S, the most bulk-insulating topological insulator thus far, have been examined at different temperatures over a broad frequency range. No Drude response is detected in the low-frequency range down to 30~cm$^{-1}$, corroborating the excellent bulk-insulating property of this material. Intriguingly, we observe a sharp peak at about 2\,200~cm$^{-1}$ in the optical conductivity at 5~K. Further quantitative analyses of the line shape and temperature dependence of this sharp peak, in combination with first-principles calculations, suggest that it corresponds to a van Hove singularity arising from Mexican-hat-shaped inverted bands. Such a van Hove singularity is a pivotal ingredient of various strongly correlated phases.

preprint2020arXiv

Anisotropic effect of a magnetic field on the neutron spin resonance in FeSe

We use inelastic neutron scattering to study the effect of a magnetic field on the neutron spin resonance (Er = 3.6 meV) of superconducting FeSe (Tc = 9 K). While a field aligned along the in-plane direction broadens and suppresses the resonance, a c-axis aligned field does so much more efficiently, consistent with the anisotropic field-induced suppression of the superfluid density from the heat capacity measurements. These results suggest that the resonance in FeSe is associated with the superconducting electrons arising from orbital selective quasi-particle excitations between the hole and electron Fermi surfaces.

preprint2020arXiv

Extreme Image Coding via Multiscale Autoencoders With Generative Adversarial Optimization

We propose a MultiScale AutoEncoder(MSAE) based extreme image compression framework to offer visually pleasing reconstruction at a very low bitrate. Our method leverages the "priors" at different resolution scale to improve the compression efficiency, and also employs the generative adversarial network(GAN) with multiscale discriminators to perform the end-to-end trainable rate-distortion optimization. We compare the perceptual quality of our reconstructions with traditional compression algorithms using High-Efficiency Video Coding(HEVC) based Intra Profile and JPEG2000 on the public Cityscapes and ADE20K datasets, demonstrating the significant subjective quality improvement.

preprint2020arXiv

GCN-Based User Representation Learning for Unifying Robust Recommendation and Fraudster Detection

In recent years, recommender system has become an indispensable function in all e-commerce platforms. The review rating data for a recommender system typically comes from open platforms, which may attract a group of malicious users to deliberately insert fake feedback in an attempt to bias the recommender system to their favour. The presence of such attacks may violate modeling assumptions that high-quality data is always available and these data truly reflect users' interests and preferences. Therefore, it is of great practical significance to construct a robust recommender system that is able to generate stable recommendations even in the presence of shilling attacks. In this paper, we propose GraphRfi - a GCN-based user representation learning framework to perform robust recommendation and fraudster detection in a unified way. In its end-to-end learning process, the probability of a user being identified as a fraudster in the fraudster detection component automatically determines the contribution of this user's rating data in the recommendation component; while the prediction error outputted in the recommendation component acts as an important feature in the fraudster detection component. Thus, these two components can mutually enhance each other. Extensive experiments have been conducted and the experimental results show the superiority of our GraphRfi in the two tasks - robust rating prediction and fraudster detection. Furthermore, the proposed GraphRfi is validated to be more robust to the various types of shilling attacks over the state-of-the-art recommender systems.

preprint2020arXiv

Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction

Cross-platform account matching plays a significant role in social network analytics, and is beneficial for a wide range of applications. However, existing methods either heavily rely on high-quality user generated content (including user profiles) or suffer from data insufficiency problem if only focusing on network topology, which brings researchers into an insoluble dilemma of model selection. In this paper, to address this problem, we propose a novel framework that considers multi-level graph convolutions on both local network structure and hypergraph structure in a unified manner. The proposed method overcomes data insufficiency problem of existing work and does not necessarily rely on user demographic information. Moreover, to adapt the proposed method to be capable of handling large-scale social networks, we propose a two-phase space reconciliation mechanism to align the embedding spaces in both network partitioning based parallel training and account matching across different social networks. Extensive experiments have been conducted on two large-scale real-life social networks. The experimental results demonstrate that the proposed method outperforms the state-of-the-art models with a big margin.

preprint2020arXiv

Pressure Engineering of the Dirac Fermions in Quasi-One-Dimensional Tl$_2$Mo$_6$Se$_6$

Topological band dispersions other than the standard Dirac or Weyl fermions have garnered the increasing interest in materials science. Among them, the cubic Dirac fermions were recently proposed in the family of quasi-one-dimensional conductors A$_2$Mo$_6$X$_6$ (A= Na, K, In, Tl; X= S, Se, Te), where the band crossing is characterized by a linear dispersion in one $k$-space direction but the cubic dispersion in the plane perpendicular to it. It is not yet clear, however, how the external perturbations can alter these nontrivial carriers and ultimately induce a new distinct quantum phase. Here we study the evolution of Dirac fermions, in particular the cubic Dirac crossing, under external pressure in the representative quasi-one-dimensional Tl$_2$Mo$_6$Se$_6$ via the first-principles calculations. Specifically, it is found that the topological properties, including the bulk Dirac crossings and the topological surface states, change progressively under pressure up to 50 GPa where it undergoes a structural transition from the hexagonal phase to body-centered tetragonal phase. Above 50 GPa, the system is more likely to be topologically trivial. Further, we also investigate its phonon spectra, which reveals a gradual depletion of the negative phonon modes with pressure, consistent with the more three-dimensional Fermi surface in the high-pressure phase. Our work may provide a useful guideline for further experimental search and the band engineering of the topologically nontrivial fermions in this intriguing state of matter.

preprint2020arXiv

Try This Instead: Personalized and Interpretable Substitute Recommendation

As a fundamental yet significant process in personalized recommendation, candidate generation and suggestion effectively help users spot the most suitable items for them. Consequently, identifying substitutable items that are interchangeable opens up new opportunities to refine the quality of generated candidates. When a user is browsing a specific type of product (e.g., a laptop) to buy, the accurate recommendation of substitutes (e.g., better equipped laptops) can offer the user more suitable options to choose from, thus substantially increasing the chance of a successful purchase. However, existing methods merely treat this problem as mining pairwise item relationships without the consideration of users' personal preferences. Moreover, the substitutable relationships are implicitly identified through the learned latent representations of items, leading to uninterpretable recommendation results. In this paper, we propose attribute-aware collaborative filtering (A2CF) to perform substitute recommendation by addressing issues from both personalization and interpretability perspectives. Instead of directly modelling user-item interactions, we extract explicit and polarized item attributes from user reviews with sentiment analysis, whereafter the representations of attributes, users, and items are simultaneously learned. Then, by treating attributes as the bridge between users and items, we can thoroughly model the user-item preferences (i.e., personalization) and item-item relationships (i.e., substitution) for recommendation. In addition, A2CF is capable of generating intuitive interpretations by analyzing which attributes a user currently cares the most and comparing the recommended substitutes with her/his currently browsed items at an attribute level. The recommendation effectiveness and interpretation quality of A2CF are demonstrated via extensive experiments on three real datasets.

preprint2016arXiv

A single scale retinex based method for palm vein extraction

Palm vein recognition is a novel biometric identification technology. But how to gain a better vein extraction result from the raw palm image is still a challenging problem, especially when the raw data collection has the problem of asymmetric illumination. This paper proposes a method based on single scale Retinex algorithm to extract palm vein image when strong shadow presents due to asymmetric illumination and uneven geometry of the palm. We test our method on a multispectral palm image. The experimental result shows that the proposed method is robust to the influence of illumination angle and shadow. Compared to the traditional extraction methods, the proposed method can obtain palm vein lines with better visualization performance (the contrast ratio increases by 18.4%, entropy increases by 1.07%, and definition increases by 18.8%).

preprint2012arXiv

Chip-based Brillouin lasers as spectral purifiers for photonic systems

High coherence lasers are essential in a wide range of applications, however, such performance is normally associated with large laser cavities, because increasing energy storage reduces quantum phase noise and also renders the laser frequency less sensitive to cavity vibration. This basic scaling rule is at odds with an emerging set of optical systems that place focus on compact (optimally integrable) sources of high coherence light. These include phase-coherent optical communication using quadrature-amplitude-modulation, and also record-low phase noise microwave sources based upon optical comb techniques. In this work, the first, chip-based Brillouin laser is demonstrated. It features high-efficiency and single-line operation with the smallest recorded Schawlow-Townes frequency noise for any chip-based laser. Because the frequency offset between the laser's emission and the input pump is relatively small, the device provides a new function: spectral purification of compact, low coherence sources such as semiconductor lasers.

preprint2012arXiv

Low-Pump-Power, Low-Phase-Noise, and Microwave to Millimeter-wave Repetition Rate Operation in Microcombs

Microresonator-based frequency combs (microcombs or Kerr-combs) can potentially miniaturize the numerous applications of conventional frequency combs. A priority is the realization of broad-band (ideally octave spanning) spectra at detectable repetition rates for comb self referencing. However, access to these rates involves pumping larger mode volumes and hence higher threshold powers. Moreover, threshold power sets both the scale for power per comb tooth and also the optical pump. Along these lines, it is shown that a class of resonators having surface-loss-limited Q factors can operate over a wide range of repetition rates with minimal variation in threshold power. A new, surface-loss-limited resonator illustrates the idea. Comb generation on mode spacings ranging from 2.6 GHz to 220 GHz with overall low threshold power (as low as 1 mW) is demonstrated. A record number of comb lines for a microcomb (around 1900) is also observed with pump power of 200 mW. The ability to engineer a wide range of repetition rates with these devices is also used to investigate a recently observed mechanism in microcombs associated with dispersion of subcomb offset frequencies. We observe high-coherence, phase-locking in cases where these offset frequencies are small enough so as to be tuned into coincidence. In these cases, a record-low microcomb phase noise is reported at a level comparable to an open-loop, high-performance microwave oscillator.

preprint2011arXiv

Ultra-high-Q wedge-resonator on a silicon chip

Ultra-high-Q optical resonators are being studied across a wide range of research subjects including quantum information, nonlinear optics, cavity optomechanics, and telecommunications. Here, we demonstrate a new, resonator on-a-chip with a record Q factor of 875 million, surpassing even microtoroids. Significantly, these devices avoid a highly specialized processing step that has made it difficult to integrate microtoroids with other photonic devices and to also precisely control their size. Thus, these devices not only set a new benchmark for Q factor on a chip, but also provide, for the first time, full compatibility of this important device class with conventional semiconductor processing. This feature will greatly expand the possible kinds of system on a chip functions enabled by ultra-high-Q devices.

Tong Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

37 published item(s)

Efficient Content-based Recommendation Model Training via Noise-aware Coreset Selection

FINER-SQL: Boosting Small Language Models for Text-to-SQL

GRAFT: Graph-Tokenized LLMs for Tool Planning

Gyral-Sulcal-Net: An Integrated Network Representation of Brain Folding Patterns

ProMax: Exploring the Potential of LLM-derived Profiles with Distribution Shaping for Recommender Systems

Reasoning Pattern Alignment Merging for Adaptive Reasoning

A Data-driven dE/dx Simulation with Normalizing Flow

Evidence for gapless quantum spin liquid in a honeycomb lattice

Are Graph Augmentations Necessary? Simple Graph Contrastive Learning for Recommendation

Choosing good subsamples for regression modelling

Decentralized Collaborative Learning Framework for Next POI Recommendation

DeHIN: A Decentralized Framework for Embedding Large-scale Heterogeneous Information Networks

Discovering Collaborative Signals for Next POI Recommendation with Iterative Seq2Graph Augmentation

Geometrical frustration versus Kitaev interactions in BaCo$_2$(AsO$_4$)$_2$

ReFRS: Resource-efficient Federated Recommender System for Dynamic and Diversified User Preferences

Reinforcement Learning-enhanced Shared-account Cross-domain Sequential Recommendation

Spatial-Temporal Meta-path Guided Explainable Crime Prediction

Time Interval-enhanced Graph Neural Network for Shared-account Cross-domain Sequential Recommendation

A Sublevel Moment-SOS Hierarchy for Polynomial Optimization

FENet: A Frequency Extraction Network for Obstructive Sleep Apnea Detection

Graph Embedding for Recommendation against Attribute Inference Attacks

Knowledge Graph Completion with Text-aided Regularization

Non-Thermal Emergence of an Orbital-Selective Mott Phase in FeTe$_{1-x}$Se$_x$

Optimal sampling for design-based estimators of regression models

Passenger Mobility Prediction via Representation Learning for Dynamic Directed and Weighted Graph

Spin-excitation anisotropy in the nematic state of detwinned FeSe

Van Hove Singularity Arising from Mexican-Hat-Shaped Inverted Bands in the Topological Insulator Sn-doped Bi$_{1.1}$Sb$_{0.9}$Te$_{2}$S

Anisotropic effect of a magnetic field on the neutron spin resonance in FeSe

Extreme Image Coding via Multiscale Autoencoders With Generative Adversarial Optimization

GCN-Based User Representation Learning for Unifying Robust Recommendation and Fraudster Detection

Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction

Pressure Engineering of the Dirac Fermions in Quasi-One-Dimensional Tl$_2$Mo$_6$Se$_6$

Try This Instead: Personalized and Interpretable Substitute Recommendation

A single scale retinex based method for palm vein extraction

Chip-based Brillouin lasers as spectral purifiers for photonic systems

Low-Pump-Power, Low-Phase-Noise, and Microwave to Millimeter-wave Repetition Rate Operation in Microcombs

Ultra-high-Q wedge-resonator on a silicon chip