Researcher profile

Yi Cheng

Yi Cheng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

From Ising to Potts: Physics-inspired Potts machines of coupled oscillators for low-energy sampling and combinatorial optimization

The $q$-state Potts model is a fundamental model in statistical physics that generalizes the Ising model and plays a key role in the study of phase transitions, critical phenomena, complex systems, and combinatorial optimization. Sampling low-energy configurations of the $q$-state Potts model is essential to these studies, but it remains challenging. While physics-inspired dynamical sampling has been extensively explored for the Ising case ($q=2$) in the form of Ising machines, its generalization to general $q$-state Potts models remains largely unexplored. To fill this gap, we propose a class of physics-inspired dynamical samplers that directly target general $q$-state Potts models, which we refer to as the oscillator Potts machine (OPM). We show, through theoretical analysis and numerical experiments, that the OPM exhibits a systematic low-energy bias with respect to the underlying Potts energy landscape. Furthermore, we demonstrate, via phase perturbation analysis, that the OPM, as overdamped Langevin dynamics, can be realized with a network of self-sustaining oscillators, demonstrating that the OPM is naturally realizable in hardware using standard technology such as CMOS. We design a small-scale ring-oscillator circuit that implements a three-state OPM and validate its operation through transistor-level simulation. Leveraging the low-energy bias of the OPM for Potts models, we then apply it to large-scale max-$K$-cut problems by mapping these instances to $q$-state Potts Hamiltonians and compare its performance against established algorithms. Our results position the OPM as a promising, physically grounded dynamical system framework for multi-state sampling and combinatorial optimization.

preprint2022arXiv

CSRS: Code Search with Relevance Matching and Semantic Matching

Developers often search and reuse existing code snippets in the process of software development. Code search aims to retrieve relevant code snippets from a codebase according to natural language queries entered by the developer. Up to now, researchers have already proposed information retrieval (IR) based methods and deep learning (DL) based methods. The IR-based methods focus on keyword matching, that is to rank codes by relevance between queries and code snippets, while DL-based methods focus on capturing the semantic correlations. However, the existing methods do not consider capturing two matching signals simultaneously. Therefore, in this paper, we propose CSRS, a code search model with relevance matching and semantic matching. CSRS comprises (1) an embedding module containing convolution kernels of different sizes which can extract n-gram embeddings of queries and codes, (2) a relevance matching module that measures lexical matching signals, and (3) a co-attention based semantic matching module to capture the semantic correlation. We train and evaluate CSRS on a dataset with 18.22M and 10k code snippets. The experimental results demonstrate that CSRS achieves an MRR of 0.614, which outperforms two state-of-the-art models DeepCS and CARLCS-CNN by 33.77% and 18.53% respectively. In addition, we also conducted several experiments to prove the effectiveness of each component of CSRS.

preprint2022arXiv

MedDG: An Entity-Centric Medical Consultation Dataset for Entity-Aware Medical Dialogue Generation

Developing conversational agents to interact with patients and provide primary clinical advice has attracted increasing attention due to its huge application potential, especially in the time of COVID-19 Pandemic. However, the training of end-to-end neural-based medical dialogue system is restricted by an insufficient quantity of medical dialogue corpus. In this work, we make the first attempt to build and release a large-scale high-quality Medical Dialogue dataset related to 12 types of common Gastrointestinal diseases named MedDG, with more than 17K conversations collected from the online health consultation community. Five different categories of entities, including diseases, symptoms, attributes, tests, and medicines, are annotated in each conversation of MedDG as additional labels. To push forward the future research on building expert-sensitive medical dialogue system, we proposes two kinds of medical dialogue tasks based on MedDG dataset. One is the next entity prediction and the other is the doctor response generation. To acquire a clear comprehension on these two medical dialogue tasks, we implement several state-of-the-art benchmarks, as well as design two dialogue models with a further consideration on the predicted entities. Experimental results show that the pre-train language models and other baselines struggle on both tasks with poor performance in our dataset, and the response quality can be enhanced with the help of auxiliary entity information. From human evaluation, the simple retrieval model outperforms several state-of-the-art generative models, indicating that there still remains a large room for improvement on generating medically meaningful responses.

preprint2022arXiv

Research on Parallel SVM Algorithm Based on Cascade SVM

Cascade SVM (CSVM) can group datasets and train subsets in parallel, which greatly reduces the training time and memory consumption. However, the model accuracy obtained by using this method has some errors compared with direct training. In order to reduce the error, we analyze the causes of error in grouping training, and summarize the grouping without error under ideal conditions. A Balanced Cascade SVM (BCSVM) algorithm is proposed, which balances the sample proportion in the subset after grouping to ensure that the sample proportion in the subset is the same as the original dataset. At the same time, it proves that the accuracy of the model obtained by BCSVM algorithm is higher than that of CSVM. Finally, two common datasets are used for experimental verification, and the results show that the accuracy error obtained by using BCSVM algorithm is reduced from 1% of CSVM to 0.1%, which is reduced by an order of magnitude.

preprint2022arXiv

Team VI-I2R Technical Report on EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021

In this report, we present the technical details of our approach to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation (UDA) Challenge for Action Recognition. The EPIC-KITCHENS-100 dataset consists of daily kitchen activities focusing on the interaction between human hands and their surrounding objects. It is very challenging to accurately recognize these fine-grained activities, due to the presence of distracting objects and visually similar action classes, especially in the unlabelled target domain. Based on an existing method for video domain adaptation, i.e., TA3N, we propose to learn hand-centric features by leveraging the hand bounding box information for UDA on fine-grained action recognition. This helps reduce the distraction from background as well as facilitate the learning of domain-invariant features. To achieve high quality hand localization, we adopt an uncertainty-aware domain adaptation network, i.e., MEAA, to train a domain-adaptive hand detector, which only uses very limited hand bounding box annotations in the source domain but can generalize well to the unlabelled target domain. Our submission achieved the 1st place in terms of top-1 action recognition accuracy, using only RGB and optical flow modalities as input.

preprint2020arXiv

Reductions of the (4 + 1)-dimensional Fokas equation and their solutions

An integrable extension of the Kadomtsev-Petviashvili (KP) and Davey-Stewartson (DS) equations is investigated in this paper.We will refer to this integrable extension as the (4+1)-dimensional Fokas equation. The determinant expressions of soliton, breather, rational, and semi-rational solutions of the (4 + 1)-dimensional Fokas equation are constructed based on the Hirota's bilinear method and the KP hierarchy reduction method. The complex dynamics of these new exact solutions are shown in both three-dimensional plots and two-dimensional contour plots. Interestingly, the patterns of obtained high-order lumps are similar to those of rogue waves in the (1 + 1)-dimensions by choosing different values of the free parameters of the model. Furthermore, three kinds of new semi-rational solutions are presented and the classification of lump fission and fusion processes is also discussed. Additionally, we give a new way to obtain rational and semi-rational solutions of (3 + 1)-dimensional KP equation by reducing the solutions of the (4 + 1)-dimensional Fokas equation. All these results show that the (4 + 1)-dimensional Fokas equation is a meaningful multidimensional extension of the KP and DS equations. The obtained results might be useful in diverse fields such as hydrodynamics, non-linear optics and photonics, ion-acoustic waves in plasmas, matter waves in Bose-Einstein condensates, and sound waves in ferromagnetic media.

preprint2019arXiv

Two-dimensional rogue waves on zero background of the Davey-Stewartson II equation

A prototypical example of a rogue wave structure in a two-dimensional model is presented in the context of the Davey-Stewartson~II (DS~II) equation arising in water waves. The analytical methodology involves a Taylor expansion of an eigenfunctionof the model's Lax pair which is used to form a hierarchy of infinitely many new eigenfunctions. These are used for the construction of two-dimensional (2D) rogue waves (RWs) of the DS~II equation by the even-fold Darboux transformation (DT). The obtained 2D RWs, which are localized in both space and time, can be viewed as a 2D analogue of the Peregrine soliton and are thus natural candidates to describe oceanic RW phenomena,as well as ones in 2D fluid systems and water tanks.