Source author record

Yifan Gao

Yifan Gao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Artificial Intelligence Computer Vision cond-mat.mtrl-sci Machine Learning cond-mat.mes-hall cond-mat.str-el cond-mat.supr-con eess.IV math.PR Other Computer Science physics.app-ph

Catalog footprint

What is connected

9works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation

Category-level 6D object pose estimation is typically formulated as a multi-category joint learning problem with fully shared model parameters. However, pronounced geometric heterogeneity across categories entangles incompatible optimization signals in shared modules, resulting in gradient conflicts and negative transfer during training. To address this challenge, we first introduce gradient-based diagnostics to quantify module-level cross-category contention. Building on results of diagnostics, we propose DecomPose, a difficulty-aware decomposition framework that mitigates optimization contention via: (1) difficulty-aware gradient decoupling, which groups categories using a data-driven difficulty proxy and routes each instance to a group-specific correspondence branch to isolate incompatible updates; and (2) stability-driven asymmetric branching, which assigns higher-capacity branches to structurally simple categories as stable optimization anchors while constraining complex categories with lightweight branches to suppress noisy updates and alleviate negative transfer. Extensive experiments on REAL275, CAMERA25, and HouseCat6D demonstrate that DecomPose effectively reduces cross-category optimization contention and delivers superior pose estimation performance across multiple benchmarks.

preprint2026arXiv

Electric field switching of altermagnetic spin-splitting in multiferroic skyrmions

Magnetic skyrmions are localized magnetic structures that retain their shape and stability over time, thanks to their topological nature. Recent theoretical and experimental progress has laid the groundwork for understanding magnetic skyrmions characterized by negligible net magnetization and ultrafast dynamics. Notably, skyrmions emerging in materials with altermagnetism, a novel magnetic phase featuring lifted Kramers degeneracy-have remained unreported until now. In this study, we demonstrate that BiFeO3, a multiferroic renowned for its strong coupling between ferroelectricity and magnetism, can transit from a spin cycloid to a Neel-type skyrmion under antidamping spin-orbit torque at room temperature. Strikingly, the altermagnetic spin splitting within BiFeO3 skyrmion can be reversed through the application of an electric field, revealed via the Circular photogalvanic effect. This quasiparticle, which possesses a neutral topological charge, holds substantial promise for diverse applications-most notably, enabling the development of unconventional computing systems with low power consumption and magnetoelectric controllability.

preprint2022arXiv

Improving Task Generalization via Unified Schema Prompt

Task generalization has been a long standing challenge in Natural Language Processing (NLP). Recent research attempts to improve the task generalization ability of pre-trained language models by mapping NLP tasks into human-readable prompted forms. However, these approaches require laborious and inflexible manual collection of prompts, and different prompts on the same downstream task may receive unstable performance. We propose Unified Schema Prompt, a flexible and extensible prompting method, which automatically customizes the learnable prompts for each task according to the task input schema. It models the shared knowledge between tasks, while keeping the characteristics of different task schema, and thus enhances task generalization ability. The schema prompt takes the explicit data structure of each task to formulate prompts so that little human effort is involved. To test the task generalization ability of schema prompt at scale, we conduct schema prompt-based multitask pre-training on a wide variety of general NLP tasks. The framework achieves strong zero-shot and few-shot generalization performance on 16 unseen downstream tasks from 8 task types (e.g., QA, NLI, etc). Furthermore, comprehensive analyses demonstrate the effectiveness of each component in the schema prompt, its flexibility in task compositionality, and its ability to improve performance under a full-data fine-tuning setting.

preprint2022arXiv

Moiré Engineering of Nonsymmorphic Symmetries and Hourglass Superconductors

Moiré heterostructures hold the promise to provide platforms to tailor strongly correlated and topological states of matter. Here, we theoretically propose the emergence of an effective, rectangular moiré lattice in twisted bilayers of SnS with nonsymmorphic symmetry. Based on first-principles calculations, we demonstrate that strong intrinsic spin-orbit interactions render this tunable platform a moiré semimetal that hosts 2D hourglass fermions protected by time-reversal symmetry $\mathcal{T}$ and the nonsymmorphic screw rotation symmetry $\widetilde{\mathcal{C}}_{2y}$. We show that topological Fermi arcs connecting pairs of Weyl nodal points in the hourglass dispersion are preserved for weak electron-electron interactions, particularly in regions of superconducting order that emerge in the phase diagram of interaction strength and filling. Our work established moiré engineering as an inroad into the realm of correlated topological semimetals and may motivate further topology related researches in moiré heterostructures.

preprint2022arXiv

Mutual Attention-based Hybrid Dimensional Network for Multimodal Imaging Computer-aided Diagnosis

Recent works on Multimodal 3D Computer-aided diagnosis have demonstrated that obtaining a competitive automatic diagnosis model when a 3D convolution neural network (CNN) brings more parameters and medical images are scarce remains nontrivial and challenging. Considering both consistencies of regions of interest in multimodal images and diagnostic accuracy, we propose a novel mutual attention-based hybrid dimensional network for MultiModal 3D medical image classification (MMNet). The hybrid dimensional network integrates 2D CNN with 3D convolution modules to generate deeper and more informative feature maps, and reduce the training complexity of 3D fusion. Besides, the pre-trained model of ImageNet can be used in 2D CNN, which improves the performance of the model. The stereoscopic attention is focused on building rich contextual interdependencies of the region in 3D medical images. To improve the regional correlation of pathological tissues in multimodal medical images, we further design a mutual attention framework in the network to build the region-wise consistency in similar stereoscopic regions of different image modalities, providing an implicit manner to instruct the network to focus on pathological tissues. MMNet outperforms many previous solutions and achieves results competitive to the state-of-the-art on three multimodal imaging datasets, i.e., Parotid Gland Tumor (PGT) dataset, the MRNet dataset, and the PROSTATEx dataset, and its advantages are validated by extensive experiments.

preprint2022arXiv

Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training

Keyphrase generation is the task of automatically predicting keyphrases given a piece of long text. Despite its recent flourishing, keyphrase generation on non-English languages haven't been vastly investigated. In this paper, we call attention to a new setting named multilingual keyphrase generation and we contribute two new datasets, EcommerceMKP and AcademicMKP, covering six languages. Technically, we propose a retrieval-augmented method for multilingual keyphrase generation to mitigate the data shortage problem in non-English languages. The retrieval-augmented model leverages keyphrase annotations in English datasets to facilitate generating keyphrases in low-resource languages. Given a non-English passage, a cross-lingual dense passage retrieval module finds relevant English passages. Then the associated English keyphrases serve as external knowledge for keyphrase generation in the current language. Moreover, we develop a retriever-generator iterative training algorithm to mine pseudo parallel passage pairs to strengthen the cross-lingual passage retriever. Comprehensive experiments and ablations show that the proposed approach outperforms all baselines.

preprint2022arXiv

Sharp asymptotics for arm probabilities in critical planar percolation

In this work, we consider critical planar site percolation on the triangular lattice and derive sharp estimates on the asymptotics of the probability of half-plane $j$-arm events for $j \geq 1$ and planar (polychromatic) $j$-arm events for $j>1$. These estimates greatly improve previous results and in particular answer (a large part of) a question of Schramm (ICM Proc., 2006).

preprint2020arXiv

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading

The goal of conversational machine reading is to answer user questions given a knowledge base text which may require asking clarification questions. Existing approaches are limited in their decision making due to struggles in extracting question-related rules and reasoning about them. In this paper, we present a new framework of conversational machine reading that comprises a novel Explicit Memory Tracker (EMT) to track whether conditions listed in the rule text have already been satisfied to make a decision. Moreover, our framework generates clarification questions by adopting a coarse-to-fine reasoning strategy, utilizing sentence-level entailment scores to weight token-level distributions. On the ShARC benchmark (blind, held-out) testset, EMT achieves new state-of-the-art results of 74.6% micro-averaged decision accuracy and 49.5 BLEU4. We also show that EMT is more interpretable by visualizing the entailment-oriented reasoning process as the conversation flows. Code and models are released at https://github.com/Yifan-Gao/explicit_memory_tracker.

preprint2020arXiv

Non-linearity identification for construction workers' personality-safety behaviour predictive relationship using neural network and linear regression modelling

The prediction of workers' safety behaviour can help identify vulnerable workers who intend to undertake unsafe behaviours and be useful in the design of management practices to minimise the occurrence of accidents. The latest literature has evidenced that there is within-population diversity that leads people's intended safety behaviours in the workplace, which are found to vary among individuals as a function of their personality traits. In this study, an innovative forecasting model, which employs neural network algorithms, is developed to numerically simulate the predictive relationship between construction workers' personality traits and their intended safety behaviour. The data-driven nature of neural network enabled a reliable estimate of the relationship, which allowed this research to find that a nonlinear effect exists in the relationship. This research has practical implications. The neural network developed is shown to have highly satisfactory prediction accuracy and is thereby potentially useful for assisting project decision-makers to assess how prone workers are to carry out unsafe behaviours in the workplace.

Yifan Gao

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation

Electric field switching of altermagnetic spin-splitting in multiferroic skyrmions

Improving Task Generalization via Unified Schema Prompt

Moiré Engineering of Nonsymmorphic Symmetries and Hourglass Superconductors

Mutual Attention-based Hybrid Dimensional Network for Multimodal Imaging Computer-aided Diagnosis

Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training

Sharp asymptotics for arm probabilities in critical planar percolation

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading

Non-linearity identification for construction workers' personality-safety behaviour predictive relationship using neural network and linear regression modelling