Researcher profile

Ying Qin

Ying Qin contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
18works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

18 published item(s)

preprint2026arXiv

Key Coverage Matters: Semi-Structured Extraction of OCR Clinical Reports

Clinical reports are often fragmented across healthcare institutions because privacy regulations and data silos limit direct information sharing. When patients seek care at a different hospital, they often carry paper or scanned reports from prior visits. This hinders EHR integration and longitudinal review, and downstream applications that depend on more complete patient records, such as patient management, follow-up care, real-world studies, and clinical-trial matching. Although OCR can digitize such reports, reliable extraction remains challenging because clinical documents are heterogeneous, OCR text is noisy, and many healthcare settings require low-cost on-premise deployment. We formulate this problem as canonical key-conditioned extractive question answering over OCR-derived clinical reports. Because the key fields are neither fixed nor known in advance, the key space is open. We maintain a canonical key inventory through iterative key mining, normalization, clustering, and lightweight human verification, and introduce key coverage as a metric to quantify inventory completeness. Using a 0.2B BERT-based model, experiments on real-world reports from more than 20 hospitals show performance improves monotonically with key coverage. The model achieves F1 scores of 0.839 and 0.893 under exact match and boundary-tolerant matching, respectively, once the Top-90 canonical keys are covered. These results show that key coverage is a dominant factor for end-to-end performance. At Top-90 coverage, our model outperforms a fine-tuned Qwen3-0.6B baseline under exact match. Although our annotated corpus is Chinese, the method relies on the language-agnostic key-value organization of semi-structured clinical reports and can be adapted to other settings given an appropriate canonical key inventory and alias mapping.

preprint2024arXiv

Sparsely Shared LoRA on Whisper for Child Speech Recognition

Whisper is a powerful automatic speech recognition (ASR) model. Nevertheless, its zero-shot performance on low-resource speech requires further improvement. Child speech, as a representative type of low-resource speech, is leveraged for adaptation. Recently, parameter-efficient fine-tuning (PEFT) in NLP was shown to be comparable and even better than full fine-tuning, while only needing to tune a small set of trainable parameters. However, current PEFT methods have not been well examined for their effectiveness on Whisper. In this paper, only parameter composition types of PEFT approaches such as LoRA and Bitfit are investigated as they do not bring extra inference costs. Different popular PEFT methods are examined. Particularly, we compare LoRA and AdaLoRA and figure out the learnable rank coefficient is a good design. Inspired by the sparse rank distribution allocated by AdaLoRA, a novel PEFT approach Sparsely Shared LoRA (S2-LoRA) is proposed. The two low-rank decomposed matrices are globally shared. Each weight matrix only has to maintain its specific rank coefficients that are constrained to be sparse. Experiments on low-resource Chinese child speech show that with much fewer trainable parameters, S2-LoRA can achieve comparable in-domain adaptation performance to AdaLoRA and exhibit better generalization ability on out-of-domain data. In addition, the rank distribution automatically learned by S2-LoRA is found to have similar patterns to AdaLoRA's allocation.

preprint2023arXiv

iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre

The capability of generating speech with specific type of emotion is desired for many applications of human-computer interaction. Cross-speaker emotion transfer is a common approach to generating emotional speech when speech with emotion labels from target speakers is not available for model training. This paper presents a novel cross-speaker emotion transfer system, named iEmoTTS. The system is composed of an emotion encoder, a prosody predictor, and a timbre encoder. The emotion encoder extracts the identity of emotion type as well as the respective emotion intensity from the mel-spectrogram of input speech. The emotion intensity is measured by the posterior probability that the input utterance carries that emotion. The prosody predictor is used to provide prosodic features for emotion transfer. The timber encoder provides timbre-related information for the system. Unlike many other studies which focus on disentangling speaker and style factors of speech, the iEmoTTS is designed to achieve cross-speaker emotion transfer via disentanglement between prosody and timbre. Prosody is considered as the main carrier of emotion-related speech characteristics and timbre accounts for the essential characteristics for speaker identification. Zero-shot emotion transfer, meaning that speech of target speakers are not seen in model training, is also realized with iEmoTTS. Extensive experiments of subjective evaluation have been carried out. The results demonstrate the effectiveness of iEmoTTS as compared with other recently proposed systems of cross-speaker emotion transfer. It is shown that iEmoTTS can produce speech with designated emotion type and controllable emotion intensity. With appropriate information bottleneck capacity, iEmoTTS is able to effectively transfer emotion information to a new speaker. Audio samples are publicly available https://patrick-g-zhang.github.io/iemotts/

preprint2023arXiv

Potential Subpopulations and Assembling Tendency of the Merging Black Holes

The origins of coalescing binary black holes (BBHs) detected by the advanced LIGO/Virgo are still under debate, and clues may be present in the joint mass-spin distribution of these merger events. Here we construct phenomenological models containing two sub-populations to investigate the BBH population detected in gravitational wave observations. We find that our models can explain the GWTC-3 data rather well, and several constraints to our model are required by the data: first, the maximum mass for the component with a stellar-origin, $m_{\rm max}$, is $39.1^{+2.4}_{-2.7}M_{\odot}$ at 90\% credibility; second, about $15\%$ of the mergers happen in dynamical environments, in which $7-16\%$ of events are hierarchical mergers, and these BHs have an average spin magnitude significantly larger than the first-generation mergers, with ${\rm d}μ_{\rm a} > 0.4 $ at $99\%$ credibility; third, the dynamical component BHs tend to pair with each other with larger total mass and higher mass ratio. An independent analysis focusing on spins is also carried out, and we find that the spin amplitude of component BHs can be divided into two groups according to a division mass $m_{\rm d} = 46.1^{+5.6}_{-5.1}M_{\odot}$. These constraints can be naturally explained by current formation channels, and our results suggest that some of the observed events were likely from AGN disks.

preprint2022arXiv

A Channel to Form Fast-spinning Black Hole--Neutron Star Binary Mergers as Multi-messenger Sources

After the successful detection of a gravitational-wave (GW) signal and its associated electromagnetic (EM) counterparts from GW170817, neutron star--black hole (NSBH) mergers have been highly expected to be the next type of multi-messenger source. However, despite the detection of several of NSBH merger candidates during the GW third observation run, no confirmed EM counterparts from these sources have been identified. The most plausible explanation is that these NSBH merger candidates were plunging events mainly because the primary BHs had near-zero projected aligned-spins based on GW observations. In view that NSs can be easily tidally disrupted by BHs with high projected aligned-spins, we study an evolution channel to form NSBH binaries with fast-spinning BHs, the properties of BH mass and spin, and their associated tidal disruption probability. We find that if the NSs are born firstly, the companion helium stars would be tidally spun up efficiently, and would thus finally form fast-spinning BHs. If BHs do not receive significant natal kicks at birth, these NSBH binaries that can merge within the Hubble time would have BHs with the projected aligned-spins $χ_{z}\gtrsim0.8$ and, hence, can certainly allow tidal disruption to happen. Even if significant BH kicks are considered for a small fraction of NSBH binaries, the projected aligned-spins of BHs are $χ_z\gtrsim0.2$. These systems can still be disrupted events unless the NSs are very massive. Thus, NS-first-born NSBH mergers would be promising multi-messenger sources. We discuss various potential EM counterparts associated with these systems and their detectability in the upcoming fourth observation run.

preprint2022arXiv

Hypercritical Accretion for Black Hole High Spin in Cygnus X-1

Recent observations of AdLIGO and Virgo have shown that the spin measurements in binary black hole (BH) systems are typically small, which is consistent with the predictions by the classical isolated binary evolution channel. In this standard formation channel, the progenitor of the first-born BH is assumed to have efficient angular momentum transport. The BH spins in high-mass X-ray binaries (HMXBs), however, have been found consistently to be extremely high. In order to explain the high BH spins, the inefficient angular momentum transport inside the BH progenitor is required. This requirement, however, is incompatible with the current understanding of conventional efficient angular momentum transport mechanism. We find that this tension can be highly alleviated as long as the hypercritical accretion is allowed. We show that, for a case study of Cygnus X-1, the hypercritical accretion cannot only be a good solution for the inconsistent assumption upon the angular momentum transport within massive stars, but match its other properties reported recently.

preprint2022arXiv

Long-duration Gamma-ray Burst and Associated Kilonova Emission from Fast-spinning Black Hole--Neutron Star Mergers

Here we collect three unique bursts, GRBs\,060614, 211211A and 211227A, all characterized by a long-duration main emission (ME) phase and a rebrightening extended emission (EE) phase, to study their observed properties and the potential origin as neutron star-black hole (NSBH) mergers. NS-first-born (BH-first-born) NSBH mergers tend to contain fast-spinning (non-spinning) BHs that more easily (hardly) allow tidal disruption to happen with (without) forming electromagnetic signals. We find that NS-first-born NSBH mergers can well interpret the origins of these three GRBs, supported by that: (1) Their X-ray MEs and EEs show unambiguous fall-back accretion signatures, decreasing as $\propto{t}^{-5/3}$, which might account for their long duration. The EEs can result from the fall-back accretion of $r$-process heating materials, predicted to occur after NSBH mergers. (2) The beaming-corrected local event rate density for this type of merger-origin long-duration GRBs is $\mathcal{R}_0\sim2.4^{+2.3}_{-1.3}\,{\rm{Gpc}}^{-3}\,{\rm{yr}}^{-1}$, consistent with that of NS-first-born NSBH mergers. (3) Our detailed analysis on the EE, afterglow and kilonova of the recently high-impact event GRB\,211211A reveals it could be a merger between a $\sim1.23^{+0.06}_{-0.07}\,M_\odot$ NS and a $\sim8.21^{+0.77}_{-0.75}\,M_\odot$ BH with an aligned-spin of $χ_{\rm{BH}}\sim0.62^{+0.06}_{-0.07}$, supporting an NS-first-born NSBH formation channel. Long-duration burst with rebrightening fall-back accretion signature after ME, and bright kilonova might be commonly observed features for on-axis NSBHs. We estimate the multimessenger detection rate between gravitational waves, GRBs and kilonovae from NSBH mergers in O4 (O5) is $\sim0.1\,{\rm{yr}}^{-1}$ ($\sim1\,{\rm{yr}}^{-1}$).

preprint2022arXiv

Population Properties of Gravitational-Wave Neutron Star--Black Hole Mergers

Over the course of the third observing run of LIGO-Virgo-KAGRA Collaboration, several gravitational-wave (GW) neutron star--black hole (NSBH) candidates have been announced. By assuming these candidates are real signals and of astrophysical origins, we analyze the population properties of the mass and spin distributions for GW NSBH mergers. We find that the primary BH mass distribution of NSBH systems, whose shape is consistent with that inferred from the GW binary BH (BBH) primaries, can be well described as a power-law with an index of $α= 4.8^{+4.5}_{-2.8}$ plus a high-mass Gaussian component peaking at $\sim33^{+14}_{-9}\,M_\odot$. The NS mass spectrum could be shaped as a near flat distribution between $\sim1.0-2.1\,M_\odot$. The constrained NS maximum mass agrees with that inferred from NSs in our Galaxy. If GW190814 and GW200210 are NSBH mergers, the posterior results of the NS maximum mass would be always larger than $\sim2.5\,M_\odot$ and significantly deviate from that inferred in the Galactic NSs. The effective inspiral spin and effective precession spin of GW NSBH mergers are measured to potentially have near-zero distributions. The negligible spins for GW NSBH mergers imply that most events in the universe should be plunging events, which supports the standard isolated formation channel of NSBH binaries. More NSBH mergers to be discovered in the fourth observing run would help to more precisely model the population properties of cosmological NSBH mergers.

preprint2022arXiv

The formation of the stripped envelope type II b Supernova progenitors: Rotation, Metallicity and Overshooting

Type IIb supernovae are believed to originate from core-collapse progenitors having kept only a very thin hydrogen envelope. We aim to explore how some physical factors, such as rotation, metallicity, overshooting, and the initial orbital period in binaries, significantly affect the Roche lobe overflow and the formation of type IIb supernovae. It is found that binaries are the main channel that capable of producing type typeIIb supernovae progenitors in the mass range for initial masses below 20 $M_{\odot}$. The formation of type IIb supernova progenitors is extremely sensitive to the initial orbital period. A less massive hydrogen indicates smaller radius and a higher effective temperatures, and vice versa. Binary systems with initial periods between 300 and 720 days produce type IIb progenitors that are a red supergiant. Those with an initial period between 50 and 300 days produce yellow supergiant progenitors and those with initial periods shorter than 50 days, blue supergiant progenitors. Both rapid rotation and larger overshooting can enlarge the carbon-oxygen core mass and lead to higher core temperature and lower central density at the pre-collapse phase. They are also beneficial to surface nitrogen enrichment but restrict the efficiency of the first dredge-up. SN IIb progenitors with low metallicity have smaller hydrogen envelope masses and radii than the high metallicity counterparts. Ultra-stripped binary models have systematically higher core mass fraction $\rm ^{12}C$ left, which has important influence on the compactness of type IIb progenitors.

preprint2021arXiv

Close binary evolution based on Gaia DR2: the origin of late WC-type Wolf-Rayet stars with low luminosity

The observed late-type WC Wolf-Rayet stars (WC7-9) with low luminosity below $\rm \log L/L_{\odot} < 5.4$ in the HR diagram cannot be reproduced satisfactorily by the evolutionary track of single stars. The mass transfer due to Roche lobe overflow drastically modifies the internal structure and surface compositions of two components. Therefore, binaries provide a very promising evolutionary channel to produce these WC stars.

preprint2021arXiv

Constraining the circumburst medium of gamma-ray bursts with X-ray afterglows

Long gamma-ray bursts (GRBs) are considered to be originated from core collapse of massive stars. It is believed that the afterglow property is determined by the density of the material in the surrounding interstellar medium. Therefore, the circumburst density can be used to distinguish between an interstellar wind $n(R) \propto R^{\rm -k}$, and a constant density medium (ISM), $n(R) = const$. Previous studies with different afterglow samples, show that the circumburst medium of GRBs is neither simply supported by an interstellar wind, nor completely favored by an ISM. In this work, our new sample is consisted of 39 GRBs with smoothly onset bump-like features in early X-ray afterglows, in which 20 GRBs have the redshift measurements. By using a smooth broken power law function to fit the bumps of X-ray light curves, we derive the full width at half-maximum (FWHM) as the feature width ($ω$), as well as the rise and decay time scales of the bumps ($T_{\rm r}$ and $T_{\rm d}$). The correlations between the timescales of X-ray bumps are similar to those found previously in the optical afterglows. Based on the fireball forward shock (FS) model of the thin shell case, we obtain the distribution of the electron spectral index $p$, and further constrain the medium density distribution index $k$. The new inferred $k$ is found to be concentrated at 1.0, with a range from 0.2 to 1.8. This finding is consistent with previous studies. The conclusion of our detailed investigation for X-ray afterglows suggests that the ambient medium of the selected GRBs is not homogeneous, i.e., neither ISM nor the typical interstellar wind.

preprint2021arXiv

On the Angular Momentum Transport Efficiency within the Star Constrained from Gravitational-wave Observations

The LIGO Scientific Collaboration and Virgo Collaboration (LVC) have recently reported in GWTC-2.1 eight additional candidate events with a probability of astrophysical origin greater than 0.5 in the LVC deeper search on O3a running. In GWTC-2.1, the majority of the effective inspiral spins ($χ_{\rm eff}$) show magnitudes consistent with zero, while two (GW190403$_{-}$051519 and GW190805$_{-}$211137) of the eight new events have $χ_{\rm eff}$ $> 0$ (at 90% credibility). We note that GW190403$_{-}$051519 was reported with $χ_{\rm eff}$ = $0.70^{+0.15}_{-0.27}$ and mass ratio $q$ = $0.25^{+0.54}_{-0.11}$, respectively. Assuming a uniform prior probability between 0 and 1 for each black hole&#39;s dimensionless spin magnitude, GW190403$_{-}$051519 was reported with the dimensionless spin of the more massive black hole, $χ_1$ = $0.92^{+0.07}_{-0.22}$. This is the fastest first-born black hole ever measured in all current gravitational-wave events. If GW190403$_{-}$051519 is formed through isolated binary evolution channel, this extremely high spin challenges, at least in that case, the existence of efficient angular momentum transport mechanism between the stellar core and the radiative envelope of massive stars, as for instance predicted by the Tayler-Spruit dynamo (Spruit 2002) or its revised version by Fuller et al. 2019.

preprint2021arXiv

Probing the progenitors of spinning binary black-hole mergers with long gamma-ray bursts

Long-duration gamma-ray bursts are thought to be associated with the core-collapse of massive, rapidly spinning stars and the formation of black holes. However, efficient angular momentum transport in stellar interiors, currently supported by asteroseismic and gravitational-wave constraints, leads to predominantly slowly-spinning stellar cores. Here, we report on binary stellar evolution and population synthesis calculations, showing that tidal interactions in close binaries not only can explain the observed sub-population of spinning, merging binary black holes but also lead to long gamma-ray bursts at the time of black-hole formation. Given our model calibration against the distribution of isotropic-equivalent energies of luminous long gamma-ray bursts, we find that ~10% of the GWTC-2 reported binary black holes had a luminous long gamma-ray burst associated with their formation, with GW190517 and GW190719 having a probability of ~85% and ~60%, respectively, being among them. Moreover, given an assumption about their average beaming fraction, our model predicts the rate density of long gamma-ray bursts, as a function of redshift, originating from this channel. For a constant beaming fraction $f_\mathrm{B}\sim 0.05$ our model predicts a rate density comparable to the observed one, throughout the redshift range, while, at redshift $z \in [0,2.5]$, a tentative comparison with the metallicity distribution of observed LGRB host galaxies implies that between 20% to 85% of the observed long gamma-ray bursts may originate from progenitors of merging binary black holes. The proposed link between a potentially significant fraction of observed, luminous long gamma-ray bursts and the progenitors of spinning binary black-hole mergers allows us to probe the latter well outside the horizon of current-generation gravitational wave observatories, and out to cosmological distances.

preprint2021arXiv

Raman spectrum of Janus transition metal dichalcogenide monolayers WSSe and MoSSe

Janus transition metal dichalcogenides (TMDs) lose the horizontal mirror symmetry of ordinary TMDs, leading to the emergence of additional features, such as native piezoelectricity, Rashba effect, and enhanced catalytic activity. While Raman spectroscopy is an essential nondestructive, phase- and composition-sensitive tool to monitor the synthesis of materials, a comprehensive study of the Raman spectrum of Janus monolayers is still missing. Here, we discuss the Raman spectra of WSSe and MoSSe measured at room and cryogenic temperatures, near and off resonance. By combining polarization-resolved Raman data with calculations of the phonon dispersion and using symmetry considerations, we identify the four first-order Raman modes and higher-order two-phonon modes. Moreover, we observe defect-activated phonon processes, which provide a route toward a quantitative assessment of the defect concentration and, thus, the crystal quality of the materials synthesized. Our work establishes a solid background for future research on material synthesis, study, and application of Janus TMD monolayers.

preprint2020arXiv

Confinement of long-lived interlayer excitons in WS$_2$/WSe$_2$ heterostructures

Interlayer excitons in layered materials constitute a novel platform to study many-body phenomena arising from long-range interactions between quantum particles. The ability to localise individual interlayer excitons in potential energy traps is a key step towards simulating Hubbard physics in artificial lattices. Here, we demonstrate spatial localisation of long-lived interlayer excitons in a strongly confining trap array using a WS$_{2}$/WSe$_{2}$ heterostructure on a nanopatterned substrate. We detect long-lived interlayer excitons with lifetime approaching 0.2 ms and show that their confinement results in a reduced lifetime in the microsecond range and stronger emission rate with sustained optical selection rules. The combination of a permanent dipole moment, spatial confinement and long lifetime places interlayer excitons in a regime that satisfies one of the requirements for observing long-range dynamics in an optically resolvable trap lattice.

preprint2020arXiv

The origin of spin in binary black holes: Predicting the distributions of the main observables of Advanced LIGO

We study the formation of coalescing binary black holes via the evolution of isolated field binaries that go through the common envelope phase in order to obtain the combined distributions of the main observables of Advanced LIGO. We used a hybrid technique that combines the parametric binary population synthesis code COMPAS with detailed binary evolution simulations performed with the MESA code. We then convolved our binary evolution calculations with the redshift- and metallicity-dependent star-formation rate and the selection effects of gravitational-wave detectors to obtain predictions of observable properties. By assuming efficient angular momentum transport, we are able to present a model that is capable of simultaneously predicting the three main gravitational-wave observables: the effective inspiral spin parameter $χ_{eff}$, the chirp mass $M_{chirp}$, and the cosmological redshift of merger $z_{merger}$. We find an excellent agreement between our model and the ten events from the first two advanced detector observing runs. We make predictions for the third observing run O3 and for Advanced LIGO design sensitivity. We expect around 80% of events with $χ_{eff} < 0.1$, while the remaining 20% of events with $χ_{eff} \ge 0.1$ are split into ~10% with $M_{chirp} < 15$ M$_\odot$ and ~10% with $M_{chirp} \ge 15$ M$_\odot$. In conclusion, the favorable comparison of the existing LIGO/Virgo observations with our model predictions gives support to the idea that the majority, if not all of the observed mergers, originate from the evolution of isolated binaries. The first-born black hole has negligible spin because it lost its envelope after it expanded to become a giant star, while the spin of the second-born black hole is determined by the tidal spin up of its naked helium star progenitor by the first-born black hole companion after the binary finished the common-envelope phase.

preprint2019arXiv

Near-unity light absorption in a monolayer WS2 van der Waals heterostructure cavity

Excitons in monolayer transition-metal-dichalcogenides (TMDs) dominate their optical response and exhibit strong light-matter interactions with lifetime-limited emission. While various approaches have been applied to enhance light-exciton interactions in TMDs, the achieved strength have been far below unity, and a complete picture of its underlying physical mechanisms and fundamental limits has not been provided. Here, we introduce a TMD-based van der Waals heterostructure cavity that provides near-unity excitonic absorption, and emission of excitonic complexes that are observed at ultra-low excitation powers. Our results are in full agreement with a quantum theoretical framework introduced to describe the light-exciton-cavity interaction. We find that the subtle interplay between the radiative, non-radiative and dephasing decay rates plays a crucial role, and unveil a universal absorption law for excitons in 2D systems. This enhanced light-exciton interaction provides a platform for studying excitonic phase-transitions and quantum nonlinearities and enables new possibilities for 2D semiconductor-based optoelectronic devices.

preprint2019arXiv

Resolving spin, valley, and moiré quasi-angular momentum of interlayer excitons in WSe2/WS2 heterostructures

Moiré superlattices provide a powerful way to engineer properties of electrons and excitons in two-dimensional van der Waals heterostructures. The moiré effect can be especially strong for interlayer excitons, where electrons and holes reside in different layers and can be addressed separately. In particular, it was recently proposed that the moiré superlattice potential not only localizes interlayer exciton states at different superlattice positions, but also hosts an emerging moiré quasi-angular momentum (QAM) that periodically switches the optical selection rules for interlayer excitons at different moiré sites. Here we report the observation of multiple interlayer exciton states coexisting in a WSe2/WS2 moiré superlattice and unambiguously determine their spin, valley, and moiré QAM through novel resonant optical pump-probe spectroscopy and photoluminescence excitation spectroscopy. We demonstrate that interlayer excitons localized at different moiré sites can exhibit opposite optical selection rules due to the spatially-varying moiré QAM. Our observation reveals new opportunities to engineer interlayer exciton states and valley physics with moiré superlattices for optoelectronic and valleytronic applications.