Researcher profile

Dong Yao

Dong Yao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation

Micro-video recommender systems suffer from the ubiquitous noises in users' behaviors, which might render the learned user representation indiscriminating, and lead to trivial recommendations (e.g., popular items) or even weird ones that are far beyond users' interests. Contrastive learning is an emergent technique for learning discriminating representations with random data augmentations. However, due to neglecting the noises in user behaviors and treating all augmented samples equally, the existing contrastive learning framework is insufficient for learning discriminating user representations in recommendation. To bridge this research gap, we propose the Contrast over Contrastive Learning framework for training recommender models, named CCL4Rec, which models the nuances of different augmented views by further contrasting augmented positives/negatives with adaptive pulling/pushing strengths, i.e., the contrast over (vanilla) contrastive learning. To accommodate these contrasts, we devise the hardness-aware augmentations that track the importance of behaviors being replaced in the query user and the relatedness of substitutes, and thus determining the quality of augmented positives/negatives. The hardness-aware augmentation also permits controllable contrastive learning, leading to performance gains and robust training. In this way, CCL4Rec captures the nuances of historical behaviors for a given user, which explicitly shields off the learned user representation from the effects of noisy behaviors. We conduct extensive experiments on two micro-video recommendation benchmarks, which demonstrate that CCL4Rec with far less model parameters could achieve comparable performance to existing state-of-the-art method, and improve the training/inference speed by several orders of magnitude.

preprint2022arXiv

Contrastive Learning with Positive-Negative Frame Mask for Music Representation

Self-supervised learning, especially contrastive learning, has made an outstanding contribution to the development of many deep learning research fields. Recently, researchers in the acoustic signal processing field noticed its success and leveraged contrastive learning for better music representation. Typically, existing approaches maximize the similarity between two distorted audio segments sampled from the same music. In other words, they ensure a semantic agreement at the music level. However, those coarse-grained methods neglect some inessential or noisy elements at the frame level, which may be detrimental to the model to learn the effective representation of music. Towards this end, this paper proposes a novel Positive-nEgative frame mask for Music Representation based on the contrastive learning framework, abbreviated as PEMR. Concretely, PEMR incorporates a Positive-Negative Mask Generation module, which leverages transformer blocks to generate frame masks on the Log-Mel spectrogram. We can generate self-augmented negative and positive samples by masking important components or inessential components, respectively. We devise a novel contrastive learning objective to accommodate both self-augmented positives/negatives sampled from the same music. We conduct experiments on four public datasets. The experimental results of two music-related downstream tasks, music classification, and cover song identification, demonstrate the generalization ability and transferability of music representation learned by PEMR.

preprint2022arXiv

Mean Field Behavior during the Big Bang Regime for Coalescing Random Walks

In this paper we consider coalescing random walks on a general connected graph $G=(V,E)$. We set up a unified framework to study the leading order of the decay rate of $P_t$, the expectation of the fraction of occupied sites at time $t$, particularly for the `Big Bang' regime where $t\ll t_{\text{coal}}:=\mathbb{E}[\inf\{s:\text{There is only one particle at time }s\}]$. Our results show that $P_t$ satisfies certain mean field behavior, if the graphs satisfy certain transience-like conditions. We apply this framework to two families of graphs: (1) graphs given by the configuration model with a degree distribution supported in $[3,\bar d]$ for some $\bar d\geq 3$, and (2) finite and infinite vertex-transitive graphs. In the first case, we show that for $1 \ll t \ll |V|$, $P_t$ decays in the order of $t^{-1}$, and $(tP_t)^{-1}$ is approximately the probability that two particles starting from the root of the corresponding unimodular Galton-Watson tree never collide after one of them leaves the root, which is also roughly $|V|/(2t_{\text{meet}})$, where $t_{\text{meet}}$ is the mean meeting time of two walkers. By taking the local weak limit, for the unimodular Galton-Watson tree we prove the convergence of $tP_t$ as $t\to\infty$. For the second family of graphs, if we take a sequence of finite graphs $G_n=(V_n, E_n)$, such that $t_{\text{meet}}=O(|V_n|)$ and the inverse of the spectral gap $t_{\text{rel}}$ is $o(|V_n|)$, then for $t_{\text{rel}}\ll t\ll t_{\text{coal}}$, $(tP_t)^{-1}$ is approximately the probability that two random walks never meet before time $t$, and also $|V|/(2t_{\text{meet}})$. In addition, we define a natural uniform transience condition, and show that it implies the above for all $1\ll t\ll t_{\text{coal}}$. Such estimates of $tP_t$ are also obtained for all infinite transient transitive unimodular graphs, in particular, all transient transitive amenable graphs.

preprint2022arXiv

Theoretical and experimental study on Noise Equivalent Power of X-ray semiconductor ultra-fast response material based on the rad-optic effect

Semiconductor material based on the rad-optic effect enables ultra-fast detection of X-rays and plays an important role in fusion diagnostics. Obtaining the accurate noise equivalent power (NEP) of the semiconductor ultrafast response material is the key to detecting X-rays. In this paper, the refractive index change mechanism of the semiconductor under X-ray irradiation was analyzed, and the quantitative relationship between the diffraction efficiency and the X-ray photon energy was established through the LT-AlGaAs diffraction imaging experiments. The impulse responses of LT-AlGaAs under 1 KeV-10 KeV X-ray radiation were calculated, revealing the variation of NEP density with radiated photon energy. In the case of bombarding the Al target to generate 1.5 KeV X-rays, the imaging experiments of LT-AlGaAs were performed. The diffraction image of LT-AlGaAs has a linear relationship with the radiation intensity, and the NEP density of LT-AlGaAs reaches 4.80*105W/cm2. This study has reference significance for the development of ultra-fast X-ray imaging systems based on the rad-optic effect.