Researcher profile

Yongjun Chen

Yongjun Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2022arXiv

ELECRec: Training Sequential Recommenders as Discriminators

Sequential recommendation is often considered as a generative task, i.e., training a sequential encoder to generate the next item of a user's interests based on her historical interacted items. Despite their prevalence, these methods usually require training with more meaningful samples to be effective, which otherwise will lead to a poorly trained model. In this work, we propose to train the sequential recommenders as discriminators rather than generators. Instead of predicting the next item, our method trains a discriminator to distinguish if a sampled item is a 'real' target item or not. A generator, as an auxiliary model, is trained jointly with the discriminator to sample plausible alternative next items and will be thrown out after training. The trained discriminator is considered as the final SR model and denoted as \modelname. Experiments conducted on four datasets demonstrate the effectiveness and efficiency of the proposed approach.

preprint2022arXiv

Generating Negative Samples for Sequential Recommendation

To make Sequential Recommendation (SR) successful, recent works focus on designing effective sequential encoders, fusing side information, and mining extra positive self-supervision signals. The strategy of sampling negative items at each time step is less explored. Due to the dynamics of users' interests and model updates during training, considering randomly sampled items from a user's non-interacted item set as negatives can be uninformative. As a result, the model will inaccurately learn user preferences toward items. Identifying informative negatives is challenging because informative negative items are tied with both dynamically changed interests and model parameters (and sampling process should also be efficient). To this end, we propose to Generate Negative Samples (items) for SR (GenNi). A negative item is sampled at each time step based on the current SR model's learned user preferences toward items. An efficient implementation is proposed to further accelerate the generation process, making it scalable to large-scale recommendation tasks. Extensive experiments on four public datasets verify the importance of providing high-quality negative samples for SR and demonstrate the effectiveness and efficiency of GenNi.

preprint2022arXiv

Improving Contrastive Learning with Model Augmentation

The sequential recommendation aims at predicting the next items in user behaviors, which can be solved by characterizing item relationships in sequences. Due to the data sparsity and noise issues in sequences, a new self-supervised learning (SSL) paradigm is proposed to improve the performance, which employs contrastive learning between positive and negative views of sequences. However, existing methods all construct views by adopting augmentation from data perspectives, while we argue that 1) optimal data augmentation methods are hard to devise, 2) data augmentation methods destroy sequential correlations, and 3) data augmentation fails to incorporate comprehensive self-supervised signals. Therefore, we investigate the possibility of model augmentation to construct view pairs. We propose three levels of model augmentation methods: neuron masking, layer dropping, and encoder complementing. This work opens up a novel direction in constructing views for contrastive SSL. Experiments verify the efficacy of model augmentation for the SSL in the sequential recommendation. Code is available\footnote{\url{https://github.com/salesforce/SRMA}}.

preprint2022arXiv

Intent Contrastive Learning for Sequential Recommendation

Users' interactions with items are driven by various intents (e.g., preparing for holiday gifts, shopping for fishing equipment, etc.).However, users' underlying intents are often unobserved/latent, making it challenging to leverage such latent intents forSequentialrecommendation(SR). To investigate the benefits of latent intents and leverage them effectively for recommendation, we proposeIntentContrastiveLearning(ICL), a general learning paradigm that leverages a latent intent variable into SR. The core idea is to learn users' intent distribution functions from unlabeled user behavior sequences and optimize SR models with contrastive self-supervised learning (SSL) by considering the learned intents to improve recommendation. Specifically, we introduce a latent variable to represent users' intents and learn the distribution function of the latent variable via clustering. We propose to leverage the learned intents into SR models via contrastive SSL, which maximizes the agreement between a view of sequence and its corresponding intent. The training is alternated between intent representation learning and the SR model optimization steps within the generalized expectation-maximization (EM) framework. Fusing user intent information into SR also improves model robustness. Experiments conducted on four real-world datasets demonstrate the superiority of the proposed learning paradigm, which improves performance, and robustness against data sparsity and noisy interaction issues.

preprint2022arXiv

Radio properties of the OH megamaser galaxy IIZw 096

Based on the two epochs EVN archive data from OH line observations of IIZw 096, we confirm that the high-resolution OH emission in this source mainly comes from two spots (OH1 and OH2) of comp D1 of this merging system. We found no significant variations in the OH line emission. The OH 1665 MHz line emission is detected at about 6 $σ$ level in the OH1 region by combining two epoch EVN observations. We found that the comp D1 shows the brightest CO, HCO+ line emission, as well as multi-band radio continuum emission. The environment around D1 shows no clear velocity structure associated with circular motions, making it different from most other OHMs in the literature, which might have been caused by an effect during the merger stage. Meanwhile, we found that the CO emission shows three velocity structures around D1, including the central broad FWHM region, the double peak region where the CO line profile shows two separated peaks, and the region of the high-velocity clouds where the CO line peaks at a high velocity ($\sim$ 11000 \kms). \HI in absorption also show high-velocity clouds around the D1 region, which might be due to inflows caused by the merging of two or more galaxy components. Based on the high-resolution K-band VLA and L-band VLBA observations of the radio continuum emission, we derived the brightness temperature in the range $10^{5}$ K to $10^{6}$ K, which is consistent with other starburst dominant OHM sources in the literature. The multi-band VLA observations show that the radio continuum emission of comp D might also have contributions from free-free emission, besides synchrotron emission. As a concenquence, these results support a starburst origin for the OHMs, without the presence of an AGN.

preprint2022arXiv

The Photon Ring in M87*

We report measurements of the gravitationally lensed secondary image -- the first in an infinite series of so-called "photon rings" -- around the supermassive black hole M87* via simultaneous modeling and imaging of the 2017 Event Horizon Telescope (EHT) observations. The inferred ring size remains constant across the seven days of the 2017 EHT observing campaign and is consistent with theoretical expectations, providing clear evidence that such measurements probe spacetime and a striking confirmation of the models underlying the first set of EHT results. The residual diffuse emission evolves on timescales comparable to one week. We are able to detect with high significance a southwestern extension consistent with that expected from the base of a jet that is rapidly rotating in the clockwise direction. This result adds further support to the identification of the jet in M87* with a black hole spin-driven outflow, launched via the Blandford-Znajek process. We present three revised estimates for the mass of M87* based on identifying the modeled thin ring component with the bright ringlike features seen in simulated images, one of which is only weakly sensitive to the astrophysics of the emission region. All three estimates agree with each other and previously reported values. Our strongest mass constraint combines information from both the ring and the diffuse emission region, which together imply a mass-to-distance ratio of $4.20^{+0.12}_{-0.06}~μ{\rm as}$ and a corresponding black hole mass of $(7.13\pm0.39)\times10^9M_\odot$, where the error on the latter is now dominated by the systematic uncertainty arising from the uncertain distance to M87*.

preprint2021arXiv

Structural and spectral properties of Galactic plane variable radio sources

In the time domain, the radio sky in particular along the Galactic plane direction may vary significantly because of various energetic activities associated with stars, stellar and supermassive black holes. Using multi-epoch Very Large Array surveys of the Galactic plane at 5.0 GHz, Becker et al. (2010) presented a catalogue of 39 variable radio sources in the flux density range 1-70 mJy. To probe their radio structures and spectra, we observed 17 sources with the very-long-baseline interferometric (VLBI) imaging technique and collected additional multi-frequency data from the literature. We detected all of the sources at 5 GHz with the Westerbork Synthesis Radio Telescope, but only G23.6644-0.0372 with the European VLBI Network (EVN). Together with its decadal variability and multi-frequency radio spectrum, we interpret it as an extragalactic peaked-spectrum source with a size of <~10 pc. The remaining sources were resolved out by the long baselines of the EVN because of either strong scatter broadening at the Galactic latitude <1 deg or intrinsically very extended structures on centi-arcsec scales. According to their spectral and structural properties, we find that the sample has a diverse nature. We notice two young H II regions and spot a radio star and a candidate planetary nebula. The rest of the sources are very likely associated with radio active galactic nuclei (AGN). Two of them also displays arcsec-scale faint jet activity. The sample study indicates that AGN are commonplace even among variable radio sources in the Galactic plane.

preprint2020arXiv

The radio properties of the OH megamaser galaxy IRAS 02524+2046

We present results from VLBI observations of continuum and OH line emission in IRAS 02524+2046 and also arcsecond-scale radio properties of this galaxy using VLA archive data. We found that there is no significant detection of radio continuum emission from VLBI observations. The arcsecond-scale radio images of this source show no clear extended emission, the total radio flux density at L and C band are around 2.9 mJy and 1.0 mJy respectively, which indicate a steep radio spectral index between the two band. Steep spectral index, low brightness temperature and high $q$-ratio (the FIR to the radio flux density), which are three critical indicators in classification of radio activity in the nuclei of galaxies, are all consistent with the classification of this source as a starburst galaxy from its optical spectrum. The high-resolution line profile show that both of \textbf{the 1665 and 1667 MHz OH maser} line have been detected which show three and two clear components respectively. The channel maps show that the maser emission are distributed in a region $\sim$ 210 pc $\times$ 90 pc, the detected maser components at different region show similar double spectral feature, which might be an evidence that this galaxy is at a stage of major merger as seen from the optical morphology.