Researcher profile

Yun Zheng

Yun Zheng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2022arXiv

CHANG-ES XXV: HI Imaging of Nearby Edge-on Galaxies -- Data Release 4

We present the HI distribution of galaxies from the Continuum Halos in Nearby Galaxies - an EVLA Survey (CHANG-ES). Though the observational mode was not optimized for detecting HI, we successfully produce HI cubes for 19 galaxies. The moment-0 maps from this work are available on CHANG-ES data release website, i.e., https://www.queensu.ca/changes. Our sample is dominated by star-forming, HI-rich galaxies at distances from 6.27 to 34.1 Mpc. HI interferometric images on two of these galaxies (NGC 5792 and UGC 10288) are presented here for the first time, while 12 of our remaining sample galaxies now have better HI spatial resolutions and/or sensitivities of intensity maps than those in existing publications. We characterize the average scale heights of the HI distributions for a subset of most inclined galaxies (inclination > 80 deg), and compare them to the radio continuum intensity scale heights, which have been derived in a similar way. The two types of scale heights are well correlated, with similar dependence on disk radial extension and star formation rate surface density but different dependence on mass surface density. This result indicates that the vertical distribution of the two components may be governed by similar fundamental physics but with subtle differences.

preprint2022arXiv

Construct the emission line galaxy-host halo connection through auto and cross correlations

We investigate the [O\,II] emission line galaxy (ELG)-host halo connection via auto and cross correlations, and propose a concise and effective method to populate ELGs in dark matter halos without assuming a parameterized halo occupation distribution (HOD) model. Using the observational data from VIMOS Public Extragalactic Redshift Survey (VIPERS), we measure the auto and cross correlation functions between ELGs selected by [O\,II] luminosity and normal galaxies selected by stellar mass. Combining the stellar-halo mass relation (SHMR) derived for the normal galaxies and the fraction of ELGs observed in the normal galaxy population, we demonstrate that we can establish an accurate ELG-halo connection. With the ELG-halo connection, we can accurately reproduce the auto and cross correlation functions of ELGs and normal galaxies both in real-space and in redshift-space, once the satellite fraction is properly reduced. Our method provides a novel strategy to generate ELG mock catalogs for ongoing and upcoming galaxy redshift surveys. We also provide a simple description for the HOD of ELGs.

preprint2022arXiv

Disentangled Representation Learning for Text-Video Retrieval

Cross-modality interaction is a critical component in Text-Video Retrieval (TVR), yet there has been little examination of how different influencing factors for computing interaction affect performance. This paper first studies the interaction paradigm in depth, where we find that its computation can be split into two terms, the interaction contents at different granularity and the matching function to distinguish pairs with the same semantics. We also observe that the single-vector representation and implicit intensive function substantially hinder the optimization. Based on these findings, we propose a disentangled framework to capture a sequential and hierarchical representation. Firstly, considering the natural sequential structure in both text and video inputs, a Weighted Token-wise Interaction (WTI) module is performed to decouple the content and adaptively exploit the pair-wise correlations. This interaction can form a better disentangled manifold for sequential inputs. Secondly, we introduce a Channel DeCorrelation Regularization (CDCR) to minimize the redundancy between the components of the compared vectors, which facilitate learning a hierarchical representation. We demonstrate the effectiveness of the disentangled representation on various benchmarks, e.g., surpassing CLIP4Clip largely by +2.9%, +3.1%, +7.9%, +2.3%, +2.8% and +6.5% R@1 on the MSR-VTT, MSVD, VATEX, LSMDC, AcitivityNet, and DiDeMo, respectively.

preprint2022arXiv

eDIG-CHANGES I: Extended Hα Emission from the Extraplanar Diffuse Ionized Gas (eDIG) around CHANG-ES Galaxies

The extraplanar diffuse ionized gas (eDIG) represents the cool/warm ionized gas reservoir around galaxies. We present a spatial analysis of H$α$ images of 22 nearby edge-on spiral galaxies from the CHANG-ES sample (the eDIG-CHANGES project), taken with the APO 3.5m telescope, in order to study their eDIG. We conduct an exponential fit to the vertical intensity profiles of the sample galaxies, of which 16 can be decomposed into a thin disk plus an extended thick disk component. The median value of the scale height (h) of the extended component is $1.13\pm 0.14$ kpc. We find a tight sublinear correlation between h and the SFR. Moreover, the offset of individual galaxies from the best-fit SFR-h relation shows significant anti-correlation with SFR_SD. This indicates that galaxies with more intense star formation tend to have disproportionately extended eDIG. Combined with data from the literature, we find that the correlations between the eDIG properties and the galaxies' properties extend to broader ranges. We further compare the vertical extension of the eDIG to multi-wavelength measurements of other CGM phases. We find the eDIG to be slightly more extended than the neutral gas (HI 21-cm line), indicating the existence of some extended ionizing sources. Most galaxies have an X-ray scale height smaller than the h, suggesting that the majority of the X-ray emission detected in shallow observations are actually from the thick disk. The h is comparable to the L-band radio continuum scale height, both slightly larger than that at higher frequencies (C-band), where the cooling is stronger and the thermal contribution may be larger. The comparable H$α$ and L-band scale height indicates that the thermal and non-thermal electrons have similar spatial distributions. This further indicates that the thermal gas, the cosmics rays, and the magnetic field may be close to energy equipartition.

preprint2022arXiv

Gas dynamics and star formation in NGC 6822

We present H I gas kinematics and star formation activities of NGC 6822, a dwarf galaxy located in the Local Group at a distance of ~ 490 kpc. We perform profile decomposition of line-of-sight velocity profiles of the H I data cube (42.4" x 12.0" spatial, corresponding to ~ 100 pc; 1.6 km s$^{-1}$ spectral) taken with the Australia Telescope Compact Array (ATCA). For this, we use a new tool, the so-called BAYGAUD which is based on Bayesian analysis techniques, allowing us to decompose a line-of-sight velocity profile into an optimal number of Gaussian components in a quantitative manner. We classify the decomposed H I gas components of NGC 6822 into cool-bulk, warm-bulk, cool-non-bulk and warm-non-bulk motions with respect to their centroid velocities and velocity dispersions. We correlate their gas surface densities with corresponding star formation rate densities derived using both the GALEX far-ultraviolet and WISE 22 $μ$m data to examine the resolved Kennicutt-Schmidt (K-S) law for NGC 6822. Of the decomposed H I gas components, the cool-bulk component is likely to better follow the linear extension of the K-S law for molecular hydrogen (H$_2$) at low gas surface densities where H I is not saturated.

preprint2022arXiv

HI Vertical Structure of Nearby Edge-on Galaxies from CHANG-ES

We study the vertical distribution of the highly inclined galaxies from the Continuum Halos in Nearby Galaxies - an EVLA Survey (CHANG-ES). We explore the feasibility of photometrically deriving the HI disk scale-heights from the moment-0 images of the relatively edge-on galaxies with inclination >80 deg, by quantifying the systematic broadening effects and thus deriving correction equations for direct measurements. The corrected HI disk scale-heights of the relatively edge-on galaxies from the CHANG-ES sample show trends consistent with the quasi-equilibrium model of the vertical structure of gas disks. The procedure provide a convenient way to derive the scale-heights and can easily be applied to statistical samples in the future.

preprint2022arXiv

Photometric Objects around Cosmic Webs (PAC) Delineated in a Spectroscopic Survey. I. Methods

We provide a method for estimating the projected density distribution $\bar{n}_2w_p(r_p)$ of photometric objects around spectroscopic objects in a redshift survey. This quantity describes the distribution of Photometric sources with certain physical properties (e.g. luminosity, mass, color etc) Around Cosmic webs (PAC) traced by the spectroscopic objects. The method can make full use of current and future deep and wide photometric surveys to explore the formation of galaxies up to medium redshift ($z_s < 2$), with the aid of cosmological redshift surveys that sample only a fairly limited species of objects (e.g. Emission Line Galaxies). As an example, we apply the PAC method to the CMASS spectroscopic and HSC-SSP PDR2 photometric samples to explore the distribution of galaxies for a wide range of stellar mass from $10^{9.0}{\rm M_\odot}$ to $10^{12.0}{\rm M_\odot}$ around massive ones at $z_s\approx 0.6$. Using the abundance matching method, we model $\bar{n}_2w_p(r_p)$ in N-body simulation using MCMC sampling, and accurately measure the stellar-halo mass relation (SHMR) and stellar mass function (SMF) for the whole mass range. We can also measure the conditional stellar mass function (CSMF) of satellites for central galaxies of different mass. The PAC method has many potential applications for studying the evolution of galaxies.

preprint2022arXiv

RCL: Recurrent Continuous Localization for Temporal Action Detection

Temporal representation is the cornerstone of modern action detection techniques. State-of-the-art methods mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the temporal domain with a discretized grid, and then regress the accurate boundaries. In this paper, we revisit this foundational stage and introduce Recurrent Continuous Localization (RCL), which learns a fully continuous anchoring representation. Specifically, the proposed representation builds upon an explicit model conditioned with video embeddings and temporal coordinates, which ensure the capability of detecting segments with arbitrary length. To optimize the continuous representation, we develop an effective scale-invariant sampling strategy and recurrently refine the prediction in subsequent iterations. Our continuous anchoring scheme is fully differentiable, allowing to be seamlessly integrated into existing detectors, e.g., BMN and G-TAD. Extensive experiments on two benchmarks demonstrate that our continuous representation steadily surpasses other discretized counterparts by ~2% mAP. As a result, RCL achieves 52.92% mAP@0.5 on THUMOS14 and 37.65% mAP on ActivtiyNet v1.3, outperforming all existing single-model detectors.

preprint2022arXiv

Strong Conformity and Assembly Bias: Towards a Physical Understanding of the Galaxy-Halo Connection in SDSS Clusters

Understanding the physical connection between cluster galaxies and massive haloes is key to mitigating systematic uncertainties in next-generation cluster cosmology. We develop a novel method to infer the level of conformity between the stellar mass of the brightest central galaxies~(BCGs) $M_*^{BCG}$ and the satellite richness $λ$, defined as their correlation coefficient $ρ_{cc}$ at fixed halo mass, using the abundance and weak lensing of SDSS clusters as functions of $M_*^{BCG}$ and $λ$. We detect a halo mass-dependent conformity as $ρ_{cc}{=}0.60{+}0.08\ln(M_h/3{\times}10^{14}M_{\odot}/h)$. The strong conformity successfully resolves the &#34;halo mass equality&#34; conundrum discovered in Zu et al. 2021 -- when split by $M_*^{BCG}$ at fixed $λ$, the low and high-$M_*^{BCG}$ clusters have the same average halo mass despite having a $0.34$ dex discrepancy in average $M_*^{BCG}$. On top of the best-fitting conformity model, we develop a cluster assembly bias~(AB) prescription calibrated against the CosmicGrowth simulation, and build a conformity+AB model for the cluster weak lensing measurements. Our model predicts that with a ${\sim}20\%$ lower halo concentration $c$, the low-$M_*^{BCG}$ clusters are ${\sim}10\%$ more biased than the high-$M_*^{BCG}$ systems, in excellent agreement with the observations. We also show that the observed conformity and assembly bias are unlikely due to projection effects. Finally, we build a toy model to argue that while the early-time BCG-halo co-evolution drives the $M_*^{BCG}$-$c$ correlation, the late-time dry merger-induced BCG growth naturally produces the $M_*^{BCG}$-$λ$ conformity despite the well-known anti-correlation between $λ$ and $c$. Our method paves the path towards simultaneously constraining cosmology and cluster formation with future cluster surveys.

preprint2021arXiv

Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce

Nowadays, live-stream and short video shopping in E-commerce have grown exponentially. However, the sellers are required to manually match images of the selling products to the timestamp of exhibition in the untrimmed video, resulting in a complicated process. To solve the problem, we present an innovative demonstration of multi-modal retrieval system called &#34;Fashion Focus&#34;, which enables to exactly localize the product images in the online video as the focuses. Different modality contributes to the community localization, including visual content, linguistic features and interaction context are jointly investigated via presented multi-modal learning. Our system employs two procedures for analysis, including video content structuring and multi-modal retrieval, to automatically achieve accurate video-to-shop matching. Fashion Focus presents a unified framework that can orientate the consumers towards relevant product exhibitions during watching videos and help the sellers to effectively deliver the products over search and recommendation.

preprint2021arXiv

Large Scale Long-tailed Product Recognition System at Alibaba

A practical large scale product recognition system suffers from the phenomenon of long-tailed imbalanced training data under the E-commercial circumstance at Alibaba. Besides product images at Alibaba, plenty of image related side information (e.g. title, tags) reveal rich semantic information about images. Prior works mainly focus on addressing the long tail problem in visual perspective only, but lack of consideration of leveraging the side information. In this paper, we present a novel side information based large scale visual recognition co-training~(SICoT) system to deal with the long tail problem by leveraging the image related side information. In the proposed co-training system, we firstly introduce a bilinear word attention module aiming to construct a semantic embedding over the noisy side information. A visual feature and semantic embedding co-training scheme is then designed to transfer knowledge from classes with abundant training data (head classes) to classes with few training data (tail classes) in an end-to-end fashion. Extensive experiments on four challenging large scale datasets, whose numbers of classes range from one thousand to one million, demonstrate the scalable effectiveness of the proposed SICoT system in alleviating the long tail problem. In the visual search platform Pailitao\footnote{http://www.pailitao.com} at Alibaba, we settle a practical large scale product recognition application driven by the proposed SICoT system, and achieve a significant gain of unique visitor~(UV) conversion rate.

preprint2021arXiv

Large-Scale Visual Search with Binary Distributed Graph at Alibaba

Graph-based approximate nearest neighbor search has attracted more and more attentions due to its online search advantages. Numbers of methods studying the enhancement of speed and recall have been put forward. However, few of them focus on the efficiency and scale of offline graph-construction. For a deployed visual search system with several billions of online images in total, building a billion-scale offline graph in hours is essential, which is almost unachievable by most existing methods. In this paper, we propose a novel algorithm called Binary Distributed Graph to solve this problem. Specifically, we combine binary codes with graph structure to speedup online and offline procedures, and achieve comparable performance with the ones in real-value based scenarios by recalling more binary candidates. Furthermore, the graph-construction is optimized to completely distributed implementation, which significantly accelerates the offline process and gets rid of the limitation of memory and disk within a single machine. Experimental comparisons on Alibaba Commodity Data Set (more than three billion images) show that the proposed method outperforms the state-of-the-art with respect to the online/offline trade-off.

preprint2021arXiv

Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting Richness of User Click Behavior for Visual Search Relevance

Visual search plays an essential role for E-commerce. To meet the search demands of users and promote shopping experience at Alibaba, visual search relevance of real-shot images is becoming the bottleneck. Traditional visual search paradigm is usually based upon supervised learning with labeled data. However, large-scale categorical labels are required with expensive human annotations, which limits its applicability and also usually fails in distinguishing the real-shot images. In this paper, we propose to discover Virtual ID from user click behavior to improve visual search relevance at Alibaba. As a totally click-data driven approach, we collect various types of click data for training deep networks without any human annotations at all. In particular, Virtual ID are learned as classification supervision with co-click embedding, which explores image relationship from user co-click behaviors to guide category prediction and feature learning. Concretely, we deploy Virtual ID Category Network by integrating first-clicks and switch-clicks as regularizer. Incorporating triplets and list constraints, Virtual ID Feature Network is trained in a joint classification and ranking manner. Benefiting from exploration of user click data, our networks are more effective to encode richer supervision and better distinguish real-shot images in terms of category and feature. To validate our method for visual search relevance, we conduct an extensive set of offline and online experiments on the collected real-shot images. We consistently achieve better experimental results across all components, compared with alternative and state-of-the-art methods.

preprint2021arXiv

Visual Search at Alibaba

This paper introduces the large scale visual search algorithm and system infrastructure at Alibaba. The following challenges are discussed under the E-commercial circumstance at Alibaba (a) how to handle heterogeneous image data and bridge the gap between real-shot images from user query and the online images. (b) how to deal with large scale indexing for massive updating data. (c) how to train deep models for effective feature representation without huge human annotations. (d) how to improve the user engagement by considering the quality of the content. We take advantage of large image collection of Alibaba and state-of-the-art deep learning techniques to perform visual search at scale. We present solutions and implementation details to overcome those problems and also share our learnings from building such a large scale commercial visual search engine. Specifically, model and search-based fusion approach is introduced to effectively predict categories. Also, we propose a deep CNN model for joint detection and feature learning by mining user click behavior. The binary index engine is designed to scale up indexing without compromising recall and precision. Finally, we apply all the stages into an end-to-end system architecture, which can simultaneously achieve highly efficient and scalable performance adapting to real-shot images. Extensive experiments demonstrate the advancement of each module in our system. We hope visual search at Alibaba becomes more widely incorporated into today&#39;s commercial applications.

preprint2020arXiv

Weakly Supervised Learning with Side Information for Noisy Labeled Images

In many real-world datasets, like WebVision, the performance of DNN based classifier is often limited by the noisy labeled data. To tackle this problem, some image related side information, such as captions and tags, often reveal underlying relationships across images. In this paper, we present an efficient weakly supervised learning by using a Side Information Network (SINet), which aims to effectively carry out a large scale classification with severely noisy labels. The proposed SINet consists of a visual prototype module and a noise weighting module. The visual prototype module is designed to generate a compact representation for each category by introducing the side information. The noise weighting module aims to estimate the correctness of each noisy image and produce a confidence score for image ranking during the training procedure. The propsed SINet can largely alleviate the negative impact of noisy image labels, and is beneficial to train a high performance CNN based classifier. Besides, we released a fine-grained product dataset called AliProducts, which contains more than 2.5 million noisy web images crawled from the internet by using queries generated from 50,000 fine-grained semantic classes. Extensive experiments on several popular benchmarks (i.e. Webvision, ImageNet and Clothing-1M) and our proposed AliProducts achieve state-of-the-art performance. The SINet has won the first place in the classification task on WebVision Challenge 2019, and outperformed other competitors by a large margin.

preprint2019arXiv

The $ΔI$=2 bands in $^{109}$In: possible antimagnetic rotation

The high-spin structure of $^{109}$In was investigated with the $^{100}$Mo($^{14}$N, 5$n$)$^{109}$In fusion-evaporation reaction at CIAE, Beijing. Eleven new $γ$-rays of $^{109}$In were identified, by which the bandheads of the $ΔI$=2 rotational bands were confirmed. The configurations were assigned with the help of the systematic discussion. Furthermore, the rotational bands are compared with the tilted-axis cranking calculations based on a relativistic mean-field approach. The rotational bands involving the $1p1h$ excitation to the $π$$d_{5/2}$ and $π$$g_{7/2}$ orbitals are suggested as candidates for antimagnetic rotation based on the theoretical results.