Researcher profile

Kan Chen

Kan Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2022arXiv

$Σ_cΣ_c$ interactions in chiral effective field theory

We study the interactions of the $Σ_cΣ_c$ system in the framework of chiral effective theory. We consider the contact, one-pion and two-pion exchange interactions and bridge the low energy constants of the $Σ_cΣ_c$ system to those of the $Σ_c^{(*)}\bar{D}^{(*)}$ systems through the quark-level ansatz for the contact interaction. We explore the influence of intermediate channels in the two-pion exchange diagrams of the $Σ_cΣ_c$ system. We obtain a deep bound state $[Σ_cΣ_c]_{J=0}^{I=0}$ and a shallow bound state $[Σ_cΣ_c]_{J=1}^{I=1}$. As a byproduct, we further investigate the interactions of the $Λ_cΛ_c$ and $Λ_cΣ_c$ systems.

preprint2022arXiv

Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation

Estimating treatment effects is of great importance for many biomedical applications with observational data. Particularly, interpretability of the treatment effects is preferable for many biomedical researchers. In this paper, we first provide a theoretical analysis and derive an upper bound for the bias of average treatment effect (ATE) estimation under the strong ignorability assumption. Derived by leveraging appealing properties of the Weighted Energy Distance, our upper bound is tighter than what has been reported in the literature. Motivated by the theoretical analysis, we propose a novel objective function for estimating the ATE that uses the energy distance balancing score and hence does not require correct specification of the propensity score model. We also leverage recently developed neural additive models to improve interpretability of deep learning models used for potential outcome prediction. We further enhance our proposed model with an energy distance balancing score weighted regularization. The superiority of our proposed model over current state-of-the-art methods is demonstrated in semi-synthetic experiments using two benchmark datasets, namely, IHDP and ACIC.

preprint2022arXiv

Cross-Domain Adaptive Teacher for Object Detection

We address the task of domain adaptation in object detection, where there is a domain gap between a domain with annotations (source) and a domain of interest without annotations (target). As an effective semi-supervised learning method, the teacher-student framework (a student model is supervised by the pseudo labels from a teacher model) has also yielded a large accuracy gain in cross-domain object detection. However, it suffers from the domain shift and generates many low-quality pseudo labels (\textit{e.g.,} false positives), which leads to sub-optimal performance. To mitigate this problem, we propose a teacher-student framework named Adaptive Teacher (AT) which leverages domain adversarial learning and weak-strong data augmentation to address the domain gap. Specifically, we employ feature-level adversarial training in the student model, allowing features derived from the source and target domains to share similar distributions. This process ensures the student model produces domain-invariant features. Furthermore, we apply weak-strong augmentation and mutual learning between the teacher model (taking data from the target domain) and the student model (taking data from both domains). This enables the teacher model to learn the knowledge from the student model without being biased to the source domain. We show that AT demonstrates superiority over existing approaches and even Oracle (fully-supervised) models by a large margin. For example, we achieve 50.9% (49.3%) mAP on Foggy Cityscape (Clipart1K), which is 9.2% (5.2%) and 8.2% (11.0%) higher than previous state-of-the-art and Oracle, respectively.

preprint2022arXiv

Manifestly exotic pentaquarks with a single heavy quark

Inspired by the observed $X(2900)$, we study systematically the mass spectra of the ground pentaquark states with the $qqqq\bar{Q}$ ($Q=c,b$; $q=n,s$; $n=u,d$) configuration in the framework of the Chromomagnetic Interaction model. We present a detailed analysis of their stabilities and decay behaviors. Our results indicate that there may exist narrow states or even stable states. We hope that the present study may inspire experimentalist's interest in searching for such a type of the exotic pentaquark state.

preprint2022arXiv

Testing Biased Randomization Assumptions and Quantifying Imperfect Matching and Residual Confounding in Matched Observational Studies

One central goal of design of observational studies is to embed non-experimental data into an approximate randomized controlled trial using statistical matching. Despite empirical researchers' best intention and effort to create high-quality matched samples, residual imbalance due to observed covariates not being well matched often persists. Although statistical tests have been developed to test the randomization assumption and its implications, few provide a means to quantify the level of residual confounding due to observed covariates not being well matched in matched samples. In this article, we develop two generic classes of exact statistical tests for a biased randomization assumption. One important by-product of our testing framework is a quantity called residual sensitivity value (RSV), which provides a means to quantify the level of residual confounding due to imperfect matching of observed covariates in a matched sample. We advocate taking into account RSV in the downstream primary analysis. The proposed methodology is illustrated by re-examining a famous observational study concerning the effect of right heart catheterization (RHC) in the initial care of critically ill patients. Code implementing the method can be found in the supplementary materials.

preprint2021arXiv

Heavy flavor molecular states with strangeness

We proposed a unified framework to describe the interactions of the observed $T_{cc}$, $P_c$, and $P_{cs}$ within a quark level interaction in our previous work. In this work, we generalize our framework to the loosely bound hadronic molecules composed of heavy flavor di-hadrons with strangeness. We predict the possible $D^{(*)}D^{(*)}_s$ molecular states in the SU(3) limit with the masses of the $P_c$ states as the inputs. We also investigate the baryon-meson and baryon-baryon systems and consider the SU(3) breaking effect in their flavor wave functions. We generalize our isospin criterion of the formation of heavy flavor di-hadron molecules to the $U/V$ spin case. For a specific heavy flavor meson-meson, baryon-meson, or baryon-baryon system, the interactions for the states with the same flavor and spin matrix elements can be related by a generalized flavor-spin symmetry.

preprint2021arXiv

Systematics of the heavy flavor hadronic molecules

With a quark level interaction, we give a unified description of the loosely bound molecular systems composed of the heavy flavor hadrons $(\bar{D},\bar{D}^*)$, $(Λ_c, Σ_c, Σ_c^*)$, and $(Ξ_c, Ξ_c^\prime,Ξ_c^*)$. Using the $P_c$ states as inputs to fix the interaction strength of light quark-quark pairs, we reproduce the observed $P_{cs}$ and $T_{cc}^+$ states and predict another narrow $T_{cc}^{\prime+}$ state with quantum numbers $[D^*D^*]_{J=1}^{I=0}$. If we require a satisfactory description of the $T_{cc}^+$ and $P_c$ states simultaneously, our framework prefers the assignments of the $P_{c}(4440)$ and $P_{c}(4457)$ as the $[Σ_c\bar{D}^*]_{J=1/2}^{I=1/2}$ and $[Σ_c\bar{D}^*]_{J=3/2}^{I=1/2}$ states, respectively. We propose the isospin criterion to explain naturally why the experimentally observed $T_{cc}$, $P_c$, and $P_{cs}$ molecular candidates prefer the lowest isospin numbers. We also predict the loosely bound states for the bottom di-hadrons.

preprint2021arXiv

Unbiased Teacher for Semi-Supervised Object Detection

Semi-supervised learning, i.e., training networks with both labeled and unlabeled data, has made significant progress recently. However, existing works have primarily focused on image classification tasks and neglected object detection which requires more annotation effort. In this work, we revisit the Semi-Supervised Object Detection (SS-OD) and identify the pseudo-labeling bias issue in SS-OD. To address this, we introduce Unbiased Teacher, a simple yet effective approach that jointly trains a student and a gradually progressing teacher in a mutually-beneficial manner. Together with a class-balance loss to downweight overly confident pseudo-labels, Unbiased Teacher consistently improved state-of-the-art methods by significant margins on COCO-standard, COCO-additional, and VOC datasets. Specifically, Unbiased Teacher achieves 6.8 absolute mAP improvements against state-of-the-art method when using 1% of labeled data on MS-COCO, achieves around 10 mAP improvements against the supervised baseline when using only 0.5, 1, 2% of labeled data on MS-COCO.

preprint2020arXiv

CPARR: Category-based Proposal Analysis for Referring Relationships

The task of referring relationships is to localize subject and object entities in an image satisfying a relationship query, which is given in the form of \texttt{<subject, predicate, object>}. This requires simultaneous localization of the subject and object entities in a specified relationship. We introduce a simple yet effective proposal-based method for referring relationships. Different from the existing methods such as SSAS, our method can generate a high-resolution result while reducing its complexity and ambiguity. Our method is composed of two modules: a category-based proposal generation module to select the proposals related to the entities and a predicate analysis module to score the compatibility of pairs of selected proposals. We show state-of-the-art performance on the referring relationship task on two public datasets: Visual Relationship Detection and Visual Genome.

preprint2020arXiv

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions

Differentiable Neural Architecture Search (DNAS) has demonstrated great success in designing state-of-the-art, efficient neural networks. However, DARTS-based DNAS&#39;s search space is small when compared to other search methods&#39;, since all candidate network layers must be explicitly instantiated in memory. To address this bottleneck, we propose a memory and computationally efficient DNAS variant: DMaskingNAS. This algorithm expands the search space by up to $10^{14}\times$ over conventional DNAS, supporting searches over spatial and channel dimensions that are otherwise prohibitively expensive: input resolution and number of filters. We propose a masking mechanism for feature map reuse, so that memory and computational costs stay nearly constant as the search space expands. Furthermore, we employ effective shape propagation to maximize per-FLOP or per-parameter accuracy. The searched FBNetV2s yield state-of-the-art performance when compared with all previous architectures. With up to 421$\times$ less search cost, DMaskingNAS finds models with 0.9% higher accuracy, 15% fewer FLOPs than MobileNetV3-Small; and with similar accuracy but 20% fewer FLOPs than Efficient-B0. Furthermore, our FBNetV2 outperforms MobileNetV3 by 2.6% in accuracy, with equivalent model size. FBNetV2 models are open-sourced at https://github.com/facebookresearch/mobile-vision.

preprint2020arXiv

How efficient is the streaming instability in viscous protoplanetary disks?

The streaming instability is a popular candidate for planetesimal formation by concentrating dust particles to trigger gravitational collapse. However, its robustness against physical conditions expected in protoplanetary disks is unclear. In particular, particle stirring by turbulence may impede the instability. To quantify this effect, we develop the linear theory of the streaming instability with external turbulence modelled by gas viscosity and particle diffusion. We find the streaming instability is sensitive to turbulence, with growth rates becoming negligible for alpha-viscosity parameters $α\gtrsim \mathrm{St} ^{1.5}$, where $\mathrm{St}$ is the particle Stokes number. We explore the effect of non-linear drag laws, which may be applicable to porous dust particles, and find growth rates are modestly reduced. We also find that gas compressibility increase growth rates by reducing the effect of diffusion. We then apply linear theory to global models of viscous protoplanetary disks. For minimum-mass Solar nebula disk models, we find the streaming instability only grows within disk lifetimes beyond $\sim 10$s of AU, even for cm-sized particles and weak turbulence ($α\sim 10^{-4}$). Our results suggest it is rather difficult to trigger the streaming instability in non-laminar protoplanetary disks, especially for small particles.

preprint2020arXiv

Video Object Grounding using Semantic Roles in Language Description

We explore the task of Video Object Grounding (VOG), which grounds objects in videos referred to in natural language descriptions. Previous methods apply image grounding based algorithms to address VOG, fail to explore the object relation information and suffer from limited generalization. Here, we investigate the role of object relations in VOG and propose a novel framework VOGNet to encode multi-modal object relations via self-attention with relative position encoding. To evaluate VOGNet, we propose novel contrasting sampling methods to generate more challenging grounding input samples, and construct a new dataset called ActivityNet-SRL (ASRL) based on existing caption and grounding datasets. Experiments on ASRL validate the need of encoding object relations in VOG, and our VOGNet outperforms competitive baselines by a significant margin.