Source author record

Kan Chen

Kan Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision hep-ph hep-ex Computation and Language Machine Learning Artificial Intelligence cond-mat hep-lat nucl-th astro-ph.EP chao-dyn cond-mat.stat-mech Methodology Neural and Evolutionary Computing nlin.CD nucl-ex physics.comp-ph physics.geo-ph

Catalog footprint

What is connected

20works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

$Σ_cΣ_c$ interactions in chiral effective field theory

We study the interactions of the $Σ_cΣ_c$ system in the framework of chiral effective theory. We consider the contact, one-pion and two-pion exchange interactions and bridge the low energy constants of the $Σ_cΣ_c$ system to those of the $Σ_c^{(*)}\bar{D}^{(*)}$ systems through the quark-level ansatz for the contact interaction. We explore the influence of intermediate channels in the two-pion exchange diagrams of the $Σ_cΣ_c$ system. We obtain a deep bound state $[Σ_cΣ_c]_{J=0}^{I=0}$ and a shallow bound state $[Σ_cΣ_c]_{J=1}^{I=1}$. As a byproduct, we further investigate the interactions of the $Λ_cΛ_c$ and $Λ_cΣ_c$ systems.

preprint2022arXiv

Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation

Estimating treatment effects is of great importance for many biomedical applications with observational data. Particularly, interpretability of the treatment effects is preferable for many biomedical researchers. In this paper, we first provide a theoretical analysis and derive an upper bound for the bias of average treatment effect (ATE) estimation under the strong ignorability assumption. Derived by leveraging appealing properties of the Weighted Energy Distance, our upper bound is tighter than what has been reported in the literature. Motivated by the theoretical analysis, we propose a novel objective function for estimating the ATE that uses the energy distance balancing score and hence does not require correct specification of the propensity score model. We also leverage recently developed neural additive models to improve interpretability of deep learning models used for potential outcome prediction. We further enhance our proposed model with an energy distance balancing score weighted regularization. The superiority of our proposed model over current state-of-the-art methods is demonstrated in semi-synthetic experiments using two benchmark datasets, namely, IHDP and ACIC.

preprint2022arXiv

Cross-Domain Adaptive Teacher for Object Detection

We address the task of domain adaptation in object detection, where there is a domain gap between a domain with annotations (source) and a domain of interest without annotations (target). As an effective semi-supervised learning method, the teacher-student framework (a student model is supervised by the pseudo labels from a teacher model) has also yielded a large accuracy gain in cross-domain object detection. However, it suffers from the domain shift and generates many low-quality pseudo labels (\textit{e.g.,} false positives), which leads to sub-optimal performance. To mitigate this problem, we propose a teacher-student framework named Adaptive Teacher (AT) which leverages domain adversarial learning and weak-strong data augmentation to address the domain gap. Specifically, we employ feature-level adversarial training in the student model, allowing features derived from the source and target domains to share similar distributions. This process ensures the student model produces domain-invariant features. Furthermore, we apply weak-strong augmentation and mutual learning between the teacher model (taking data from the target domain) and the student model (taking data from both domains). This enables the teacher model to learn the knowledge from the student model without being biased to the source domain. We show that AT demonstrates superiority over existing approaches and even Oracle (fully-supervised) models by a large margin. For example, we achieve 50.9% (49.3%) mAP on Foggy Cityscape (Clipart1K), which is 9.2% (5.2%) and 8.2% (11.0%) higher than previous state-of-the-art and Oracle, respectively.

preprint2022arXiv

Manifestly exotic pentaquarks with a single heavy quark

Inspired by the observed $X(2900)$, we study systematically the mass spectra of the ground pentaquark states with the $qqqq\bar{Q}$ ($Q=c,b$; $q=n,s$; $n=u,d$) configuration in the framework of the Chromomagnetic Interaction model. We present a detailed analysis of their stabilities and decay behaviors. Our results indicate that there may exist narrow states or even stable states. We hope that the present study may inspire experimentalist's interest in searching for such a type of the exotic pentaquark state.

preprint2022arXiv

Testing Biased Randomization Assumptions and Quantifying Imperfect Matching and Residual Confounding in Matched Observational Studies

One central goal of design of observational studies is to embed non-experimental data into an approximate randomized controlled trial using statistical matching. Despite empirical researchers' best intention and effort to create high-quality matched samples, residual imbalance due to observed covariates not being well matched often persists. Although statistical tests have been developed to test the randomization assumption and its implications, few provide a means to quantify the level of residual confounding due to observed covariates not being well matched in matched samples. In this article, we develop two generic classes of exact statistical tests for a biased randomization assumption. One important by-product of our testing framework is a quantity called residual sensitivity value (RSV), which provides a means to quantify the level of residual confounding due to imperfect matching of observed covariates in a matched sample. We advocate taking into account RSV in the downstream primary analysis. The proposed methodology is illustrated by re-examining a famous observational study concerning the effect of right heart catheterization (RHC) in the initial care of critically ill patients. Code implementing the method can be found in the supplementary materials.

preprint2021arXiv

Heavy flavor molecular states with strangeness

We proposed a unified framework to describe the interactions of the observed $T_{cc}$, $P_c$, and $P_{cs}$ within a quark level interaction in our previous work. In this work, we generalize our framework to the loosely bound hadronic molecules composed of heavy flavor di-hadrons with strangeness. We predict the possible $D^{(*)}D^{(*)}_s$ molecular states in the SU(3) limit with the masses of the $P_c$ states as the inputs. We also investigate the baryon-meson and baryon-baryon systems and consider the SU(3) breaking effect in their flavor wave functions. We generalize our isospin criterion of the formation of heavy flavor di-hadron molecules to the $U/V$ spin case. For a specific heavy flavor meson-meson, baryon-meson, or baryon-baryon system, the interactions for the states with the same flavor and spin matrix elements can be related by a generalized flavor-spin symmetry.

preprint2021arXiv

Systematics of the heavy flavor hadronic molecules

With a quark level interaction, we give a unified description of the loosely bound molecular systems composed of the heavy flavor hadrons $(\bar{D},\bar{D}^*)$, $(Λ_c, Σ_c, Σ_c^*)$, and $(Ξ_c, Ξ_c^\prime,Ξ_c^*)$. Using the $P_c$ states as inputs to fix the interaction strength of light quark-quark pairs, we reproduce the observed $P_{cs}$ and $T_{cc}^+$ states and predict another narrow $T_{cc}^{\prime+}$ state with quantum numbers $[D^*D^*]_{J=1}^{I=0}$. If we require a satisfactory description of the $T_{cc}^+$ and $P_c$ states simultaneously, our framework prefers the assignments of the $P_{c}(4440)$ and $P_{c}(4457)$ as the $[Σ_c\bar{D}^*]_{J=1/2}^{I=1/2}$ and $[Σ_c\bar{D}^*]_{J=3/2}^{I=1/2}$ states, respectively. We propose the isospin criterion to explain naturally why the experimentally observed $T_{cc}$, $P_c$, and $P_{cs}$ molecular candidates prefer the lowest isospin numbers. We also predict the loosely bound states for the bottom di-hadrons.

preprint2021arXiv

Unbiased Teacher for Semi-Supervised Object Detection

Semi-supervised learning, i.e., training networks with both labeled and unlabeled data, has made significant progress recently. However, existing works have primarily focused on image classification tasks and neglected object detection which requires more annotation effort. In this work, we revisit the Semi-Supervised Object Detection (SS-OD) and identify the pseudo-labeling bias issue in SS-OD. To address this, we introduce Unbiased Teacher, a simple yet effective approach that jointly trains a student and a gradually progressing teacher in a mutually-beneficial manner. Together with a class-balance loss to downweight overly confident pseudo-labels, Unbiased Teacher consistently improved state-of-the-art methods by significant margins on COCO-standard, COCO-additional, and VOC datasets. Specifically, Unbiased Teacher achieves 6.8 absolute mAP improvements against state-of-the-art method when using 1% of labeled data on MS-COCO, achieves around 10 mAP improvements against the supervised baseline when using only 0.5, 1, 2% of labeled data on MS-COCO.

preprint2020arXiv

CPARR: Category-based Proposal Analysis for Referring Relationships

The task of referring relationships is to localize subject and object entities in an image satisfying a relationship query, which is given in the form of \texttt{<subject, predicate, object>}. This requires simultaneous localization of the subject and object entities in a specified relationship. We introduce a simple yet effective proposal-based method for referring relationships. Different from the existing methods such as SSAS, our method can generate a high-resolution result while reducing its complexity and ambiguity. Our method is composed of two modules: a category-based proposal generation module to select the proposals related to the entities and a predicate analysis module to score the compatibility of pairs of selected proposals. We show state-of-the-art performance on the referring relationship task on two public datasets: Visual Relationship Detection and Visual Genome.

preprint2020arXiv

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions

Differentiable Neural Architecture Search (DNAS) has demonstrated great success in designing state-of-the-art, efficient neural networks. However, DARTS-based DNAS's search space is small when compared to other search methods', since all candidate network layers must be explicitly instantiated in memory. To address this bottleneck, we propose a memory and computationally efficient DNAS variant: DMaskingNAS. This algorithm expands the search space by up to $10^{14}\times$ over conventional DNAS, supporting searches over spatial and channel dimensions that are otherwise prohibitively expensive: input resolution and number of filters. We propose a masking mechanism for feature map reuse, so that memory and computational costs stay nearly constant as the search space expands. Furthermore, we employ effective shape propagation to maximize per-FLOP or per-parameter accuracy. The searched FBNetV2s yield state-of-the-art performance when compared with all previous architectures. With up to 421$\times$ less search cost, DMaskingNAS finds models with 0.9% higher accuracy, 15% fewer FLOPs than MobileNetV3-Small; and with similar accuracy but 20% fewer FLOPs than Efficient-B0. Furthermore, our FBNetV2 outperforms MobileNetV3 by 2.6% in accuracy, with equivalent model size. FBNetV2 models are open-sourced at https://github.com/facebookresearch/mobile-vision.

preprint2020arXiv

How efficient is the streaming instability in viscous protoplanetary disks?

The streaming instability is a popular candidate for planetesimal formation by concentrating dust particles to trigger gravitational collapse. However, its robustness against physical conditions expected in protoplanetary disks is unclear. In particular, particle stirring by turbulence may impede the instability. To quantify this effect, we develop the linear theory of the streaming instability with external turbulence modelled by gas viscosity and particle diffusion. We find the streaming instability is sensitive to turbulence, with growth rates becoming negligible for alpha-viscosity parameters $α\gtrsim \mathrm{St} ^{1.5}$, where $\mathrm{St}$ is the particle Stokes number. We explore the effect of non-linear drag laws, which may be applicable to porous dust particles, and find growth rates are modestly reduced. We also find that gas compressibility increase growth rates by reducing the effect of diffusion. We then apply linear theory to global models of viscous protoplanetary disks. For minimum-mass Solar nebula disk models, we find the streaming instability only grows within disk lifetimes beyond $\sim 10$s of AU, even for cm-sized particles and weak turbulence ($α\sim 10^{-4}$). Our results suggest it is rather difficult to trigger the streaming instability in non-laminar protoplanetary disks, especially for small particles.

preprint2020arXiv

Video Object Grounding using Semantic Roles in Language Description

We explore the task of Video Object Grounding (VOG), which grounds objects in videos referred to in natural language descriptions. Previous methods apply image grounding based algorithms to address VOG, fail to explore the object relation information and suffer from limited generalization. Here, we investigate the role of object relations in VOG and propose a novel framework VOGNet to encode multi-modal object relations via self-attention with relative position encoding. To evaluate VOGNet, we propose novel contrasting sampling methods to generate more challenging grounding input samples, and construct a new dataset called ActivityNet-SRL (ASRL) based on existing caption and grounding datasets. Experiments on ASRL validate the need of encoding object relations in VOG, and our VOGNet outperforms competitive baselines by a significant margin.

preprint2016arXiv

$X(4140)$, $X(4270)$, $X(4500)$ and $X(4700)$ and their $cs\bar{c}\bar{s}$ tetraquark partners

In the simple color-magnetic interaction model, we investigate possible ground $cs\bar{c}\bar{s}$ tetraquark states in the diquark-antidiquark basis. We use several methods to estimate the mass spectrum and discuss possible assignment for the $X$ states observed in the $J/ψϕ$ channel. We find that assigning the Belle $X(4350)$ as a $0^{++}$ tetraquark is consistent with the tetraquark interpretation for the $X(4140)$ and $X(4270)$ while the interpretation of the $X(4500)$ and $X(4700)$ needs orbital or radial excitation. There probably exist several tetraquarks around 4.3 GeV that decay into $J/ψϕ$ or $η_cϕ$.

preprint2016arXiv

ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

We propose a novel attention based deep learning architecture for visual question answering task (VQA). Given an image and an image related natural language question, VQA generates the natural language answer for the question. Generating the correct answers requires the model's attention to focus on the regions corresponding to the question, because different questions inquire about the attributes of different image regions. We introduce an attention based configurable convolutional neural network (ABC-CNN) to learn such question-guided attention. ABC-CNN determines an attention map for an image-question pair by convolving the image feature map with configurable convolutional kernels derived from the question's semantics. We evaluate the ABC-CNN architecture on three benchmark VQA datasets: Toronto COCO-QA, DAQUAR, and VQA dataset. ABC-CNN model achieves significant improvements over state-of-the-art methods on these datasets. The question-guided attention generated by ABC-CNN is also shown to reflect the regions that are highly relevant to the questions.

preprint2016arXiv

Knowledge Graph Representation with Jointly Structural and Textual Encoding

The objective of knowledge graph embedding is to encode both entities and relations of knowledge graphs into continuous low-dimensional vector spaces. Previously, most works focused on symbolic representation of knowledge graph with structure information, which can not handle new entities or entities with few facts well. In this paper, we propose a novel deep architecture to utilize both structural and textual information of entities. Specifically, we introduce three neural models to encode the valuable information from text description of entity, among which an attentive model can select related information as needed. Then, a gating mechanism is applied to integrate representations of structure and text into a unified architecture. Experiments show that our models outperform baseline by margin on link prediction and triplet classification tasks. Source codes of this paper will be available on Github.

preprint2016arXiv

Learning Word Embeddings from Intrinsic and Extrinsic Views

While word embeddings are currently predominant for natural language processing, most of existing models learn them solely from their contexts. However, these context-based word embeddings are limited since not all words' meaning can be learned based on only context. Moreover, it is also difficult to learn the representation of the rare words due to data sparsity problem. In this work, we address these issues by learning the representations of words by integrating their intrinsic (descriptive) and extrinsic (contextual) information. To prove the effectiveness of our model, we evaluate it on four tasks, including word similarity, reverse dictionaries,Wiki link prediction, and document classification. Experiment results show that our model is powerful in both word and document modeling.

preprint2015arXiv

Light axial vector mesons

Inspired by the abundant experimental observation of axial-vector states, we study whether the observed axial-vector states can be categorized into the conventional axial-vector meson family. In this paper we carry out an analysis based on the mass spectra and two-body Okubo-Zweig-Iizuka-allowed decays. Besides testing the possible axial-vector meson assignments, we also predict abundant information for their decays and the properties of some missing axial-vector mesons, which are valuable for further experimental exploration of the observed and predicted axial-vector mesons.

preprint2005arXiv

Growing Directed Networks: Organization and Dynamics

We study the organization and dynamics of growing directed networks. These networks are built by adding nodes successively in such a way that each new node has $K$ directed links to the existing ones. The organization of a growing directed network is analyzed in terms of the number of ``descendants'' of each node in the network. We show that the distribution $P(S)$ of the size, $S$, of the descendant cluster is described generically by a power-law, $P(S) \sim S^{-η}$, where the exponent $η$ depends on the value of $K$ as well as the strength of preferential attachment. We determine that, in the case of growing random directed networks without any preferential attachment, $η$ is given by $1+1/K$. We also show that the Boolean dynamics of these networks is stable for any value of $K$. However, with a small fraction of reversal in the direction of the links, the dynamics of growing directed networks appears to operate on ``the edge of chaos'' with a power-law distribution of the cycle lengths. We suggest that the growing directed network may serve as another paradigm for the emergence of the scale-free features in network organization and dynamics.

preprint2003arXiv

Theory of Phase Transition in the Evolutionary Minority Game

We discover the mechanism for the transition from self-segregation (into opposing groups) to clustering (towards cautious behaviors) in the evolutionary minority game (EMG). The mechanism is illustrated with a statistical mechanics analysis of a simplified EMG involving three groups of agents: two groups of opposing agents and one group of cautious agents. Two key factors affect the population distribution of the agents. One is the market impact (the self-interaction), which has been identified previously. The other is the market inefficiency due to the short-time imbalance in the number of agents using opposite strategies. Large market impact favors "extreme" players who choose fixed strategies, while large market inefficiency favors cautious players. The phase transition depends on the number of agents ($N$), the reward-to-fine ratio ($R$), as well as the wealth reduction threshold ($d$) for switching strategy. When the rate for switching strategy is large, there is strong clustering of cautious agents. On the other hand, when $N$ is small, the market impact becomes large, and the extreme behavior is favored.

preprint1998arXiv

Dynamics of Dry Friction: A Numerical Investigation

We perform extended numerical simulation of the dynamics of dry friction, based on a model derived from the phenomenological description proposed by T. Baumberger et al.. In the case of small deviation from the steady sliding motion, the model is shown to be equivalent to the state- and rate-dependent friction law which was first introduced by Rice and Ruina on the basis of experiments on rocks. We obtain the dynamical phase diagram that agrees well with the experimental results on the paper-on-paper systems. In particular, the bifurcation between stick-slip and steady sliding are shown to change from a direct (supercritical) Hopf type to an inverted (subcritical) one as the driving velocity increases, in agreement with the experiments.

Kan Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

$Σ_cΣ_c$ interactions in chiral effective field theory

Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation

Cross-Domain Adaptive Teacher for Object Detection

Manifestly exotic pentaquarks with a single heavy quark

Testing Biased Randomization Assumptions and Quantifying Imperfect Matching and Residual Confounding in Matched Observational Studies

Heavy flavor molecular states with strangeness

Systematics of the heavy flavor hadronic molecules

Unbiased Teacher for Semi-Supervised Object Detection

CPARR: Category-based Proposal Analysis for Referring Relationships

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions

How efficient is the streaming instability in viscous protoplanetary disks?

Video Object Grounding using Semantic Roles in Language Description

$X(4140)$, $X(4270)$, $X(4500)$ and $X(4700)$ and their $cs\bar{c}\bar{s}$ tetraquark partners

ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

Knowledge Graph Representation with Jointly Structural and Textual Encoding

Learning Word Embeddings from Intrinsic and Extrinsic Views

Light axial vector mesons

Growing Directed Networks: Organization and Dynamics

Theory of Phase Transition in the Evolutionary Minority Game

Dynamics of Dry Friction: A Numerical Investigation