Source author record

Bo Zhang

Bo Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

199works

57topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

We introduce AEGIS, A holistic benchmark for Evaluating forensic analysis of AI-Generated academic ImageS. Compared to existing benchmarks, AEGIS features three key advances: (1) Domain-Specific Complexity: covering seven academic categories with 39 fine-grained subtypes, exposing intrinsic forensic difficulty, where even GPT-5.1 reaches 48.80% overall performance and expert models achieve only limited localization accuracy (IoU 30.09%); (2) Diverse Forgery Simulations: modeling four prevalent academic forgery strategies across 25 generative models, with 11 yielding average forensic accuracy below 50%, showing that forensics lag behind generative advances; and (3) Multi-Dimensional Forensic Evaluation: jointly assessing detection, reasoning, and localization, revealing complementary strengths between model families, with multimodal large language models (MLLMs) at 84.74% accuracy in textual artifact recognition and expert detectors peaking at 79.54% accuracy in binary authenticity detection. By evaluating 25 leading MLLMs, nine expert models, and one unified multimodal understanding and generation model, AEGIS serves as a diagnostic testbed exposing fundamental limitations in academic image forensics.

preprint2025arXiv

GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Geometric problem solving constitutes a critical branch of mathematical reasoning, requiring precise analysis of shapes and spatial relationships. Current evaluations of geometric reasoning in vision-language models (VLMs) face limitations, including the risk of test data contamination from textbook-based benchmarks, overemphasis on final answers over reasoning processes, and insufficient diagnostic granularity. To address these issues, we present GeoBench, a hierarchical benchmark featuring four reasoning levels in geometric problem-solving: Visual Perception, Goal-Oriented Planning, Rigorous Theorem Application, and Self-Reflective Backtracking. Through six formally verified tasks generated via TrustGeoGen, we systematically assess capabilities ranging from attribute extraction to logical error correction. Experiments reveal that while reasoning models like OpenAI-o3 outperform general MLLMs, performance declines significantly with increasing task complexity. Key findings demonstrate that sub-goal decomposition and irrelevant premise filtering critically influence final problem-solving accuracy, whereas Chain-of-Thought prompting unexpectedly degrades performance in some tasks. These findings establish GeoBench as a comprehensive benchmark while offering actionable guidelines for developing geometric problem-solving systems.

preprint2025arXiv

SCP: Accelerating Discovery with a Global Web of Autonomous Scientific Agents

We introduce SCP: the Science Context Protocol, an open-source standard designed to accelerate discovery by enabling a global network of autonomous scientific agents. SCP is built on two foundational pillars: (1) Unified Resource Integration: At its core, SCP provides a universal specification for describing and invoking scientific resources, spanning software tools, models, datasets, and physical instruments. This protocol-level standardization enables AI agents and applications to discover, call, and compose capabilities seamlessly across disparate platforms and institutional boundaries. (2) Orchestrated Experiment Lifecycle Management: SCP complements the protocol with a secure service architecture, which comprises a centralized SCP Hub and federated SCP Servers. This architecture manages the complete experiment lifecycle (registration, planning, execution, monitoring, and archival), enforces fine-grained authentication and authorization, and orchestrates traceable, end-to-end workflows that bridge computational and physical laboratories. Based on SCP, we have constructed a scientific discovery platform that offers researchers and agents a large-scale ecosystem of more than 1,600 tool resources. Across diverse use cases, SCP facilitates secure, large-scale collaboration between heterogeneous AI systems and human researchers while significantly reducing integration overhead and enhancing reproducibility. By standardizing scientific context and tool orchestration at the protocol level, SCP establishes essential infrastructure for scalable, multi-institution, agent-driven science.

preprint2024arXiv

An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion

Event cameras or dynamic vision sensors (DVS) record asynchronous response to brightness changes instead of conventional intensity frames, and feature ultra-high sensitivity at low bandwidth. The new mechanism demonstrates great advantages in challenging scenarios with fast motion and large dynamic range. However, the recorded events might be highly sparse due to either limited hardware bandwidth or extreme photon starvation in harsh environments. To unlock the full potential of event cameras, we propose an inventive event sequence completion approach conforming to the unique characteristics of event data in both the processing stage and the output form. Specifically, we treat event streams as 3D event clouds in the spatiotemporal domain, develop a diffusion-based generative model to generate dense clouds in a coarse-to-fine manner, and recover exact timestamps to maintain the temporal resolution of raw data successfully. To validate the effectiveness of our method comprehensively, we perform extensive experiments on three widely used public datasets with different spatial resolutions, and additionally collect a novel event dataset covering diverse scenarios with highly dynamic motions and under harsh illumination. Besides generating high-quality dense events, our method can benefit downstream applications such as object classification and intensity frame reconstruction.

preprint2024arXiv

Outer-space branch-and-bound algorithm for generalized linear multiplicative programs

This paper introduces a new global optimization algorithm for solving the generalized linear multiplicative problem (GLMP). The algorithm starts by introducing $\bar{p}$ new variables and applying a logarithmic transformation to convert the problem into an equivalent problem (EP). By using the strong duality of linear program, a new convex relaxation subproblem is formulated to obtain the lower bounds for the optimal value of EP. This relaxation subproblem, combined with a simplicial branching process, forms the foundation of a simplicial branch-and-bound algorithm that can globally solve the problem. The paper also includes an analysis of the theoretical convergence and computational complexity of the algorithm. Additionally, numerical experiments are conducted to demonstrate the effectiveness of the proposed algorithm in various test instances.

preprint2023arXiv

AI of Brain and Cognitive Sciences: From the Perspective of First Principles

Nowadays, we have witnessed the great success of AI in various applications, including image classification, game playing, protein structure analysis, language translation, and content generation. Despite these powerful applications, there are still many tasks in our daily life that are rather simple to humans but pose great challenges to AI. These include image and language understanding, few-shot learning, abstract concepts, and low-energy cost computing. Thus, learning from the brain is still a promising way that can shed light on the development of next-generation AI. The brain is arguably the only known intelligent machine in the universe, which is the product of evolution for animals surviving in the natural environment. At the behavior level, psychology and cognitive sciences have demonstrated that human and animal brains can execute very intelligent high-level cognitive functions. At the structure level, cognitive and computational neurosciences have unveiled that the brain has extremely complicated but elegant network forms to support its functions. Over years, people are gathering knowledge about the structure and functions of the brain, and this process is accelerating recently along with the initiation of giant brain projects worldwide. Here, we argue that the general principles of brain functions are the most valuable things to inspire the development of AI. These general principles are the standard rules of the brain extracting, representing, manipulating, and retrieving information, and here we call them the first principles of the brain. This paper collects six such first principles. They are attractor network, criticality, random network, sparse coding, relational memory, and perceptual learning. On each topic, we review its biological background, fundamental property, potential application to AI, and future development.

preprint2023arXiv

Danlu Tongdu tablets treat lumbar spinal stenosis through reducing reactive oxygen species and apoptosis by regulating CDK2/CDK4/CDKN1A expression

Lumbar spinal stenosis (LSS) is caused by the compression of the nerve root or cauda equina nerve by stenosis of the lumbar spinal canal or intervertebral foramen, and is manifested as chronic low back and leg pain. Danlu Tongdu (DLTD) tablets can relieve chronic pain caused by LSS, but the molecular mechanism remains largely unknown. In this study, the potential molecular mechanism of DLTD tablets in the treatment of LSS was firstly predicted by network pharmacology method. Results showed that DLTD functions in regulating anti-oxidative, apoptosis, and inflammation signaling pathways. Furthermore, the flow cytometry results showed that DLTD tablets efficiently reduced ROS content and inhibited rat neural stem cell apoptosis induced by hydrogen peroxide. DLTD also inhibited the mitochondrial membrane potential damage induced by hydrogen peroxide. Elisa analysis showed that DLTD induced cell cycle related protein, CDK2 and CDK4 and reduced CDKN1A protein expression level. Taken together, our study provided new insights of DLTD in treating LSS through reducing ROS content, decreasing apoptosis by inhibiting CDKN1A and promoting CDK2 and CDK4 expression levels.

preprint2023arXiv

DarkVision: A Benchmark for Low-light Image/Video Perception

Imaging and perception in photon-limited scenarios is necessary for various applications, e.g., night surveillance or photography, high-speed photography, and autonomous driving. In these cases, cameras suffer from low signal-to-noise ratio, which degrades the image quality severely and poses challenges for downstream high-level vision tasks like object detection and recognition. Data-driven methods have achieved enormous success in both image restoration and high-level vision tasks. However, the lack of high-quality benchmark dataset with task-specific accurate annotations for photon-limited images/videos delays the research progress heavily. In this paper, we contribute the first multi-illuminance, multi-camera, and low-light dataset, named DarkVision, serving for both image enhancement and object detection. We provide bright and dark pairs with pixel-wise registration, in which the bright counterpart provides reliable reference for restoration and annotation. The dataset consists of bright-dark pairs of 900 static scenes with objects from 15 categories, and 32 dynamic scenes with 4-category objects. For each scene, images/videos were captured at 5 illuminance levels using three cameras of different grades, and average photons can be reliably estimated from the calibration data for quantitative studies. The static-scene images and dynamic videos respectively contain around 7,344 and 320,667 instances in total. With DarkVision, we established baselines for image/video enhancement and object detection by representative algorithms. To demonstrate an exemplary application of DarkVision, we propose two simple yet effective approaches for improving performance in video enhancement and object detection respectively. We believe DarkVision would advance the state-of-the-arts in both imaging and related computer vision tasks in low-light environment.

preprint2023arXiv

Generalizing the intention-to-treat effect of an active control against placebo from historical placebo-controlled trials to an active-controlled trial: A case study of the efficacy of daily oral TDF/FTC in the HPTN 084 study

In many clinical settings, an active-controlled trial design (e.g., a non-inferiority or superiority design) is often used to compare an experimental medicine to an active control (e.g., an FDA-approved, standard therapy). One prominent example is a recent phase 3 efficacy trial, HIV Prevention Trials Network Study 084 (HPTN 084), comparing long-acting cabotegravir, a new HIV pre-exposure prophylaxis (PrEP) agent, to the FDA-approved daily oral tenofovir disoproxil fumarate plus emtricitabine (TDF/FTC) in a population of heterosexual women in 7 African countries. One key complication of interpreting study results in an active-controlled trial like HPTN 084 is that the placebo arm is not present and the efficacy of the active control (and hence the experimental drug) compared to the placebo can only be inferred by leveraging other data sources. \bz{In this article, we study statistical inference for the intention-to-treat (ITT) effect of the active control using relevant historical placebo-controlled trials data under the potential outcomes (PO) framework}. We highlight the role of adherence and unmeasured confounding, discuss in detail identification assumptions and two modes of inference (point versus partial identification), propose estimators under identification assumptions permitting point identification, and lay out sensitivity analyses needed to relax identification assumptions. We applied our framework to estimating the intention-to-treat effect of daily oral TDF/FTC versus placebo in HPTN 084 using data from an earlier Phase 3, placebo-controlled trial of daily oral TDF/FTC (Partners PrEP).

preprint2023arXiv

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices

We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted to run on mobile devices. It is an amalgamation of a myriad of architectural designs and techniques that are mobile-oriented, which comprises a set of language models at the scale of 1.4B and 2.7B parameters, trained from scratch, a multimodal vision model that is pre-trained in the CLIP fashion, cross-modality interaction via an efficient projector. We evaluate MobileVLM on several typical VLM benchmarks. Our models demonstrate on par performance compared with a few much larger models. More importantly, we measure the inference speed on both a Qualcomm Snapdragon 888 CPU and an NVIDIA Jeston Orin GPU, and we obtain state-of-the-art performance of 21.5 tokens and 65.3 tokens per second, respectively. Our code will be made available at: https://github.com/Meituan-AutoML/MobileVLM.

preprint2023arXiv

Towards simultaneous coherent radiation in the visible and microwave bands with doped molecular crystals

Coherent sources exploiting the stimulated emission of non-equilibrium quantum systems, i.e. gain media, have proven indispensable for advancing fundamental research and engineering. The operating electromagnetic bands of such coherent sources have been continuously enriched for increasing demands.Nevertheless, for a single bench top coherent source, simultaneous generation of radiation in multiple bands, especially when the bands are widely separated, present formidable challenges with a single gain medium. Here, we propose a mechanism of simultaneously realizing the stimulated emission of radiation in the visible and microwave bands, i.e. lasing and masing actions, at ambient conditions by utilizing photoexcited singlet and triplet states of the pentacene molecules that are doped in p-terphenyl. The possibility is validated by the observed amplified spontaneous emission (ASE) at 645 nm with a narrow linewidth around 1 nm from the pentacene-doped p-terphenyl crystal used for masing at 1.45 GHz and consolidated by a 20 fold lower threshold of ASE compared to the reported masing threshold. The overall threshold of the pentacene-based multiband coherent source can be optimized by appropriate alignment of the pump-light polarization with the pentacene's transition dipole moment. Our work not only shows a great promise on immediate realization of multiband coherent sources but also establishes an intriguing solid-state platform for fundamental research of quantum optics in multiple frequency domains.

preprint2023arXiv

YOLOv6 v3.0: A Full-Scale Reloading

The YOLO community has been in high spirits since our first two releases! By the advent of Chinese New Year 2023, which sees the Year of the Rabbit, we refurnish YOLOv6 with numerous novel enhancements on the network architecture and the training scheme. This release is identified as YOLOv6 v3.0. For a glimpse of performance, our YOLOv6-N hits 37.5% AP on the COCO dataset at a throughput of 1187 FPS tested with an NVIDIA Tesla T4 GPU. YOLOv6-S strikes 45.0% AP at 484 FPS, outperforming other mainstream detectors at the same scale (YOLOv5-S, YOLOv8-S, YOLOX-S and PPYOLOE-S). Whereas, YOLOv6-M/L also achieve better accuracy performance (50.0%/52.8% respectively) than other detectors at a similar inference speed. Additionally, with an extended backbone and neck design, our YOLOv6-L6 achieves the state-of-the-art accuracy in real-time. Extensive experiments are carefully conducted to validate the effectiveness of each improving component. Our code is made available at https://github.com/meituan/YOLOv6.

preprint2022arXiv

A new model for preferential attachment scheme with time-varying parameters

We propose an extension of the preferential attachment scheme by allowing the connecting probability to depend on time t. We estimate the parameters involved in the model by minimizing the expected squared difference between the number of vertices of degree one and its conditional expectation. The asymptotic properties of the estimators are also investigated when the parameters are time-varying by establishing the central limit theorem (CLT) of the number of vertices of degree one. We propose a new statistic to test whether the parameters have change points. We also offer some methods to estimate the number of change points and detect the locations of change points. Simulations are conducted to illustrate the performances of the above results.

preprint2022arXiv

A new preferential model with homophily for recommender systems

"Rich-get-richer" and "homophily" are two important phenomena in evolving social networks. "Rich-get-richer" means people with higher followings are more likely to attract new fans, and "homophily" means people prefer to bond with others of the same social group or who have some other attribute in common. To formalize the phenomena simultaneously in the context of an evolving social network, we consider a K-groups preferential attachment (KPA) network model, which is helpful for the social networks recommender system. The main contribution of this paper is to propose a new evolving social network model with the mechanisms of rich-get-richer and homophily. We show that the KPA model exhibits a power-law degree distribution for each group and prove the central limit theorem (CLT) for the maximum likelihood estimation (MLE) of the parameters in the KPA model. We illustrate our results through simulated data and explore the usage of this model with real data examples.

preprint2022arXiv

A physical perturbation based study on the prediction of free-fall disks with chaotic modes in the water

We report a phenomenon that physical perturbations sometimes can benefit the certainty of a free-fall motion with chaotic modes, albeit, as commonly believed, they can ruin it. We statistically compare those factors that may lead to uncertainty, by which we find that the growth of the standard deviation of the landing locations is directly determined by the physical perturbations. A significant yardstick is defined in the meantime. This temporal criterion is of big relevance to the replicability of such problems experimentally, although they are inherently chaotic. Our hypothesis is verified by experiments from other literature. This outcome also provides a practical strategy to evaluate the credible prediction time by estimating the disturbances from physical parameters as a priori.

preprint2022arXiv

A Semiparametric Approach to Model-based Sensitivity Analysis in Observational Studies

When drawing causal inference from observational data, there is always concern about unmeasured confounding. One way to tackle this is to conduct a sensitivity analysis. One widely-used sensitivity analysis framework hypothesizes the existence of a scalar unmeasured confounder U and asks how the causal conclusion would change were U measured and included in the primary analysis. Works along this line often make various parametric assumptions on U, for the sake of mathematical and computational simplicity. In this article, we further this line of research by developing a valid sensitivity analysis that leaves the distribution of U unrestricted. Our semiparametric estimator has three desirable features compared to many existing methods in the literature. First, our method allows for a larger and more flexible family of models, and mitigates observable implications (Franks et al., 2019). Second, our methods work seamlessly with any primary analysis that models the outcome regression parametrically. Third, our method is easy to use and interpret. We construct both pointwise confidence intervals and confidence bands that are uniformly valid over a given sensitivity parameter space, thus formally accounting for unknown sensitivity parameters. We apply our proposed method on an influential yet controversial study of the causal relationship between war experiences and political activeness using observational data from Uganda.

preprint2022arXiv

A VLBA Trigonometric Parallax for RR Aql and the Mira PL Relation

We report VLBA observations of 22 GHz H$_{2}$O and 43 GHz SiO masers toward the Mira variable RR Aql. By fitting the SiO maser emission to a circular ring, we estimate the absolute stellar position of RR Aql and find agreement with Gaia astrometry to within the joint uncertainty of $\approx1$ mas. Using the maser astrometry we measure a stellar parallax of 2.44 $\pm$ 0.07 mas, corresponding to a distance of 410$^{+12}_{-11}$ pc. The maser parallax deviates significantly from the Gaia EDR3 parallax of 1.95 $\pm$ 0.11 mas, indicating a $3.8σ$ tension between radio and optical measurements. This tension is most likely caused by optical photo-center variations limiting the Gaia astrometric accuracy for this Mira variable. Combining infrared magnitudes with parallaxes for RR Aql and other Miras, we fit a period-luminosity relation using a Bayesian approach with MCMC sampling and a strong prior for the slope of -3.60 $\pm$ 0.30 from the LMC. We find a $K$-band zero-point (defined at logP(days) = 2.30) of -6.79 $\pm$ 0.15 mag using VLBI parallaxes and -7.08 $\pm$ 0.29 mag using Gaia parallaxes. The Gaia zero-point is statistically consistent with the more accurate VLBI value.

preprint2022arXiv

Adaptable Text Matching via Meta-Weight Regulator

Neural text matching models have been used in a range of applications such as question answering and natural language inference, and have yielded a good performance. However, these neural models are of a limited adaptability, resulting in a decline in performance when encountering test examples from a different dataset or even a different task. The adaptability is particularly important in the few-shot setting: in many cases, there is only a limited amount of labeled data available for a target dataset or task, while we may have access to a richly labeled source dataset or task. However, adapting a model trained on the abundant source data to a few-shot target dataset or task is challenging. To tackle this challenge, we propose a Meta-Weight Regulator (MWR), which is a meta-learning approach that learns to assign weights to the source examples based on their relevance to the target loss. Specifically, MWR first trains the model on the uniformly weighted source examples, and measures the efficacy of the model on the target examples via a loss function. By iteratively performing a (meta) gradient descent, high-order gradients are propagated to the source examples. These gradients are then used to update the weights of source examples, in a way that is relevant to the target performance. As MWR is model-agnostic, it can be applied to any backbone neural model. Extensive experiments are conducted with various backbone text matching models, on four widely used datasets and two tasks. The results demonstrate that our proposed approach significantly outperforms a number of existing adaptation methods and effectively improves the cross-dataset and cross-task adaptability of the neural text matching models in the few-shot setting.

preprint2022arXiv

Adversarial Texture for Fooling Person Detectors in the Physical World

Nowadays, cameras equipped with AI systems can capture and analyze images to detect people automatically. However, the AI system can make mistakes when receiving deliberately designed patterns in the real world, i.e., physical adversarial examples. Prior works have shown that it is possible to print adversarial patches on clothes to evade DNN-based person detectors. However, these adversarial examples could have catastrophic drops in the attack success rate when the viewing angle (i.e., the camera's angle towards the object) changes. To perform a multi-angle attack, we propose Adversarial Texture (AdvTexture). AdvTexture can cover clothes with arbitrary shapes so that people wearing such clothes can hide from person detectors from different viewing angles. We propose a generative method, named Toroidal-Cropping-based Expandable Generative Attack (TC-EGA), to craft AdvTexture with repetitive structures. We printed several pieces of cloth with AdvTexure and then made T-shirts, skirts, and dresses in the physical world. Experiments showed that these clothes could fool person detectors in the physical world.

preprint2022arXiv

Aspect-specific Context Modeling for Aspect-based Sentiment Analysis

Aspect-based sentiment analysis (ABSA) aims at predicting sentiment polarity (SC) or extracting opinion span (OE) expressed towards a given aspect. Previous work in ABSA mostly relies on rather complicated aspect-specific feature induction. Recently, pretrained language models (PLMs), e.g., BERT, have been used as context modeling layers to simplify the feature induction structures and achieve state-of-the-art performance. However, such PLM-based context modeling can be not that aspect-specific. Therefore, a key question is left under-explored: how the aspect-specific context can be better modeled through PLMs? To answer the question, we attempt to enhance aspect-specific context modeling with PLM in a non-intrusive manner. We propose three aspect-specific input transformations, namely aspect companion, aspect prompt, and aspect marker. Informed by these transformations, non-intrusive aspect-specific PLMs can be achieved to promote the PLM to pay more attention to the aspect-specific context in a sentence. Additionally, we craft an adversarial benchmark for ABSA (advABSA) to see how aspect-specific modeling can impact model robustness. Extensive experimental results on standard and adversarial benchmarks for SC and OE demonstrate the effectiveness and robustness of the proposed method, yielding new state-of-the-art performance on OE and competitive performance on SC.

preprint2022arXiv

Asymptotic Inference for Infinitely Imbalanced Logistic Regression

In this paper we extend the work of Owen (2007) by deriving a second order expansion for the slope parameter in logistic regression, when the size of the majority class is unbounded and the minority class is finite. More precisely, we demonstrate that the second order term converges to a normal distribution and explicitly compute its variance, which surprisingly once again depends only on the mean of the minority class points and not their arrangement under mild regularity assumptions. In the case that the majority class is normally distributed, we illustrate that the variance of the the limiting slope depends exponentially on the z-score of the average of the minority class's points with respect to the majority class's distribution. We confirm our results by Monte Carlo simulations.

preprint2022arXiv

Blind Source Separation over Space

We propose a new estimation method for the blind source separation model of Bachoc et al. (2020). The new estimation is based on an eigenanalysis of a positive definite matrix defined in terms of multiple normalized spatial local covariance matrices, and, therefore, can handle moderately high-dimensional random fields. The consistency of the estimated mixing matrix is established with explicit error rates even when the eigen-gap decays to zero slowly. The proposed method is illustrated via both simulation and a real data example.

preprint2022arXiv

Bringing Old Films Back to Life

We present a learning-based framework, recurrent transformer network (RTN), to restore heavily degraded old films. Instead of performing frame-wise restoration, our method is based on the hidden knowledge learned from adjacent frames that contain abundant information about the occlusion, which is beneficial to restore challenging artifacts of each frame while ensuring temporal coherency. Moreover, contrasting the representation of the current frame and the hidden knowledge makes it possible to infer the scratch position in an unsupervised manner, and such defect localization generalizes well to real-world degradations. To better resolve mixed degradation and compensate for the flow estimation error during frame alignment, we propose to leverage more expressive transformer blocks for spatial restoration. Experiments on both synthetic dataset and real-world old films demonstrate the significant superiority of the proposed RTN over existing solutions. In addition, the same framework can effectively propagate the color from keyframes to the whole video, ultimately yielding compelling restored films. The implementation and model will be released at https://github.com/raywzy/Bringing-Old-Films-Back-to-Life.

preprint2022arXiv

Contrastive Cross-domain Recommendation in Matching

Cross-domain recommendation (CDR) aims to provide better recommendation results in the target domain with the help of the source domain, which is widely used and explored in real-world systems. However, CDR in the matching (i.e., candidate generation) module struggles with the data sparsity and popularity bias issues in both representation learning and knowledge transfer. In this work, we propose a novel Contrastive Cross-Domain Recommendation (CCDR) framework for CDR in matching. Specifically, we build a huge diversified preference network to capture multiple information reflecting user diverse interests, and design an intra-domain contrastive learning (intra-CL) and three inter-domain contrastive learning (inter-CL) tasks for better representation learning and knowledge transfer. The intra-CL enables more effective and balanced training inside the target domain via a graph augmentation, while the inter-CL builds different types of cross-domain interactions from user, taxonomy, and neighbor aspects. In experiments, CCDR achieves significant improvements on both offline and online evaluations in a real-world system. Currently, we have deployed our CCDR on WeChat Top Stories, affecting plenty of users. The source code is in https://github.com/lqfarmer/CCDR.

preprint2022arXiv

Diagnosing Circumburst Environment with Multiband Gamma-Ray Burst Radio Afterglows

It has been widely recognized that gamma-ray burst (GRB) afterglows arise from interactions between GRB outflow and circumburst medium, while their evolution follows the behaviors of relativistic shock waves. Assuming the distribution of circumburst medium follows a general power-law form, that is, $n = A_{\ast} R^{-k}$, where $R$ denotes the distance from the burst, it is obvious that the value of density-distribution index $k$ can affect the behaviors of the afterglow. In this paper, we analyze the temporal and spectral behaviors of GRB radio afterglows with arbitrary $k$-values. In the radio band, a standard GRB afterglow produced by forward shock exhibits a late-time flux peak, and the relative peak fluxes as well as peak times at different frequencies show dependencies on $k$. Thus with multi-band radio peak observations, one can determine the density profile of circumburst medium by comparing the relations between peak flux/time and frequency at each observing band. Also, the effects of trans-relativistic shock waves, as well as jets in afterglows are discussed. By analyzing 31 long and 1 short GRBs with multi-band data of radio afterglows, we find that nearly half of them can be explained with uniform interstellar medium ($k=0$), $\sim 1/5$ can be constrained to exhibiting stellar wind environment ($k=2$), while less than $\sim 1/3$ samples show $0< k< 2$.

preprint2022arXiv

Diagnosis of ultrafast ultraintense laser pulse characteristics by machine-learning-assisted electron spin

Rapid development of ultrafast ultraintense laser technologies continues to create opportunities for studying strong-field physics under extreme conditions. However, accurate determination of the spatial and temporal characteristics of a laser pulse is still a great challenge, especially when laser powers higher than hundreds of terawatts are involved. In this paper, by utilizing the radiative spin-flip effect, we find that the spin depolarization of an electron beam can be employed to diagnose characteristics of ultrafast ultraintense lasers with peak intensities around $10^{20}$-$10^{22}$~W/cm$^2$. With three shots, our machine-learning-assisted model can predict, simultaneously, the pulse duration, peak intensity, and focal radius of a focused Gaussian ultrafast ultraintense laser (in principle, the profile can be arbitrary) with relative errors of $0.1\%$-$10\%$. The underlying physics and an alternative diagnosis method (without the assistance of machine learning) are revealed by the asymptotic approximation of the final spin degree of polarization. Our proposed scheme exhibits robustness and detection accuracy with respect to fluctuations in the electron beam parameters. Accurate measurements of the ultrafast ultraintense laser parameters will lead to much higher precision in, for example, laser nuclear physics investigations and laboratory astrophysics studies. Robust machine learning techniques may also find applications in more general strong-field physics scenarios.

preprint2022arXiv

Disentangled Inference for GANs with Latently Invertible Autoencoder

Generative Adversarial Networks (GANs) play an increasingly important role in machine learning. However, there is one fundamental issue hindering their practical applications: the absence of capability for encoding real-world samples. The conventional way of addressing this issue is to learn an encoder for GAN via Variational Auto-Encoder (VAE). In this paper, we show that the entanglement of the latent space for the VAE/GAN framework poses the main challenge for encoder learning. To address the entanglement issue and enable inference in GAN we propose a novel algorithm named Latently Invertible Autoencoder (LIA). The framework of LIA is that an invertible network and its inverse mapping are symmetrically embedded in the latent space of VAE. The decoder of LIA is first trained as a standard GAN with the invertible network and then the partial encoder is learned from a disentangled autoencoder by detaching the invertible network from LIA, thus avoiding the entanglement problem caused by the random latent space. Experiments conducted on the FFHQ face dataset and three LSUN datasets validate the effectiveness of LIA/GAN.

preprint2022arXiv

Enhanced quantum sensing with room-temperature solid-state masers

Quantum sensing with solid-state systems finds broad applications in diverse areas ranging from material and biomedical sciences to fundamental physics. Several solid-state spin sensors have been developed, facilitating the ultra-sensitive detection of physical quantities such as magnetic and electric fields and temperature. Exploiting collective behaviour of non-interacting spins holds the promise of pushing the detection limit to even lower levels, while to date, those levels are scarcely reached due to the broadened linewidth and inefficient readout of solid-state spin ensembles. Here, we experimentally demonstrate that such drawbacks can be overcome by newly reborn maser technology at room temperature in the solid state. Owing to maser action, we observe a 4-fold reduction in the inhomogeneously broadened linewidth of a molecular spin ensemble, which is narrower than the same measured from single spins at cryogenic temperatures. The maser-based readout applied to magnetometry showcases a signal-to-noise ratio (SNR) of 30 dB for single shots. This technique would be a significant addition to the toolbox for boosting the sensitivity of solid-state ensemble spin sensors.

preprint2022arXiv

Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models

Diffusion probabilistic models (DPMs) are a class of powerful deep generative models (DGMs). Despite their success, the iterative generation process over the full timesteps is much less efficient than other DGMs such as GANs. Thus, the generation performance on a subset of timesteps is crucial, which is greatly influenced by the covariance design in DPMs. In this work, we consider diagonal and full covariances to improve the expressive power of DPMs. We derive the optimal result for such covariances, and then correct it when the mean of DPMs is imperfect. Both the optimal and the corrected ones can be decomposed into terms of conditional expectations over functions of noise. Building upon it, we propose to estimate the optimal covariance and its correction given imperfect mean by learning these conditional expectations. Our method can be applied to DPMs with both discrete and continuous timesteps. We consider the diagonal covariance in our implementation for computational efficiency. For an efficient practical implementation, we adopt a parameter sharing scheme and a two-stage training process. Empirically, our method outperforms a wide variety of covariance design on likelihood results, and improves the sample quality especially on a small number of timesteps.

preprint2022arXiv

Factor Modelling for Clustering High-dimensional Time Series

We propose a new unsupervised learning method for clustering a large number of time series based on a latent factor structure. Each cluster is characterized by its own cluster-specific factors in addition to some common factors which impact on all the time series concerned. Our setting also offers the flexibility that some time series may not belong to any clusters. The consistency with explicit convergence rates is established for the estimation of the common factors, the cluster-specific factors, the latent clusters. Numerical illustration with both simulated data as well as a real data example is also reported. As a spin-off, the proposed new approach also advances significantly the statistical inference for the factor model of Lam and Yao (2012).

preprint2022arXiv

Fast Density Estimation for Density-based Clustering Methods

Density-based clustering algorithms are widely used for discovering clusters in pattern recognition and machine learning since they can deal with non-hyperspherical clusters and are robustness to handle outliers. However, the runtime of density-based algorithms are heavily dominated by finding fixed-radius near neighbors and calculating the density, which is time-consuming. Meanwhile, the traditional acceleration methods using indexing technique such as KD tree is not effective in processing high-dimensional data. In this paper, we propose a fast region query algorithm named fast principal component analysis pruning (called FPCAP) with the help of the fast principal component analysis technique in conjunction with geometric information provided by principal attributes of the data, which can process high-dimensional data and be easily applied to density-based methods to prune unnecessary distance calculations when finding neighbors and estimating densities. As an application in density-based clustering methods, FPCAP method was combined with the Density Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm. And then, an improved DBSCAN (called IDBSCAN) is obtained, which preserves the advantage of DBSCAN and meanwhile, greatly reduces the computation of redundant distances. Experiments on seven benchmark datasets demonstrate that the proposed algorithm improves the computational efficiency significantly.

preprint2022arXiv

Fast Lossless Neural Compression with Integer-Only Discrete Flows

By applying entropy codecs with learned data distributions, neural compressors have significantly outperformed traditional codecs in terms of compression ratio. However, the high inference latency of neural networks hinders the deployment of neural compressors in practical applications. In this work, we propose Integer-only Discrete Flows (IODF), an efficient neural compressor with integer-only arithmetic. Our work is built upon integer discrete flows, which consists of invertible transformations between discrete random variables. We propose efficient invertible transformations with integer-only arithmetic based on 8-bit quantization. Our invertible transformation is equipped with learnable binary gates to remove redundant filters during inference. We deploy IODF with TensorRT on GPUs, achieving 10x inference speedup compared to the fastest existing neural compressors, while retaining the high compression rates on ImageNet32 and ImageNet64.

preprint2022arXiv

Guiding self-assembly of active colloids by temporal modulation of activity

Self-organization phenomena in ensembles of self-propelled particles open pathways to the synthesis of new dynamic states not accessible by traditional equilibrium processes. The challenge is to develop a set of principles that facilitate the control and manipulation of emergent active states. Here, we report that dielectric rolling colloids energized by a pulsating electric field self-organize into alternating square lattices with a lattice constant controlled by the parameters of the field. We combine experiments and simulations to examine spatiotemporal properties of the emergent collective patterns, and investigate the underlying dynamics of the self-organization.We reveal the resistance of the dynamic lattices to compression/expansion stresses leading to a hysteretic behavior of the lattice constant. The general mechanism of pattern synthesis and control in active ensembles via temporal modulation of activity can be applied to other active colloidal systems.

preprint2022arXiv

Human-centric Image Cropping with Partition-aware and Content-preserving Features

Image cropping aims to find visually appealing crops in an image, which is an important yet challenging task. In this paper, we consider a specific and practical application: human-centric image cropping, which focuses on the depiction of a person. To this end, we propose a human-centric image cropping method with two novel feature designs for the candidate crop: partition-aware feature and content-preserving feature. For partition-aware feature, we divide the whole image into nine partitions based on the human bounding box and treat different partitions in a candidate crop differently conditioned on the human information. For content-preserving feature, we predict a heatmap indicating the important content to be included in a good crop, and extract the geometric relation between the heatmap and a candidate crop. Extensive experiments demonstrate that our method can perform favorably against state-of-the-art image cropping methods on human-centric image cropping task. Code is available at https://github.com/bcmi/Human-Centric-Image-Cropping.

preprint2022arXiv

Hyperuniform Active Chiral Fluids with Tunable Internal Structure

Large density fluctuations observed in active systems and hyperuniformity are two seemingly incompatible phenomena. However, the formation of hyperuniform states has been recently predicted in non-equilibrium fluids formed by chiral particles performing circular motion with the same handedness. Here we report evidence of hyperuniformity realized in a chiral active fluid comprised of pear-shaped Quincke rollers of arbitrary handedness. We show that hyperuniformity and large density fluctuations, triggered by dynamic clustering, coexist in this system at different length scales. The system loses its hyperuniformity as the curvature of particles' motion increases transforming them into localized spinners. Our results experimentally demonstrate a novel hyperuniform active fluid and provide new insights into an interplay between chirality, activity and hyperuniformity.

preprint2022arXiv

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection

Unsupervised domain adaptive object detection aims to adapt a well-trained detector from its original source domain with rich labeled data to a new target domain with unlabeled data. Recently, mainstream approaches perform this task through adversarial learning, yet still suffer from two limitations. First, they mainly align marginal distribution by unsupervised cross-domain feature matching, and ignore each feature's categorical and positional information that can be exploited for conditional alignment; Second, they treat all classes as equally important for transferring cross-domain knowledge and ignore that different classes usually have different transferability. In this paper, we propose a joint adaptive detection framework (JADF) to address the above challenges. First, an end-to-end joint adversarial adaptation framework for object detection is proposed, which aligns both marginal and conditional distributions between domains without introducing any extra hyperparameter. Next, to consider the transferability of each object class, a metric for class-wise transferability assessment is proposed, which is incorporated into the JADF objective for domain adaptation. Further, an extended study from unsupervised domain adaptation (UDA) to unsupervised few-shot domain adaptation (UFDA) is conducted, where only a few unlabeled training images are available in unlabeled target domain. Extensive experiments validate that JADF is effective in both the UDA and UFDA settings, achieving significant performance gains over existing state-of-the-art cross-domain detection methods.

preprint2022arXiv

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

LAMOST-MRS-B is one of the sub-surveys of LAMOST medium-resolution (R~7500) spectroscopic survey. It aims at studying the statistical properties (e.g., binary fraction, orbital period distribution, mass ratio distribution) of binary stars and exotic stars. We intend to observe about 30000 stars (10 mag <= G <= 14.5 mag) with at least 10 visits in five years. We first planned to observe 25 plates around the galactic plane in 2018. Then the plates were reduced to 12 in 2019 because of the limitation of observation. At the same time, two new plates located at the high galactic latitude were added to explore binary properties influenced by the different environments. In this survey project, we set the identified exotic and low-metallicity stars with the highest observation priorities. For the rest of the selected stars, we gave higher priority to the relatively brighter stars in order to obtain high-quality spectra as many as possible. Spectra of 49129 stars have been obtained in LAMOST-MRS-B field and released in DR8, of which 28828 and 3375 stars have been visited more than twice and ten times with SNR >= 10, respectively. Most of the sources are B-, A-, and F-type stars with 0.6 < [Fe/H] < 0.4 dex. We also obtain 347 identified variable and exotic stars and about 250 stars with [Fe/H] < 1 dex. We measure radial velocities (RVs) by using 892233 spectra of the stars. The uncertainties of RV achieve about 1 km/s and 10 km/s1 for 95% of late- and early-type stars, respectively. The datasets presented in this paper are available at http://www.doi.org/10.57760/sciencedb.j00113.00035.

preprint2022arXiv

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification

Few-shot fine-grained learning aims to classify a query image into one of a set of support categories with fine-grained differences. Although learning different objects' local differences via Deep Neural Networks has achieved success, how to exploit the query-support cross-image object semantic relations in Transformer-based architecture remains under-explored in the few-shot fine-grained scenario. In this work, we propose a Transformer-based double-helix model, namely HelixFormer, to achieve the cross-image object semantic relation mining in a bidirectional and symmetrical manner. The HelixFormer consists of two steps: 1) Relation Mining Process (RMP) across different branches, and 2) Representation Enhancement Process (REP) within each individual branch. By the designed RMP, each branch can extract fine-grained object-level Cross-image Semantic Relation Maps (CSRMs) using information from the other branch, ensuring better cross-image interaction in semantically related local object regions. Further, with the aid of CSRMs, the developed REP can strengthen the extracted features for those discovered semantically-related local regions in each branch, boosting the model's ability to distinguish subtle feature differences of fine-grained objects. Extensive experiments conducted on five public fine-grained benchmarks demonstrate that HelixFormer can effectively enhance the cross-image object semantic relation matching for recognizing fine-grained objects, achieving much better performance over most state-of-the-art methods under 1-shot and 5-shot scenarios. Our code is available at: https://github.com/JiakangYuan/HelixFormer

preprint2022arXiv

Li-rich Giants in LAMOST Survey. III. The statistical analysis of Li-rich giants

The puzzle of Li-rich giant is still unsolved, contradicting the prediction of the standard stellar models. Although the exact evolutionary stages play a key role in the knowledge of Li-rich giants, a limited number of Li-rich giants have been taken with high-quality asteroseismic parameters to clearly distinguish the stellar evolutionary stages. Based on the LAMOST Data Release 7 (DR7), we applied a data-driven neural network method to derive the parameters for giant stars, which contain the largest number of Li-rich giants. The red giant stars are classified into three stages of Red Giant Branch (RGB), Primary Red Clump (PRC), and Secondary Red Clump (SRC) relying on the estimated asteroseismic parameters. In the statistical analysis of the properties (i.e. stellar mass, carbon, nitrogen, Li-rich distribution, and frequency) of Li-rich giants, we found that: (1) Most of the Li-rich RGB stars are suggested to be the descendants of Li-rich pre-RGB stars and/or the result of engulfment of planet or substellar companions; (2) The massive Li-rich SRC stars could be the natural consequence of Li depletion from the high-mass Li-rich RGB stars. (3) Internal mixing processes near the helium flash can account for the phenomenon of Li-rich on PRC that dominated the Li-rich giants. Based on the comparison of [C/N] distributions between Li-rich and normal PRC stars, the Li-enriched processes probably depend on the stellar mass.

preprint2022arXiv

Machine learning for percolation utilizing auxiliary Ising variables

Machine learning for phase transition has received intensive research interest in recent years. However, its application in percolation still remains challenging. We propose an auxiliary Ising mapping method for machine learning study of the standard percolation as well as a variety of statistical mechanical systems in correlated percolation representations. We demonstrate that unsupervised machine learning is able to accurately locate the percolation threshold, independent of the spatial dimension of system or the type of phase transition, which can be first order or continuous. Moreover, we show that, by neural network machine learning, auxiliary Ising configurations for different universalities can be classified with high confidence level. Our results indicate that the auxiliary Ising mapping method, despite of it simplicity, can advance the application of machine learning in statistical and condensed-matter physics.

preprint2022arXiv

Mining Error Templates for Grammatical Error Correction

Some grammatical error correction (GEC) systems incorporate hand-crafted rules and achieve positive results. However, manually defining rules is time-consuming and laborious. In view of this, we propose a method to mine error templates for GEC automatically. An error template is a regular expression aiming at identifying text errors. We use the web crawler to acquire such error templates from the Internet. For each template, we further select the corresponding corrective action by using the language model perplexity as a criterion. We have accumulated 1,119 error templates for Chinese GEC based on this method. Experimental results on the newly proposed CTC-2021 Chinese GEC benchmark show that combing our error templates can effectively improve the performance of a strong GEC system, especially on two error types with very little training data. Our error templates are available at \url{https://github.com/HillZhang1999/gec_error_template}.

preprint2022arXiv

MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction

This paper presents MuCGEC, a multi-reference multi-source evaluation dataset for Chinese Grammatical Error Correction (CGEC), consisting of 7,063 sentences collected from three Chinese-as-a-Second-Language (CSL) learner sources. Each sentence is corrected by three annotators, and their corrections are carefully reviewed by a senior annotator, resulting in 2.3 references per sentence. We conduct experiments with two mainstream CGEC models, i.e., the sequence-to-sequence model and the sequence-to-edit model, both enhanced with large pretrained language models, achieving competitive benchmark performance on previous and our datasets. We also discuss CGEC evaluation methodologies, including the effect of multiple references and using a char-based metric. Our annotation guidelines, data, and code are available at \url{https://github.com/HillZhang1999/MuCGEC}.

preprint2022arXiv

Multi-granularity Item-based Contrastive Recommendation

Contrastive learning (CL) has shown its power in recommendation. However, most CL-based recommendation models build their CL tasks merely focusing on the user's aspects, ignoring the rich diverse information in items. In this work, we propose a novel Multi-granularity item-based contrastive learning (MicRec) framework for the matching stage (i.e., candidate generation) in recommendation, which systematically introduces multi-aspect item-related information to representation learning with CL. Specifically, we build three item-based CL tasks as a set of plug-and-play auxiliary objectives to capture item correlations in feature, semantic and session levels. The feature-level item CL aims to learn the fine-grained feature-level item correlations via items and their augmentations. The semantic-level item CL focuses on the coarse-grained semantic correlations between semantically related items. The session-level item CL highlights the global behavioral correlations of items from users' sequential behaviors in all sessions. In experiments, we conduct both offline and online evaluations on real-world datasets, verifying the effectiveness and universality of three proposed CL tasks. Currently, MicRec has been deployed on a real-world recommender system, affecting millions of users. The source code will be released in the future.

preprint2022arXiv

Neutron spectroscopy evidence for a possible magnetic-field-induced gapless quantum-spin-liquid phase in a Kitaev material $α$-RuCl$_3$

As one of the most promising Kitaev quantum-spin-liquid (QSL) candidates, $α$-RuCl$_3$ has received a great amount of attention. However, its ground state exhibits a long-range zigzag magnetic order, which defies the QSL phase. Nevertheless, the magnetic order is fragile and can be completely suppressed by applying an external magnetic field. Here, we explore the evolution of magnetic excitations of $α$-RuCl$_3$ under an in-plane magnetic field, by carrying out inelastic neutron scattering measurements on high-quality single crystals. Under zero field, there exist spin-wave excitations near the $M$ point and a continuum near the $\mitΓ$ point, which are believed to be associated with the zigzag magnetic order and fractional excitations of the Kitaev QSL state, respectively. By increasing the magnetic field, the spin-wave excitations gradually give way to the continuous excitations. On the verge of the critical field $μ_0H_{\rm c}=7.5$ T, the former vanish and only the latter is left, indicating the emergence of a pure QSL state. By further increasing the field strength, the excitations near the $\mitΓ$ point become more intense. By following the gap evolution of the excitations near the $\mitΓ$ point, we are able to establish a phase diagram composed of three interesting phases, including a gapped zigzag order phase at low fields, possibly-gapless QSL phase near $μ_0H_{\rm c}$, and gapped partially polarized phase at high fields. These results demonstrate that an in-plane magnetic field can drive $α$-RuCl$_3$ into a long-sought QSL state near the critical field.

preprint2022arXiv

On the HI Content of MaNGA Major Merger Pairs

The role of HI content in galaxy interactions is still under debate. To study the HI content of galaxy pairs at different merging stages, we compile a sample of 66 major-merger galaxy pairs and 433 control galaxies from the SDSS-IV MaNGA IFU survey. In this study, we adopt kinematic asymmetry as a new effective indicator to describe the merging stage of galaxy pairs. With archival data from the HI-MaNGA survey and new observations from the Five-hundred-meter Aperture Spherical Radio Telescope (FAST), we investigate the differences in HI gas fraction ($f_{\text{HI}}$), star formation rate (SFR), and HI star formation efficiency ($\rm SFE_{\text{HI}}$) between the pair and control samples. Our results suggest that the HI gas fraction of major-merger pairs on average is marginally decreased by $\sim 15\%$ relative to isolated galaxies, implying mild HI depletion during galaxy interactions. Compared to isolated galaxies, pre-passage paired galaxies have similar $f_{\text{HI}}$, SFR and $\rm SFE_{\text{HI}}$, while pairs during pericentric passage have weakly decreased $f_{\text{HI}}$ ($-0.10\pm0.05$ dex), significantly enhanced SFR ($0.42\pm0.11$ dex) and $\rm SFE_{\text{HI}}$ ($0.48\pm0.12$ dex). When approaching the apocenter, paired galaxies show marginally decreased $f_{\text{HI}}$ ($-0.05\pm0.04$ dex), comparable SFR ($0.04\pm0.06$ dex) and $\rm SFE_{\text{HI}}$ ($0.08\pm0.08$ dex). We propose the marginally detected HI depletion may originate from the gas consumption in fuelling the enhanced $\rm H_2$ reservoir of galaxy pairs. In addition, new FAST observations also reveal an HI absorber ($N_{\text{HI}}\sim 4.7 \times 10^{21} \text{ cm}^{-2}$), which may suggest gas infalling and the triggering of AGN activity.

preprint2022arXiv

OPA: Object Placement Assessment Dataset

Image composition aims to generate realistic composite image by inserting an object from one image into another background image, where the placement (e.g., location, size, occlusion) of inserted object may be unreasonable, which would significantly degrade the quality of the composite image. Although some works attempted to learn object placement to create realistic composite images, they did not focus on assessing the plausibility of object placement. In this paper, we focus on object placement assessment task, which verifies whether a composite image is plausible in terms of the object placement. To accomplish this task, we construct the first Object Placement Assessment (OPA) dataset consisting of composite images and their rationality labels. We also propose a simple yet effective baseline for this task. Dataset is available at https://github.com/bcmi/Object-Placement-Assessment-Dataset-OPA.

preprint2022arXiv

Pretraining is All You Need for Image-to-Image Translation

We propose to use pretraining to boost general image-to-image translation. Prior image-to-image translation methods usually need dedicated architectural design and train individual translation models from scratch, struggling for high-quality generation of complex scenes, especially when paired training data are not abundant. In this paper, we regard each image-to-image translation problem as a downstream task and introduce a simple and generic framework that adapts a pretrained diffusion model to accommodate various kinds of image-to-image translation. We also propose adversarial training to enhance the texture synthesis in the diffusion model training, in conjunction with normalized guidance sampling to improve the generation quality. We present extensive empirical comparison across various tasks on challenging benchmarks such as ADE20K, COCO-Stuff, and DIODE, showing the proposed pretraining-based image-to-image translation (PITI) is capable of synthesizing images of unprecedented realism and faithfulness.

preprint2022arXiv

Radio properties of the OH megamaser galaxy IIZw 096

Based on the two epochs EVN archive data from OH line observations of IIZw 096, we confirm that the high-resolution OH emission in this source mainly comes from two spots (OH1 and OH2) of comp D1 of this merging system. We found no significant variations in the OH line emission. The OH 1665 MHz line emission is detected at about 6 $σ$ level in the OH1 region by combining two epoch EVN observations. We found that the comp D1 shows the brightest CO, HCO+ line emission, as well as multi-band radio continuum emission. The environment around D1 shows no clear velocity structure associated with circular motions, making it different from most other OHMs in the literature, which might have been caused by an effect during the merger stage. Meanwhile, we found that the CO emission shows three velocity structures around D1, including the central broad FWHM region, the double peak region where the CO line profile shows two separated peaks, and the region of the high-velocity clouds where the CO line peaks at a high velocity ($\sim$ 11000 \kms). \HI in absorption also show high-velocity clouds around the D1 region, which might be due to inflows caused by the merging of two or more galaxy components. Based on the high-resolution K-band VLA and L-band VLBA observations of the radio continuum emission, we derived the brightness temperature in the range $10^{5}$ K to $10^{6}$ K, which is consistent with other starburst dominant OHM sources in the literature. The multi-band VLA observations show that the radio continuum emission of comp D might also have contributions from free-free emission, besides synchrotron emission. As a concenquence, these results support a starburst origin for the OHMs, without the presence of an AGN.

preprint2022arXiv

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

We propose pose-guided multiplane image (MPI) synthesis which can render an animatable character in real scenes with photorealistic quality. We use a portable camera rig to capture the multi-view images along with the driving signal for the moving subject. Our method generalizes the image-to-image translation paradigm, which translates the human pose to a 3D scene representation -- MPIs that can be rendered in free viewpoints, using the multi-views captures as supervision. To fully cultivate the potential of MPI, we propose depth-adaptive MPI which can be learned using variable exposure images while being robust to inaccurate camera registration. Our method demonstrates advantageous novel-view synthesis quality over the state-of-the-art approaches for characters with challenging motions. Moreover, the proposed method is generalizable to novel combinations of training poses and can be explicitly controlled. Our method achieves such expressive and animatable character rendering all in real time, serving as a promising solution for practical applications.

preprint2022arXiv

Robust PCA for High Dimensional Data based on Characteristic Transformation

In this paper, we propose a novel robust Principal Component Analysis (PCA) for high-dimensional data in the presence of various heterogeneities, especially the heavy-tailedness and outliers. A transformation motivated by the characteristic function is constructed to improve the robustness of the classical PCA. Besides the typical outliers, the proposed method has the unique advantage of dealing with heavy-tail-distributed data, whose covariances could be nonexistent (positively infinite, for instance). The proposed approach is also a case of kernel principal component analysis (KPCA) method and adopts the robust and non-linear properties via a bounded and non-linear kernel function. The merits of the new method are illustrated by some statistical properties including the upper bound of the excess error and the behaviors of the large eigenvalues under a spiked covariance model. In addition, we show the advantages of our method over the classical PCA by a variety of simulations. At last, we apply the new robust PCA to classify mice with different genotypes in a biological study based on their protein expression data and find that our method is more accurately on identifying abnormal mice comparing to the classical PCA.

preprint2022arXiv

Robust quantum control for the manipulation of solid-state spins

Robust and high-fidelity control of electron spins in solids is the cornerstone for facilitating applications of solid-state spins in quantum information processing and quantum sensing. However, precise control of spin systems is always challenging due to the presence of a variety of noises originating from the environment and control fields. Herein, noise-resilient quantum gates, designed with robust optimal control (ROC) algorithms, are demonstrated experimentally with nitrogen-vacancy (NV) centers in diamond to realize tailored robustness against detunings and Rabi errors simultaneously. In the presence of both 10% off-resonant detuning and deviation of a Rabi frequency, we achieve an average single-qubit gate fidelity of up to 99.97%. Our experiments also show that, ROCbased multipulse quantum sensing sequences can suppress spurious responses resulting from finite widths and imperfections of microwave pulses, which provides an efficient strategy for enhancing the performance of existing multipulse quantum sensing sequences.

preprint2022arXiv

Some Reflections on Drawing Causal Inference using Textual Data: Parallels Between Human Subjects and Organized Texts

We examine the role of textual data as study units when conducting causal inference by drawing parallels between human subjects and organized texts. %in human population research. We elaborate on key causal concepts and principles, and expose some ambiguity and sometimes fallacies. To facilitate better framing a causal query, we discuss two strategies: (i) shifting from immutable traits to perceptions of them, and (ii) shifting from some abstract concept/property to its constituent parts, i.e., adopting a constructivist perspective of an abstract concept. We hope this article would raise the awareness of the importance of articulating and clarifying fundamental concepts before delving into developing methodologies when drawing causal inference using textual data.

preprint2022arXiv

Spatial Transformation for Image Composition via Correspondence Learning

When using cut-and-paste to acquire a composite image, the geometry inconsistency between foreground and background may severely harm its fidelity. To address the geometry inconsistency in composite images, several existing works learned to warp the foreground object for geometric correction. However, the absence of annotated dataset results in unsatisfactory performance and unreliable evaluation. In this work, we contribute a Spatial TRAnsformation for virtual Try-on (STRAT) dataset covering three typical application scenarios. Moreover, previous works simply concatenate foreground and background as input without considering their mutual correspondence. Instead, we propose a novel correspondence learning network (CorrelNet) to model the correspondence between foreground and background using cross-attention maps, based on which we can predict the target coordinate that each source coordinate of foreground should be mapped to on the background. Then, the warping parameters of foreground object can be derived from pairs of source and target coordinates. Additionally, we learn a filtering mask to eliminate noisy pairs of coordinates to estimate more accurate warping parameters. Extensive experiments on our STRAT dataset demonstrate that our proposed CorrelNet performs more favorably against previous methods.

preprint2022arXiv

Statistical matching and subclassification with a continuous dose: characterization, algorithm, and application to a health outcomes study

Subclassification and matching are often used in empirical studies to adjust for observed covariates; however, they are largely restricted to relatively simple study designs with a binary treatment and less developed for designs with a continuous exposure. Matching with exposure doses is particularly useful in instrumental variable designs and in understanding the dose-response relationships. In this article, we propose two criteria for optimal subclassification based on subclass homogeneity in the context of having a continuous exposure dose, and propose an efficient polynomial-time algorithm that is guaranteed to find an optimal subclassification with respect to one criterion and serves as a 2-approximation algorithm for the other criterion. We discuss how to incorporate dose and use appropriate penalties to control the number of subclasses in the design. Via extensive simulations, we systematically compare our proposed design to optimal non-bipartite pair matching, and demonstrate that combining our proposed subclassification scheme with regression adjustment helps reduce model dependence for parametric causal inference with a continuous dose. We apply the new design and associated randomization-based inferential procedure to study the effect of transesophageal echocardiography (TEE) monitoring during coronary artery bypass graft (CABG) surgery on patients' post-surgery clinical outcomes using Medicare and Medicaid claims data, and find evidence that TEE monitoring lowers patients' all-cause $30$-day mortality rate.

preprint2022arXiv

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Despite the tantalizing success in a broad of vision tasks, transformers have not yet demonstrated on-par ability as ConvNets in high-resolution image generative modeling. In this paper, we seek to explore using pure transformers to build a generative adversarial network for high-resolution image synthesis. To this end, we believe that local attention is crucial to strike the balance between computational efficiency and modeling capacity. Hence, the proposed generator adopts Swin transformer in a style-based architecture. To achieve a larger receptive field, we propose double attention which simultaneously leverages the context of the local and the shifted windows, leading to improved generation quality. Moreover, we show that offering the knowledge of the absolute position that has been lost in window-based transformers greatly benefits the generation quality. The proposed StyleSwin is scalable to high resolutions, with both the coarse geometry and fine structures benefit from the strong expressivity of transformers. However, blocking artifacts occur during high-resolution synthesis because performing the local attention in a block-wise manner may break the spatial coherency. To solve this, we empirically investigate various solutions, among which we find that employing a wavelet discriminator to examine the spectral discrepancy effectively suppresses the artifacts. Extensive experiments show the superiority over prior transformer-based GANs, especially on high resolutions, e.g., 1024x1024. The StyleSwin, without complex training strategies, excels over StyleGAN on CelebA-HQ 1024, and achieves on-par performance on FFHQ-1024, proving the promise of using transformers for high-resolution image generation. The code and models will be available at https://github.com/microsoft/StyleSwin.

preprint2022arXiv

Testing Biased Randomization Assumptions and Quantifying Imperfect Matching and Residual Confounding in Matched Observational Studies

One central goal of design of observational studies is to embed non-experimental data into an approximate randomized controlled trial using statistical matching. Despite empirical researchers' best intention and effort to create high-quality matched samples, residual imbalance due to observed covariates not being well matched often persists. Although statistical tests have been developed to test the randomization assumption and its implications, few provide a means to quantify the level of residual confounding due to observed covariates not being well matched in matched samples. In this article, we develop two generic classes of exact statistical tests for a biased randomization assumption. One important by-product of our testing framework is a quantity called residual sensitivity value (RSV), which provides a means to quantify the level of residual confounding due to imperfect matching of observed covariates in a matched sample. We advocate taking into account RSV in the downstream primary analysis. The proposed methodology is illustrated by re-examining a famous observational study concerning the effect of right heart catheterization (RHC) in the initial care of critically ill patients. Code implementing the method can be found in the supplementary materials.

preprint2022arXiv

The Eclipsing Binaries from the LAMOST Medium-resolution Survey.III. A High-precision Empirical Stellar Mass Library

High-precision stellar mass and radius measured directly from binaries can effectively calibrate the stellar models. However, such a database containing full spectral types and large range of metallicity is still not fully established. A continuous effort of data collecting and analysis are requested to complete the database. In this work, we provide a catalog containing 184 binaries with independent atmospheric parameters and accurate masses and radii as the benchmark of stellar mass and radius. The catalog contains 56 new detached binaries from LAMOST Medium-resolution spectroscopic (MRS) survey and 128 detached eclipsing binaries compiled from previous studies. We obtain the orbital solutions of the new detached binaries with uncertainties of masses and radii smaller than 5%. These new samples densify the distribution of metallicity of the high-precision stellar mass library and add 9 hot stars with Teff>8000 K. Comparisons show that these samples well agree with the PARSEC isochrones in Teff-logg-mass-radius-luminosity space. We compare mass and radius estimates from isochrone and SED fitting, respectively, with those from the binary orbital solution. We find that the precision of the stellar-model dependent mass estimates is >10% and the precision of the radius estimates based on atmospheric parameters is >15%. These give a general view of the uncertainty of the usual approaches to estimate stellar mass and radius.

preprint2022arXiv

The Properties and Evolutions of Starspots on Three Detached Eclipsing Binaries in the LAMOST-Kepler survey

The spotted detached eclipsing binary (DEB) offers insights into starspots on the binary. Three spotted DEBs, KIC 8097825, KIC 6859813, and KIC 5527172, which were observed by the Kepler photometry and LAMOST spectroscopy, are studied in this work. The physical parameters of binaries are determined by binary modeling. The sizes, lifetimes, and single/double-dip ratio (SDR) of starspots are derived by starspot analysis. KIC 8097825 has large starspots. KIC 6859813 has a spot rotation period shorter than its orbital period but the system should be synchronized inferred from timescale estimation. The difference may be the result of the surface differential rotation. The KIC 5527172 has a long spot lifetime and an M dwarf component with an inflation radius. The primaries of these binaries and the secondary of KIC 8097825 have spots. Adding spotted DEBs of literature, we compare the starspots on binaries with those on the single stars. The spot sizes of starspots on 65% binaries are smaller than the median of those on single stars. The lifetimes of starspots on binaries are consistent with those on single stars when the rotation periods are larger than 3 days. SDRs for half of the binaries are consistent with those of single star systems, while another half are smaller. The relative lifetime positively correlates with the RMS and SDR but negatively correlates with the rotation period. These relations are similar to those of spots on the single star systems. Binaries with luminosity ratios close to the unit tend to have more double dips.

preprint2022arXiv

The Role of Placebo Samples in Observational Studies

In an observational study, it is common to leverage known null effect to detect bias. One such strategy is to set aside a placebo sample -- a subset of data immune from the hypothesized cause-and-effect relationship. Existence of an effect in the placebo sample raises concern of unmeasured confounding bias while absence of it corroborates the causal conclusion. This paper establishes a formal framework for using a placebo sample to detect and remove bias. We state identification assumption, and develop estimation and inference methods based on outcome regression, inverse probability weighting, and doubly-robust approaches. Simulation studies and an empirical application illustrate the finite-sample performance of the proposed methods.

preprint2022arXiv

TransLog: A Unified Transformer-based Framework for Log Anomaly Detection

Log anomaly detection is a key component in the field of artificial intelligence for IT operations (AIOps). Considering log data of variant domains, retraining the whole network for unknown domains is inefficient in real industrial scenarios especially for low-resource domains. However, previous deep models merely focused on extracting the semantics of log sequence in the same domain, leading to poor generalization on multi-domain logs. Therefore, we propose a unified Transformer-based framework for log anomaly detection (\ourmethod{}), which is comprised of the pretraining and adapter-based tuning stage. Our model is first pretrained on the source domain to obtain shared semantic knowledge of log data. Then, we transfer the pretrained model to the target domain via the adapter-based tuning. The proposed method is evaluated on three public datasets including one source domain and two target domains. The experimental results demonstrate that our simple yet efficient approach, with fewer trainable parameters and lower training costs in the target domain, achieves state-of-the-art performance on three benchmarks.

preprint2022arXiv

Uniqueness in inverse diffraction grating problems with infinitely many plane waves at a fixed frequency

This paper is concerned with the inverse diffraction problems by a periodic curve with Dirichlet boundary condition in two dimensions. It is proved that the periodic curve can be uniquely determined by the near-field measurement data corresponding to infinitely many incident plane waves with distinct directions at a fixed frequency. Our proof is based on Schiffer's idea which consists of two ingredients: i) the total fields for incident plane waves with distinct directions are linearly independent, and ii) there exist only finitely many linearly independent Dirichlet eigenfunctions in a bounded domain or in a closed waveguide under additional assumptions on the waveguide boundary. Based on the Rayleigh expansion, we prove that the phased near-field data can be uniquely determined by the phaseless near-field data in a bounded domain, with the exception of a finite set of incident angles. Such a phase retrieval result leads to a new uniqueness result for the inverse grating diffraction problem with phaseless near-field data at a fixed frequency. Since the incident direction determines the quasi-periodicity of the boundary value problem, our inverse issues are different from the existing results of [Htttlich & Kirsch, Inverse Problems 13 (1997): 351-361] where fixed-direction plane waves at multiple frequencies were considered.

preprint2022arXiv

Water Maser Survey towards off-plane O-rich AGBs around the orbital plane of the Sagittarius Stellar Stream

A 22 GHz water maser survey was conducted towards 178 O-rich AGB stars with the aim of identifying maser emission associated with the Sagittarius stellar stream. In this survey, maser emissions were detected in 21 targets, of which 20 were new detections. We studied the Galactic distributions of H2O and SiO maser-traced AGBs towards the Sgr orbital plane, and found an elongated structure towards the (l, b)~(340, 40) direction. In order to verify its association with the Sagittarius tidal stream, we further studied the 3D motions of these sources, but found, kinematically, these maser-traced AGBs are still Galactic disc sources rather than Stream debris. In addition, we found a remarkable outward motion, ~50 km/s away from the Galactic center of these maser-traced AGBs, but with no systermatic lag of rotational speed which were reported in 2000 for solar neighborhood Miras.

preprint2022arXiv

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

For years, the YOLO series has been the de facto industry-level standard for efficient object detection. The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios. In this technical report, we strive to push its limits to the next level, stepping forward with an unwavering mindset for industry application. Considering the diverse requirements for speed and accuracy in the real environment, we extensively examine the up-to-date object detection advancements either from industry or academia. Specifically, we heavily assimilate ideas from recent network design, training strategies, testing techniques, quantization, and optimization methods. On top of this, we integrate our thoughts and practice to build a suite of deployment-ready networks at various scales to accommodate diversified use cases. With the generous permission of YOLO authors, we name it YOLOv6. We also express our warm welcome to users and contributors for further enhancement. For a glimpse of performance, our YOLOv6-N hits 35.9% AP on the COCO dataset at a throughput of 1234 FPS on an NVIDIA Tesla T4 GPU. YOLOv6-S strikes 43.5% AP at 495 FPS, outperforming other mainstream detectors at the same scale~(YOLOv5-S, YOLOX-S, and PPYOLOE-S). Our quantized version of YOLOv6-S even brings a new state-of-the-art 43.3% AP at 869 FPS. Furthermore, YOLOv6-M/L also achieves better accuracy performance (i.e., 49.5%/52.3%) than other detectors with a similar inference speed. We carefully conducted experiments to validate the effectiveness of each component. Our code is made available at https://github.com/meituan/YOLOv6.

preprint2021arXiv

AutoKWS: Keyword Spotting with Differentiable Architecture Search

Smart audio devices are gated by an always-on lightweight keyword spotting program to reduce power consumption. It is however challenging to design models that have both high accuracy and low latency for accurate and fast responsiveness. Many efforts have been made to develop end-to-end neural networks, in which depthwise separable convolutions, temporal convolutions, and LSTMs are adopted as building units. Nonetheless, these networks designed with human expertise may not achieve an optimal trade-off in an expansive search space. In this paper, we propose to leverage recent advances in differentiable neural architecture search to discover more efficient networks. Our searched model attains 97.2% top-1 accuracy on Google Speech Command Dataset v1 with only nearly 100K parameters.

preprint2021arXiv

Convergence of the uniaxial PML method for time-domain electromagnetic scattering problems

In this paper, we propose and study the uniaxial perfectly matched layer (PML) method for three-dimensional time-domain electromagnetic scattering problems, which has a great advantage over the spherical one in dealing with problems involving anisotropic scatterers. The truncated uniaxial PML problem is proved to be well-posed and stable, based on the Laplace transform technique and the energy method. Moreover, the $L^2$-norm and $L^{\infty}$-norm error estimates in time are given between the solutions of the original scattering problem and the truncated PML problem, leading to the exponential convergence of the time-domain uniaxial PML method in terms of the thickness and absorbing parameters of the PML layer. The proof depends on the error analysis between the EtM operators for the original scattering problem and the truncated PML problem, which is different from our previous work (SIAM J. Numer. Anal. 58(3) (2020), 1918-1940).

preprint2021arXiv

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

Despite the fast development of differentiable architecture search (DARTS), it suffers from long-standing performance instability, which extremely limits its application. Existing robustifying methods draw clues from the resulting deteriorated behavior instead of finding out its causing factor. Various indicators such as Hessian eigenvalues are proposed as a signal to stop searching before the performance collapses. However, these indicator-based methods tend to easily reject good architectures if the thresholds are inappropriately set, let alone the searching is intrinsically noisy. In this paper, we undertake a more subtle and direct approach to resolve the collapse. We first demonstrate that skip connections have a clear advantage over other candidate operations, where it can easily recover from a disadvantageous state and become dominant. We conjecture that this privilege is causing degenerated performance. Therefore, we propose to factor out this benefit with an auxiliary skip connection, ensuring a fairer competition for all operations. We call this approach DARTS-. Extensive experiments on various datasets verify that it can substantially improve robustness. Our code is available at https://github.com/Meituan-AutoML/DARTS- .

preprint2021arXiv

Data completion algorithms and their applications in inverse acoustic scattering with limited-aperture backscattering data

We introduce two data completion algorithms for the limited-aperture problems in inverse acoustic scattering. Both completion algorithms are independent of the topological and physical properties of the unknown scatterers. The main idea is to relate the limited-aperture data to the full-aperture data via the prolate matrix. The data completion algorithms are simple and fast since only the approximate inversion of the prolate matrix is involved. We then combine the data completion algorithms with imaging methods such as factorization method and direct sampling method for the object reconstructions. A variety of numerical examples are presented to illustrate the effectiveness and robustness of the proposed algorithms.

preprint2021arXiv

Deep Sketch-guided Cartoon Video Inbetweening

We propose a novel framework to produce cartoon videos by fetching the color information from two input keyframes while following the animated motion guided by a user sketch. The key idea of the proposed approach is to estimate the dense cross-domain correspondence between the sketch and cartoon video frames, and employ a blending module with occlusion estimation to synthesize the middle frame guided by the sketch. After that, the input frames and the synthetic frame equipped with established correspondence are fed into an arbitrary-time frame interpolation pipeline to generate and refine additional inbetween frames. Finally, a module to preserve temporal consistency is employed. Compared to common frame interpolation methods, our approach can address frames with relatively large motion and also has the flexibility to enable users to control the generated video sequences by editing the sketch guidance. By explicitly considering the correspondence between frames and the sketch, we can achieve higher quality results than other image synthesis methods. Our results show that our system generalizes well to different movie frames, achieving better results than existing solutions.

preprint2021arXiv

Efficient Compressed Sensing Based Image Coding by Using Gray Transformation

In recent years, compressed sensing (CS) based image coding has become a hot topic in image processing field. However, since the bit depth required for encoding each CS sample is too large, the compression performance of this paradigm is unattractive. To address this issue, a novel CS-based image coding system by using gray transformation is proposed. In the proposed system, we use a gray transformation to preprocess the original image firstly and then use CS to sample the transformed image. Since gray transformation makes the probability distribution of CS samples centralized, the bit depth required for encoding each CS sample is reduced significantly. Consequently, the proposed system can considerably improve the compression performance of CS-based image coding. Simulation results show that the proposed system outperforms the traditional one without using gray transformation in terms of compression performance.

preprint2021arXiv

LAMOST Time-Domain Survey: First Results of four $K$2 plates

From Oct. 2019 to Apr. 2020, LAMOST performs a time-domain spectroscopic survey of four $K$2 plates with both low- and med-resolution observations. The low-resolution spectroscopic survey gains 282 exposures ($\approx$46.6 hours) over 25 nights, yielding a total of about 767,000 spectra, and the med-resolution survey takes 177 exposures ($\approx$49.1 hours) over 27 nights, collecting about 478,000 spectra. More than 70%/50% of low-resolution/med-resolution spectra have signal-to-noise ratio higher than 10. We determine stellar parameters (e.g., $T_{\rm eff}$, log$g$, [Fe/H]) and radial velocity (RV) with different methods, including LASP, DD-Payne, and SLAM. In general, these parameter estimations from different methods show good agreement, and the stellar parameter values are consistent with those of APOGEE. We use the $Gaia$ DR2 RV data to calculate a median RV zero point (RVZP) for each spectrograph exposure by exposure, and the RVZP-corrected RVs agree well with the APOGEE data. The stellar evolutionary and spectroscopic masses are estimated based on the stellar parameters, multi-band magnitudes, distances and extinction values. Finally, we construct a binary catalog including about 2700 candidates by analyzing their light curves, fitting the RV data, calculating the binarity parameters from med-resolution spectra, and cross-matching the spatially resolved binary catalog from $Gaia$ EDR3. The LAMOST TD survey is expected to get breakthrough in various scientific topics, such as binary system, stellar activity, and stellar pulsation, etc.

preprint2021arXiv

LTD064402+245919: A Subgiant with a 1-3 M$_{\odot}$ Undetected Companion Identified from LAMOST-TD Data

Single-line spectroscopic binaries recently contribute to the stellar-mass black hole discovery, independently of the X-ray transient method. We report the identification of a single-line binary system LTD064402+245919, with an orbital period of 14.50 days. The observed component is a subgiant with a mass of 2.77$\pm$0.68M$_{\odot}$, radius 15.5$\pm$2.5R$_{\odot}$, effective temperature $T_{\rm eff}$ 4500$\pm$200K, and surface gravity log\emph{g} 2.5$\pm$0.25dex. The discovery makes use of the LAMOST time-domain (LAMOST-TD) and ZTF survey. Our general-purpose software pipeline applies the Lomb-Scargle periodogram to determine the orbital period and uses machine-learning to classify the variable type from the folded light curves. We apply a combined model to estimate the orbital parameters from both the light and radial velocity curves, taking constraints on the primary star mass, mass function, and detection limit of secondary luminosity into consideration. We obtain a radial velocity semi-amplitude of 44.6$\pm$1.5 km s$^{-1}$, mass ratio of 0.73$\pm$0.07, and an undetected component mass of 2.02$\pm$0.49M$_{\odot}$ when the type of the undetected component is not set. We conclude that the inclination is not well constrained, and that the secondary mass is larger than 1M$_{\odot}$ when the undetected component is modelled as a compact object. According to our investigations using an MCMC simulation, increasing the spectra SNR by a factor of 3 would enable the secondary light to be distinguished (if present). The algorithm and software in this work are able to serve as general-purpose tools for the identification of compact objects quiescent in X-rays.

preprint2021arXiv

Polar state memory in active fluids

Spontaneous emergence of correlated states such as flocks and vortices are prime examples of remarkable collective dynamics and self-organization observed in active matter. The formation of globally correlated polar states in geometrically confined systems proceeds through the emergence of a macroscopic steadily rotating vortex that spontaneously selects a clockwise or counterclockwise global chiral state. Here, we reveal that a global vortex formed by colloidal rollers exhibits state memory. The information remains stored even when the energy injection is ceased and the activity is terminated. We show that a subsequent formation of the collective states upon re-energizing the system is not random. We combine experiments and simulations to elucidate how a combination of hydrodynamic and electrostatic interactions leads to hidden asymmetries in the local particle positional order encoding the chiral state of the system. The stored information can be accessed and exploited to systematically command subsequent polar states of active liquid through temporal control of the activity. With the chirality of the emergent collective states controlled on-demand, active liquids offer new possibilities for flow manipulation, transport, and mixing at the microscale.

preprint2021arXiv

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation

Self-training is a competitive approach in domain adaptive segmentation, which trains the network with the pseudo labels on the target domain. However inevitably, the pseudo labels are noisy and the target features are dispersed due to the discrepancy between source and target domains. In this paper, we rely on representative prototypes, the feature centroids of classes, to address the two issues for unsupervised domain adaptation. In particular, we take one step further and exploit the feature distances from prototypes that provide richer information than mere prototypes. Specifically, we use it to estimate the likelihood of pseudo labels to facilitate online correction in the course of training. Meanwhile, we align the prototypical assignments based on relative feature distances for two different views of the same target, producing a more compact target feature space. Moreover, we find that distilling the already learned knowledge to a self-supervised pretrained model further boosts the performance. Our method shows tremendous performance advantage over state-of-the-art methods. We will make the code publicly available.

preprint2021arXiv

Robust Dynamical Decoupling for the Manipulation of a Spin Network via a Single Spin

High-fidelity control of quantum systems is crucial for quantum information processing, but is often limited by perturbations from the environment and imperfections in the applied control fields. Here, we investigate the combination of dynamical decoupling (DD) and robust optimal control (ROC) to address this problem. In this combination, ROC is employed to find robust shaped pulses, wherein the directional derivatives of the controlled dynamics with respect to control errors are reduced to a desired order. Then, we incorporate ROC pulses into DD sequences, achieving a remarkable improvement of robustness against multiple error channels. We demonstrate this method in the example of manipulating nuclear spin bath via an electron spin in the NV center system. Simulation results indicate that ROC based DD sequences outperform the state-of-the-art robust DD sequences. Our work has implications for robust quantum control on near-term noisy quantum devices.

preprint2021arXiv

Simultaneous recovery of a locally rough interface and the embedded obstacle with its surrounding medium

Consider the scattering of time-harmonic point sources by an infinite locally rough interface with bounded obstacles embedded in the lower half-space. The model problem is first reduced to an equivalent integral equation formulation defined in a bounded domain, where the well-posedness is obtained in $L^p$ by the classical Fredholm theory. Then a global uniqueness theorem is proved for the inverse problem of recovering the locally rough interface, the embedded obstacles and the wave number in the lower-half space by means of near-field measurements above the interface.

preprint2021arXiv

The Spectroscopic Binaries from LAMOST Medium-Resolution Survey (MRS). I. Searching for Double-lined Spectroscopic Binaries (SB2s) with Convolutional Neural Network

We developed a convolutional neural network (CNN) model to distinguish the double-lined spectroscopic binaries (SB2s) from others based on single exposure medium-resolution spectra ($R\sim 7,500$). The training set consists of a large set of mock spectra of single stars and binaries synthesized based on the MIST stellar evolutionary model and ATLAS9 atmospheric model. Our model reaches a novel theoretic false positive rate by adding a proper penalty on the negative sample (e.g., 0.12\% and 0.16\% for the blue/red arm when the penalty parameter $Λ=16$). Tests show that the performance is as expected and favors FGK-type Main-sequence binaries with high mass ratio ($q \geq 0.7$) and large radial velocity separation ($Δv \geq 50\,\mathrm{km\,s^{-1}}$). Although the real false positive rate can not be estimated reliably, validating on eclipsing binaries identified from Kepler light curves indicates that our model predicts low binary probabilities at eclipsing phases (0, 0.5, and 1.0) as expected. The color-magnitude diagram also helps illustrate its feasibility and capability of identifying FGK MS binaries from spectra. We conclude that this model is reasonably reliable and can provide an automatic approach to identify SB2s with period $\lesssim 10$ days. This work yields a catalog of binary probabilities for over 5 million spectra of 1 million sources from the LAMOST medium-resolution survey (MRS), and a catalog of 2198 SB2 candidates whose physical properties will be analyzed in our following-up paper. Data products are made publicly available at the journal as well as our Github website.

preprint2021arXiv

Time domain analysis for electromagnetic scattering by an elastic obstacle in a two-layered medium

In this paper, we consider the scattering of a time-dependent electromagnetic wave by an elastic body immersed in the lower half-space of a two-layered background medium which is separated by an unbounded rough surface. By proposing two exact transparent boundary conditions (TBCs) on the artificial planes, we reformulate the unbounded scattering problem into an equivalent initial-boundary value problem in a strip domain with the well-posedness and stability proved using the Laplace transform, variational method and energy method. A perfectly matched layer (PML) is then introduced to truncate the interaction problem with two finite layers containing the elastic body, leading to a PML problem in a finite strip domain. We further verify the existence, uniqueness and stability estimate of solution for the PML problem. Finally, we establish the exponential convergence in terms of the thickness and parameters of the PML layers via an error estimate on the electric-to-magnetic (EtM) capacity operators between the original problem and the PML problem.

preprint2021arXiv

User-Level Privacy-Preserving Federated Learning: Analysis and Performance Optimization

Federated learning (FL), as a type of collaborative machine learning framework, is capable of preserving private data from mobile terminals (MTs) while training the data into useful models. Nevertheless, from a viewpoint of information theory, it is still possible for a curious server to infer private information from the shared models uploaded by MTs. To address this problem, we first make use of the concept of local differential privacy (LDP), and propose a user-level differential privacy (UDP) algorithm by adding artificial noise to the shared models before uploading them to servers. According to our analysis, the UDP framework can realize $(ε_{i}, δ_{i})$-LDP for the $i$-th MT with adjustable privacy protection levels by varying the variances of the artificial noise processes. We then derive a theoretical convergence upper-bound for the UDP algorithm. It reveals that there exists an optimal number of communication rounds to achieve the best learning performance. More importantly, we propose a communication rounds discounting (CRD) method. Compared with the heuristic search method, the proposed CRD method can achieve a much better trade-off between the computational complexity of searching and the convergence performance. Extensive experiments indicate that our UDP algorithm using the proposed CRD method can effectively improve both the training efficiency and model quality for the given privacy protection levels.

preprint2020arXiv

A calibrated sensitivity analysis for matched observational studies with application to the effect of second-hand smoke exposure on blood lead levels in U.S. children

Matched observational studies are commonly used to study treatment effects in non-randomized data. After matching for observed confounders, there could remain bias from unobserved confounders. A standard way to address this problem is to do a sensitivity analysis. A sensitivity analysis asks how sensitive the result is to a hypothesized unmeasured confounder U. One method, known as simultaneous sensitivity analysis, has two sensitivity parameters: one relating U to treatment assignment and the other to response. This method assumes that in each matched set, U is distributed to make the bias worst. This approach has two concerning features. First, this worst case distribution of U in each matched set does not correspond to a realistic distribution of U in the population. Second, sensitivity parameters are in absolute scales which are hard to compare to observed covariates. We address these concerns by introducing a method that endows U with a probability distribution in the population and calibrates the unmeasured confounder to the observed covariates. We compare our method to simultaneous sensitivity analysis in simulations and in a study of the effect of second-hand smoke exposure on blood lead levels in U.S. children.

preprint2020arXiv

A Catalog of Short Period Spectroscopic and Eclipsing Binaries Identified from the LAMOST & PTF Surveys

Binaries play key roles in determining stellar parameters and exploring stellar evolution models. We build a catalog of 88 eclipsing binaries with spectroscopic information, taking advantage of observations from both the Large Sky Area Multi-Object fiber Spectroscopic Telescope (LAMOST) and the Palomar Transient Factory (PTF) surveys. A software pipeline is constructed to identify binary candidates by examining their light curves. The orbital periods of binaries are derived from the Lomb-Scargle method. The key distinguishing features of eclipsing binaries are recognized by a new filter \textit{Flat Test}. We classify the eclipsing binaries by applying Fourier analysis on the light curves. Among all the binary stars, 13 binaries are identified as eclipsing binaries for the first time. The catalog contains information: position, primary eclipsing magnitude and time, eclipsing depth, the number of photometry and radial velocity observations, largest radial velocity difference, binary type, the effective temperature of observable star $T_{\rm eff}$, and surface gravity of observable star log \emph{g}. The false-positive probability is calculated by using both a Monte Carlo simulation and real data from the SDSS Stripe 82 Standard Catalog. The binaries in the catalog are mostly with a period of less than one day. The period distribution shows a 0.22-day cut-off which is consistent with the low probability of an eclipsing binary rotating with such a period.

preprint2020arXiv

A linear sampling method for inverse acoustic scattering by a locally rough interface

This paper is concerned with the inverse problem of time-harmonic acoustic scattering by an unbounded, locally rough interface which is assumed to be a local perturbation of a plane. The purpose of this paper is to recover the local perturbation of the interface from the near-field measurement given on a straight line segment with a finite distance above the interface and generated by point sources. Precisely, we propose a novel version of the linear sampling method to recover the location and shape of the local perturbation of the interface numerically. Our method is based on a modified near-field operator equation associated with a special rough surface, constructed by reformulating the forward scattering problem into an equivalent integral equation formulation in a bounded domain, leading to a fast imaging algorithm. Numerical experiments are presented to illustrate the effectiveness of the imaging method.

preprint2020arXiv

A Wasserstein Minimum Velocity Approach to Learning Unnormalized Models

Score matching provides an effective approach to learning flexible unnormalized models, but its scalability is limited by the need to evaluate a second-order derivative. In this paper, we present a scalable approximation to a general family of learning objectives including score matching, by observing a new connection between these objectives and Wasserstein gradient flows. We present applications with promise in learning neural density estimators on manifolds, and training implicit variational and Wasserstein auto-encoders with a manifold-valued prior.

preprint2020arXiv

Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

In deep learning tasks, the learning rate determines the update step size in each iteration, which plays a critical role in gradient-based optimization. However, the determination of the appropriate learning rate in practice typically replies on subjective judgement. In this work, we propose a novel optimization method based on local quadratic approximation (LQA). In each update step, given the gradient direction, we locally approximate the loss function by a standard quadratic function of the learning rate. Then, we propose an approximation step to obtain a nearly optimal learning rate in a computationally efficient way. The proposed LQA method has three important features. First, the learning rate is automatically determined in each update step. Second, it is dynamically adjusted according to the current loss function value and the parameter estimates. Third, with the gradient direction fixed, the proposed method leads to nearly the greatest reduction in terms of the loss function. Extensive experiments have been conducted to prove the strengths of the proposed LQA method.

preprint2020arXiv

Bringing Old Photos Back to Life

We propose to restore old photos that suffer from severe degradation through a deep learning approach. Unlike conventional restoration tasks that can be solved through supervised learning, the degradation in real photos is complex and the domain gap between synthetic images and real old photos makes the network fail to generalize. Therefore, we propose a novel triplet domain translation network by leveraging real photos along with massive synthetic image pairs. Specifically, we train two variational autoencoders (VAEs) to respectively transform old photos and clean photos into two latent spaces. And the translation between these two latent spaces is learned with synthetic paired data. This translation generalizes well to real photos because the domain gap is closed in the compact latent space. Besides, to address multiple degradations mixed in one old photo, we design a global branch with a partial nonlocal block targeting to the structured defects, such as scratches and dust spots, and a local branch targeting to the unstructured defects, such as noises and blurriness. Two branches are fused in the latent space, leading to improved capability to restore old photos from multiple defects. The proposed method outperforms state-of-the-art methods in terms of visual quality for old photos restoration.

preprint2020arXiv

Convergence analysis of the PML method for time-domain electromagnetic scattering problems

In this paper, a perfectly matched layer (PML) method is proposed to solve the time-domain electromagnetic scattering problems in 3D effectively. The PML problem is defined in a spherical layer and derived by using the Laplace transform and real coordinate stretching in the frequency domain. The well-posedness and the stability estimate of the PML problem are first proved based on the Laplace transform and the energy method. The exponential convergence of the PML method is then established in terms of the thickness of the layer and the PML absorbing parameter. As far as we know, this is the first convergence result for the time-domain PML method for the three-dimensional Maxwell equations. Our proof is mainly based on the stability estimates of solutions of the truncated PML problem and the exponential decay estimates of the stretched dyadic Green's function for the Maxwell equations in the free space.

preprint2020arXiv

Cross-domain Correspondence Learning for Exemplar-based Image Translation

We present a general framework for exemplar-based image translation, which synthesizes a photo-realistic image from the input in a distinct domain (e.g., semantic segmentation mask, or edge map, or pose keypoints), given an exemplar image. The output has the style (e.g., color, texture) in consistency with the semantically corresponding objects in the exemplar. We propose to jointly learn the crossdomain correspondence and the image translation, where both tasks facilitate each other and thus can be learned with weak supervision. The images from distinct domains are first aligned to an intermediate domain where dense correspondence is established. Then, the network synthesizes images based on the appearance of semantically corresponding patches in the exemplar. We demonstrate the effectiveness of our approach in several image translation tasks. Our method is superior to state-of-the-art methods in terms of image quality significantly, with the image style faithful to the exemplar with semantic consistency. Moreover, we show the utility of our method for several applications

preprint2020arXiv

Deep residual detection of radio frequency interference for FAST

Radio frequency interference (RFI) detection and excision are key steps in the data-processing pipeline of the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Because of its high sensitivity and large data rate, FAST requires more accurate and efficient RFI flagging methods than its counterparts. In the last decades, approaches based upon artificial intelligence (AI), such as codes using convolutional neural networks (CNNs), have been proposed to identify RFI more reliably and efficiently. However, RFI flagging of FAST data with such methods has often proved to be erroneous, with further manual inspections required. In addition, network construction as well as preparation of training data sets for effective RFI flagging has imposed significant additional workloads. Therefore, rapid deployment and adjustment of AI approaches for different observations is impractical to implement with existing algorithms. To overcome such problems, we propose a model called RFI-Net. With the input of raw data without any processing, RFI-Net can detect RFI automatically, producing corresponding masks without any alteration of the original data. Experiments with RFI-Net using simulated astronomical data show that our model has outperformed existing methods in terms of both precision and recall. Besides, compared with other models, our method can obtain the same relative accuracy with fewer training data, thus reducing the effort and time required to prepare the training data set. Further, the training process of RFI-Net can be accelerated, with overfittings being minimized, compared with other CNN codes. The performance of RFI-Net has also been evaluated with observing data obtained by FAST and the Bleien Observatory. Our results demonstrate the ability of RFI-Net to accurately identify RFI with fine-grained, high-precision masks that required no further modification.

preprint2020arXiv

Differential rotation of the halo traced by the K-giant stars

We use K-giant stars selected from the LAMOST DR5 to study the variation of the rotational velocity of the galactic halo at different space positions. Modelling the rotational velocity distribution with both the halo and disk components, we find that the rotational velocity of the halo population decreases almost linearly with increasing vertical distance to the galactic disk plane, $Z$, at fixed galactocentric radius, $R$. The samples are separated into two parts with $6<R<12$ kpc and $12<R<20$ kpc. We derive that the decreasing rates along $Z$ for the two subsamples are $-3.07\pm0.63$ and $-1.89\pm0.37$ km s$^{-1}$ kpc$^{-1}$, respectively. Compared with the TNG simulations, we suggest that this trend is probably caused by the interaction between the disk and halo. The results from the simulations show that only the oblate halo can provide a decreasing rotational velocity with an increasing $Z$. This indicates that the Galactic halo is oblate with galactocentric radius $R<20$ kpc. On the other hand, the flaring of the disk component (mainly the thick disk) is clearly traced by this study, with $R$ between 12 and 20 kpc, the disk can vertically extend to $6\sim10$ kpc above the disk plane. What is more interesting is that, we find the Gaia-Enceladus-Sausage (GES) component has a significant contribution only in the halo with $R<12$ kpc, i.e. a fraction of 23$-$47\%. While in the outer subsample, the contribution is too low to be well constrained.

preprint2020arXiv

Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search

Differentiable Architecture Search (DARTS) is now a widely disseminated weight-sharing neural architecture search method. However, it suffers from well-known performance collapse due to an inevitable aggregation of skip connections. In this paper, we first disclose that its root cause lies in an unfair advantage in exclusive competition. Through experiments, we show that if either of two conditions is broken, the collapse disappears. Thereby, we present a novel approach called Fair DARTS where the exclusive competition is relaxed to be collaborative. Specifically, we let each operation's architectural weight be independent of others. Yet there is still an important issue of discretization discrepancy. We then propose a zero-one loss to push architectural weights towards zero or one, which approximates an expected multi-hot solution. Our experiments are performed on two mainstream search spaces, and we derive new state-of-the-art results on CIFAR-10 and ImageNet. Our code is available on https://github.com/xiaomi-automl/fairdarts .

preprint2020arXiv

Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search

Deep convolutional neural networks demonstrate impressive results in the super-resolution domain. A series of studies concentrate on improving peak signal noise ratio (PSNR) by using much deeper layers, which are not friendly to constrained resources. Pursuing a trade-off between the restoration capacity and the simplicity of models is still non-trivial. Recent contributions are struggling to manually maximize this balance, while our work achieves the same goal automatically with neural architecture search. Specifically, we handle super-resolution with a multi-objective approach. We also propose an elastic search tactic at both micro and macro level, based on a hybrid controller that profits from evolutionary computation and reinforcement learning. Quantitative experiments help us to draw a conclusion that our generated models dominate most of the state-of-the-art methods with respect to the individual FLOPS.

preprint2020arXiv

Galaxy Optical Variability of Virgo Cluster: New Tracer for Environmental Influences on Galaxies

We investigate the relationship between the optical variability of galaxies and their distances from the centre of the Virgo Cluster using Palomar Transient Factory data. We define the ratio between the standard deviation of the galaxy brightness and the mean value of the standard deviation as a measure of a galaxy's optical variability. A sample of 814 Virgo galaxies with 230263 observations shows a monotonically decreasing trend of optical variability with increasing clustercentric distance. The variability level inside the cluster is 3.2$σ$ higher than the level outside. We fit the variability with a linear function and find that the data reject a distance-independent model. We examine 217 background galaxies for comparison and find no significant trend in galaxy variability. We assess the relation with Monte Carlo simulation by rebuilding the brightness of each galaxy. The simulation shows a monotonically decreasing relation for member galaxy variability and a distance-independent relation for background galaxies. Our result is consistent with the theory that the cold gas flowing inwards the cluster centre fuels AGN activity. This work is a new implementation of the method using optical variability to investigate the relation between galaxies evolution and their environment.

preprint2020arXiv

HCGrid: A Convolution-based Gridding Framework for RadioAstronomy in Hybrid Computing Environments

Gridding operation, which is to map non-uniform data samples onto a uniformly distributedgrid, is one of the key steps in radio astronomical data reduction process. One of the mainbottlenecks of gridding is the poor computing performance, and a typical solution for suchperformance issue is the implementation of multi-core CPU platforms. Although such amethod could usually achieve good results, in many cases, the performance of gridding is stillrestricted to an extent due to the limitations of CPU, since the main workload of gridding isa combination of a large number of single instruction, multi-data-stream operations, which ismore suitable for GPU, rather than CPU implementations. To meet the challenge of massivedata gridding for the modern large single-dish radio telescopes, e.g., the Five-hundred-meterAperture Spherical radio Telescope (FAST), inspired by existing multi-core CPU griddingalgorithms such as Cygrid, here we present an easy-to-install, high-performance, and open-source convolutional gridding framework, HCGrid,in CPU-GPU heterogeneous platforms. Itoptimises data search by employing multi-threading on CPU, and accelerates the convolutionprocess by utilising massive parallelisation of GPU. In order to make HCGrid a more adaptivesolution, we also propose the strategies of thread organisation and coarsening, as well as optimalparameter settings under various GPU architectures. A thorough analysis of computing timeand performance gain with several GPU parallel optimisation strategies show that it can leadto excellent performance in hybrid computing environments.

preprint2020arXiv

Integrated and Spectrally Selective Thermal Emitters Enabled by Layered Metamaterials

Nanophotonic engineering of light-matter interaction at subwavelength scale allows thermal radiation that is fundamentally different from that of traditional thermal emitters and provides exciting opportunities for various thermal-photonic applications. We propose a new kind of integrated and electrically controlled thermal emitter that exploits layered metamaterials with lithography-free and dielectric/metallic nanolayers. We demonstrate both theoretically and experimentally that the proposed concept can create a strong photonic bandgap in the visible regime and allow small impedance mismatch at the infrared wavelengths, which gives rise to optical features of significantly enhanced emissivity at the broad infrared wavelengths of 1.4-14 um as well as effectively suppressed emissivity in the visible region. The electrically driven metamaterial devices are optically and thermally stable at temperature up to ~800 K with electro-optical conversion efficiency reaching ~30%. We believe that the proposed high efficiency thermal emitters will pave the way towards integrated infrared light source platforms for various thermal-photonic applications and particularly provide a novel alternative for cost-effective, compact, low glare, and energy-efficient infrared heating.

preprint2020arXiv

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2023. According to the detailed survey plan, we expect that LAMOST-MRS can observe about 2 million stellar spectra with ~7500 and limiting magnitude of around G=15 mag. Moreover, it will also provide about 200 thousand stars with averagely 60-epoch observations and limiting magnitude of G~14 mag. These high quality spectra will give around 20 elemental abundances, rotational velocities, emission line profiles as well as precise radial velocity with uncertainty less than 1 km/s. With these data, we expect that LAMOST can effectively leverage sciences on stellar physics, e.g. exotic binary stars, detailed observation of many types of variable stars etc., planet host stars, emission nebulae, open clusters, young pre-main-sequence stars etc.

preprint2020arXiv

Latent Variables on Spheres for Autoencoders in High Dimensions

Variational Auto-Encoder (VAE) has been widely applied as a fundamental generative model in machine learning. For complex samples like imagery objects or scenes, however, VAE suffers from the dimensional dilemma between reconstruction precision that needs high-dimensional latent codes and probabilistic inference that favors a low-dimensional latent space. By virtue of high-dimensional geometry, we propose a very simple algorithm, called Spherical Auto-Encoder (SAE), completely different from existing VAEs to address the issue. SAE is in essence the vanilla autoencoder with spherical normalization on the latent space. We analyze the unique characteristics of random variables on spheres in high dimensions and argue that random variables on spheres are agnostic to various prior distributions and data modes when the dimension is sufficiently high. Therefore, SAE can harness a high-dimensional latent space to improve the inference precision of latent codes while maintain the property of stochastic sampling from priors. The experiments on sampling and inference validate our theoretical analysis and the superiority of SAE.

preprint2020arXiv

Learning Implicit Generative Models by Teaching Explicit Ones

Implicit generative models are difficult to train as no explicit density functions are defined. Generative adversarial nets (GANs) present a minimax framework to train such models, which however can suffer from mode collapse due to the nature of the JS-divergence. This paper presents a learning by teaching (LBT) approach to learning implicit models, which intrinsically avoids the mode collapse problem by optimizing a KL-divergence rather than the JS-divergence in GANs. In LBT, an auxiliary density estimator is introduced to fit the implicit model's distribution while the implicit model teaches the density estimator to match the data distribution. LBT is formulated as a bilevel optimization problem, whose optimal generator matches the true data distribution. LBT can be naturally integrated with GANs to derive a hybrid LBT-GAN that enjoys complimentary benefits. Finally, we present a stochastic gradient ascent algorithm with unrolling to solve the challenging learning problems. Experimental results demonstrate the effectiveness of our method.

preprint2020arXiv

Modeling and Detailed Numerical Simulation of the Primary Breakup of a Gasoline Surrogate Jet under Non-Evaporative Operating Conditions

In the present study, detailed numerical simulations are performed to investigate the primary breakup of a gasoline surrogate jet under non-evaporative "Spray G" operating conditions. The Spray G injector and operating conditions, developed by the Engine Combustion Network (ECN), represent the early phase of spray-guided gasoline injection. To focus the computational resources on resolving the primary breakup, simplifications have been made on the injector geometry. The effect of the internal flow on the primary breakup is modeled by specifying a nonzero injection angle at the inlet. The nonzero injection angle results in an increase of the jet penetration speed and also a deflection of the liquid jet. A parametric study on the injection angle is performed, and the numerical results are compared to the experimental data to identify the injection angle that best represents the Spray G conditions. The nonzero injection angle introduces an azimuthally non-uniform velocity in the liquid jet, which in turn influences the instability development on the jet surfaces and also the deformation and breakup of the jet head. The asymmetric primary breakup dynamics eventually lead to an azimuthal variation of droplet size distributions. The number of droplets varies significantly with the azimuthal angle, but interestingly, the probability density functions (PDF) of droplet size for different azimuthal angles collapse to a self-similar profile. Analysis has also been conducted to estimate the percentage and statistics of the tiny droplets that are under resolved in the present simulation. The PDF of the azimuthal angle is also presented, which is also shown to exhibit a self-similar form that varies little over time. Finally, a model is developed to predict the droplet number as a function of droplet diameter, azimuthal angle where a droplet is located, and time.

preprint2020arXiv

MoGA: Searching Beyond MobileNetV3

The evolution of MobileNets has laid a solid foundation for neural network applications on mobile end. With the latest MobileNetV3, neural architecture search again claimed its supremacy in network design. Unfortunately, till today all mobile methods mainly focus on CPU latencies instead of GPU, the latter, however, is much preferred in practice for it has faster speed, lower overhead and less interference. Bearing the target hardware in mind, we propose the first Mobile GPU-Aware (MoGA) neural architecture search in order to be precisely tailored for real-world applications. Further, the ultimate objective to devise a mobile network lies in achieving better performance by maximizing the utilization of bounded resources. Urging higher capability while restraining time consumption is not reconcilable. We alleviate the tension by weighted evolution techniques. Moreover, we encourage increasing the number of parameters for higher representational power. With 200x fewer GPU days than MnasNet, we obtain a series of models that outperform MobileNetV3 under the similar latency constraints, i.e., MoGA-A achieves 75.9% top-1 accuracy on ImageNet, MoGA-B meets 75.5% which costs only 0.5 ms more on mobile GPU. MoGA-C best attests GPU-awareness by reaching 75.3% and being slower on CPU but faster on GPU.The models and test code is made available here https://github.com/xiaomi-automl/MoGA.

preprint2020arXiv

Near-field imaging of a locally rough interface and buried obstacles with the linear sampling method

Consider the problem of inverse scattering of time-harmonic point sources from an infinite, penetrable rough interface with bounded obstacles buried in the lower half-space, where the interface is assumed to be a local perturbation of a planar surface. A novel version of the sampling method is proposed to simultaneously reconstruct the local perturbation of the rough interface and buried obstacles by constructing a modified near-field equation associated with a special rough surface, yielding a fast imaging algorithm. Numerical examples are presented to illustrate the effectiveness of the inversion algorithm.

preprint2020arXiv

Neural Architecture Search on Acoustic Scene Classification

Convolutional neural networks are widely adopted in Acoustic Scene Classification (ASC) tasks, but they generally carry a heavy computational burden. In this work, we propose a lightweight yet high-performing baseline network inspired by MobileNetV2, which replaces square convolutional kernels with unidirectional ones to extract features alternately in temporal and frequency dimensions. Furthermore, we explore a dynamic architecture space built on the basis of the proposed baseline with the recent Neural Architecture Search (NAS) paradigm, which first trains a supernet that incorporates all candidate networks and then applies a well-known evolutionary algorithm NSGA-II to discover more efficient networks with higher accuracy and lower computational cost. Experimental results demonstrate that our searched network is competent in ASC tasks, which achieves 90.3% F1-score on the DCASE2018 task 5 evaluation set, marking a new state-of-the-art performance while saving 25% of FLOPs compared to our baseline network.

preprint2020arXiv

Old Photo Restoration via Deep Latent Space Translation

We propose to restore old photos that suffer from severe degradation through a deep learning approach. Unlike conventional restoration tasks that can be solved through supervised learning, the degradation in real photos is complex and the domain gap between synthetic images and real old photos makes the network fail to generalize. Therefore, we propose a novel triplet domain translation network by leveraging real photos along with massive synthetic image pairs. Specifically, we train two variational autoencoders (VAEs) to respectively transform old photos and clean photos into two latent spaces. And the translation between these two latent spaces is learned with synthetic paired data. This translation generalizes well to real photos because the domain gap is closed in the compact latent space. Besides, to address multiple degradations mixed in one old photo, we design a global branch with apartial nonlocal block targeting to the structured defects, such as scratches and dust spots, and a local branch targeting to the unstructured defects, such as noises and blurriness. Two branches are fused in the latent space, leading to improved capability to restore old photos from multiple defects. Furthermore, we apply another face refinement network to recover fine details of faces in the old photos, thus ultimately generating photos with enhanced perceptual quality. With comprehensive experiments, the proposed pipeline demonstrates superior performance over state-of-the-art methods as well as existing commercial tools in terms of visual quality for old photos restoration.

preprint2020arXiv

Perceptual Image Super-Resolution with Progressive Adversarial Network

Single Image Super-Resolution (SISR) aims to improve resolution of small-size low-quality image from a single one. With popularity of consumer electronics in our daily life, this topic has become more and more attractive. In this paper, we argue that the curse of dimensionality is the underlying reason of limiting the performance of state-of-the-art algorithms. To address this issue, we propose Progressive Adversarial Network (PAN) that is capable of coping with this difficulty for domain-specific image super-resolution. The key principle of PAN is that we do not apply any distance-based reconstruction errors as the loss to be optimized, thus free from the restriction of the curse of dimensionality. To maintain faithful reconstruction precision, we resort to U-Net and progressive growing of neural architecture. The low-level features in encoder can be transferred into decoder to enhance textural details with U-Net. Progressive growing enhances image resolution gradually, thereby preserving precision of recovered image. Moreover, to obtain high-fidelity outputs, we leverage the framework of the powerful StyleGAN to perform adversarial learning. Without the curse of dimensionality, our model can super-resolve large-size images with remarkable photo-realistic details and few distortions. Extensive experiments demonstrate the superiority of our algorithm over state-of-the-arts both quantitatively and qualitatively.

preprint2020arXiv

Privacy for All: Demystify Vulnerability Disparity of Differential Privacy against Membership Inference Attack

Machine learning algorithms, when applied to sensitive data, pose a potential threat to privacy. A growing body of prior work has demonstrated that membership inference attack (MIA) can disclose specific private information in the training data to an attacker. Meanwhile, the algorithmic fairness of machine learning has increasingly caught attention from both academia and industry. Algorithmic fairness ensures that the machine learning models do not discriminate a particular demographic group of individuals (e.g., black and female people). Given that MIA is indeed a learning model, it raises a serious concern if MIA ``fairly'' treats all groups of individuals equally. In other words, whether a particular group is more vulnerable against MIA than the other groups. This paper examines the algorithmic fairness issue in the context of MIA and its defenses. First, for fairness evaluation, it formalizes the notation of vulnerability disparity (VD) to quantify the difference of MIA treatment on different demographic groups. Second, it evaluates VD on four real-world datasets, and shows that VD indeed exists in these datasets. Third, it examines the impacts of differential privacy, as a defense mechanism of MIA, on VD. The results show that although DP brings significant change on VD, it cannot eliminate VD completely. Therefore, fourth, it designs a new mitigation algorithm named FAIRPICK to reduce VD. An extensive set of experimental results demonstrate that FAIRPICK can effectively reduce VD for both with and without the DP deployment.

preprint2020arXiv

Protocol for an Observational Study on the Effects of Social Distancing on Influenza-Like Illness and COVID-19

The novel coronavirus disease (COVID-19) is a highly contagious respiratory disease that was first detected in Wuhan, China in December 2019, and has since spread around the globe, claiming more than 69,000 lives by the time this protocol is written. It has been widely acknowledged that the most effective public policy to mitigate the pandemic is \emph{social and physical distancing}: keeping at least six feet away from people, working from home, closing non-essential businesses, etc. There have been a lot of anecdotal evidences suggesting that social distancing has a causal effect on disease mitigation; however, few studies have investigated the effect of social distancing on disease mitigation in a transparent and statistically-sound manner. We propose to perform an optimal non-bipartite matching to pair counties with similar observed covariates but vastly different average social distancing scores during the first week (March 16th through Match 22nd) of President's \emph{15 Days to Slow the Spread} campaign. We have produced a total of $302$ pairs of two U.S. counties with good covariate balance on a total of $16$ important variables. Our primary outcome will be the average observed illness collected by Kinsa Inc. two weeks after the intervention period. Although the observed illness does not directly measure COVID-19, it reflects a real-time aspect of the pandemic, and unlike confirmed cases, it is much less confounded by counties' testing capabilities. We also consider observed illness three weeks after the intervention period as a secondary outcome. We will test a proportional treatment effect using a randomization-based test with covariance adjustment and conduct a sensitivity analysis.

preprint2020arXiv

Sex Differences in Severity and Mortality Among Patients With COVID-19: Evidence from Pooled Literature Analysis and Insights from Integrated Bioinformatic Analysis

Objective: To conduct a meta-analysis of current studies that examined sex differences in severity and mortality in patients with COVID-19, and identify potential mechanisms underpinning these differences. Methods: We performed a systematic review to collate data from observational studies examining associations of sex differences with clinical outcomes of COVID-19. PubMed, Web of Science and four preprint servers were searched for relevant studies. Data were extracted and analyzed using meta-analysis where possible, with summary data presented otherwise. Publicly available bulk RNA sequencing (RNA-seq), single-cell RNA sequencing (scRNA-seq), and chromatin immunoprecipitation sequencing (ChIP-seq) data were analyzed to explore the potential mechanisms underlying the observed association. Results: 39 studies met inclusion criteria, representing 77932 patients, of which 41510 (53.3%) were males. Men were at a markedly increased risk of developing severe cases compared with women. Furthermore, the pooled odds ratio (OR) of mortality for male group compared with the female group indicated significant higher mortality rate for male. Data from scRNA-seq suggest that men have a higher amount of ACE2-expressing pulmonary alveolar type II cells than women. Sex-based immunological differences exist. The expression of androgen receptor (AR) is positively correlated with ACE2, and there is evidence that AR may directly regulate the expression of ACE2. Conclusions: This meta-analysis detected an increased severity and mortality rate in the male populations with COVID-19, which might be attributable to the sex-based differences in cellular compositions and immunological microenvironments of the lung. The host cell receptor ACE2 is likely regulated by AR signaling pathway, which is identified as a potential target for prevention and treatment of SARS-Cov-2 infections in men.

preprint2020arXiv

Short-term oscillation and falling dynamics for a water drop dripping in quiescent air

The short-term transient falling dynamics of a dripping water drop in quiescent air has been investigated through both simulation and experiment. The focus is on the short term behavior and the time range considered covers about eight dominant second-mode oscillations of the drop after it is formed. Due to the small fluid inertia the growth of the drop is quasi-static and is well captured by the static pendant drop theory. Nevertheless, the pinching dynamics and the resulting post-formation state of the drop trigger a nonlinear oscillation when the drop falls. The initial shape of the drop when it is just formed is decomposed into spherical harmonic modes. The pinching dynamics such as interface overturning introduces small-scale variation on the drop contour, which in turn contributes to the finite amplitudes of the higher-order modes. Furthermore, the initial kinetic energy when the droplet is just formed is as important as the initial surface energy contained in the drop shape, and is found to amplify the initial oscillation amplitude and to induce a phase shift in the oscillation of all the modes. By incorporating both the initial surface and kinetic energy, the linear model for a free drop oscillation yields very good predictions for the second and third modes. The mode amplitude spectra show both the primary frequencies that are consistent with the Lamb's theory and the secondary frequencies arising from different modes due to nonlinear inter-mode coupling. The complex transient flow inside and outside the drop is induced by the interaction between the falling motion and the nonlinear oscillation. The streamlines indicate that the internal flow is substantially different from the Hill vortex for a falling drop without oscillation. The temporal evolutions of both the internal flow and the wake morphology follow the dominant second oscillation mode.

preprint2020arXiv

The atomic gas of star-forming galaxies at z$\sim$0.05 as revealed by the Five-hundred-meter Aperture Spherical Radio Telescope

We report new HI observations of four z$\sim$0.05 star-forming galaxies undertaken during the commissioning phase of the Five-hundred-meter Aperture Spherical Radio Telescope (FAST). FAST is the largest single-dish telescope with a 500 meter aperture and a 19-Beam receiver. Exploiting the unprecedented sensitivity provided by FAST, we aim to study the atomic gas, via the HI 21cm emission line, in low-$z$ star-forming galaxies taken from the Valparaíso ALMA/APEX Line Emission Survey (VALES) project. Together with previous ALMA CO($J=1-0$) observations, the HI data provides crucial information to measure the gas mass and dynamics. As a pilot HI survey, we targeted four local star-forming galaxies at $z\sim0.05$. In particular, one of them has already been detected in HI by the Arecibo Legacy Fast ALFA survey (ALFALFA), allowing a careful comparison. We use an ON-OFF observing approach that allowed us to reach an rms of 0.7mJy/beam at a 1.7km/s velocity resolution within only 20 minutes ON-target integration time. We demonstrate the great capabilities of the FAST 19-beam receiver for pushing the detectability of the HI emission line of extra-galactic sources. The HI emission line detected by FAST shows good consistency with the previous ALFALFA results. Our observations are put in context with previous multi-wavelength data to reveal the physical properties of these low-$z$ galaxies. We find that the CO($J=1-0$) and HI emission line profiles are similar. The dynamical mass estimated from the HI data is an order of magnitude higher than the baryon mass and the dynamical mass derived from the CO observations, implying that the mass probed by dynamics of HI is dominated by the dark matter halo. In one case, a target shows an excess of CO($J=1-0$) in the line centre, which can be explained by an enhanced CO($J=1-0$) emission induced by a nuclear starburst showing high velocity dispersion.

preprint2020arXiv

The radio properties of the OH megamaser galaxy IRAS 02524+2046

We present results from VLBI observations of continuum and OH line emission in IRAS 02524+2046 and also arcsecond-scale radio properties of this galaxy using VLA archive data. We found that there is no significant detection of radio continuum emission from VLBI observations. The arcsecond-scale radio images of this source show no clear extended emission, the total radio flux density at L and C band are around 2.9 mJy and 1.0 mJy respectively, which indicate a steep radio spectral index between the two band. Steep spectral index, low brightness temperature and high $q$-ratio (the FIR to the radio flux density), which are three critical indicators in classification of radio activity in the nuclei of galaxies, are all consistent with the classification of this source as a starburst galaxy from its optical spectrum. The high-resolution line profile show that both of \textbf{the 1665 and 1667 MHz OH maser} line have been detected which show three and two clear components respectively. The channel maps show that the maser emission are distributed in a region $\sim$ 210 pc $\times$ 90 pc, the detected maser components at different region show similar double spectral feature, which might be an evidence that this galaxy is at a stage of major merger as seen from the optical morphology.

preprint2020arXiv

To Relieve Your Headache of Training an MRF, Take AdVIL

We propose a black-box algorithm called {\it Adversarial Variational Inference and Learning} (AdVIL) to perform inference and learning on a general Markov random field (MRF). AdVIL employs two variational distributions to approximately infer the latent variables and estimate the partition function of an MRF, respectively. The two variational distributions provide an estimate of the negative log-likelihood of the MRF as a minimax optimization problem, which is solved by stochastic gradient descent. AdVIL is proven convergent under certain conditions. On one hand, compared with contrastive divergence, AdVIL requires a minimal assumption about the model structure and can deal with a broader family of MRFs. On the other hand, compared with existing black-box methods, AdVIL provides a tighter estimate of the log partition function and achieves much better empirical results.

preprint2020arXiv

Triple Generative Adversarial Networks

We propose a unified game-theoretical framework to perform classification and conditional image generation given limited supervision. It is formulated as a three-player minimax game consisting of a generator, a classifier and a discriminator, and therefore is referred to as Triple Generative Adversarial Network (Triple-GAN). The generator and the classifier characterize the conditional distributions between images and labels to perform conditional generation and classification, respectively. The discriminator solely focuses on identifying fake image-label pairs. Under a nonparametric assumption, we prove the unique equilibrium of the game is that the distributions characterized by the generator and the classifier converge to the data distribution. As a byproduct of the three-player mechanism, Triple-GAN is flexible to incorporate different semi-supervised classifiers and GAN architectures. We evaluate Triple-GAN in two challenging settings, namely, semi-supervised learning and the extreme low data regime. In both settings, Triple-GAN can achieve excellent classification results and generate meaningful samples in a specific class simultaneously. In particular, using a commonly adopted 13-layer CNN classifier, Triple-GAN outperforms extensive semi-supervised learning methods substantially on more than 10 benchmarks no matter data augmentation is applied or not.

preprint2020arXiv

Understanding and Stabilizing GANs' Training Dynamics with Control Theory

Generative adversarial networks (GANs) are effective in generating realistic images but the training is often unstable. There are existing efforts that model the training dynamics of GANs in the parameter space but the analysis cannot directly motivate practically effective stabilizing methods. To this end, we present a conceptually novel perspective from control theory to directly model the dynamics of GANs in the function space and provide simple yet effective methods to stabilize GANs' training. We first analyze the training dynamic of a prototypical Dirac GAN and adopt the widely-used closed-loop control (CLC) to improve its stability. We then extend CLC to stabilize the training dynamic of normal GANs, where CLC is implemented as a squared $L2$ regularizer on the output of the discriminator. Empirical results show that our method can effectively stabilize the training and obtain state-of-the-art performance on data generation tasks.

preprint2019arXiv

Deriving the stellar labels of LAMOST spectra with Stellar LAbel Machine (SLAM)

The LAMOST survey has provided 9 million spectra in its Data Release 5 (DR5) at R$\sim$1800. Extracting precise stellar labels is crucial for such a large sample. In this paper, we report the implementation of the Stellar LAbel Machine (SLAM), which is a data-driven method based on Support Vector Regression (SVR), a robust non-linear regression technique. Thanks to the capability to model highly non-linear problems with SVR, SLAM generally can derive stellar labels over a wide range of spectral types. This gives it a unique capability compared to other popular data-driven methods. To illustrate this capability, we test the performance of SLAM on stars ranging from Teff$\sim$4000 to $\sim$8000 K trained on LAMOST spectra and stellar labels. At g-band signal-to-noise ratio (SNRg) higher than 100, the random uncertainties of Teff, logg and [Fe/H] are 50 K, 0.09 dex, and 0.07 dex, respectively. We then set up another SLAM model trained by APOGEE and LAMOST common stars to demonstrate its capability of dealing with high dimensional problems. The spectra are from LAMOST DR5 and the stellar labels of the training set are from APOGEE DR15, including Teff, logg, [M/H],[$α$/M], [C/M], and [N/M]. The cross-validated scatters at SNRg$\sim$100 are 49 K, 0.10 dex, 0.037 dex,0.026 dex, 0.058 dex, and 0.106 dex for these parameters, respectively. This performance is at the same level as other up-to-date data-driven models. As a byproduct, we also provide the latest catalog of $\sim$1 million LAMOST DR5 K giant stars with SLAM-predicted stellar labels in this work.

preprint2019arXiv

Exploring the spectral \textit{information content} in the LAMOST medium-resolution survey (MRS)

Low-resolution spectra are proved competitive to high-resolution spectra in determining many stellar labels at comparable precision. It is useful to consider the spectral information content when assessing the capability of a stellar spectrum in deriving precise stellar labels. In this work, we quantify the information content brought by the LAMOST-II medium-resolution spectroscopic survey (MRS) using the gradient spectra and the coefficients-of-dependence (CODs). In general, the wavelength coverage of the MRS well constrains the stellar labels but the sensitivities of different stellar labels vary with spectral types and metallicity of the stars of interest and, therefore, affect the performance of the stellar label determination from the MRS spectra. Applying the SLAM to the synthetic spectra which mimic the MRS data, we find the precision of the fundamental stellar parameters Teff, logg and [M/H] are better when combining both the blue and red bands of the MRS. This is especially important for warm stars since the H$α$ line located in the red part plays a more important role in determining the effective temperature for warm stars. With blue and red parts together, we are able to reach similar performance to the low-resolution spectra except for warm stars. However, at [M/H]$\sim-2.0$ dex, the uncertainties of fundamental stellar labels estimated from MRS are substantially larger than those from low-resolution spectra. We also tested the uncertainties of Teff, logg and [M/H] of from MRS data induced from the radial velocity mismatch and find that a mismatch of about 1 km s$^{-1}$, which is typical for LAMOST MRS data, would not significantly affect the stellar label estimates. At last, reference precision limits are calculated using synthetic gradient spectra, according to which we expect abundances of at least 17 elements to be measured precisely from MRS spectra.

preprint2019arXiv

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kpc. The metallicity distributions of Sgr K-, M-giants, and BHBs present that the M-giants are on average the most metal-rich population, followed by K-giants and BHBs. All of the K-, M-giants, and BHBs indicate that the trailing arm is on average more metal-rich than leading arm, and the K-giants show that the Sgr debris is the most metal-poor part. The $α$-abundance of Sgr stars exhibits a similar trend with the Galactic halo stars at lower metallicity ([Fe/H] $<\sim$ $-$1.0 dex), and then evolve down to lower [$α$/Fe] than disk stars at higher metallicity, which is close to the evolution pattern of $α$-element of Milky Way dwarf galaxies. We find $V_Y$ and metallicity of K-giants have gradients along the direction of line-of-sight from the Galactic center in $X$-$Z$ plane, and the K-giants show that $V_Y$ increases with metallicity at [Fe/H] $>\sim-$1.5 dex. After dividing the Sgr stream into bright and faint stream according to their locations in equatorial coordinate, the K-giants and BHBs show that the bright and faint stream present different $V_Y$ and metallicities, the bright stream is on average higher in $V_Y$ and metallicity than the faint stream.

preprint2016arXiv

A Coherent Polariton Laser

The semiconductor polariton laser promises a new source of coherent light, which, compared to conventional semiconductor photon lasers, has input-energy threshold orders of magnitude lower. However, intensity stability, a defining feature of a coherent state, has remained poor. Intensity noise at many times of the shot-noise of a coherent state has persisted, which has been attributed to multiple mechanisms that are difficult to separate in conventional polariton systems. The large intensity noise in turn limited the phase coherence. These limit the capability of the polariton laser as a source of coherence light. Here, we demonstrate a polariton laser with shot-noise limited intensity stability, as expected of a fully coherent state. This is achieved by using an optical cavity with high mode selectivity to enforce single-mode lasing, suppress condensate depletion, and establish gain saturation. The absence of spurious intensity fluctuations moreover enabled measurement of a transition from exponential to Gaussian decay of the phase coherence of the polariton laser. It suggests large self-interaction energies in the polariton condensate, exceeding the laser bandwidth. Such strong interactions are unique to matter-wave laser and important for nonlinear polariton devices. The results will guide future development of polariton lasers and nonlinear polariton devices.

preprint2016arXiv

A New Manifold Distance Measure for Visual Object Categorization

Manifold distances are very effective tools for visual object recognition. However, most of the traditional manifold distances between images are based on the pixel-level comparison and thus easily affected by image rotations and translations. In this paper, we propose a new manifold distance to model the dissimilarities between visual objects based on the Complex Wavelet Structural Similarity (CW-SSIM) index. The proposed distance is more robust to rotations and translations of images than the traditional manifold distance and the CW-SSIM index based distance. In addition, the proposed distance is combined with the $k$-medoids clustering method to derive a new clustering method for visual object categorization. Experiments on Coil-20, Coil-100 and Olivetti Face Databases show that the proposed distance measure is better for visual object categorization than both the traditional manifold distances and the CW-SSIM index based distances.

preprint2016arXiv

A Novel Biologically Mechanism-Based Visual Cognition Model--Automatic Extraction of Semantics, Formation of Integrated Concepts and Re-selection Features for Ambiguity

Integration between biology and information science benefits both fields. Many related models have been proposed, such as computational visual cognition models, computational motor control models, integrations of both and so on. In general, the robustness and precision of recognition is one of the key problems for object recognition models. In this paper, inspired by features of human recognition process and their biological mechanisms, a new integrated and dynamic framework is proposed to mimic the semantic extraction, concept formation and feature re-selection in human visual processing. The main contributions of the proposed model are as follows: (1) Semantic feature extraction: Local semantic features are learnt from episodic features that are extracted from raw images through a deep neural network; (2) Integrated concept formation: Concepts are formed with local semantic information and structural information learnt through network. (3) Feature re-selection: When ambiguity is detected during recognition process, distinctive features according to the difference between ambiguous candidates are re-selected for recognition. Experimental results on hand-written digits and facial shape dataset show that, compared with other methods, the new proposed model exhibits higher robustness and precision for visual recognition, especially in the condition when input samples are smantic ambiguous. Meanwhile, the introduced biological mechanisms further strengthen the interaction between neuroscience and information science.

preprint2016arXiv

Analysis of Transient Acoustic-Elastic Interaction in an Unbounded Structure

Consider the wave propagation in a two-layered medium consisting of a homogeneous compressible air or fluid on top of a homogeneous isotropic elastic solid. The interface between the two layers is assumed to be an unbounded rough surface. This paper concerns the time-domain analysis of such an acoustic-elastic interaction problem in an unbounded structure in three dimensions. Using an exact transparent boundary condition and suitable interface conditions, we study an initial-boundary value problem for the coupling of the Helmholtz equation and the Navier equation. The well-posedness and stability are established for the reduced problem. Our proof is based on the method of energy, the Lax--Milgram lemma, and the inversion theorem of the Laplace transform. Moreover, a priori estimates with explicit dependence on the time are achieved for the quantities of acoustic pressure and elastic displacement by taking special test functions for the time-domain variational problem.

preprint2016arXiv

Bootstrapping Face Detection with Hard Negative Examples

Recently significant performance improvement in face detection was made possible by deeply trained convolutional networks. In this report, a novel approach for training state-of-the-art face detector is described. The key is to exploit the idea of hard negative mining and iteratively update the Faster R-CNN based face detector with the hard negatives harvested from a large set of background examples. We demonstrate that our face detector outperforms state-of-the-art detectors on the FDDB dataset, which is the de facto standard for evaluating face detection algorithms.

preprint2016arXiv

Carbon stars from LAMOST DR2 data

In this work, we present the new catalog of carbon stars from the LAMOST DR2 catalog. In total, 894 carbon stars are identified from multiple line indices measured from the stellar spectra. Combining the CN bands in the red end with \ctwo\ and other lines, we are able to identify the carbon stars. Moreover, we also classify the carbon stars into spectral sub-types of \ch, \CR, and \cn. These sub-types approximately show distinct features in the multi-dimensional line indices, implying that in the future we can use them to identify carbon stars from larger spectroscopic datasets. Meanwhile, from the line indices space, while the \cn\ stars are clearly separated from the others, we find no clear separation between \CR\ and \ch\ sub-types. The \CR\ and \ch\ stars seem to smoothly transition from one to another. This may hint that the \CR\ and \ch\ stars may not be different in their origins but look different in their spectra because of different metallicity. Due to the relatively low spectral resolution and lower signal-to-noise ratio, the ratio of $^{12}$C/$^{13}$C is not measured and thus the \cj\ stars are not identified.

preprint2016arXiv

Constraining the Mass of the Photon with Gamma-Ray Bursts

One of the cornerstones of modern physics is Einstein's special relativity, with its constant speed of light and zero photon mass assumptions. Constraint on the rest mass m_γ of photons is a fundamental way to test Einstein's theory, as well as other essential electromagnetic and particle theories. Since non-zero photon mass can give rise to frequency-(or energy-) dependent dispersions, measuring the time delay of photons with different frequencies emitted from explosive astrophysical events is an important and model-independent method to put such a constraint. The cosmological gamma-ray bursts (GRBs), with short time scales, high redshifts as well as broadband prompt and afterglow emissions, provide an ideal testbed for m_γ constraints. In this paper we calculate the upper limits of the photon mass with GRB early time radio afterglow observations as well as multi-band radio peaks, thus improve the results of Schaefer (1999) by nearly half an order of magnitude.

preprint2016arXiv

Effective Deterministic Initialization for $k$-Means-Like Methods via Local Density Peaks Searching

The $k$-means clustering algorithm is popular but has the following main drawbacks: 1) the number of clusters, $k$, needs to be provided by the user in advance, 2) it can easily reach local minima with randomly selected initial centers, 3) it is sensitive to outliers, and 4) it can only deal with well separated hyperspherical clusters. In this paper, we propose a Local Density Peaks Searching (LDPS) initialization framework to address these issues. The LDPS framework includes two basic components: one of them is the local density that characterizes the density distribution of a data set, and the other is the local distinctiveness index (LDI) which we introduce to characterize how distinctive a data point is compared with its neighbors. Based on these two components, we search for the local density peaks which are characterized with high local densities and high LDIs to deal with 1) and 2). Moreover, we detect outliers characterized with low local densities but high LDIs, and exclude them out before clustering begins. Finally, we apply the LDPS initialization framework to $k$-medoids, which is a variant of $k$-means and chooses data samples as centers, with diverse similarity measures other than the Euclidean distance to fix the last drawback of $k$-means. Combining the LDPS initialization framework with $k$-means and $k$-medoids, we obtain two novel clustering methods called LDPS-means and LDPS-medoids, respectively. Experiments on synthetic data sets verify the effectiveness of the proposed methods, especially when the ground truth of the cluster number $k$ is large. Further, experiments on several real world data sets, Handwritten Pendigits, Coil-20, Coil-100 and Olivetti Face Database, illustrate that our methods give a superior performance than the analogous approaches on both estimating $k$ and unsupervised object categorization.

preprint2016arXiv

Fast Sampling for Bayesian Max-Margin Models

Bayesian max-margin models have shown superiority in various practical applications, such as text categorization, collaborative prediction, social network link prediction and crowdsourcing, and they conjoin the flexibility of Bayesian modeling and predictive strengths of max-margin learning. However, Monte Carlo sampling for these models still remains challenging, especially for applications that involve large-scale datasets. In this paper, we present the stochastic subgradient Hamiltonian Monte Carlo (HMC) methods, which are easy to implement and computationally efficient. We show the approximate detailed balance property of subgradient HMC which reveals a natural and validated generalization of the ordinary HMC. Furthermore, we investigate the variants that use stochastic subsampling and thermostats for better scalability and mixing. Using stochastic subgradient Markov Chain Monte Carlo (MCMC), we efficiently solve the posterior inference task of various Bayesian max-margin models and extensive experimental results demonstrate the effectiveness of our approach.

preprint2016arXiv

Footprints of the weak s-process in the carbon-enhanced metal-poor star ET0097

Historically, the weak s-process contribution to metal-poor stars is thought to be extremely small, due to the effect of the secondary-like nature of the neutron source 22Ne(a;n)25Mg in massive stars, which means that metal-poor weak s-process stars could not be found. ET0097 is the first observed carbon-enhanced metal-poor (CEMP) star in the Sculptor dwarf spheroidal galaxy. Because C is enriched and the elements heavier than Ba are not overabundant, ET0097 can be classified as a CEMP-no star. However, this star shows overabundances of lighter n-capture elements (i.e., Sr, Y and Zr). In this work, having adopted the abundance decomposition approach, we investigate the astrophysical origins of the elements in ET0097. We find that the light elements and iron-peak elements (from O to Zn) of the star mainly originate from the primary process of massive stars and the heavier n-capture elements (heavier than Ba) mainly come from the main r-process. However, the lighter n-capture elements such as Sr, Y and Zr should mainly come from the primary weak s-process. The contributed fractions of the primary weak s-process to the Sr, Y and Zr abundances of ET0097 are about 82%, 84% and 58% respectively, suggesting that the CEMP star ET0097 should have the footprints of the weak s-process. The derived result should be a significant evidence that the weak s-process elements can be produced in metal-poor massive stars.

preprint2016arXiv

Ion-wake Field inside a Glass Box

The confinement provided by a glass box is proving ideal for the formation of vertically aligned structures and a convenient method for controlling the number of dust particles comprising these dust structures, as well as their size and shape. In this paper, the electronic confinement of the glass box is mapped and the particle interactions between the particle pairs inside the glass box are measured. The ion-wake field is shown to exist within the glass box and its vertical and horizontal extent is measured.

preprint2016arXiv

Learning Deep Generative Models with Doubly Stochastic MCMC

We present doubly stochastic gradient MCMC, a simple and generic method for (approximate) Bayesian inference of deep generative models (DGMs) in a collapsed continuous parameter space. At each MCMC sampling step, the algorithm randomly draws a mini-batch of data samples to estimate the gradient of log-posterior and further estimates the intractable expectation over hidden variables via a neural adaptive importance sampler, where the proposal distribution is parameterized by a deep neural network and learnt jointly. We demonstrate the effectiveness on learning various DGMs in a wide range of tasks, including density estimation, data generation and missing data imputation. Our method outperforms many state-of-the-art competitors.

preprint2016arXiv

Learning to Generate with Memory

Memory units have been widely used to enrich the capabilities of deep networks on capturing long-term dependencies in reasoning and prediction tasks, but little investigation exists on deep generative models (DGMs) which are good at inferring high-level invariant representations from unlabeled data. This paper presents a deep generative model with a possibly large external memory and an attention mechanism to capture the local detail information that is often lost in the bottom-up abstraction process in representation learning. By adopting a smooth attention model, the whole network is trained end-to-end by optimizing a variational bound of data likelihood via auto-encoding variational Bayesian methods, where an asymmetric recognition network is learnt jointly to infer high-level invariant representations. The asymmetric architecture can reduce the competition between bottom-up invariant feature extraction and top-down generation of instance details. Our experiments on several datasets demonstrate that memory can significantly boost the performance of DGMs and even achieve state-of-the-art results on various tasks, including density estimation, image generation, and missing value imputation.

preprint2016arXiv

Max-Margin Deep Generative Models for (Semi-)Supervised Learning

Deep generative models (DGMs) are effective on learning multilayered representations of complex data and performing inference of input data by exploring the generative ability. However, it is relatively insufficient to empower the discriminative ability of DGMs on making accurate predictions. This paper presents max-margin deep generative models (mmDGMs) and a class-conditional variant (mmDCGMs), which explore the strongly discriminative principle of max-margin learning to improve the predictive performance of DGMs in both supervised and semi-supervised learning, while retaining the generative capability. In semi-supervised learning, we use the predictions of a max-margin classifier as the missing labels instead of performing full posterior inference for efficiency; we also introduce additional max-margin and label-balance regularization terms of unlabeled data for effectiveness. We develop an efficient doubly stochastic subgradient algorithm for the piecewise linear objectives in different settings. Empirical results on various datasets demonstrate that: (1) max-margin learning can significantly improve the prediction performance of DGMs and meanwhile retain the generative ability; (2) in supervised learning, mmDGMs are competitive to the best fully discriminative networks when employing convolutional neural networks as the generative and recognition models; and (3) in semi-supervised learning, mmDCGMs can perform efficient inference and achieve state-of-the-art classification results on several benchmarks.

preprint2016arXiv

Molecular Lines of 13 Galactic Infrared Bubble Regions

We investigated the physical properties of molecular clouds and star formation processes around infrared bubbles which are essentially expanding HII regions. We performed observations of 13 galactic infrared bubble fields containing 18 bubbles. Five molecular lines, 12CO (J=1-0), 13CO (J=1-0), C18O(J=1-0), HCN (J=1-0), and HCO+ (J=1-0), were observed, and several publicly available surveys, GLIMPSE, MIPSGAL, ATLASGAL, BGPS, VGPS, MAGPIS, and NVSS, were used for comparison. We find that these bubbles are generally connected with molecular clouds, most of which are giant. Several bubble regions display velocity gradients and broad shifted profiles, which could be due to the expansion of bubbles. The masses of molecular clouds within bubbles range from 100 to 19,000 solar mass, and their dynamic ages are about 0.3-3.7 Myr, which takes into account the internal turbulence pressure of surrounding molecular clouds. Clumps are found in the vicinity of all 18 bubbles, and molecular clouds near four of these bubbles with larger angular sizes show shell-like morphologies, indicating that either collect-and-collapse or radiation-driven implosion processes may have occurred. Due to the contamination of adjacent molecular clouds, only six bubble regions are appropriate to search for outflows, and we find that four of them have outflow activities. Three bubbles display ultra-compact HII regions at their borders, and one of them is probably responsible for its outflow. In total, only six bubbles show star formation activities in the vicinity, and we suggest that star formation processes might have been triggered.

preprint2016arXiv

Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization

Hashing method maps similar data to binary hashcodes with smaller hamming distance, and it has received a broad attention due to its low storage cost and fast retrieval speed. However, the existing limitations make the present algorithms difficult to deal with large-scale datasets: (1) discrete constraints are involved in the learning of the hash function; (2) pairwise or triplet similarity is adopted to generate efficient hashcodes, resulting both time and space complexity are greater than O(n^2). To address these issues, we propose a novel discrete supervised hash learning framework which can be scalable to large-scale datasets. First, the discrete learning procedure is decomposed into a binary classifier learning scheme and binary codes learning scheme, which makes the learning procedure more efficient. Second, we adopt the Asymmetric Low-rank Matrix Factorization and propose the Fast Clustering-based Batch Coordinate Descent method, such that the time and space complexity is reduced to O(n). The proposed framework also provides a flexible paradigm to incorporate with arbitrary hash function, including deep neural networks and kernel methods. Experiments on large-scale datasets demonstrate that the proposed method is superior or comparable with state-of-the-art hashing algorithms.

preprint2016arXiv

Structures and dynamics of glass-forming colloidal liquids under spherical confinement

Recent theories predict that when a supercooled liquid approaches the glass transition, particle clusters with a special "amorphous order" nucleate within the liquid, which lead to static correlations dictating the dramatic slowdown of liquid relaxation. The prediction, however, has yet to be verified in 3D experiments. Here, we design a colloidal system, where particles are confined inside spherical cavities with an amorphous layer of particles pinned at the boundary. Using this novel system, we capture the amorphous-order particle clusters and demonstrate the development of a static correlation. Moreover, by investigating the dynamics of spherically confined samples, we reveal a profound influence of the static correlation on the relaxation of colloidal liquids. In analogy to glass-forming liquids with randomly pinned particles, we propose a simple relation for the change of the configurational entropy of confined colloidal liquids, which quantitatively explains our experimental findings and illustrates a divergent static length scale during the colloidal glass transition.

preprint2016arXiv

Testing Einstein's Equivalence Principle with Supercluster Laniakea's Gravitational Field

Comparing the parameterized post-Newtonian parameter $γ$ values for different types of particles, or the same type of particles with different energies is an important method to test the Einstein Equivalence Principle (EEP). Assuming that the observed time delays are dominated by the gravitational potential of the Laniakea supercluster of galaxies, better results of EEP constraints can be obtained. In this paper, we apply photons from three kinds of cosmic transients, including TeV blazars, gamma-ray bursts as well as fast radio bursts to constrain EEP. With a gravitational field far more stronger than a single galaxy, we obtain 4--5 orders of magnitude more stringent than the pervious results.

preprint2016arXiv

The odd-isotope fractions of Barium in CEMP-r/s star HE 0338-3945 and r-II star CS 31082-001

We report the first measurement of the odd-isotope fractions for barium, \fodd\, in two extremely metal-poor stars: a CEMP-r/s star \he\ (\feh\,$=-2.42\pm0.11$) and an r-II star \cs\ (\feh\,$=-2.90\pm0.13$). The measured \fodd\ values are $0.23\pm0.12$ corresponding to $34.3\pm34.3$\% of the r-process contributions for \he\ and $0.43\pm0.09$ corresponding to $91.4\pm25.7$\% of the r-process contribution to Ba production for \cs. The high r-process signature of barium in \cs\ ($91.4\pm25.7\%$) suggests that the majority of the heavy elements in this star were synthesised via an r-process path, while the lower r-process value ($34.3\pm34.3\%$) found in \he\ indicates that the heavy elements in this star formed through a mix of s-process and r-process synthesis. These conclusions are consistent with studies based on AGB model calculations to fit their abundance distributions.

preprint2015arXiv

Astrophysical Origins for the Unusual Chemical Abundance of the Globular Cluster Palomar 1

We study the abundances of α elements, Fe-peak elements, and neutron-capture elements in Pal 1. We found that the abundances of the SNe Ia and main s-process components of Pal 1 are larger than those of the disk stars and the abundances of the primary component of Pal 1 are smaller than those of the disk stars with similar metallicity. The Fe abundances of Pal 1 and the disk stars mainly originate from the SNe Ia and the primary component, respectively. Although the α abundances dominantly produced by the primary process for the disk stars and Pal 1, the contributions of the primary component to Pal 1 are smaller than the corresponding contributions to the disk stars. The Fe-peak elements V and Co mainly originate from the primary and secondary components for the disk stars and Pal 1, but the contributions of the massive stars to Pal 1 are lower than those of the massive stars to the disk stars. The Yabundances mainly originate from the weak r-component for the disk stars. However, the contributions of the main s-components and main r-components to Y are close to those of the weak r-component for Pal 1. The Ba abundances of Pal 1 and the disk stars mainly originate from the main s-component and the main r-component, respectively. Our calculated results imply that the unusual abundances of Pal could be explained by the top-light IMF for Pal 1 progenitor-system.

preprint2015arXiv

Concept for a Future Super Proton-Proton Collider

Following the discovery of the Higgs boson at LHC, new large colliders are being studied by the international high-energy community to explore Higgs physics in detail and new physics beyond the Standard Model. In China, a two-stage circular collider project CEPC-SPPC is proposed, with the first stage CEPC (Circular Electron Positron Collier, a so-called Higgs factory) focused on Higgs physics, and the second stage SPPC (Super Proton-Proton Collider) focused on new physics beyond the Standard Model. This paper discusses this second stage.

preprint2015arXiv

Deposition and characterization of TiZrV-Pd thin films by dc magnetron sputtering

TiZrV film is mainly applied in the ultra-high vacuum pipe of storage ring. Thin film coatings of palladium which was added onto the TiZrV film to increase the service life of nonevaporable getters and enhance pumping speed for H2, was deposited on the inner face of stainless steel pipes by dc magnetron sputtering using argon gas as the sputtering gas. The TiZrV-Pd film properties were investigated by atomic force microscope (AFM), scanning electron microscope (SEM), X-ray photoelectron spectroscopy (XPS) and X-Ray Diffraction (XRD). The grain size of TiZrV and Pd film were about 0.42~1.3 nm and 8.5~18.25 nm respectively. It was found that the roughness of TiZrV films was small, about 2~4 nm, for Pd film it is large, about 17~19 nm. PP At. % of Pd in TiZrV/Pd films varied from 86.84 to 87.56 according to the XPS test results.

preprint2015arXiv

Discriminative Nonparametric Latent Feature Relational Models with Data Augmentation

We present a discriminative nonparametric latent feature relational model (LFRM) for link prediction to automatically infer the dimensionality of latent features. Under the generic RegBayes (regularized Bayesian inference) framework, we handily incorporate the prediction loss with probabilistic inference of a Bayesian model; set distinct regularization parameters for different types of links to handle the imbalance issue in real networks; and unify the analysis of both the smooth logistic log-loss and the piecewise linear hinge loss. For the nonconjugate posterior inference, we present a simple Gibbs sampler via data augmentation, without making restricting assumptions as done in variational methods. We further develop an approximate sampler using stochastic gradient Langevin dynamics to handle large networks with hundreds of thousands of entities and millions of links, orders of magnitude larger than what existing LFRM models can process. Extensive studies on various real networks show promising performance.

preprint2015arXiv

Energy Spectrum Extraction and Optimal Imaging via Dual-Energy Material Decomposition

Inferior soft-tissue contrast resolution is a major limitation of current CT scanners. The aim of the study is to improve the contrast resolution of CT scanners using dual-energy acquisition. Based on dual-energy material decomposition, the proposed method starts with extracting the outgoing energy spectrum by polychromatic forward projecting the material-selective images. The extracted spectrum is then reweighted to boost the soft-tissue contrast. A simulated water cylinder phantom with inserts that contain a series of six solutions of varying iodine concentration (range, 0-20 mg/mL) is used to evaluate the proposed method. Results show the root mean square error (RMSE) and mean energy difference between the extracted energy spectrum and the spectrum acquired using an energy-resolved photon counting detector(PCD), are 0.044 and 0.01 keV, respectively. Compared to the method using the standard energy-integrating detectors, dose normalized contrast-to-noise ratio (CNRD) for the proposed method are improved from 1 to 2.15 and from 1 to 1.88 for the 8 mg/mL and 16 mg/mL iodine concentration inserts, respectively. The results show CT image reconstructed using the proposed method is superior to the image reconstructed using the standard method that using an energy-integrating detector.

preprint2015arXiv

Film Coating Process Research and Characterization of TiN Coated Racetrack-type Ceramic Pipe

TiN film was coated on the internal face of racetrack-type ceramic pipe by three different methods: radio-frequency sputtering, DC sputtering and DC magnetron sputtering. The deposition rates of TiN film under different coating methods were compared. According to the AFM, SEM, XPS test results,these properties were analyzed, such as TiN film roughness and surface morphology. At the same time, the deposition rates were studied under two types' cathode, Ti wires and Ti plate. According to the SEM test results, Ti plate cathode can improve the TiN/Ti film deposition rate obviously.

preprint2015arXiv

Fluorescence Imaging In Vivo at Wavelengths beyond 1500 nm

Compared to imaging in the visible and near-infrared regions below 900 nm, imaging in the second near-infrared window (NIR-II, 1000-1700 nm) is a promising method for deep-tissue high-resolution optical imaging in vivo mainly due to the reduced scattering of photons traversing through biological tissues. Herein, semiconducting single-walled carbon nanotubes with large diameters were used for in vivo fluorescence imaging in the long-wavelength NIR region (1500-1700 nm, NIR-IIb). With this imaging agent, 3-4 um wide capillary blood vessels at a depth of about 3 mm could be resolved. Meanwhile, the blood-flow speeds in multiple individual vessels could be mapped simultaneously. Furthermore, NIR-IIb tumor imaging of a live mouse was explored. NIR-IIb imaging can be generalized to a wide range of fluorophores emitting at up to 1700 nm for high-performance in vivo optical imaging.

preprint2015arXiv

Improved Direct Counterfactual Quantum Communication

Recently, a novel direct counterfactual quantum communication protocol was proposed using chained quantum Zeno effect. We found that this protocol is far from being widely used in practical channels, due to the side effect of 'chained', which leads to a dramatic increase of the equivalent optical distance between Alice and Bob. Therefore, not only the transmission time of a single bit increases in multiple times, but also the protocol is more sensitive to the noise. Here, we proposed an improved protocol, in which quantum interference is employed to destroy the nested structure induced by 'chained' effect. Moreover, we proved that a better counterfactuality is easier to be achieved, and showed that our protocol outperforms the former in the presence of noises.

preprint2015arXiv

Joint Communication-Motion Planning in Wireless-Connected Robotic Networks: Overview and Design Guidelines

Recent years have witnessed the prosperity of robots and in order to support consensus and cooperation for multi-robot system, wireless communications and networking among robots and the infrastructure have become indispensable. In this technical note, we first provide an overview of the research contributions on communication-aware motion planning (CAMP) in designing wireless-connected robotic networks (WCRNs), where the degree-of-freedom (DoF) provided by motion and communication capabilities embraced by the robots have not been fully exploited. Therefore, we propose the framework of joint communication-motion planning (JCMP) as well as the architecture for incorporating JCMP in WCRNs. The proposed architecture is motivated by the observe-orient-decision-action (OODA) model commonly adopted in robotic motion control and cognitive radio. Then, we provide an overview of the orient module that quantify the connectivity assessment. Afterwards, we highlight the JCMP module and compare it with the conventional communication-planning, where the necessity of the JCMP is validated via both theoretical analysis and simulation results of an illustrative example. Finally, a series of open problems are discussed, which picture the gap between the state-of-the-art and a practical WCRN.

preprint2015arXiv

Jointly Modeling Topics and Intents with Global Order Structure

Modeling document structure is of great importance for discourse analysis and related applications. The goal of this research is to capture the document intent structure by modeling documents as a mixture of topic words and rhetorical words. While the topics are relatively unchanged through one document, the rhetorical functions of sentences usually change following certain orders in discourse. We propose GMM-LDA, a topic modeling based Bayesian unsupervised model, to analyze the document intent structure cooperated with order information. Our model is flexible that has the ability to combine the annotations and do supervised learning. Additionally, entropic regularization can be introduced to model the significant divergence between topics and intents. We perform experiments in both unsupervised and supervised settings, results show the superiority of our model over several state-of-the-art baselines.

preprint2015arXiv

Magnetic dipolar interaction between correlated triplets created by singlet fission in tetracene crystals

Singlet fission (SF) can potentially break the Shockley-Queisser efficiency limit in single-junction solar cells by splitting one photo-excited singlet exciton (S1) into two triplets (2T1) in organic semiconductors. A dark multi-exciton (ME) state has been proposed as the intermediate connecting S1 to 2T1. However, the exact nature of this ME state, especially how the doubly-excited triplets interact, remains elusive. Here, we report a quantitative study on the magnetic dipolar interaction between SF-induced correlated triplets in tetracene crystals by monitoring quantum beats relevant to the ME sublevels at room temperature. The resonances of ME sublevels approached by tuning an external magnetic field are observed to be avoided, which agrees well with the theoretical predictions considering a magnetic dipolar interaction of ~ 0.008 GHz. Our work paves a way to quantify the magnetic dipolar interaction in organic materials and marks an important step towards understanding the underlying physics of the ME state.

preprint2015arXiv

Max-margin Deep Generative Models

preprint2015arXiv

Member candidates of the star clusters from LAMOST DR2 data

In this work, we provide 2189 photometric- and kinematic-selected member candidates of 24 star clusters from the LAMOST DR2 catalog. We perform two-step membership identification: selection along the stellar track in the color-magnitude diagram, i.e., photometric identification, and the selection from the distribution of radial velocities, i.e. the kinematic identification. We find that the radial velocity from the LAMOST data are very helpful in the membership identification. The mean probability of membership is 40\% for the radial velocity selected sample. With these 24 star clusters, we investigate the performance of the radial velocity and metallicity estimated in the LAMOST pipeline. We find that the systematic offset in radial velocity and metallicity are $0.85\pm1.26$\,\kms\ and $-0.08\pm0.04$\,dex, with dispersions of $5.47_{-0.71}^{+1.16}$\,\kms\ and $0.13_{-0.02}^{+0.04}$\,dex, respectively. Finally, we propose that the photometric member candidates of the clusters covered by the LAMOST footprints should be assigned higher priority so that more member stars can be observed.

preprint2015arXiv

Relativistic Hydrodynamics with Wavelets

Methods to solve the relativistic hydrodynamic equations are a key computational kernel in a large number of astrophysics simulations and are crucial to understanding the electromagnetic signals that originate from the merger of astrophysical compact objects. Because of the many physical length scales present when simulating such mergers, these methods must be highly adaptive and capable of automatically resolving numerous localized features and instabilities that emerge throughout the computational domain across many temporal scales. While this has been historically accomplished with adaptive mesh refinement (AMR) based methods, alternatives based on wavelet bases and the wavelet transformation have recently achieved significant success in adaptive representation for advanced engineering applications. This work presents a new method for the integration of the relativistic hydrodynamic equations using iterated interpolating wavelets and introduces a highly adaptive implementation for multidimensional simulation. The wavelet coefficients provide a direct measure of the local approximation error for the solution and place collocation points that naturally adapt to the fluid flow while providing good conservation of fluid quantities. The resulting implementation, OAHU, is applied to a series of demanding one- and two-dimensional problems which explore high Lorentz factor outflows and the formation of several instabilities, including the Kelvin-Helmholtz instability and the Rayleigh-Taylor instability.

preprint2015arXiv

Research on Pd film deposition rate calculation and simulation based on TiZrV/Pd film coating experiment

The vacuum chamber of accelerator storage ring need clean ultra-high vacuum environment. TiZrV getter film which was deposited on interior wall of vacuum chamber, can realize distributed pumping, effectively improve the vacuum degree and reduce the longitudinal gradient. But accumulation of pollutants such as N2, O2, will decrease the adsorption ability of non-evaporable getter (NEG), which leads to the reduction of NEG lifetime. Therefore, NEG thin film coated with a layer of Pd which has high diffusion rate and absorption ability for H2, can extend the service life of NEG, and improve the pumping rate of H2 at the same time. With argon as discharge gas, magnetron sputtering method was adopted to prepare TiZrV-Pd film in long straight pipe. According to the experimental results of the scanning electron microscope (SEM), deposition rates of TiZrV-Pd films were analyzed under different deposition parameters, the magnetic field strength, the gas flow rate, discharge current, discharge voltage and working pressure. Moreover, comparing the simulation results based on Sigmund's theory and experimental results, it was shown that the deposition rate C can be estimated by the depth sputtered, D for Pd film coatings in this experiment device.

preprint2015arXiv

Research on the secondary electron yield of tizrv-pd thin film coatings

In particle accelerators, the build-up of electron cloud may have important influence on beam quality. Especially for the positron and proton accelerators, massive electrons lead to electron cloud, which affects the stability, energy, emittance and beam life adversely. A secondary electron emission (SEE) measurement system has been designed and used to study the SEE of palladium (Pd), TiZrV and TiZrV-Pd with an independently adjustable energy from 50 eV to 5 keV. Here, we obtained the characteristics of the SEE from Pd, TiZrV and TiZrV-Pd film coatings with different thickness under ultrahigh-vacuum (UHV) conditions. Moreover, the maximum secondary electron yield (SEY), δmax, of the Pd, TiZrV and TiZrV-Pd film coatings under different primary electron doses were obtained, respectively. Finally, the variation of the secondary electron yield with the incident electron energy will be discussed for Pd, TiZrV and TiZrV-Pd thin film coatings. Low SEY is a new advantage of TiZrV-Pd films, besides high H2 absorption ability and prolonging the lifetime of TiZrV film, which will be of great value in the design of beam screen for Super Proton-Proton Collider (SPPC).

preprint2015arXiv

Spectral classification of stars based on LAMOST spectra

In this work, we select the high signal-to-noise ratio spectra of stars from the LAMOST data andmap theirMK classes to the spectral features. The equivalentwidths of the prominent spectral lines, playing the similar role as the multi-color photometry, form a clean stellar locus well ordered by MK classes. The advantage of the stellar locus in line indices is that it gives a natural and continuous classification of stars consistent with either the broadly used MK classes or the stellar astrophysical parameters. We also employ a SVM-based classification algorithm to assignMK classes to the LAMOST stellar spectra. We find that the completenesses of the classification are up to 90% for A and G type stars, while it is down to about 50% for OB and K type stars. About 40% of the OB and K type stars are mis-classified as A and G type stars, respectively. This is likely owe to the difference of the spectral features between the late B type and early A type stars or between the late G and early K type stars are very weak. The relative poor performance of the automatic MK classification with SVM suggests that the directly use of the line indices to classify stars is likely a more preferable choice.

preprint2015arXiv

Study of the element abundances in HD 140283: the abundance robustness of the weak r- and main r-process stars

Many works have attempted to investigate the astrophysical origin of the neutron-capture elements in the metalpoor star HD 140283. However, no definite conclusions have been drawn. In this work, using the abundancedecomposed approach, we find that the metal-poor star HD 140283 is a weak r-process star. Although this star is a weak r-process star, its Ba abundance mainly originates from the main r-process. This is the reason that the ratio [Ba/Eu]= -0.58+- 0.15 for HD 140283 is close to the ratio of the main r-process. Based on the comparison of the abundances in the six-weak r-process stars, we find that their element abundances possess a robust nature. On the other hand, we find that the robust nature of the abundance of the extreme main r-process stars ([r/Fe]>= 1.5) can be extended to the lighter neutron-capture elements. Furthermore, the abundance characteristics of the weak r-process and main r-process are investigated. The abundance robustness of the two category r-process stars could be used as the constraint of the r-process theory and could be used to investigate the astrophysical origins of the elements in the metal-poor stars and population I stars.

preprint2015arXiv

The K giant stars from the LAMOST survey data II: the Hercules stream in radial migration

We estimate the age for the individual stars located at the lower part of the red giant branch from the LAMOST DR2 K giant sample. Taking into account the selection effects and the volume completeness, the age--metallicity map for the stars located between 0.3 and 1.5 kpc from the Sun is obtained. A significant substructure (denoted as the \it{narrow stripe}) located from (age, [Fe/H])$\sim$(5, 0.4) to (10 Gyr, -0.4 dex) in the age--metallicity map is clearly identified. Moreover, the \it{narrow stripe} stars are found the dominate contributors to several velocity substructures, including the well-known Hercules stream. The substantially large difference between the observed guiding-center radii and the birth radii inferred from the age--metallicity relation is evident that the \it{narrow stripe} stars have been radially migrated from about R$\sim4$ kpc to the solar neighborhood. This implies that the Hercules stream may not be owe to the resonance associated with the bar, but may be the kinematic imprint of the inner disk and later moved out due to radial migration. We estimate that the traveling speed of the radial migration are roughly 1.1$\pm0.1$ kpc Gyr$^{-1}$, equivalent with about $1.1\pm0.1$ km s$^{-1}$. This is in agreement with the median $v_R$ of $2.6^{+1.8}_{-1.9}$ km s$^{-1}$ of the \it{narrow stripe}. We also obtain that about one third stars in the solar neighborhood are radially migrated from around 4 kpc. Finally, we find that the radial migration does not lead to additional disk thickening according to the distribution of $z_{max}$.

preprint2014arXiv

Dropout Training for Support Vector Machines

Dropout and other feature noising schemes have shown promising results in controlling over-fitting by artificially corrupting the training data. Though extensive theoretical and empirical studies have been performed for generalized linear models, little work has been done for support vector machines (SVMs), one of the most successful approaches for supervised learning. This paper presents dropout training for linear SVMs. To deal with the intractable expectation of the non-smooth hinge loss under corrupting distributions, we develop an iteratively re-weighted least square (IRLS) algorithm by exploring data augmentation techniques. Our algorithm iteratively minimizes the expectation of a re-weighted least square problem, where the re-weights have closed-form solutions. The similar ideas are applied to develop a new IRLS algorithm for the expected logistic loss under corrupting distributions. Our algorithms offer insights on the connection and difference between the hinge loss and logistic loss in dropout training. Empirical results on several real datasets demonstrate the effectiveness of dropout training on significantly boosting the classification accuracy of linear SVMs.

preprint2014arXiv

Estimating R-Process Yields from Abundances of the Metal-Poor Stars

The chemical abundances of metal-poor stars provide important clues to explore stellar formation history and set significant constraints on models of the r-process. In this work, we find that the abundance patterns of the light and iron group elements of the main r-process stars are very close to those of the weak r-process stars. Based on a detailed abundance comparison, we find that the weak r-process occurs in supernovae with a progenitor mass range of $\sim11-26M_{\odot}$. Using the SN yields given by Heger & Woosley and the abundances of the weak r-process stars, the weak r-process yields are derived. The SNe with a progenitor mass range of $15M_{\odot}<M<26M_{\odot}$ are the main sites of the weak r-process and their contributions are larger than 80%. Using the abundance ratios of the weak r-process and the main r-process in the solar system, the average yields of the main r-process are estimated. The observed correlations of the [neutron-capture/Eu] versus [Eu/Fe] can be explained by mixing of the two r-process abundances in various fractions.

preprint2014arXiv

Gamma-Ray Burst Prompt Emission Light Curves and Power Density Spectra in the ICMART Model

In this paper, we simulate the prompt emission light curves of gamma-ray bursts (GRBs) within the framework of the Internal-Collision-induced MAgnetic Reconnection and Turbulence (ICMART) model. This model applies to GRBs with a moderately-high magnetization parameter $σ$ in the emission region. We show that this model can produce highly variable light curves with both fast and slow components. The rapid variability is caused by many locally Doppler-boosted mini-emitters due to turbulent magnetic reconnection in a moderately-high-$σ$ flow. The run-away growth and subsequent depletion of these mini-emitters as a function time define a broad slow component for each ICMART event. A GRB light curve is usually composed of multiple ICMART events that are fundamentally driven by the erratic GRB central engine activity. Allowing variations of the model parameters, one is able to reproduce a variety of light curves and the power density spectra as observed.

preprint2014arXiv

Is Germanium (Ge, Z=32) A Neutron-Capture Element?

Historically,Ge has been considered to be a neutron-capture element. In this case, the r-process abundance of Ge is derived by subtracting the s-process abundance from the total abundance in the Solar system. However, the Ge abundance of the metal-poor star HD 108317 is lower than that of the scaled residual r-process abundance in the Solar system, about 1.2 dex. In this paper, based on a comparison of the Ge abundances of metal-poor stars and stellar yields, we find that the Ge abundances are not the result of the primary-like yields in massive stars and come mainly from the r-process. Based on the observed abundances of metal-poor stars, we derived the Ge abundances of the weak r-process and main r-process. The contributed percentage of the neutron-capture process to Ge in the Solar system is about 59 per cent, which means that the contributed percentage of the Ge residual abundance in the Solar system is about 41 per cent. We find that the Ge residual abundance is produced as secondary-like yields in massive stars. This implies that the element Ge in the Solar system is not produced solely by the neutron-capture process.

preprint2014arXiv

Polarization-dependent exciton dynamics in tetracene single crystals

We conduct polarization-dependent ultrafast spectroscopy to study the dynamics of singlet fission in tetracene single crystals. The spectrotemporal species for singlet and triplet excitons in transient absorption spectra are found to be strongly dependent on probe polarization. By carefully analyzing the polarization dependence, the signals contributed by different transitions related to singlet excitons have been disentangled, which is further applied to construct the correlation between dynamics of singlet and triplet excitons. The anisotropy of exciton dynamics provides an alternative approach to tackle the long-standing challenge in understanding the mechanism of singlet fission in organic semiconductors.

preprint2014arXiv

Superlinear density dependence of singlet fission rate in tetracene films

We experimentally show that the rate of singlet fission in tetracene films has a superlinear dependence on the density of photo-excited singlet excitons with ultrafast transient absorption spectroscopy. The spectrotemporal features of singlet and triplet dynamics can be disentangled from experimental data with the algorithm of singular value decomposition. The correlation between their temporal dynamics indicates a nonlinear density dependence of fission rate, which leads to a conjecture of coherent singlet fission process arising from superradiant excitons in crystalline tetracene. This hypothesis might be able to resolve some long-standing controversies.

preprint2014arXiv

Through Skull Fluorescence Imaging of the Brain in a New Near-Infrared Window

To date, brain imaging has largely relied on X-ray computed tomography and magnetic resonance angiography with limited spatial resolution and long scanning times. Fluorescence-based brain imaging in the visible and traditional near-infrared regions (400-900 nm) is an alternative but currently requires craniotomy, cranial windows and skull thinning techniques, and the penetration depth is limited to 1-2 mm due to light scattering. Here, we report through-scalp and through-skull fluorescence imaging of mouse cerebral vasculature without craniotomy utilizing the intrinsic photoluminescence of single-walled carbon nanotubes in the 1.3-1.4 micrometre near-infrared window. Reduced photon scattering in this spectral region allows fluorescence imaging reaching a depth of >2 mm in mouse brain with sub-10 micrometre resolution. An imaging rate of ~5.3 frames/s allows for dynamic recording of blood perfusion in the cerebral vessels with sufficient temporal resolution, providing real-time assessment of blood flow anomaly in a mouse middle cerebral artery occlusion stroke model.

preprint2014arXiv

Ultra-Fast Fluorescence Imaging in Vivo with Conjugated Polymer Fluorophores in the Second Near-Infrared Window

In vivo fluorescence imaging in the second near-infrared window (1.0-1.7 microns) can afford deep tissue penetration and high spatial resolution, owing to the reduced scattering of long-wavelength photons. Here, we synthesize a series of low-bandgap donor/acceptor copolymers with tunable emission wavelengths of 1050-1350 nm in this window. Non-covalent functionalization with phospholipid-polyethylene glycol results in water-soluble and biocompatible polymeric nanoparticles, allowing for live cell molecular imaging at > 1000 nm with polymer fluorophores for the first time. Importantly, the high quantum yield of the polymer allows for in vivo, deep-tissue and ultrafast imaging of mouse arterial blood flow with an unprecedented frame rate of > 25 frames per second. The high time resolution results in spatially and time resolved imaging of the blood flow pattern in cardiogram waveform over a single cardiac cycle (~ 200 ms) of a mouse, which has not been observed with fluorescence imaging in this window before.

preprint2014arXiv

Very Long Baseline Interferometry with the SKA

Adding VLBI capability to the SKA arrays will greatly broaden the science of the SKA, and is feasible within the current specifications. SKA-VLBI can be initially implemented by providing phased-array outputs for SKA1-MID and SKA1-SUR and using these extremely sensitive stations with other radio telescopes, and in SKA2 by realising a distributed configuration providing baselines up to thousands of km, merging it with existing VLBI networks. The motivation for and the possible realization of SKA-VLBI is described in this paper.

preprint2014arXiv

Zero Dimensional Polariton Laser in a Sub-Wavelength Grating Based Vertical Microcavity

Semiconductor exciton-polaritons in planar microcavities form coherent two-dimensional condensates in non-equilibrium. However, coupling of multiple lower-dimensional polariton quantum systems, critically needed for polaritonic quantum device applications and novel cavity-lattice physics, has been limited due to the conventional cavity structures. Here we demonstrate full confinement of the polaritons non-destructively using a hybrid cavity made of a single-layer sub-wavelength grating mirror and a distributed Bragg reflector. Single-mode polariton lasing was observed at a chosen polarization. Incorporation of a designable slab mirror into the conventional vertical cavity, when operating in the strong-coupling regime, enables confinement, control and coupling of polariton gasses in a scalable fashion. It may open a door to experimental implementation of polariton-based quantum photonic devices and coupled cavity quantum electrodynamics systems.

preprint2013arXiv

A novel integral equation for scattering by locally rough surfaces and application to the inverse problem

This paper is concerned with the direct and inverse acoustic or electromagnetic scattering problems by a locally perturbed, perfectly reflecting, infinite plane (which is called a locally rough surface in this paper). We propose a novel integral equation formulation for the direct scattering problem which is defined on a bounded curve (consisting of a bounded part of the infinite plane containing the local perturbation and the lower part of a circle) with two corners. This novel integral equation can be solved efficiently by using the Nystrom method with a graded mesh introduced previously by Kress and is capable of dealing with large wavenumber cases. For the inverse problem, we propose a Newton iteration method to reconstruct the local perturbation of the plane from multiple frequency far-field data, based on the novel integral equation formulation. Numerical examples are carried out to demonstrate that our reconstruction method is stable and accurate even for the case of multiple-scale profiles.

preprint2013arXiv

Discriminative Relational Topic Models

Many scientific and engineering fields involve analyzing network data. For document networks, relational topic models (RTMs) provide a probabilistic generative process to describe both the link structure and document contents, and they have shown promise on predicting network structures and discovering latent topic representations. However, existing RTMs have limitations in both the restricted model expressiveness and incapability of dealing with imbalanced network data. To expand the scope and improve the inference accuracy of RTMs, this paper presents three extensions: 1) unlike the common link likelihood with a diagonal weight matrix that allows the-same-topic interactions only, we generalize it to use a full weight matrix that captures all pairwise topic interactions and is applicable to asymmetric networks; 2) instead of doing standard Bayesian inference, we perform regularized Bayesian inference (RegBayes) with a regularization parameter to deal with the imbalanced link structure issue in common real networks and improve the discriminative ability of learned latent representations; and 3) instead of doing variational approximation with strict mean-field assumptions, we present collapsed Gibbs sampling algorithms for the generalized relational topic models by exploring data augmentation without making restricting assumptions. Under the generic RegBayes framework, we carefully investigate two popular discriminative loss functions, namely, the logistic log-loss and the max-margin hinge loss. Experimental results on several real network datasets demonstrate the significance of these extensions on improving the prediction performance, and the time efficiency can be dramatically improved with a simple fast approximation method.

preprint2013arXiv

Gibbs Max-margin Topic Models with Data Augmentation

Max-margin learning is a powerful approach to building classifiers and structured output predictors. Recent work on max-margin supervised topic models has successfully integrated it with Bayesian topic models to discover discriminative latent semantic structures and make accurate predictions for unseen testing data. However, the resulting learning problems are usually hard to solve because of the non-smoothness of the margin loss. Existing approaches to building max-margin supervised topic models rely on an iterative procedure to solve multiple latent SVM subproblems with additional mean-field assumptions on the desired posterior distributions. This paper presents an alternative approach by defining a new max-margin loss. Namely, we present Gibbs max-margin supervised topic models, a latent variable Gibbs classifier to discover hidden topic representations for various tasks, including classification, regression and multi-task learning. Gibbs max-margin supervised topic models minimize an expected margin loss, which is an upper bound of the existing margin loss derived from an expected prediction rule. By introducing augmented variables and integrating out the Dirichlet variables analytically by conjugacy, we develop simple Gibbs sampling algorithms with no restricting assumptions and no need to solve SVM subproblems. Furthermore, each step of the "augment-and-collapse" Gibbs sampling algorithms has an analytical conditional distribution, from which samples can be easily drawn. Experimental results demonstrate significant improvements on time efficiency. The classification performance is also significantly improved over competitors on binary, multi-class and multi-label classification tasks.

preprint2013arXiv

Improved Bayesian Logistic Supervised Topic Models with Data Augmentation

Supervised topic models with a logistic likelihood have two issues that potentially limit their practical use: 1) response variables are usually over-weighted by document word counts; and 2) existing variational inference methods make strict mean-field assumptions. We address these issues by: 1) introducing a regularization constant to better balance the two parts based on an optimization formulation of Bayesian inference; and 2) developing a simple Gibbs sampling algorithm by introducing auxiliary Polya-Gamma variables and collapsing out Dirichlet variables. Our augment-and-collapse sampling algorithm has analytical forms of each conditional distribution without making any restricting assumptions and can be easily parallelized. Empirical results demonstrate significant improvements on prediction performance and time efficiency.

preprint2013arXiv

Inverse electromagnetic scattering problems by a doubly periodic structure

Consider the problem of scattering of electromagnetic waves by a doubly periodic structure. The medium above the structure is assumed to be inhomogeneous characterized completely by an index of refraction. Below the structure is a perfect conductor or an imperfect conductor partially coated with a dielectric. Having established the well-posedness of the direct problem by the variational approach, we prove the uniqueness of the inverse problem, that is, the unique determination of the doubly periodic grating with its physical property and the index of refraction from a knowledge of the scattered near field by a countably infinite number of incident quasi-periodic electromagnetic waves. A key ingredient in our proofs is a novel mixed reciprocity relation derived in this paper.

preprint2013arXiv

Investigation of the Puzzling Abundance Pattern in the Stars of the Fornax Dwarf Spheroidal Galaxy

Many works have found unusual characteristics of elemental abundances in nearby dwarf galaxies. This implies that there is a key factor of galactic evolution that is different from that of the Milky Way (MW). The chemical abundances of the stars in the Fornax dwarf spheroidal galaxy (Fornax dSph) provide excellent information for setting constraints on the models of the galactic chemical evolution. In this work, adopting the five-component approach, we fit the abundances of the Fornax dSph stars, including $α$ elements, iron group elements and neutron-capture elements. For most sample stars, the relative contributions from the various processes to the elemental abundances are not usually in the MW proportions. We find that the contributions from massive stars to the primary $α$ elements and iron group elements increase monotonously with increasing [Fe/H]. This means that the effect of the galactic wind is not strong enough to halt star formation and the contributions from massive stars to $α$ elements did not halted for [Fe/H]$\lesssim$-0.5. The average contributed ratios of various processes between the dSph stars and the MW stars monotonously decrease with increasing progenitor mass. This is important evidence of a bottom-heavy initial mass function (IMF) for the Fonax dSph, compared to the MW. Considering a bottom-heavy IMF for the dSph, the observed relations of [$α$/Fe] versus [Fe/H], [iron group/Fe] versus [Fe/H] and [neutron-capture/Fe] versus [Fe/H] for the dSph stars can be explained.

preprint2013arXiv

Lattice Boltzmann based discrete simulation for gas-solid fluidization

Discrete particle simulation, a combined approach of computational fluid dynamics and discrete methods such as DEM (Discrete Element Method), DSMC (Direct Simulation Monte Carlo), SPH (Smoothed Particle Hydrodynamics), PIC (Particle-In-Cell), etc., is becoming a practical tool for exploring lab-scale gas-solid systems owing to the fast development of parallel computation. However, gas-solid coupling and the corresponding fluid flow solver remain immature. In this work, we propose a modified lattice Boltzmann approach to consider the effect of both the local solid volume fraction and the local relative velocity between particles and fluid, which is different from the traditional volume-averaged Navier-Stokes equations. A time-driven hard sphere algorithm is combined to simulate the motion of individual particles, in which particles interact with each other via hard-sphere collisions, the collision detection and motion of particles are performed at constant time intervals. The EMMS (energy minimization multi-scale) drag is coupled with the lattice Boltzmann based discrete particle simulation to improve the accuracy. Two typical fluidization processes, namely, a single bubble injection at incipient fluidization and particle clustering in a fast fluidized bed riser, are simulated with this approach, with the results showing a good agreement with published correlations and experimental data. The capability of the approach to capture more detailed and intrinsic characteristics of particle-fluid systems is demonstrated. The method can also be used straightforward with other solid phase solvers.

preprint2013arXiv

Magnetization Characteristic of Ferromagnetic Thin Strip by Measuring Anisotropic Magnetoresistance and Ferromagnetic Resonance

The magnetization characteristic in a permalloy thin strip is investigated by electrically measuring the anisotropic magnetoresistance and ferromagnetic resonance in in-plane and out-of-plane configurations. Our results indicate that the magnetization vector can rotate in the film plane as well as out of the film plane by changing the intensity of external magnetic field of certain direction. The magnetization characteristic can be explained by considering demagnetization and magnetic anisotropy. Our method can be used to obtain the demagnetization factor, saturated magnetic moment and the magnetic anisotropy.

preprint2013arXiv

Planning and Acting under Uncertainty: A New Model for Spoken Dialogue Systems

Uncertainty plays a central role in spoken dialogue systems. Some stochastic models like Markov decision process (MDP) are used to model the dialogue manager. But the partially observable system state and user intention hinder the natural representation of the dialogue state. MDP-based system degrades fast when uncertainty about a user's intention increases. We propose a novel dialogue model based on the partially observable Markov decision process (POMDP). We use hidden system states and user intentions as the state set, parser results and low-level information as the observation set, domain actions and dialogue repair actions as the action set. Here the low-level information is extracted from different input modals, including speech, keyboard, mouse, etc., using Bayesian networks. Because of the limitation of the exact algorithms, we focus on heuristic approximation algorithms and their applicability in POMDP for dialogue management. We also propose two methods for grid point selection in grid-based approximation algorithms.

preprint2013arXiv

Study of Neutron-Capture Element Abundances in Metal-Poor Stars

This work describes a study of elemental abundances for 30 metal-poor stars whose chemical abundances provide excellent information for setting constraints on models of neutron-capture processes. Based on the abundances of main r- and weak r-process stars, the abundance patterns of main r-process and weak r-process are obtained. The two r-process component coefficients are defined to determine the relative contributions from individual neutron-capture process to abundances of metal-poor stars. Based on the component coefficients, we find that metal-poor stars BD+4 2621 and HD 4306 are also weak r-process stars, which means that the abundance pattern produced by weak r-process is stable. All metal-poor star abundances contain the contributions of both main r-process and weak r-process. The elements produced by weak r-process have increased along with Fe over the polluted history. Most of the metal-poor star abundances do not follow the pattern observed in solar system, but there is a small fraction that do. For the low-[Sr/Fe] star BD-18 5550 ([Sr/Fe]$\lesssim-1$), neutron-capture element abundances can be explained by the mixture of two r-process components. Since lighter elements in this star cannot be fitted by the two components, the abundance pattern of P-component is estimated from those abundances.

preprint2013arXiv

The study of s-process nucleosynthesis based on barium stars, CEMP-s and CEMP-r/s stars

In order to get a broader view of the s-process nucleosynthesis we study the abundance distribution of heavy elements of 35 barium stars and 24 CEMP-stars, including nine CEMP-s stars and 15 CEMP-r/s stars. The similar distribution of [Pb/hs] between CEMP-s and CEMP-r/s stars indicate that the s-process material of both CEMP-s and CEMP-r/s stars should have a uniform origin, i.e. mass transfer from their predominant AGB companions. For the CEMP-r/s stars, we found that the r-process should provide similar proportional contributes to the second s-peak and the third s-peak elements, and also be responsible for the higher overabundance of heavy elements than those in CEMP-s stars. Which hints that the r-process origin of CEMP-r/s stars should be closely linked to the main r-process. The fact that some small $r$ values exist for both barium and CEMP-s stars, implies that the single exposure event of the s-process nucleosynthesis should be general in a wide metallicity range of our Galaxy. Based on the relation between $C_{r}$ and $C_{s}$, we suggest that the origin of r-elements for CEMP-r/s stars have more sources. A common scenario is that the formation of the binary system was triggered by only one or a few supernova. In addition, accretion-induced collapse(AIC) or SN 1.5 should be the supplementary scenario, especially for these whose pre-AGB companion with higher mass and smaller orbit radius, which support the higher values of both $C_{r}$ and $C_{s}$.

preprint2012arXiv

A Newton method for simultaneous reconstruction of an interface and a buried obstacle from far-field data

This paper is concerned with the inverse problem of scattering of time-harmonic acoustic waves from a penetrable and buried obstacles. By introducing a related transmission scattering problem, a Newton iteration method is proposed to simultaneously reconstruct both the penetrable interface and the buried obstacle inside from far-field data. A main feature of our method is that we do not need to know the type of boundary conditions on the buried obstacle. In particular, the boundary condition on the buried obstacle can also be determined simultaneously by the method. Finally, numerical examples using multi-frequency data are carried out to illustrate the effectiveness of our method.

preprint2012arXiv

Study of The Abundance Patterns in The Metal-Poor Stellar Stream

The chemical abundances of the metal-poor stars in the stellar stream provide important information for setting constraints on models of neutron-capture processes. The study of these stars could give us a better understanding of r-process nucleosynthesis and chemical composition of the early Galaxy. Using the updated main r-process and weak r-process patterns, we fit abundances in the stellar stream stars. The weak r-process component coefficients are almost constant for the sample stars, including r-rich stars, which means that both weak r-process and Fe are produced as primary elements from SNeII and their yields have nearly a constant mass fraction. The difference between the stream stars and r-rich stars is obvious. For the stream stars, that the increase trend in the main r-process component coefficients as metallicity increases means the gradual increase in the production of main r-process elements relative to iron. This behavior implies that the masses of progenitors for the main r-process are smaller than those of the weak r-process. Furthermore, we find metal-poor stream star HD 237846 is a weak r-process star.

preprint2012arXiv

Three-dimensional imaging of single nanotube molecule endocytosis on plasmonic substrates

Investigating the cellular internalization pathways of single molecules or single nano-objects is important to understanding cell-matter interactions and to applications in drug delivery and discovery. Imaging and tracking the motion of single molecules on cell plasma membrane require high spatial resolution in three dimensions (3D). Fluorescence imaging along the axial dimension with nanometer resolution has been highly challenging but critical to revealing displacements in trans-membrane events. Here, utilizing a plasmonic ruler based on the sensitive distance dependence of near-infrared fluorescence enhancement (NIR-FE) of carbon nanotubes on a gold plasmonic substrate, we probe ~10 nm scale trans-membrane displacements through changes in nanotube fluorescence intensity, enabling observations of single nanotube endocytosis in 3D. Cellular uptake and trans-membrane displacements show clear dependences to temperature and clathrin assembly on cell membrane, suggesting that the cellular entry mechanism for a nanotube molecule is via clathrin-dependent endocytosis through the formation of clathrin-coated pits on cell membrane.

preprint2012arXiv

Urban Freight Transportation Planning: A Dynamic Stackelberg Game-Theoretic Approach

In this paper we propose a dynamic Stackelberg game-theoretic model for urban freight transportation planning which is able to characterize the interaction between freight and personal transportation in an urban area. The problem is formulated as a bi-level dynamic mathematical program with equilibrium constraints (MPEC) which belongs to a class of computationally challenging problems. The lower level is dynamic user equilibrium (DUE) with inhomogeneous traffic that characterizes traffic system optimum (SO) freight transportation planning problem which aims at minimizing the total cost to a truck company. A mathematical program with complementarity constraints (MPCC) reformulation is derived and a projected gradient algorithm is designed to solve this computationally challenging problem. Numerical experiments are conducted to show that when planning freight transportation the background traffic is nonnegligible, even though the amount of trucks compared to other vehicles traveling on the same network is relatively small. What's more, in our proposed bi-level model for urban freight transportation planning, we find a dynamic case of a Braess-like Paradox which can provide managerial insights to a metropolitan planning organization (MPO) in increasing social welfare by restricting freight movement.

preprint2011arXiv

A new embedding quality assessment method for manifold learning

Manifold learning is a hot research topic in the field of computer science. A crucial issue with current manifold learning methods is that they lack a natural quantitative measure to assess the quality of learned embeddings, which greatly limits their applications to real-world problems. In this paper, a new embedding quality assessment method for manifold learning, named as Normalization Independent Embedding Quality Assessment (NIEQA), is proposed. Compared with current assessment methods which are limited to isometric embeddings, the NIEQA method has a much larger application range due to two features. First, it is based on a new measure which can effectively evaluate how well local neighborhood geometry is preserved under normalization, hence it can be applied to both isometric and normalized embeddings. Second, it can provide both local and global evaluations to output an overall assessment. Therefore, NIEQA can serve as a natural tool in model selection and evaluation tasks for manifold learning. Experimental results on benchmark data sets validate the effectiveness of the proposed method.

preprint2011arXiv

A unitary quantum lattice gas algorithm for two dimensional quantum turbulence

Quantum vortex structures and energy cascades are examined for two dimensional quantum turbulence (2D QT) at zero temperature. A special unitary evolution algorithm, the quantum lattice gas (QLG) algorithm, is employed to simulate the Bose-Einstein condensate (BEC) governed by the Gross-Pitaevskii (GP) equation. A parameter regime is uncovered in which, as in 3D QT, there is a short Poincaré recurrence time. It is demonstrated that such short recurrence times are destroyed as the nonlinear interaction is strengthened. The similar loss of Poincaré recurrence is also reported in 3D QT [1] Energy cascades for 2D QT are considered to examine whether 2D QT exhibits inverse cascades as in 2D classical turbulence. In the parameter regime considered, the spectra analysis reveals no such dual cascades-dual cascades being a hallmark of 2D classical turbulence.

preprint2011arXiv

Near-Infrared Fluorescence Enhanced (NIR-FE) Molecular Imaging of Live Cells on Gold Substrates

Low quantum yields of near infrared (NIR) fluorophores have limited their capabilities as imaging probes in a transparent, low background imaging window. Here for the first time we reported near-infrared fluorescence enhance (NIR-FE) cell imaging using nanostructured Au substrate, which was employed as a general platform for both single-walled carbon nanotubes (SWNTs) and organic fluorescent labels in the NIR region. Fluorescence intensity, as well as cell targeting specificity, was greatly improved by this novel imaging technique. With NIR-FE imaging, we were able to image SWNT-stained cells at short exposure time of 300ms, and push the detectable limit of SWNT staining of cells down to an ultralow concentration of ~50 pM. Further, different degrees of fluorescence enhancement for endocytosed, intracellular SWNTs vs. nanotubes on the cell membrane at the cell/gold interface were observed, suggesting the possibility of using this technique to track the transmembrane behavior of NIR fluorophores.

preprint2011arXiv

Optimal error estimates and energy conservation identities of the ADI-FDTD scheme on staggered grids for 3D Maxwell's equations

This paper is concerned with the optimal error estimates and energy conservation properties of the alternating direction implicit finite-difference time-domain (ADI-FDTD) method which is a popular scheme for solving the 3D Maxwell equations. Precisely, for the case with a perfectly electric conducting (PEC) boundary condition we establish the optimal second-order error estimates in both space and time in the discrete $H^1$-norm for the ADI-FDTD scheme and prove the approximate divergence preserving property that if the divergence of the initial electric and magnetic fields are zero then the discrete $L^2$-norm of the discrete divergence of the ADI-FDTD solution is approximately zero with the second-order accuracy in both space and time. A key ingredient is two new discrete energy norms which are second-order in time perturbations of two new energy conservation laws for the Maxwell equations introduced in this paper. Furthermore, we prove that, in addition to two known discrete energy identities which are second-order in time perturbations of two known energy conservation laws, the ADI-FDTD scheme also satisfies two new discrete energy identities which are second-order in time perturbations of the two new energy conservation laws. This means that the ADI-FDTD scheme is unconditionally stable under the four discrete energy norms. Experimental results are presented which confirm the theoretical results.

preprint2011arXiv

Poincare recurrence and intermittent loss of quantum Kelvin wave cascades in quantum turbulence

The evolution of the ground state wave function of a zero-temperature Bose-Einstein condensate (BEC) is well described by the Hamiltonian Gross-Pitaevskii (GP) equation. Using a set of appropriately interleaved unitary collision-streaming operators, a quantum lattice gas algorithm is devised which on taking moments recovers the Gross-Pitaevskii (GP) equation in diffusion ordering (time scales as square of length). Unexpectedly, there is a class of initial conditions in which their Poincare recurrence is extremely short. Further it is shown that the Poincare recurrence time scales with diffusion ordering as the the grid is increased. The spectral results of Yepez et.al. [1] for quantum turbulence are corrected and it is found that it is the compressible kinetic energy spectrum that exhibits the 3 cascade regions: a small k classical Kolmogorov k^(-5/3) spectrum, a steep semi-classical cascade region, and a large k quantum Kelvin wave cascade k^(-3) spectrum. The incompressible kinetic energy spectrum exhibits basically a single cascade power law of k^(-3). For winding number 1 linear vortices it is also shown that there is an intermittent loss of Kelvin wave cascade with its signature seen in the time evolution of the kinetic energy, the loss of the k^(-3) spectrum in the incompressible kinetic energy spectrum as well as the minimization of the vortex core isosurfaces that inhibits the Kelvin wave cascade.

preprint2011arXiv

The Bar and Spiral Structure Legacy (BeSSeL) Survey: Mapping the Milky Way with VLBI Astrometry

Astrometric Very Long Baseline Interferometry (VLBI) observations of maser sources in the Milky Way are used to map the spiral structure of our Galaxy and to determine fundamental parameters such as the rotation velocity ($Θ_0$) and curve and the distance to the Galactic center (R$_0$). Here, we present an update on our first results, implementing a recent change in the knowledge about the Solar motion. It seems unavoidable that the IAU recommended values for R$_0$ and $Θ_0$ need a substantial revision. In particular the combination of 8.5 kpc and 220 \kms\, can be ruled out with high confidence. Combining the maser data with the distance to the Galactic center from stellar orbits and the proper motion of Sgr\,A* gives best values of R$_0$ = 8.3 $\pm$ 0.23 kpc and $Θ_0$ = 239 or 246 $\pm$ 7 \kms, for Solar motions of V$_ \odot$ = 12.23 and 5.25 \kms, respectively. Finally, we give an outlook to future observations in the Bar and Spiral Structure Legacy (BeSSeL) Survey.

preprint2010arXiv

A holistic abundance analysis of r-rich stars

The chemical abundances of metal-poor stars are an excellent test bed by which to set new constraints on models of neutron-capture processes at low metallicity. Some r-process-rich (hereafter r-rich) metal-poor stars, such as HD221170, show an overabundance of the heavier neutron-capture elements and excesses of lighter neutron-capture elements. The study of these r-rich stars could give us a better understanding of weak and main r-process nucleosynthesis at low metallicity. Based on conclusions from the observation of metal-poor stars and neutron-capture element nucleosynthesis theory, we set up a model to determine the relative contributions from weak and main r-processes to the heavy-element abundances in metal-poor stars. Using this model, we find that the abundance patterns of light elements for most sample stars are close to the pattern of weak r-process stars, and those of heavier neutron-capture elements very similar to the pattern of main r-process stars, while the lighter neutron-capture elements can be fitted by the mixing of weak and main r-process material. The production of weak r-process elements appears to be associated with the light elements, while the production of main r-process elements is almost decoupled from that of the light elements. We compare our results with the observed data at low metallicities, showing that the predicted trends are in good agreement with the observed trends, at least for the metallicity range [Fe/H] < -2.1. For most sample stars, the abundance patterns of both neutron-capture elements and light elements could be best explained by a star formed in a molecular cloud that has been polluted by both weak and main r-process material.

preprint2010arXiv

An Explicit Nonlinear Mapping for Manifold Learning

Manifold learning is a hot research topic in the field of computer science and has many applications in the real world. A main drawback of manifold learning methods is, however, that there is no explicit mappings from the input data manifold to the output embedding. This prohibits the application of manifold learning methods in many practical problems such as classification and target detection. Previously, in order to provide explicit mappings for manifold learning methods, many methods have been proposed to get an approximate explicit representation mapping with the assumption that there exists a linear projection between the high-dimensional data samples and their low-dimensional embedding. However, this linearity assumption may be too restrictive. In this paper, an explicit nonlinear mapping is proposed for manifold learning, based on the assumption that there exists a polynomial mapping between the high-dimensional data samples and their low-dimensional representations. As far as we know, this is the first time that an explicit nonlinear mapping for manifold learning is given. In particular, we apply this to the method of Locally Linear Embedding (LLE) and derive an explicit nonlinear manifold learning algorithm, named Neighborhood Preserving Polynomial Embedding (NPPE). Experimental results on both synthetic and real-world data show that the proposed mapping is much more effective in preserving the local neighborhood information and the nonlinear geometry of the high-dimensional data samples than previous work.

preprint2010arXiv

An inverse electromagnetic scattering problem for a bi-periodic inhomogeneous layer on a perfectly conducting plate

This paper is concerned with uniqueness for reconstructing a periodic inhomogeneous medium covered on a perfectly conducting plate. We deal with the problem in the frame of time-harmonic Maxwell systems without TE or TM polarization. An orthogonal relation for two refractive indices is obtained, and then inspired by Kirsch's idea, the refractive index can be identified by utilizing the eigenvalues and eigenfunctions of a quasi-periodic Sturm-Liouville eigenvalue problem.

preprint2010arXiv

Dual condensate and QCD phase transition

The dual condensate is a new QCD phase transition order parameter, which connnects confinement and chiral symmetry breaking as different mass limits. We discuss the relation between the fermion spectrum at general boundary conditions and the dual condensate and show numerical results for the latter from unquenched SU(3) lattice configurations.

preprint2010arXiv

Intrinsic dimension estimation of data by principal component analysis

Estimating intrinsic dimensionality of data is a classic problem in pattern recognition and statistics. Principal Component Analysis (PCA) is a powerful tool in discovering dimensionality of data sets with a linear structure; it, however, becomes ineffective when data have a nonlinear structure. In this paper, we propose a new PCA-based method to estimate intrinsic dimension of data with nonlinear structures. Our method works by first finding a minimal cover of the data set, then performing PCA locally on each subset in the cover and finally giving the estimation result by checking up the data variance on all small neighborhood regions. The proposed method utilizes the whole data set to estimate its intrinsic dimension and is convenient for incremental learning. In addition, our new PCA procedure can filter out noise in data and converge to a stable estimation with the neighborhood region size increasing. Experiments on synthetic and real world data sets show effectiveness of the proposed method.

preprint2010arXiv

Investigation for the enrichment pattern of the element abundances in r+s star HE 0338-3945: a special r-II star?

The very metal-poor star HE 0338-3945 shows a double-enhanced pattern of the neutron-capture elements. The study to this sample could make people gain a better understanding of s- and r-process nucleosynthesis at low metallicity. Using a parametric model,we find that the abundance pattern of the neutron-capture elements could be best explained by a binary system formed in a molecular cloud, which had been polluted by r-process material. The observed abundance pattern of C and N can be explained by an AGB model(Karakas & Lattanzio 2007), . Combing with the parameters obtained from Cui & Zhang (2006), we suggest that the initial mass of the AGB companion is most likely to be about 2.5Msun, which excludes the possibility of forming a type-1.5 supernova. By comparing with the observational abundance pattern of CS 22892-052, we find that the dominating production of O should accompany with the production of the heavy r-process elements of r+s stars. Similar to r-II stars, the heavy r-process elements are not produced in conjunction with all the light elements from Na to Fe group. The abundance pattern of the light and r-process elements for HE 0338-3945 is very close to the pattern of the r-II star CS 22892-052. So, we suggest that this star HE 0338-3945 should be a special r-II star.

preprint2010arXiv

Magnetic Field Control of the Quantum Chaotic Dynamics of Hydrogen Analogues in an Anisotropic Crystal Field

We report magnetic field control of the quantum chaotic dynamics of hydrogen analogues in an anisotropic solid state environment. The chaoticity of the system dynamics was quantified by means of energy level statistics. We analyzed the magnetic field dependence of the statistical distribution of the impurity energy levels and found a smooth transition between the Poisson limit and the Wigner limit, i.e. transition between regular Poisson and fully chaotic Wigner dynamics. Effect of the crystal field anisotropy on the quantum chaotic dynamics, which manifests itself in characteristic transitions between regularity and chaos for different field orientations, was demonstrated.

preprint2010arXiv

Study of isotopic fractions and abundances of the neutron-capture elements in HD 175305

The chemical abundances of metal-poor stars are excellent sources of information for setting new constraints on models of Galactic chemical evolution at low metallicities. In this paper we present an attempt to fit the elemental abundances observed in the bright, metal-poor giant HD 175305, and derive isotopic fractions using a parametric model. The observed abundances can be wellmatched by the combined contributions froms- and r-processmaterial. The component coefficients of the r- and s-processes are C1 = 3.220 and C3 = 1.134, respectively. The Smisotopic fraction in this star where the observed neutron-capture elements are produced is predicted to be f 152+154 =0.582,which suggests that, even though the r-process is predominantly responsible for the synthesis of the neutron-capture elements in the early Galaxy, the onset of the s-process had already occurred at this metallicity of [Fe/H] = -1.6.

preprint2010arXiv

Synchro-Curvature Self-Compton Radiation of Electrons in Curved Magnetic Fields

In this paper we present the spectrum of synchro-curvature self-Compton (SCSC) radiation of relativistic electrons with a power-law distribution of Lorentz factors. We find that the resulting spectrum is significantly different from that of either synchrotron self-Compton or curvature self-Compton radiation if both the curvature radius of the magnetic field and the cyclotron radius of the electrons are within some proper ranges. The effects of electrons' cooling and drifting, the low-energy self absorption in seed spectra, and the Klein-Nishina cutoff are also discussed, in order to get an accurate picture. We take gamma-ray bursts (GRBs) as our example environment for discussions. The results would be considered as a universal approach of the self-Compton emission of relativistic electrons moving in curved magnetic fields, and thus could be applied to many astrophysical phenomena, including GRBs, active galactic nuclei (AGNs), and pulsars.

preprint2010arXiv

The inverse electromagnetic scattering problem in a piecewise homogeneous medium

This paper is concerned with the problem of scattering of time-harmonic electromagnetic waves from an impenetrable obstacle in a piecewise homogeneous medium. The well-posedness of the direct problem is established, employing the integral equation method. Inspired by a novel idea developed by Hahner [11], we prove that the penetrable interface between layers can be uniquely determined from a knowledge of the electric far field pattern for incident plane waves. Then, using the idea developed by Liu and Zhang [21], a new mixed reciprocity relation is obtained and used to show that the impenetrable obstacle with its physical property can also be recovered. Note that the wave numbers in the corresponding medium may be different and therefore this work can be considered as a generalization of the uniqueness result of [20].

preprint2010arXiv

The linear sampling method for the inverse electromagnetic scattering by a partially coated bi-periodic structure

In this paper, we consider the inverse problem of recovering a doubly periodic Lipschitz structure through the measurement of the scattered field above the structure produced by point sources lying above the structure. The medium above the structure is assumed to be homogenous and lossless with a positive dielectric coefficient. Below the structure is a perfect conductor partially coated with a dielectric. A periodic version of the linear sampling method is developed to reconstruct the doubly periodic structure using the near field data. In this case, the far field equation defined on the unit ball of R^3 is replaced by the near field equation which is a linear integral equation of the first kind defined on a plane above the periodic surface.

preprint2010arXiv

Vortex content of calorons and deconfinement mechanism

We reveal the center vortex content of SU(2) calorons and ensembles of them. While one part of the vortex connects the constituent dyons of a single caloron, another part is predominantly spatial and can be related to the twist that exists in the caloron gauge field. The latter part depends strongly on the caloron holonomy and degenerates to a plane between the dyons when the asymptotic Polyakov loop is traceless. Correspondingly, the spatial vortex in caloron ensembles is percolating in this case. This finding fits perfectly in the confinement scenario of vortices and shows that calorons are suitable to facilitate the vortex (de)confinement mechanism.

preprint2009arXiv

Heating rate and spin flip lifetime due to near field noise in layered superconducting atom chips

We theoretically investigate the heating rate and spin flip lifetimes due to near field noise for atoms trapped close to layered superconducting structures. In particular, we compare the case of a gold layer deposited above a superconductor with the case of a bare superconductor. We study a niobium-based and a YBCO-based chip. For both niobium and YBCO chips at a temperature of 4.2 K, we find that the deposition of the gold layer can have a significant impact on the heating rate and spin flip lifetime, as a result of the increase of the near field noise. At a chip temperature of 77 K, this effect is less pronounced for the YBCO chip.

preprint2009arXiv

The Vortex Structure of SU(2) Calorons

We reveal the center vortex content of SU(2) calorons and ensembles of them. We use Laplacian Center Gauge as well as Maximal Center Gauges to show that the vortex in a single caloron consists of two parts. The first one connects the constituent dyons of the caloron (which are monopoles in Laplacian Abelian Gauge) and extends in time. The second part is predominantly spatial, encloses one of the dyons and can be related to the twist in the caloron gauge field. This part depends strongly on the caloron holonomy and degenerates to a plane when the holonomy is maximally nontrivial, i.e. when the asymptotic Polyakov loop is traceless. Correspondingly, we find the spatial vortices in caloron ensembles to percolate in this case. This finding fits perfectly in the confinement scenario of vortices and shows that calorons are suitable to facilitate the vortex confinement mechanism.

preprint2009arXiv

Vortex Content of SU(2) Calorons and Multi-Calorons

We use Laplacian Center Gauge to reveal the vortex content of single SU(2) calorons and multi-caloron systems at different holonomies. The vortex surfaces in a single SU(2) caloron consist of two parts that are induced by the constituent dyon charges and by the twist between the dyons, respectively. The latter part percolates in a caloron ensemble at maximal nontrivial holonomy. This finding fits perfectly in the confinement scenario of vortices and shows that calorons are suitable to facilitate the vortex confinement mechanism.

preprint2008arXiv

Superconducting atom chips: advantages and challenges

Superconductors are considered in view of applications to atom chip devices. The main features of magnetic traps based on superconducting wires in the Meissner and mixed states are discussed. The former state may mainly be interesting for improved atom optics, while in the latter, cold atoms may provide a probe of superconductor phenomena. The properties of a magnetic side guide based on a single superconducting strip wire placed in an external magnetic field are calculated analytically and numerically. In the mixed state of type II superconductors, inhomogeneous trapped magnetic flux, relaxation processes and noise caused by vortex motion are posing specific challenges for atom trapping.

Bo Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

199 published item(s)

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

SCP: Accelerating Discovery with a Global Web of Autonomous Scientific Agents

An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion

Outer-space branch-and-bound algorithm for generalized linear multiplicative programs

AI of Brain and Cognitive Sciences: From the Perspective of First Principles

Danlu Tongdu tablets treat lumbar spinal stenosis through reducing reactive oxygen species and apoptosis by regulating CDK2/CDK4/CDKN1A expression

DarkVision: A Benchmark for Low-light Image/Video Perception

Generalizing the intention-to-treat effect of an active control against placebo from historical placebo-controlled trials to an active-controlled trial: A case study of the efficacy of daily oral TDF/FTC in the HPTN 084 study

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices

Towards simultaneous coherent radiation in the visible and microwave bands with doped molecular crystals

YOLOv6 v3.0: A Full-Scale Reloading

A new model for preferential attachment scheme with time-varying parameters

A new preferential model with homophily for recommender systems

A physical perturbation based study on the prediction of free-fall disks with chaotic modes in the water

A Semiparametric Approach to Model-based Sensitivity Analysis in Observational Studies

A VLBA Trigonometric Parallax for RR Aql and the Mira PL Relation

Adaptable Text Matching via Meta-Weight Regulator

Adversarial Texture for Fooling Person Detectors in the Physical World

Aspect-specific Context Modeling for Aspect-based Sentiment Analysis

Asymptotic Inference for Infinitely Imbalanced Logistic Regression

Blind Source Separation over Space

Bringing Old Films Back to Life

Contrastive Cross-domain Recommendation in Matching

Diagnosing Circumburst Environment with Multiband Gamma-Ray Burst Radio Afterglows

Diagnosis of ultrafast ultraintense laser pulse characteristics by machine-learning-assisted electron spin

Disentangled Inference for GANs with Latently Invertible Autoencoder

Enhanced quantum sensing with room-temperature solid-state masers

Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models

Factor Modelling for Clustering High-dimensional Time Series

Fast Density Estimation for Density-based Clustering Methods

Fast Lossless Neural Compression with Integer-Only Discrete Flows

Guiding self-assembly of active colloids by temporal modulation of activity

Human-centric Image Cropping with Partition-aware and Content-preserving Features

Hyperuniform Active Chiral Fluids with Tunable Internal Structure

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification

Li-rich Giants in LAMOST Survey. III. The statistical analysis of Li-rich giants

Machine learning for percolation utilizing auxiliary Ising variables

Mining Error Templates for Grammatical Error Correction

MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction

Multi-granularity Item-based Contrastive Recommendation

Neutron spectroscopy evidence for a possible magnetic-field-induced gapless quantum-spin-liquid phase in a Kitaev material $α$-RuCl$_3$

On the HI Content of MaNGA Major Merger Pairs

OPA: Object Placement Assessment Dataset

Pretraining is All You Need for Image-to-Image Translation

Radio properties of the OH megamaser galaxy IIZw 096

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

Robust PCA for High Dimensional Data based on Characteristic Transformation

Robust quantum control for the manipulation of solid-state spins

Some Reflections on Drawing Causal Inference using Textual Data: Parallels Between Human Subjects and Organized Texts

Spatial Transformation for Image Composition via Correspondence Learning

Statistical matching and subclassification with a continuous dose: characterization, algorithm, and application to a health outcomes study

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Testing Biased Randomization Assumptions and Quantifying Imperfect Matching and Residual Confounding in Matched Observational Studies

The Eclipsing Binaries from the LAMOST Medium-resolution Survey.III. A High-precision Empirical Stellar Mass Library

The Properties and Evolutions of Starspots on Three Detached Eclipsing Binaries in the LAMOST-Kepler survey

The Role of Placebo Samples in Observational Studies

TransLog: A Unified Transformer-based Framework for Log Anomaly Detection

Uniqueness in inverse diffraction grating problems with infinitely many plane waves at a fixed frequency

Water Maser Survey towards off-plane O-rich AGBs around the orbital plane of the Sagittarius Stellar Stream

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

AutoKWS: Keyword Spotting with Differentiable Architecture Search

Convergence of the uniaxial PML method for time-domain electromagnetic scattering problems

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

Data completion algorithms and their applications in inverse acoustic scattering with limited-aperture backscattering data

Deep Sketch-guided Cartoon Video Inbetweening

Efficient Compressed Sensing Based Image Coding by Using Gray Transformation

LAMOST Time-Domain Survey: First Results of four $K$2 plates

LTD064402+245919: A Subgiant with a 1-3 M$_{\odot}$ Undetected Companion Identified from LAMOST-TD Data

Polar state memory in active fluids

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation

Robust Dynamical Decoupling for the Manipulation of a Spin Network via a Single Spin