Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
40works
0followers
25topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

40 published item(s)

preprint2026arXiv

TRAP: Tail-aware Ranking Attack for World-Model Planning

World models enable long-horizon planning by internally generating and evaluating imagined trajectories, making them a promising foundation for generalist agents. However, this imagination-driven decision process also introduces new security risks. Existing backdoor attacks typically aim to manipulate local features, one-step predictions, or instantaneous policy outputs. While such objectives may suffice for weaker reactive models, they are often ineffective against world models, where the learned dynamics prior and planning process can absorb or wash out the effects of shallow perturbations. More importantly, we find that world models exhibit a distinct backdoor vulnerability rooted in the long-tailed ranking structure of imagined trajectories, where disrupting the ordering of a few decision-critical trajectories can systematically hijack planning. To exploit this vulnerability, we propose TRAP, a backdoor attack framework for world models that targets imagined trajectory ranking. TRAP combines a tail-aware ranking loss to focus optimization on decision-critical trajectories with dual gating mechanisms that stabilize optimization and regulate when and where the attack penalty is applied. Under trigger conditions, TRAP alters the relative ranking of imagined trajectories to redirect planning outcomes, while largely maintaining the normal ranking structure on clean inputs. Experiments on DreamerV3 and TD-MPC2 across diverse tasks show that TRAP consistently induces sustained behavioral deviations and significant performance degradation, highlighting the need for dedicated security evaluation of world-model-based agents.

preprint2024arXiv

Integrated lithium niobate microwave photonic processing engine

Integrated microwave photonics is an intriguing field that leverages integrated photonic technologies for the generation, transmission, and manipulation of microwave signals in chip-scale optical systems. In particular, ultrafast processing and computation of analog electronic signals in the optical domain with high fidelity and low latency could enable a variety of applications such as MWP filters, microwave signal processing, and image recognition. An ideal photonic platform for achieving these integrated MWP processing tasks shall simultaneously offer an efficient, linear and high-speed electro-optic modulation block to faithfully perform microwave-optic conversion at low power, and a low-loss functional photonic network that can be configured for a variety of signal processing tasks, as well as large-scale, low-cost manufacturability to monolithically integrate the two building blocks on the same chip. In this work, we demonstrate such an integrated MWP processing engine based on a thin-film lithium niobate platform capable of performing multi-purpose processing and computation tasks of analog signals up to 92 giga samples per second at CMOS-compatible voltages. We demonstrate high-speed analog computation, i.e., first- and second-order temporal integration and differentiation with computing accuracies up to 98.1 %, and deploy these functions to showcase three proof-of-concept applications, namely, ordinary differential equation solving, ultra-wideband signal generation and high-speed edge detection of images. We further leverage the image edge detector to enable a photonic-assisted image segmentation model that could effectively outline the boundaries of melanoma lesion in medical diagnostic images, achieving orders of magnitude faster processing speed and lower power consumption than conventional electronic processors.

preprint2024arXiv

Spectral engineering of optical microresonators in anisotropic lithium niobate crystal

On-chip optical microresonators are essential building blocks in integrated optics. The ability to arbitrarily engineer their resonant frequencies is crucial for exploring novel physics in synthetic frequency dimensions and practical applications like nonlinear optical parametric processes and dispersion-engineered frequency comb generation. Photonic crystal ring (PhCR) resonators are a versatile tool for such arbitrary frequency engineering, by controllably creating mode splitting at selected resonances. To date, these PhCRs have mostly been demonstrated in isotropic photonic materials, while such engineering could be significantly more complicated in anisotropic platforms that often offer more fruitful optical properties. Here, we realize the spectral engineering of chip-scale optical microresonators in the anisotropic lithium niobate (LN) crystal by a gradient design that precisely compensates for variations in both refractive index and perturbation strength. We experimentally demonstrate controllable frequency splitting at single and multiple selected resonances in LN PhCR resonators with different sizes, while maintaining high Q-factors up to 1 million. Moreover, we experimentally construct a sharp boundary in the synthetic frequency dimension based on an actively modulated x-cut LN gradient-PhCR, opening up new paths toward the arbitrary control of electro-optic comb spectral shapes and exploration of novel physics in the frequency degree of freedom.

preprint2023arXiv

CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations

To improve the generalization of 3D human pose estimators, many existing deep learning based models focus on adding different augmentations to training poses. However, data augmentation techniques are limited to the "seen" pose combinations and hard to infer poses with rare "unseen" joint positions. To address this problem, we present CameraPose, a weakly-supervised framework for 3D human pose estimation from a single image, which can not only be applied on 2D-3D pose pairs but also on 2D alone annotations. By adding a camera parameter branch, any in-the-wild 2D annotations can be fed into our pipeline to boost the training diversity and the 3D poses can be implicitly learned by reprojecting back to 2D. Moreover, CameraPose introduces a refinement network module with confidence-guided loss to further improve the quality of noisy 2D keypoints extracted by 2D pose estimators. Experimental results demonstrate that the CameraPose brings in clear improvements on cross-scenario datasets. Notably, it outperforms the baseline method by 3mm on the most challenging dataset 3DPW. In addition, by combining our proposed refinement network module with existing 3D pose estimators, their performance can be improved in cross-scenario evaluation.

preprint2023arXiv

ZScribbleSeg: Zen and the Art of Scribble Supervised Medical Image Segmentation

Curating a large scale fully-annotated dataset can be both labour-intensive and expertise-demanding, especially for medical images. To alleviate this problem, we propose to utilize solely scribble annotations for weakly supervised segmentation. Existing solutions mainly leverage selective losses computed solely on annotated areas and generate pseudo gold standard segmentation by propagating labels to adjacent areas. However, these methods could suffer from the inaccurate and sometimes unrealistic pseudo segmentation due to the insufficient supervision and incomplete shape features. Different from previous efforts, we first investigate the principle of ''good scribble annotations'', which leads to efficient scribble forms via supervision maximization and randomness simulation. Furthermore, we introduce regularization terms to encode the spatial relationship and shape prior, where a new formulation is developed to estimate the mixture ratios of label classes. These ratios are critical in identifying the unlabeled pixels for each class and correcting erroneous predictions, thus the accurate estimation lays the foundation for the incorporation of spatial prior. Finally, we integrate the efficient scribble supervision with the prior into a unified framework, denoted as ZScribbleSeg, and apply the method to multiple scenarios. Leveraging only scribble annotations, ZScribbleSeg set new state-of-the-arts on four segmentation tasks using ACDC, MSCMRseg, MyoPS and PPSS datasets.

preprint2022arXiv

A power-efficient integrated lithium niobate electro-optic comb generator

Integrated electro-optic (EO) frequency combs are essential components for future applications in optical communications, light detection and ranging, optical computation, sensing and spectroscopy. To date, broadband on-chip EO combs are typically generated in high-quality-factor micro-resonators, while the more straightforward and flexible non-resonant method, usually using single or cascaded EO phase modulators, often requires high driving power to realize a reasonably strong modulation index. Here, we show that the phase modulation efficiency of an integrated lithium niobate modulator could be dramatically enhanced by passing optical signals through the modulation electrodes for a total of 4 round trips, via multiple low-loss TE0/TE1 mode multiplexers and waveguide crossings, reducing electrical power consumption by more than one order of magnitude. Using devices fabricated from a wafer-scale stepper lithography process, we demonstrate a broadband optical frequency comb featuring 47 comb lines at a 25-GHz repetition rate, using a moderate RF driving power of 28 dBm (0.63 W). Leveraging the excellent tunability in repetition rate and operation wavelength, our power-efficient EO comb generator could serve as a compact low-cost solution for future high-speed data transmission, sensing and spectroscopy, as well as classical and quantum optical computation systems.

preprint2022arXiv

CEKD:Cross Ensemble Knowledge Distillation for Augmented Fine-grained Data

Data augmentation has been proved effective in training deep models. Existing data augmentation methods tackle the fine-grained problem by blending image pairs and fusing corresponding labels according to the statistics of mixed pixels, which produces additional noise harmful to the performance of networks. Motivated by this, we present a simple yet effective cross ensemble knowledge distillation (CEKD) model for fine-grained feature learning. We innovatively propose a cross distillation module to provide additional supervision to alleviate the noise problem, and propose a collaborative ensemble module to overcome the target conflict problem. The proposed model can be trained in an end-to-end manner, and only requires image-level label supervision. Extensive experiments on widely used fine-grained benchmarks demonstrate the effectiveness of our proposed model. Specifically, with the backbone of ResNet-101, CEKD obtains the accuracy of 89.59%, 95.96% and 94.56% in three datasets respectively, outperforming state-of-the-art API-Net by 0.99%, 1.06% and 1.16%.

preprint2022arXiv

CycleMix: A Holistic Strategy for Medical Image Segmentation from Scribble Supervision

Curating a large set of fully annotated training data can be costly, especially for the tasks of medical image segmentation. Scribble, a weaker form of annotation, is more obtainable in practice, but training segmentation models from limited supervision of scribbles is still challenging. To address the difficulties, we propose a new framework for scribble learning-based medical image segmentation, which is composed of mix augmentation and cycle consistency and thus is referred to as CycleMix. For augmentation of supervision, CycleMix adopts the mixup strategy with a dedicated design of random occlusion, to perform increments and decrements of scribbles. For regularization of supervision, CycleMix intensifies the training objective with consistency losses to penalize inconsistent segmentation, which results in significant improvement of segmentation performance. Results on two open datasets, i.e., ACDC and MSCMRseg, showed that the proposed method achieved exhilarating performance, demonstrating comparable or even better accuracy than the fully-supervised methods. The code and expert-made scribble annotations for MSCMRseg are publicly available at https://github.com/BWGZK/CycleMix.

preprint2022arXiv

Deep Compatible Learning for Partially-Supervised Medical Image Segmentation

Partially-supervised learning can be challenging for segmentation due to the lack of supervision for unlabeled structures, and the methods directly applying fully-supervised learning could lead to incompatibility, meaning ground truth is not in the solution set of the optimization problem given the loss function. To address the challenge, we propose a deep compatible learning (DCL) framework, which trains a single multi-label segmentation network using images with only partial structures annotated. We first formulate the partially-supervised segmentation as an optimization problem compatible with missing labels, and prove its compatibility. Then, we equip the model with a conditional segmentation strategy, to propagate labels from multiple partially-annotated images to the target. Additionally, we propose a dual learning strategy, which learns two opposite mappings of label propagation simultaneously, to provide substantial supervision for unlabeled structures. The two strategies are formulated into compatible forms, termed as conditional compatibility and dual compatibility, respectively. We show this framework is generally applicable for conventional loss functions. The approach attains significant performance improvement over existing methods, especially in the situation where only a small training dataset is available. Results on three segmentation tasks have shown that the proposed framework could achieve performance matching fully-supervised models.

preprint2022arXiv

InsCon:Instance Consistency Feature Representation via Self-Supervised Learning

Feature representation via self-supervised learning has reached remarkable success in image-level contrastive learning, which brings impressive performances on image classification tasks. While image-level feature representation mainly focuses on contrastive learning in single instance, it ignores the objective differences between pretext and downstream prediction tasks such as object detection and instance segmentation. In order to fully unleash the power of feature representation on downstream prediction tasks, we propose a new end-to-end self-supervised framework called InsCon, which is devoted to capturing multi-instance information and extracting cell-instance features for object recognition and localization. On the one hand, InsCon builds a targeted learning paradigm that applies multi-instance images as input, aligning the learned feature between corresponding instance views, which makes it more appropriate for multi-instance recognition tasks. On the other hand, InsCon introduces the pull and push of cell-instance, which utilizes cell consistency to enhance fine-grained feature representation for precise boundary localization. As a result, InsCon learns multi-instance consistency on semantic feature representation and cell-instance consistency on spatial feature representation. Experiments demonstrate the method we proposed surpasses MoCo v2 by 1.1% AP^{bb} on COCO object detection and 1.0% AP^{mk} on COCO instance segmentation using Mask R-CNN R50-FPN network structure with 90k iterations, 2.1% APbb on PASCAL VOC objection detection using Faster R-CNN R50-C4 network structure with 24k iterations.

preprint2022arXiv

Mapping the Design Space of Human-AI Interaction in Text Summarization

Automatic text summarization systems commonly involve humans for preparing data or evaluating model performance, yet, there lacks a systematic understanding of humans' roles, experience, and needs when interacting with or being assisted by AI. From a human-centered perspective, we map the design opportunities and considerations for human-AI interaction in text summarization and broader text generation tasks. We first conducted a systematic literature review of 70 papers, developing a taxonomy of five interactions in AI-assisted text generation and relevant design dimensions. We designed text summarization prototypes for each interaction. We then interviewed 16 users, aided by the prototypes, to understand their expectations, experience, and needs regarding efficiency, control, and trust with AI in text summarization and propose design considerations accordingly.

preprint2022arXiv

Molecules with ALMA at Planet-forming Scales (MAPS) III: Characteristics of Radial Chemical Substructures

The Molecules with ALMA at Planet-forming Scales (MAPS) Large Program provides a detailed, high resolution (${\sim}$10-20 au) view of molecular line emission in five protoplanetary disks at spatial scales relevant for planet formation. Here, we present a systematic analysis of chemical substructures in 18 molecular lines toward the MAPS sources: IM Lup, GM Aur, AS 209, HD 163296, and MWC 480. We identify more than 200 chemical substructures, which are found at nearly all radii where line emission is detected. A wide diversity of radial morphologies - including rings, gaps, and plateaus - is observed both within each disk and across the MAPS sample. This diversity in line emission profiles is also present in the innermost 50 au. Overall, this suggests that planets form in varied chemical environments both across disks and at different radii within the same disk. Interior to 150 au, the majority of chemical substructures across the MAPS disks are spatially coincident with substructures in the millimeter continuum, indicative of physical and chemical links between the disk midplane and warm, elevated molecular emission layers. Some chemical substructures in the inner disk and most chemical substructures exterior to 150 au cannot be directly linked to dust substructure, however, which indicates that there are also other causes of chemical substructures, such as snowlines, gradients in UV photon fluxes, ionization, and radially-varying elemental ratios. This implies that chemical substructures could be developed into powerful probes of different disk characteristics, in addition to influencing the environments within which planets assemble. This paper is part of the MAPS special issue of the Astrophysical Journal Supplement.

preprint2022arXiv

Molecules with ALMA at Planet-forming Scales (MAPS). A Circumplanetary Disk Candidate in Molecular Line Emission in the AS 209 Disk

We report the discovery of a circumplanetary disk (CPD) candidate embedded in the circumstellar disk of the T Tauri star AS 209 at a radial distance of about 200 au (on-sky separation of 1."4 from the star at a position angle of $161^\circ$), isolated via $^{13}$CO $J=2-1$ emission. This is the first instance of CPD detection via gaseous emission capable of tracing the overall CPD mass. The CPD is spatially unresolved with a $117\times82$ mas beam and manifests as a point source in $^{13}$CO, indicating that its diameter is $\lesssim14$ au. The CPD is embedded within an annular gap in the circumstellar disk previously identified using $^{12}$CO and near-infrared scattered light observations, and is associated with localized velocity perturbations in $^{12}$CO. The coincidence of these features suggests that they have a common origin: an embedded giant planet. We use the $^{13}$CO intensity to constrain the CPD gas temperature and mass. We find that the CPD temperature is $\gtrsim35$ K, higher than the circumstellar disk temperature at the radial location of the CPD, 22 K, suggesting that heating sources localized to the CPD must be present. The CPD gas mass is $\gtrsim 0.095 M_{\rm Jup} \simeq 30 M_{\rm Earth}$ adopting a standard $^{13}$CO abundance. From the non-detection of millimeter continuum emission at the location of the CPD ($3σ$ flux density $\lesssim26.4~μ$Jy), we infer that the CPD dust mass is $\lesssim 0.027 M_{\rm Earth} \simeq 2.2$ lunar masses, indicating a low dust-to-gas mass ratio of $\lesssim9\times10^{-4}$. We discuss the formation mechanism of the CPD-hosting giant planet on a wide orbit in the framework of gravitational instability and pebble accretion.

preprint2022arXiv

New Constraints on Protoplanetary Disk Gas Masses in Lupus

Gas mass is a fundamental quantity of protoplanetary disks that directly relates to their ability to form planets. Because we are unable to observe the bulk H$_2$ content of disks directly, we rely on indirect tracers to provide quantitative mass estimates. Current estimates for the gas masses of the observed disk population in the Lupus star-forming region are based on measurements of isotopologues of CO. However, without additional constraints, the degeneracy between H$_2$ mass and the elemental composition of the gas leads to large uncertainties in such estimates. Here we explore the gas compositions of seven disks from the Lupus sample representing a range of CO-to-dust ratios. With Band 6 and 7 ALMA observations, we measure line emission for HCO$^+$, HCN, and N$_2$H$^+$. We find a tentative correlation among the line fluxes for these three molecular species across the sample, but no correlation with $^{13}$CO or sub-mm continuum fluxes. For the three disks where N$_2$H$^+$ is detected, we find that a combination of high disk gas masses and sub-interstellar C/H and O/H are needed to reproduce the observed values. We find increases of $\sim$10-100$\times$ previous mass estimates are required to match the observed line fluxes. This study highlights how multi-molecular studies are essential for constraining the physical and chemical properties of the gas in populations of protoplanetary disks and that CO isotopologues alone are not sufficient for determining the mass of many observed disks.

preprint2022arXiv

Parallel Network with Channel Attention and Post-Processing for Carotid Arteries Vulnerable Plaque Segmentation in Ultrasound Images

Carotid arteries vulnerable plaques are a crucial factor in the screening of atherosclerosis by ultrasound technique. However, the plaques are contaminated by various noises such as artifact, speckle noise, and manual segmentation may be time-consuming. This paper proposes an automatic convolutional neural network (CNN) method for plaque segmentation in carotid ultrasound images using a small dataset. First, a parallel network with three independent scale decoders is utilized as our base segmentation network, pyramid dilation convolutions are used to enlarge receptive fields in the three segmentation sub-networks. Subsequently, the three decoders are merged to be rectified in channels by SENet. Thirdly, in test stage, the initially segmented plaque is refined by the max contour morphology post-processing to obtain the final plaque. Moreover, three loss function Dice loss, SSIM loss and cross-entropy loss are compared to segment plaques. Test results show that the proposed method with dice loss function yields a Dice value of 0.820, an IoU of 0.701, Acc of 0.969, and modified Hausdorff distance (MHD) of 1.43 for 30 vulnerable cases of plaques, it outperforms some of the conventional CNN-based methods on these metrics. Additionally, we apply an ablation experiment to show the validity of each proposed module. Our study provides some reference for similar researches and may be useful in actual applications for plaque segmentation of ultrasound carotid arteries.

preprint2022arXiv

Parallel Spatio-Temporal Attention-Based TCN for Multivariate Time Series Prediction

As industrial systems become more complex and monitoring sensors for everything from surveillance to our health become more ubiquitous, multivariate time series prediction is taking an important place in the smooth-running of our society. A recurrent neural network with attention to help extend the prediction windows is the current-state-of-the-art for this task. However, we argue that their vanishing gradients, short memories, and serial architecture make RNNs fundamentally unsuited to long-horizon forecasting with complex data. Temporal convolutional networks (TCNs) do not suffer from gradient problems and they support parallel calculations, making them a more appropriate choice. Additionally, they have longer memories than RNNs, albeit with some instability and efficiency problems. Hence, we propose a framework, called PSTA-TCN, that combines a parallel spatio-temporal attention mechanism to extract dynamic internal correlations with stacked TCN backbones to extract features from different window sizes. The framework makes full use parallel calculations to dramatically reduce training times, while substantially increasing accuracy with stable prediction windows up to 13 times longer than the status quo.

preprint2022arXiv

Recent advances and clinical applications of deep learning in medical image analysis

Deep learning has received extensive research interest in developing new medical image processing algorithms, and deep learning based models have been remarkably successful in a variety of medical imaging tasks to support disease detection and diagnosis. Despite the success, the further improvement of deep learning models in medical image analysis is majorly bottlenecked by the lack of large-sized and well-annotated datasets. In the past five years, many studies have focused on addressing this challenge. In this paper, we reviewed and summarized these recent studies to provide a comprehensive overview of applying deep learning methods in various medical image analysis tasks. Especially, we emphasize the latest progress and contributions of state-of-the-art unsupervised and semi-supervised deep learning in medical image analysis, which are summarized based on different application scenarios, including classification, segmentation, detection, and image registration. We also discuss the major technical challenges and suggest the possible solutions in future research efforts.

preprint2022arXiv

ShapePU: A New PU Learning Framework Regularized by Global Consistency for Scribble Supervised Cardiac Segmentation

Cardiac segmentation is an essential step for the diagnosis of cardiovascular diseases. However, pixel-wise dense labeling is both costly and time-consuming. Scribble, as a form of sparse annotation, is more accessible than full annotations. However, it's particularly challenging to train a segmentation network with weak supervision from scribbles. To tackle this problem, we propose a new scribble-guided method for cardiac segmentation, based on the Positive-Unlabeled (PU) learning framework and global consistency regularization, and termed as ShapePU. To leverage unlabeled pixels via PU learning, we first present an Expectation-Maximization (EM) algorithm to estimate the proportion of each class in the unlabeled pixels. Given the estimated ratios, we then introduce the marginal probability maximization to identify the classes of unlabeled pixels. To exploit shape knowledge, we apply cutout operations to training images, and penalize the inconsistent segmentation results. Evaluated on two open datasets, i.e, ACDC and MSCMRseg, our scribble-supervised ShapePU surpassed the fully supervised approach respectively by 1.4% and 9.8% in average Dice, and outperformed the state-of-the-art weakly supervised and PU learning methods by large margins. Our code is available at https://github.com/BWGZK/ShapePU.

preprint2022arXiv

Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms

Deep convolutional neural networks (CNNs) have been widely used in various medical imaging tasks. However, due to the intrinsic locality of convolution operation, CNNs generally cannot model long-range dependencies well, which are important for accurately identifying or mapping corresponding breast lesion features computed from unregistered multiple mammograms. This motivates us to leverage the architecture of Multi-view Vision Transformers to capture long-range relationships of multiple mammograms from the same patient in one examination. For this purpose, we employ local Transformer blocks to separately learn patch relationships within four mammograms acquired from two-view (CC/MLO) of two-side (right/left) breasts. The outputs from different views and sides are concatenated and fed into global Transformer blocks, to jointly learn patch relationships between four images representing two different views of the left and right breasts. To evaluate the proposed model, we retrospectively assembled a dataset involving 949 sets of mammograms, which include 470 malignant cases and 479 normal or benign cases. We trained and evaluated the model using a five-fold cross-validation method. Without any arduous preprocessing steps (e.g., optimal window cropping, chest wall or pectoral muscle removal, two-view image registration, etc.), our four-image (two-view-two-side) Transformer-based model achieves case classification performance with an area under ROC curve (AUC = 0.818), which significantly outperforms AUC = 0.784 achieved by the state-of-the-art multi-view CNNs (p = 0.009). It also outperforms two one-view-two-side models that achieve AUC of 0.724 (CC view) and 0.769 (MLO view), respectively. The study demonstrates the potential of using Transformers to develop high-performing computer-aided diagnosis schemes that combine four mammograms.

preprint2022arXiv

Virtual Adversarial Training for Semi-supervised Breast Mass Classification

This study aims to develop a novel computer-aided diagnosis (CAD) scheme for mammographic breast mass classification using semi-supervised learning. Although supervised deep learning has achieved huge success across various medical image analysis tasks, its success relies on large amounts of high-quality annotations, which can be challenging to acquire in practice. To overcome this limitation, we propose employing a semi-supervised method, i.e., virtual adversarial training (VAT), to leverage and learn useful information underlying in unlabeled data for better classification of breast masses. Accordingly, our VAT-based models have two types of losses, namely supervised and virtual adversarial losses. The former loss acts as in supervised classification, while the latter loss aims at enhancing model robustness against virtual adversarial perturbation, thus improving model generalizability. To evaluate the performance of our VAT-based CAD scheme, we retrospectively assembled a total of 1024 breast mass images, with equal number of benign and malignant masses. A large CNN and a small CNN were used in this investigation, and both were trained with and without the adversarial loss. When the labeled ratios were 40% and 80%, VAT-based CNNs delivered the highest classification accuracy of 0.740 and 0.760, respectively. The experimental results suggest that the VAT-based CAD scheme can effectively utilize meaningful knowledge from unlabeled data to better classify mammographic breast mass images.

preprint2021arXiv

Effect of MHD wind-driven disk evolution on the observed sizes of protoplanetary disks

It is still unclear whether the evolution of protoplanetary disks, a key ingredient in the theory of planet formation, is driven by viscous turbulence or magnetic disk winds. As viscously evolving disks expand outward over time, the evolution of disk sizes is a discriminant test for studying disk evolution. However, it is unclear how the observed disk size changes over time if disk evolution is driven by magnetic disk winds. Combining the thermochemical code DALI with the analytical wind-driven disk evolution model presented in Tabone et al. (2021a), we study the time evolution of the observed gas outer radius as measured from CO rotational emission ($R_{\rm CO, 90\%}$). The evolution of $R_{\rm CO, 90\%}$ is driven by the evolution of the disk mass, as the physical radius stays constant over time. For a constant $α_{\rm DW}$, an extension of the $α-$Shakura-Sunyaev parameter to wind-driven accretion, $R_{\rm CO, 90\%}$ decreases linearly with time. Its initial size is set by the disk mass and the characteristic radius $R_c$, but only $R_c$ affects the evolution of $R_{\rm CO, 90\%}$, with a larger $R_c$ resulting in a steeper decrease of $R_{\rm CO, 90\%}$. For a time-dependent $α_{\rm DW}$ $R_{\rm CO, 90\%}$ stays approximately constant during most of the disk lifetime until $R_{\rm CO, 90\%}$ rapidly shrinks as the disk dissipates. The constant $α_{\rm DW}$-models are able to reproduce the observed gas disk sizes in the $\sim1-3$ Lupus and $\sim5-11$ Myr old Upper Sco star-forming regions. However, they likely overpredict the gas disk size of younger $(\lessapprox0.7\ \mathrm{Myr})$ disks.

preprint2021arXiv

Efficient erbium-doped thin-film lithium niobate waveguide amplifiers

Lithium niobate on insulator (LNOI) is an emerging photonic platform with great promises for future optical communications, nonlinear optics and microwave photonics. An important integrated photonic building block, active waveguide amplifiers, however, is still missing in the LNOI platform. Here we report an efficient and compact waveguide amplifier based on erbium-doped LNOI waveguides, realized by a sequence of erbium-doped crystal growth, ion slicing and lithography-based waveguide fabrication. Using a compact 5-mm-long waveguide, we demonstrate on-chip net gain of > 5 dB for 1530-nm signal light with a relatively low pump power of 21 mW at 980 nm. The efficient LNOI waveguide amplifiers could become an important fundamental element in future lithium niobate photonic integrated circuits.

preprint2021arXiv

Probing fast oscillating scalar dark matter with atoms and molecules

Light scalar Dark Matter with scalar couplings to matter is expected within several scenarios to induce variations in the fundamental constants of nature. Such variations can be searched for, among other ways, via atomic spectroscopy. Sensitive atomic observables arise primarily due to possible changes in the fine-structure constant or the electron mass. Most of the searches to date have focused on slow variations of the constants (i.e. modulation frequencies $<$ 1 Hz). In a recent experiment \mbox{[Phys. Rev. Lett. 123, 141102 (2019)]} called WReSL (Weekend Relaxion-Search Laboratory), we reported on a direct search for rapid variations in the radio-frequency band. Such a search is particularly motivated within a class of relaxion Dark Matter models. We discuss the WReSL experiment, report on progress towards improved measurements of rapid fundamental constant variations, and discuss the planned extension of the work to molecules, in which rapid variations of the nuclear mass can be sensitively searched for.

preprint2021arXiv

Synthesizing five-body interaction in a superconducting quantum circuit

Synthesizing many-body interaction Hamiltonian is a central task in quantum simulation. However, it is challenging to synthesize interactions including more than two spins. Borrowing tools from quantum optics, we synthesize five-body spin-exchange interaction in a superconducting quantum circuit by simultaneously exciting four independent qubits with time-energy correlated photon quadruples generated from a qudit. During the dynamic evolution of the five-body interaction, a Greenberger-Horne-Zeilinger state is generated in a single step with fidelity estimated to be $0.685$. We compare the influence of noise on the three-, four- and five-body interaction as a step toward answering the question on the quantum origin of chiral molecules. We also demonstrate a many-body Mach-Zehnder interferometer which potentially has a Heisenberg-limit sensitivity. This study paves a way for quantum simulation involving many-body interactions and high excited states of quantum circuits.

preprint2020arXiv

A novel method for constructing high accurate and robust WENO-Z type scheme

A novel method for constructing robust and high-order accurate weighted essentially non-oscillatory (WENO) scheme is proposed in this paper. The method is mainly based on the WENO-Z type scheme, in which, an eighth-order global smoothness indicator (the square of the approximation of the fourth-order derivative on the five-point stencil used by the fifth-order WENO scheme) is used, and in order to keep the ENO property and robustness, the constant 1 used to calculate the un-normalized weights is replaced by a function of local smoothness indicators of candidate sub-stencils. This function is designed to have following adaptive property: if the five-point stencil contains a discontinuity, then the function approaches to a small value, otherwise, it approaches to a large value. Analysis and numerical results show that the resulted WENO-Z type (WENO-ZN) scheme is robust for capturing shock waves and, in smooth regions, achieves fifth-order accuracy at first-order critical point and fourth-order accuracy at second-order critical point.

preprint2020arXiv

CO Depletion in Protoplanetary Disks: A Unified Picture Combining Physical Sequestration and Chemical Processing

The gas-phase CO abundance (relative to hydrogen) in protoplanetary disks decreases by up to 2 orders of magnitude from its ISM value ${\sim}10^{-4}$, even after accounting for freeze-out and photo-dissociation. Previous studies have shown that while local chemical processing of CO and the sequestration of CO ice on solids in the midplane can both contribute, neither of these processes appears capable of consistently reaching the observed depletion factors on the relevant timescale of $1{-}3\mathrm{~Myr}$. In this study, we model these processes simultaneously by including a compact chemical network (centered on carbon and oxygen) to 2D ($r+z$) simulations of the outer ($r>20\mathrm{~au}$) disk regions that include turbulent diffusion, pebble formation, and pebble dynamics. In general, we find that the CO/H$_2$ abundance is a complex function of time and location. Focusing on CO in the warm molecular layer, we find that only the most complete model (with chemistry and pebble evolution included) can reach depletion factors consistent with observations. In the absence of pressure traps, highly-efficient planetesimal formation, or high cosmic ray ionization rates, this model also predicts a resurgence of CO vapor interior to the CO snowline. We show the impact of physical and chemical processes on the elemental (C/O) and (C/H) ratios (in the gas and ice phases), discuss the use of CO as a disk mass tracer, and, finally, connect our predicted pebble ice compositions to those of pristine planetesimals as found in the Cold Classical Kuiper Belt and debris disks.

preprint2020arXiv

Detection Method Based on Automatic Visual Shape Clustering for Pin-Missing Defect in Transmission Lines

Bolts are the most numerous fasteners in transmission lines and are prone to losing their split pins. How to realize the automatic pin-missing defect detection for bolts in transmission lines so as to achieve timely and efficient trouble shooting is a difficult problem and the long-term research target of power systems. In this paper, an automatic detection model called Automatic Visual Shape Clustering Network (AVSCNet) for pin-missing defect is constructed. Firstly, an unsupervised clustering method for the visual shapes of bolts is proposed and applied to construct a defect detection model which can learn the difference of visual shape. Next, three deep convolutional neural network optimization methods are used in the model: the feature enhancement, feature fusion and region feature extraction. The defect detection results are obtained by applying the regression calculation and classification to the regional features. In this paper, the object detection model of different networks is used to test the dataset of pin-missing defect constructed by the aerial images of transmission lines from multiple locations, and it is evaluated by various indicators and is fully verified. The results show that our method can achieve considerably satisfactory detection effect.

preprint2020arXiv

Detection of missing low-lying atomic states in actinium

Two lowest-energy odd-parity atomic levels of actinium, 7s^27p 2P^o_1/2, 7s^27p 2P^o_3/2, were observed via two-step resonant laser-ionization spectroscopy and their respective energies were measured to be 7477.36(4) cm^-1 and 12 276.59(2) cm^-1. The lifetimes of these states were determined as 668(11) ns and 255(7) ns, respectively. In addition, these properties were calculated using a hybrid approach that combines configuration interaction and coupled-cluster methods in good agreement. The data are of relevance for understanding the complex atomic spectra of actinides and for developing efficient laser-cooling and ionization schemes for actinium, with possible applications for high-purity medicalisotope production and future fundamental physics experiments with this atom.

preprint2020arXiv

Excess C/H in Protoplanetary Disk Gas from Icy Pebble Drift across the CO Snowline

The atmospheric composition of giant planets carries the information of their formation history. Superstellar C/H ratios are seen in atmospheres of Jupiter, Saturn, and various giant exoplanets. Also, giant exoplanets show a wide range of C/O ratio. To explain these ratios, one hypothesis is that protoplanets accrete carbon-enriched gas when a large number of icy pebbles drift across the CO snowline. Here we report the first direct evidence of an elevated C/H ratio in disk gas. We use two thermo-chemical codes to model the $^{13}$C$^{18}$O, C$^{17}$O, and C$^{18}$O (2-1) line spectra of the HD 163296 disk. We show that the gas inside the CO snowline ($\sim$70 au) has a C/H ratio of 1-2 times higher than the stellar value. This ratio exceeds the expected value substantially, as only 25-60% of the carbon should be in gas at these radii. Although we cannot rule out the case of a normal C/H ratio inside 70 au, the most probable solution is an elevated C/H ratio of 2-8 times higher than the expectation. Our model also shows that the gas outside 70 au has a C/H ratio of 0.1$\times$ the stellar value. This picture of enriched C/H gas at the inner region and depleted gas at the outer region is consistent with numerical simulations of icy pebble growth and drift in protoplanetary disks. Our results demonstrate that the large-scale drift of icy pebble can occur in disks and may significantly change the disk gas composition for planet formation.

preprint2020arXiv

Half-Magnetic Topological Insulator

Topological magnets are a new family of quantum materials providing great potential to realize emergent phenomena, such as quantum anomalous Hall effect and axion-insulator state. Here we present our discovery that stoichiometric ferromagnet MnBi8Te13 with natural heterostructure MnBi2Te4-(Bi2Te3)3 is an unprecedented half-magnetic topological insulator, with the magnetization existing at the MnBi2Te4 surface but not at the opposite surface terminated by triple Bi2Te3 layers. Our angle-resolved photoemission spectroscopy measurements unveil a massive Dirac gap at the MnBi2Te4 surface, and gapless Dirac cone on the other side. Remarkably, the Dirac gap (~28 meV) at MnBi2Te4 surface decreases monotonically with increasing temperature and closes right at the Curie temperature, thereby representing the first smoking-gun spectroscopic evidence of magnetization-induced topological surface gap among all known magnetic topological materials. We further demonstrate theoretically that the half-magnetic topological insulator is desirable to realize the half-quantized surface anomalous Hall effect, which serves as a direct proof of the general concept of axion electrodynamics in condensed matter systems.

preprint2020arXiv

Hints of a Population of Solar System Analog Planets from ALMA

The recent ALMA DSHARP survey provided illuminating results on the diversity of substructures in planet forming disks. These substructures trace pebble-sized grains accumulated at local pressure maxima, possibly due to planet-disk interactions or other planet formation processes. DSHARP sources are heavily biased to large and massive disks that only represent the high (dust flux) tail end of the disk population. Thus it is unclear whether similar substructures and corresponding physical processes also occur in the majority of disks which are fainter and more compact. Here we explore the presence and characteristics of features in a compact disk around GQ Lup A, the effective radius of which is 1.5 to 10 times smaller than those of DSHARP disks. We present our analysis of ALMA 1.3mm continuum observations of the GQ Lup system. By fitting visibility profiles of the continuum emission, we find substructures including a gap at ~ 10 au. The compact disk around GQ Lup exhibits similar substructures to those in the DSHARP sample, suggesting that mechanisms of trapping pebble-sized grains are at work in small disks as well. Characteristics of the feature at ~ 10 au, if due to a hidden planet, are evidence of planet formation at Saturnian distances. Our results hint at a rich world of substructures to be identified within the common population of compact disks, and subsequently a population of solar system analogs within these disks. Such study is critical to understanding the formation mechanisms and planet populations in the majority of protoplanetary disks.

preprint2020arXiv

Machine Learning at the Edge: A Data-Driven Architecture with Applications to 5G Cellular Networks

The fifth generation of cellular networks (5G) will rely on edge cloud deployments to satisfy the ultra-low latency demand of future applications. In this paper, we argue that such deployments can also be used to enable advanced data-driven and Machine Learning (ML) applications in mobile networks. We propose an edge-controller-based architecture for cellular networks and evaluate its performance with real data from hundreds of base stations of a major U.S. operator. In this regard, we will provide insights on how to dynamically cluster and associate base stations and controllers, according to the global mobility patterns of the users. Then, we will describe how the controllers can be used to run ML algorithms to predict the number of users in each base station, and a use case in which these predictions are exploited by a higher-layer application to route vehicular traffic according to network Key Performance Indicators (KPIs). We show that the prediction accuracy improves when based on machine learning algorithms that rely on the controllers&#39; view and, consequently, on the spatial correlation introduced by the user mobility, with respect to when the prediction is based only on the local data of each single base station.

preprint2020arXiv

MU-GAN: Facial Attribute Editing based on Multi-attention Mechanism

Facial attribute editing has mainly two objectives: 1) translating image from a source domain to a target one, and 2) only changing the facial regions related to a target attribute and preserving the attribute-excluding details. In this work, we propose a Multi-attention U-Net-based Generative Adversarial Network (MU-GAN). First, we replace a classic convolutional encoder-decoder with a symmetric U-Net-like structure in a generator, and then apply an additive attention mechanism to build attention-based U-Net connections for adaptively transferring encoder representations to complement a decoder with attribute-excluding detail and enhance attribute editing ability. Second, a self-attention mechanism is incorporated into convolutional layers for modeling long-range and multi-level dependencies across image regions. experimental results indicate that our method is capable of balancing attribute editing ability and details preservation ability, and can decouple the correlation among attributes. It outperforms the state-of-the-art methods in terms of attribute manipulation accuracy and image quality.

preprint2020arXiv

Rapid Evolution of Volatile CO from the Protostellar Disk Stage to the Protoplanetary Disk Stage

Recent observations show that the CO gas abundance, relative to H$_2$, in many 1-10 Myr old protoplanetary disks may be heavily depleted, by a factor of 10-100 compared to the canonical interstellar medium value of 10$^{-4}$. When and how this depletion happens can significantly affect compositions of planetesimals and atmospheres of giant planets. It is therefore important to constrain if the depletion occurs already at the earliest protostellar disk stage. Here we present spatially resolved observations of C$^{18}$O, C$^{17}$O, and $^{13}$C$^{18}$O $J$=2-1 lines in three protostellar disks. We show that the C$^{18}$O line emits from both the disk and the inner envelope, while C$^{17}$O and $^{13}$C$^{18}$O lines are consistent with a disk origin. The line ratios indicate that both C$^{18}$O and C$^{17}$O lines are optically thick in the disk region, and only $^{13}$C$^{18}$O line is optically thin. The line profiles of the $^{13}$C$^{18}$O emissions are best reproduced by Keplerian gaseous disks at similar sizes as their mm-continuum emissions, suggesting small radial separations between the gas and mm-sized grains in these disks, in contrast to the large separation commonly seen in protoplanetary disks. Assuming a gas-to-dust ratio of 100, we find that the CO gas abundances in these protostellar disks are consistent with the ISM abundance within a factor of 2, nearly one order of magnitude higher than the average value of 1-10 Myr old disks. These results suggest that there is a fast, $\sim$1 Myr, evolution of the abundance of CO gas from the protostellar disk stage to the protoplanetary disk stage.

preprint2020arXiv

The TW Hya Rosetta Stone Project III: Resolving the Gaseous Thermal Profile of the Disk

The thermal structure of protoplanetary disks is a fundamental characteristic of the system that has wide reaching effects on disk evolution and planet formation. In this study, we constrain the 2D thermal structure of the protoplanetary disk TW Hya structure utilizing images of seven CO lines. This includes new ALMA observations of 12CO J=2-1 and C18O J=2-1 as well as archival ALMA observations of 12CO J=3-2, 13CO J=3-2, 6-5, C18O J= 3-2, 6-5. Additionally, we reproduce a Herschel observation of the HD J=1-0 line flux, the spectral energy distribution, and utilize a recent quantification of CO radial depletion in TW Hya. These observations were modeled using the thermochemical code RAC2D, and our best fit model reproduces all spatially resolved CO surface brightness profiles. The resulting thermal profile finds a disk mass of 0.025 Msun and a thin upper layer of gas depleted of small dust with a thickness of approx 1.2% of the corresponding radius. Using our final thermal structure, we find that CO alone is not a viable mass tracer as its abundance is degenerate with the total H2 surface density. Different mass models can readily match the spatially resolved CO line profiles with disparate abundance assumptions. Mass determination requires additional knowledge and, in this work, HD provides the additional constraint to derive the gas mass and supports the inference of CO depletion in the TW Hya disk. Our final thermal structure confirms the use of HD as a powerful probe of protoplanetary disk mass. Additionally, the method laid out in this paper is an employable strategy for extraction of disk temperatures and masses in the future.

preprint2020arXiv

Visualizing the Kinematics of Planet Formation

A stunning range of substructures in the dust of protoplanetary disks is routinely observed across a range of wavelengths. These gaps, rings and spirals are highly indicative of a population of unseen planets, hinting at the possibility of current observational facilities being able to capture planet-formation in action. Over the last decade, our understanding of the influence of a young planet on the dynamical structure of its parental disk has progressed significantly, revealing a host of potentially observable features which would betray the presence of a deeply embedded planet. In concert, recent observations have shown that subtle perturbations in the kinematic structure of protoplanetary disks are found in multiple sources, potentially the characteristic disturbances associated with embedded planets. In this work, we review the theoretical background of planet-disk interactions, focusing on the kinematical features, and the current methodologies used to observe these interactions in spatially and spectrally resolved observations. We discuss the potential pit falls of such kinematical detections of planets, providing best-practices for imaging and analysing interferometric data, along with a set of criteria to use as a benchmark for any claimed detection of embedded planets. We finish with a discussion on the current state of simulations in regard to planet-disk interactions, highlighting areas of particular interest and future directions which will provide the most significant impact in our search for embedded planets. This work is the culmination of the &#39;Visualizing the Kinematics of Planet Formation&#39; workshop, held in October 2019 at the Center for Computational Astrophysics at the Flatiron Institute in New York City.

preprint2019arXiv

DeepAuto: A Hierarchical Deep Learning Framework for Real-Time Prediction in Cellular Networks

Accurate real-time forecasting of key performance indicators (KPIs) is an essential requirement for various LTE/5G radio access network (RAN) automation. However, an accurate prediction can be very challenging in large-scale cellular environments due to complex spatio-temporal dynamics, network configuration changes and unavailability of real-time network data. In this work, we introduce a reusable analytics framework that enables real-time KPI prediction using a hierarchical deep learning architecture. Our prediction approach, namely DeepAuto, stacks multiple long short-term memory (LSTM) networks horizontally to capture instantaneous, periodic and seasonal patterns in KPI time-series. It further merge with feed-forward networks to learn the impact of network configurations and other external factors. We validate the approach by predicting two important KPIs, including cell load and radio channel quality, using large-scale real network streaming measurement data from the operator. For cell load prediction, DeepAuto model showed up to 15% improvement in Root Mean Square Error (RMSE) compared to naive method of using recent measurements for short-term horizon and up to 32% improvement for longer-term prediction.

preprint2019arXiv

Nanoconfined, dynamic electrolyte gating and memory effects in multilayered graphene-based membranes

Multilayered graphene-based nanoporous membranes with electrolyte incorporated between individual sheets is a unique nano-heterostructure system in which nanoconfined electrons in graphene and ions confined in between sheets are intimately coupled throughout the entire membrane. In contrast to the general notion that the electrolyte gating is unlikely to appear in multilayered graphene stacks, it is demonstrated in this work that the electrolyte gating effect in monolayer graphene can be transferred to its corresponding multilayered porous membranes. This gating effect presented on each individual graphene sheets through electrolyte confined in nanopores provides a real-time, electrical approach for probing the complex dynamics of nanoconfined electrical double layer. This has enabled the observation of the ionic memory effect in supercapacitors and produces new insights into the charging dynamics of supercapacitors. Such discoveries may stimulate the design of novel nanoionic devices.

preprint2019arXiv

Unsupervised Detection of Sub-events in Large Scale Disasters

Social media plays a major role during and after major natural disasters (e.g., hurricanes, large-scale fires, etc.), as people ``on the ground&#39;&#39; post useful information on what is actually happening. Given the large amounts of posts, a major challenge is identifying the information that is useful and actionable. Emergency responders are largely interested in finding out what events are taking place so they can properly plan and deploy resources. In this paper we address the problem of automatically identifying important sub-events (within a large-scale emergency ``event&#39;&#39;, such as a hurricane). In particular, we present a novel, unsupervised learning framework to detect sub-events in Tweets for retrospective crisis analysis. We first extract noun-verb pairs and phrases from raw tweets as sub-event candidates. Then, we learn a semantic embedding of extracted noun-verb pairs and phrases, and rank them against a crisis-specific ontology. We filter out noisy and irrelevant information then cluster the noun-verb pairs and phrases so that the top-ranked ones describe the most important sub-events. Through quantitative experiments on two large crisis data sets (Hurricane Harvey and the 2015 Nepal Earthquake), we demonstrate the effectiveness of our approach over the state-of-the-art. Our qualitative evaluation shows better performance compared to our baseline.

preprint2018arXiv

Diffusion limit for a slow-fast standard map

Consider the map $(x, y) \mapsto (x + ε^{-α} \sin (2πx) + ε^{-1-α}z, z + ε\sin(2πx))$, which is conjugate to the Chirikov standard map with a large parameter. The parameter value $α= 1$ is related to &#34;scattering by resonance&#34; phenomena. For suitable $α$, we obtain a central limit theorem for the slow variable $z$ for a (Lebesgue) random initial condition. The result is proved by conjugating to the Chirikov standard map and utilizing the formalism of standard pairs. Our techniques also yield for the Chirikov standard map a related limit theorem and a &#34;finite-time&#34; decay of correlations result.