Source author record

Hao Yang

Hao Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

73works

33topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Judge, Then Drive: A Critic-Centric Vision Language Action Framework for Autonomous Driving

Recent advances in vision language action (VLA) models have shown remarkable potential for autonomous driving by directly mapping multimodal inputs to control signals. However, previous VLA-based methods have not explicitly exploited the critic capability of VLAs to refine driving decisions, even though such capability has been well demonstrated in other LLM-based domains, thereby limiting their performance in complex closed-loop scenarios. In this work, we present a theoretically inspired two-stage framework, CriticVLA, which extends the role of VLAs from acting to judging. CriticVLA first generates a rough trajectory and then refines it through multimodal evaluation and single-step optimization guided by a VLA-based critic, yielding higher-quality driving behaviors. To support this process, we construct a large-scale synthetic dataset of 12.9 million annotated trajectories covering diverse driving scenarios, which enhances the critic's reasoning and refinement abilities. Extensive closed-loop experiments on the Bench2Drive benchmark show that CriticVLA significantly surpasses state-of-the-art baselines, achieving a 73.33% total success rate and delivering about 30% improvement in challenging scenarios.

preprint2026arXiv

ViSRA: A Video-based Spatial Reasoning Agent for Multi-modal Large Language Models

Recent advances in Multi-modal Large Language Models (MLLMs) target 3D spatial intelligence, yet the progress has been largely driven by post-training on curated benchmarks, leaving the inference-time approach relatively underexplored. In this paper, we take a training-free perspective and introduce ViSRA, a human-aligned Video-based Spatial Reasoning Agent, as a framework to probe the spatial reasoning mechanism of MLLMs. ViSRA elicits spatial reasoning in a modular and extensible manner by leveraging explicit spatial information from expert models, enabling a plug-and-play flexible paradigm. ViSRA offers two key advantages: (1) human-aligned and transferable 3D understanding rather than task-specific overfitting; and (2) no post-training computational cost along with heavy manual curation of spatial reasoning datasets. Experimental results demonstrate consistent improvement across a set of MLLMs on both existing benchmarks and unseen 3D spatial reasoning tasks, with ViSRA outperforming baselines by up to a 15.6% and 28.9% absolute margin respectively.

preprint2024arXiv

MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning

Existing contrastive language-image pre-training aims to learn a joint representation by matching abundant image-text pairs. However, the number of image-text pairs in medical datasets is usually orders of magnitude smaller than that in natural datasets. Besides, medical image-text pairs often involve numerous complex fine-grained correspondences. This paper aims to enhance the data efficiency by introducing multiple-to-multiple local relationship modeling to capture denser supervisions. More specifically, we propose a Medical Language-Image Pre-training (MLIP) framework, which exploits the limited image-text medical data more efficiently through patch-sentence matching. Furthermore, we introduce a masked contrastive learning strategy with semantic integrity estimation to reduce redundancy in images while preserving the underlying semantics. Our evaluation results show that MLIP outperforms previous work in zero/few-shot classification and few-shot segmentation tasks by a large margin.

preprint2022arXiv

A GNSS Aided Initial Alignment Method for MEMS-IMU Based on Backtracking Algorithm and Backward Filtering

To obtain a high-accuracy position with SINS(Strapdown Inertial Navigation System), initial alignment needs to determine initial attitude rapidly and accurately. High-accuracy grade IMU(Inertial Measurement Uint) can obtain the initial attitude indenpendently, however, the low-accuracy grade gyroscope doesn't adapt to determine the heading angle, hence the initial attitude matrix will not be obtained. If using large misalignment angle model to estiamting heading angle, the convergence time will become much longer. For solving these two problems, a novel alignment algorithm combined backtracking algorithm and reverse navigation updating method with GNSS(Global Navigation Satellite System) aiding is proposed herein. The simulation and land vehicle test were finished to evaluate the alignment accuracy of the proposed algorithm. The horizontal misalignment is less than 2.3 arcmin and the heading misalignment is less than 10.1 arcmin in test. The proposed algorithm is a feasible and practical alignment method for low-cost IMU to obtain initial attitude in short term and large misalignment condition aided by GNSS.

preprint2022arXiv

Accessibility of SPDEs driven by pure jump noise and its applications

In this paper, we develop a new method to obtain the accessibility of stochastic partial differential equations driven by additive pure jump noise. An important novelty of this paper is to allow the driving noises to be degenerate. As an application, for the first time, we obtain the accessibility of a class of stochastic equations driven by pure jump degenerate noise, which cover 2D stochastic Navier-Stokes equations, stochastic Burgers type equations, singular stochastic p-Laplace equations, stochastic fast diffusion equations, etc. As a further application, we establish the ergodicity of singular stochastic p-Laplace equations and stochastic fast diffusion equations driven by additive pure jump noise, and we remark that the driving noises could be Levy processes with heavy tails.

preprint2022arXiv

Boosting 3D Object Detection via Object-Focused Image Fusion

3D object detection has achieved remarkable progress by taking point clouds as the only input. However, point clouds often suffer from incomplete geometric structures and the lack of semantic information, which makes detectors hard to accurately classify detected objects. In this work, we focus on how to effectively utilize object-level information from images to boost the performance of point-based 3D detector. We present DeMF, a simple yet effective method to fuse image information into point features. Given a set of point features and image feature maps, DeMF adaptively aggregates image features by taking the projected 2D location of the 3D point as reference. We evaluate our method on the challenging SUN RGB-D dataset, improving state-of-the-art results by a large margin (+2.1 mAP@0.25 and +2.3mAP@0.5). Code is available at https://github.com/haoy945/DeMF.

preprint2022arXiv

ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers

Annotating bounding boxes for object detection is expensive, time-consuming, and error-prone. In this work, we propose a DETR based framework called ComplETR that is designed to explicitly complete missing annotations in partially annotated dense scene datasets. This reduces the need to annotate every object instance in the scene thereby reducing annotation cost. ComplETR augments object queries in DETR decoder with patch information of objects in the image. Combined with a matching loss, it can effectively find objects that are similar to the input patch and complete the missing annotations. We show that our framework outperforms the state-of-the-art methods such as Soft Sampling and Unbiased Teacher by itself, while at the same time can be used in conjunction with these methods to further improve their performance. Our framework is also agnostic to the choice of the downstream object detectors; we show performance improvement for several popular detectors such as Faster R-CNN, Cascade R-CNN, CenterNet2, and Deformable DETR on multiple dense scene datasets.

preprint2022arXiv

Exploring Entity Interactions for Few-Shot Relation Learning (Student Abstract)

Few-shot relation learning refers to infer facts for relations with a limited number of observed triples. Existing metric-learning methods for this problem mostly neglect entity interactions within and between triples. In this paper, we explore this kind of fine-grained semantic meanings and propose our model TransAM. Specifically, we serialize reference entities and query entities into sequence and apply transformer structure with local-global attention to capture both intra- and inter-triple entity interactions. Experiments on two public benchmark datasets NELL-One and Wiki-One with 1-shot setting prove the effectiveness of TransAM.

preprint2022arXiv

Forecast-based Multi-aspect Framework for Multivariate Time-series Anomaly Detection

Today's cyber-world is vastly multivariate. Metrics collected at extreme varieties demand multivariate algorithms to properly detect anomalies. However, forecast-based algorithms, as widely proven approaches, often perform sub-optimally or inconsistently across datasets. A key common issue is they strive to be one-size-fits-all but anomalies are distinctive in nature. We propose a method that tailors to such distinction. Presenting FMUAD - a Forecast-based, Multi-aspect, Unsupervised Anomaly Detection framework. FMUAD explicitly and separately captures the signature traits of anomaly types - spatial change, temporal change and correlation change - with independent modules. The modules then jointly learn an optimal feature representation, which is highly flexible and intuitive, unlike most other models in the category. Extensive experiments show our FMUAD framework consistently outperforms other state-of-the-art forecast-based anomaly detectors.

preprint2022arXiv

General Facial Representation Learning in a Visual-Linguistic Manner

How to learn a universal facial representation that boosts all face analysis tasks? This paper takes one step toward this goal. In this paper, we study the transfer performance of pre-trained models on face analysis tasks and introduce a framework, called FaRL, for general Facial Representation Learning in a visual-linguistic manner. On one hand, the framework involves a contrastive loss to learn high-level semantic meaning from image-text pairs. On the other hand, we propose exploring low-level information simultaneously to further enhance the face representation, by adding a masked image modeling. We perform pre-training on LAION-FACE, a dataset containing large amount of face image-text pairs, and evaluate the representation capability on multiple downstream tasks. We show that FaRL achieves better transfer performance compared with previous pre-trained models. We also verify its superiority in the low-data regime. More importantly, our model surpasses the state-of-the-art methods on face analysis tasks including face parsing and face alignment.

preprint2022arXiv

Gravitational-Wave Signatures of Chiral-Symmetric Technicolor

A chiral-symmetric technicolor model successfully reconciles the tension between electroweak precision tests and traditional technicolor models. Focusing on its simplest realization preserving the conventional Higgs mechanism, we study its primordial gravitational wave signatures originating from first order phase transitions in the early Universe. We found that abundant phase transition patterns arise from a physically viable parameter space. Besides, we have also found gravitational wave signals possibly visible by future experiments, such as LISA, BBO and u-DECIGO. Our results stress the importance of gravitational wave detectors in exploring new physics complementary to ground colliders in the multi-messenger astronomy era.

preprint2022arXiv

High sensitivity air-coupled MHz frequency ultrasound detection using on-chip microcavities

Owing to their dual-resonance enhanced sensitivity, cavity optomechanical systems provide an ideal platform for ultrasound sensing. In this work, we realize high sensitivity air-coupled ultrasound sensing from kilohertz (kHz) to megahertz (MHz) frequency range based on whispering gallery mode microcavities. Using a 57 um-diameter microtoroid with high optical Q factor (~10^7) and mechanical Q factor (~700), we achieve sensitivities of 46 uPa Hz^{-1/2}-10 mPa Hz^{-1/2} in a frequency range of 0.25-3.2 MHz. Thermal-noise-limited sensitivity is realized around the mechanical resonance at 2.56 MHz, in a frequency range of 0.6 MHz. We also observe the second- and third-order mechanical sidebands, and quantitatively study the intensities of each mechanical sideband as a function of the mechanical displacement. Measuring the combination of signal to noise ratios at all sidebands has the potential to extend the dynamic range of ultrasound sensing. In addition, to improve the ultrasound sensitivity in the kHz frequency range, we use a microdisk with a diameter of 200 um, and achieve sensitivities of 1.83 uPa Hz^{-1/2}-10.4 mPa Hz^{-1/2} in 30 kHz-1.65 MHz range.

preprint2022arXiv

Instance-wise Prompt Tuning for Pretrained Language Models

Prompt Learning has recently gained great popularity in bridging the gap between pretraining tasks and various downstream tasks. It freezes Pretrained Language Models (PLMs) and only tunes a few task-related parameters (prompts) for downstream tasks, greatly reducing the cost of tuning giant models. The key enabler of this is the idea of querying PLMs with task-specific knowledge implicated in prompts. This paper reveals a major limitation of existing methods that the indiscriminate prompts for all input data in a task ignore the intrinsic knowledge from input data, resulting in sub-optimal performance. We introduce Instance-wise Prompt Tuning (IPT), the first prompt learning paradigm that injects knowledge from the input data instances to the prompts, thereby providing PLMs with richer and more concrete context information. We devise a series of strategies to produce instance-wise prompts, addressing various concerns like model quality and cost-efficiency. Across multiple tasks and resource settings, IPT significantly outperforms task-based prompt learning methods, and achieves comparable performance to conventional finetuning with only 0.5% - 1.5% of tuned parameters.

preprint2022arXiv

K-Detector: Identifying Duplicate Crash Failures in Large-Scale Software Delivery

After a developer submits code, corresponding test cases arise to ensure the quality of software delivery. Test failures would occur during this period, such as crash, error, and timeout. Since it takes time for developers to resolve them, many duplicate failures will happen during this period. In the delivery practice of SAP HANA, crash triage is considered as the most time-consuming task. If duplicate crash failures can be automatically identified, the degree of automation will be significantly enhanced. To find such duplicates, we propose a training-based mathematical model that utilizes component information of SAP HANA to achieve better crash similarity comparison. We implement our approach in a tool named Knowledge-based Detector (K-Detector), which is verified by 11,208 samples and performs 0.986 in AUC. Furthermore, we have deployed K-Detector to the production environment, and it can save 97% human efforts in crash triage as statistics.

preprint2022arXiv

Large-Scale Pre-training for Person Re-identification with Noisy Labels

This paper aims to address the problem of pre-training for person re-identification (Re-ID) with noisy labels. To setup the pre-training task, we apply a simple online multi-object tracking system on raw videos of an existing unlabeled Re-ID dataset "LUPerson" nd build the Noisy Labeled variant called "LUPerson-NL". Since theses ID labels automatically derived from tracklets inevitably contain noises, we develop a large-scale Pre-training framework utilizing Noisy Labels (PNL), which consists of three learning modules: supervised Re-ID learning, prototype-based contrastive learning, and label-guided contrastive learning. In principle, joint learning of these three modules not only clusters similar examples to one prototype, but also rectifies noisy labels based on the prototype assignment. We demonstrate that learning directly from raw videos is a promising alternative for pre-training, which utilizes spatial and temporal correlations as weak supervision. This simple pre-training task provides a scalable way to learn SOTA Re-ID representations from scratch on "LUPerson-NL" without bells and whistles. For example, by applying on the same supervised Re-ID method MGN, our pre-trained model improves the mAP over the unsupervised pre-training counterpart by 5.7%, 2.2%, 2.3% on CUHK03, DukeMTMC, and MSMT17 respectively. Under the small-scale or few-shot setting, the performance gain is even more significant, suggesting a better transferability of the learned representation. Code is available at https://github.com/DengpanFu/LUPerson-NL

preprint2022arXiv

LogStamp: Automatic Online Log Parsing Based on Sequence Labelling

Logs are one of the most critical data for service management. It contains rich runtime information for both services and users. Since size of logs are often enormous in size and have free handwritten constructions, a typical log-based analysis needs to parse logs into structured format first. However, we observe that most existing log parsing methods cannot parse logs online, which is essential for online services. In this paper, we present an automatic online log parsing method, name as LogStamp. We extensively evaluate LogStamp on five public datasets to demonstrate the effectiveness of our proposed method. The experiments show that our proposed method can achieve high accuracy with only a small portion of the training set. For example, it can achieve an average accuracy of 0.956 when using only 10% of the data training.

preprint2022arXiv

M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation

End-to-end speech-to-text translation models are often initialized with pre-trained speech encoder and pre-trained text decoder. This leads to a significant training gap between pre-training and fine-tuning, largely due to the modality differences between speech outputs from the encoder and text inputs to the decoder. In this work, we aim to bridge the modality gap between speech and text to improve translation quality. We propose M-Adapter, a novel Transformer-based module, to adapt speech representations to text. While shrinking the speech sequence, M-Adapter produces features desired for speech-to-text translation via modelling global and local dependencies of a speech sequence. Our experimental results show that our model outperforms a strong baseline by up to 1 BLEU score on the Must-C En$\rightarrow$DE dataset.\footnote{Our code is available at https://github.com/mingzi151/w2v2-st.}

preprint2022arXiv

MACSA: A Multimodal Aspect-Category Sentiment Analysis Dataset with Multimodal Fine-grained Aligned Annotations

Multimodal fine-grained sentiment analysis has recently attracted increasing attention due to its broad applications. However, the existing multimodal fine-grained sentiment datasets most focus on annotating the fine-grained elements in text but ignore those in images, which leads to the fine-grained elements in visual content not receiving the full attention they deserve. In this paper, we propose a new dataset, the Multimodal Aspect-Category Sentiment Analysis (MACSA) dataset, which contains more than 21K text-image pairs. The dataset provides fine-grained annotations for both textual and visual content and firstly uses the aspect category as the pivot to align the fine-grained elements between the two modalities. Based on our dataset, we propose the Multimodal ACSA task and a multimodal graph-based aligned model (MGAM), which adopts a fine-grained cross-modal fusion method. Experimental results show that our method can facilitate the baseline comparison for future research on this corpus. We will make the dataset and code publicly available.

preprint2022arXiv

New criterions on nonexistence of periodic orbits of planar dynamical systems and their applications

Characterizing existence or not of periodic orbit is a classical problem and it has both theoretical importance and many real applications. Here, several new criterions on nonexistence of periodic orbits of the planar dynamical system $\dot x=y,~\dot y=-g(x)-f(x,y)y$ are obtained in this paper, and by examples showing that these criterions are applicable, but the known ones are invalid to them. Based on these criterions, we further characterize the local topological structures of its equilibrium, which also show that one of the classical results by A.F. Andreev [Amer. Math. Soc. Transl. 8 (1958), 183--207] on local topological classification of the degenerate equilibrium is incomplete. Finally, as another application of these results, we classify the global phase portraits of a planar differential system, which comes from the third question in the list of the 33 questions posed by A. Gasull and also from a mechanical oscillator under suitable restriction to its parameters.

preprint2022arXiv

NLO QCD corrections to pseudoscalar quarkonium production with two heavy flavors in photon-photon collision

We calculate the next-to-leading order (NLO) quantum chromodynamics (QCD) corrections to $γ+γ\to η_c+c+\bar{c}$, $γ+γ\to η_b+b+\bar{b}$ and $γ+γ\to B_c+b+\bar{c}$ processes in the framework of non-relativistic QCD (NRQCD) factorization formalism. The cross sections at the SuperKEKB electron-positron collider, as well as the future collider like the Circular Electron Positron Collider (CEPC), are evaluated. Numerical results indicate that the NLO corrections are significant, and the uncertainties in theoretical predictions with NLO corrections are reduced as expected. Due to the high luminosity of the SuperKEKB collider, the $η_c+c+\bar{c}$ production is hopefully observable in the near future.

preprint2022arXiv

Omni-DETR: Omni-Supervised Object Detection with Transformers

We consider the problem of omni-supervised object detection, which can use unlabeled, fully labeled and weakly labeled annotations, such as image tags, counts, points, etc., for object detection. This is enabled by a unified architecture, Omni-DETR, based on the recent progress on student-teacher framework and end-to-end transformer based object detection. Under this unified architecture, different types of weak labels can be leveraged to generate accurate pseudo labels, by a bipartite matching based filtering mechanism, for the model to learn. In the experiments, Omni-DETR has achieved state-of-the-art results on multiple datasets and settings. And we have found that weak annotations can help to improve detection performance and a mixture of them can achieve a better trade-off between annotation cost and accuracy than the standard complete annotation. These findings could encourage larger object detection datasets with mixture annotations. The code is available at https://github.com/amazon-research/omni-detr.

preprint2022arXiv

Prompt Tuning for Generative Multimodal Pretrained Models

Prompt tuning has become a new paradigm for model tuning and it has demonstrated success in natural language pretraining and even vision pretraining. In this work, we explore the transfer of prompt tuning to multimodal pretraining, with a focus on generative multimodal pretrained models, instead of contrastive ones. Specifically, we implement prompt tuning on the unified sequence-to-sequence pretrained model adaptive to both understanding and generation tasks. Experimental results demonstrate that the light-weight prompt tuning can achieve comparable performance with finetuning and surpass other light-weight tuning methods. Besides, in comparison with finetuned models, the prompt-tuned models demonstrate improved robustness against adversarial attacks. We further figure out that experimental factors, including the prompt length, prompt depth, and reparameteratization, have great impacts on the model performance, and thus we empirically provide a recommendation for the setups of prompt tuning. Despite the observed advantages, we still find some limitations in prompt tuning, and we correspondingly point out the directions for future studies. Codes are available at \url{https://github.com/OFA-Sys/OFA}

preprint2022arXiv

Quantum phase transition in magnetic nanographenes on a lead superconductor

Quantum spins, referred to the spin operator preserved by full SU(2) symmetry in the absence of the magnetic anistropy, have been proposed to host exotic interactions with superconductivity4. However, spin orbit coupling and crystal field splitting normally cause a significant magnetic anisotropy for d/f-shell spins on surfaces6,9, breaking SU(2) symmetry and fabricating the spins with Ising properties10. Recently, magnetic nanographenes have been proven to host intrinsic quantum magnetism due to their negligible spin orbital coupling and crystal field splitting. Here, we fabricate three atomically precise nanographenes with the same magnetic ground state of spin S=1/2 on Pb(111) through engineering sublattice imbalance in graphene honeycomb lattice. Scanning tunneling spectroscopy reveals the coexistence of magnetic bound states and Kondo screening in such hybridized system. Through engineering the magnetic exchange strength between the unpaired spin in nanographenes and cooper pairs, quantum phase transition from the singlet to the doublet state has been observed, in consistent with quantum models of spins on superconductors. Our work demonstrates delocalized graphene magnetism host highly tunable magnetic bound states with cooper pairs, which can be further developed to study the Majorana bound states and other rich quantum physics of low-dimensional quantum spins on superconductors.

preprint2022arXiv

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

We propose pose-guided multiplane image (MPI) synthesis which can render an animatable character in real scenes with photorealistic quality. We use a portable camera rig to capture the multi-view images along with the driving signal for the moving subject. Our method generalizes the image-to-image translation paradigm, which translates the human pose to a 3D scene representation -- MPIs that can be rendered in free viewpoints, using the multi-views captures as supervision. To fully cultivate the potential of MPI, we propose depth-adaptive MPI which can be learned using variable exposure images while being robust to inaccurate camera registration. Our method demonstrates advantageous novel-view synthesis quality over the state-of-the-art approaches for characters with challenging motions. Moreover, the proposed method is generalizable to novel combinations of training poses and can be explicitly controlled. Our method achieves such expressive and animatable character rendering all in real time, serving as a promising solution for practical applications.

preprint2022arXiv

Remarkably Enhanced Dynamic Oxygen Migration on Graphene Oxide Supported by Copper Substrate

The dynamic covalent properties of graphene oxide (GO) are of fundamental interest to a broad range of scientific areas and technological applications. It remains a challenge to access the feasible dynamic reactions for reversibly breaking/reforming covalent bonds of oxygen functional groups on GO, although these reactions can be induced by photonic or mechanical routes, or mediated by adsorbed water. Here, using the density functional theory calculations, we demonstrate the remarkably enhanced dynamic oxygen migration along the basal plane of GO supported by copper substrate (GO@copper), with the C-O bond breaking reaction and proton transfer between the neighboring epoxy and hydroxyl groups. Compared to that on GO, the energy barrier of oxygen migration on GO@copper is sharply reduced to be less than or comparable to thermal fluctuations, and meanwhile the crystallographic match between GO and copper substrate induces new oxygen migration paths on GO@copper. This work sheds light on the understanding of metal substrate-enhanced dynamic properties of GO, and evidences the strategy to tune the activity of two-dimension-interfacial oxygen groups for various potential applications.

preprint2022arXiv

Rethinking Feature Uncertainty in Stochastic Neural Networks for Adversarial Robustness

It is well-known that deep neural networks (DNNs) have shown remarkable success in many fields. However, when adding an imperceptible magnitude perturbation on the model input, the model performance might get rapid decrease. To address this issue, a randomness technique has been proposed recently, named Stochastic Neural Networks (SNNs). Specifically, SNNs inject randomness into the model to defend against unseen attacks and improve the adversarial robustness. However, existed studies on SNNs mainly focus on injecting fixed or learnable noises to model weights/activations. In this paper, we find that the existed SNNs performances are largely bottlenecked by the feature representation ability. Surprisingly, simply maximizing the variance per dimension of the feature distribution leads to a considerable boost beyond all previous methods, which we named maximize feature distribution variance stochastic neural network (MFDV-SNN). Extensive experiments on well-known white- and black-box attacks show that MFDV-SNN achieves a significant improvement over existing methods, which indicates that it is a simple but effective method to improve model robustness.

preprint2022arXiv

Text Steganalysis with Attentional LSTM-CNN

With the rapid development of Natural Language Processing (NLP) technologies, text steganography methods have been significantly innovated recently, which poses a great threat to cybersecurity. In this paper, we propose a novel attentional LSTM-CNN model to tackle the text steganalysis problem. The proposed method firstly maps words into semantic space for better exploitation of the semantic feature in texts and then utilizes a combination of Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) recurrent neural networks to capture both local and long-distance contextual information in steganography texts. In addition, we apply attention mechanism to recognize and attend to important clues within suspicious sentences. After merge feature clues from Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs), we use a softmax layer to categorize the input text as cover or stego. Experiments showed that our model can achieve the state-of-art result in the text steganalysis task.

preprint2021arXiv

A coarse-to-fine framework for unsupervised multi-contrast MR image deformable registration with dual consistency constraint

Multi-contrast magnetic resonance (MR) image registration is useful in the clinic to achieve fast and accurate imaging-based disease diagnosis and treatment planning. Nevertheless, the efficiency and performance of the existing registration algorithms can still be improved. In this paper, we propose a novel unsupervised learning-based framework to achieve accurate and efficient multi-contrast MR image registrations. Specifically, an end-to-end coarse-to-fine network architecture consisting of affine and deformable transformations is designed to improve the robustness and achieve end-to-end registration. Furthermore, a dual consistency constraint and a new prior knowledge-based loss function are developed to enhance the registration performances. The proposed method has been evaluated on a clinical dataset containing 555 cases, and encouraging performances have been achieved. Compared to the commonly utilized registration methods, including VoxelMorph, SyN, and LT-Net, the proposed method achieves better registration performance with a Dice score of 0.8397 in identifying stroke lesions. With regards to the registration speed, our method is about 10 times faster than the most competitive method of SyN (Affine) when testing on a CPU. Moreover, we prove that our method can still perform well on more challenging tasks with lacking scanning information data, showing high robustness for the clinical application.

preprint2021arXiv

Diformer: Directional Transformer for Neural Machine Translation

Autoregressive (AR) and Non-autoregressive (NAR) models have their own superiority on the performance and latency, combining them into one model may take advantage of both. Current combination frameworks focus more on the integration of multiple decoding paradigms with a unified generative model, e.g. Masked Language Model. However, the generalization can be harmful to the performance due to the gap between training objective and inference. In this paper, we aim to close the gap by preserving the original objective of AR and NAR under a unified framework. Specifically, we propose the Directional Transformer (Diformer) by jointly modelling AR and NAR into three generation directions (left-to-right, right-to-left and straight) with a newly introduced direction variable, which works by controlling the prediction of each token to have specific dependencies under that direction. The unification achieved by direction successfully preserves the original dependency assumption used in AR and NAR, retaining both generalization and performance. Experiments on 4 WMT benchmarks demonstrate that Diformer outperforms current united-modelling works with more than 1.5 BLEU points for both AR and NAR decoding, and is also competitive to the state-of-the-art independent AR and NAR models.

preprint2021arXiv

Gaussian State-Based Quantum Illumination with Simple Photodetection

Proofs of the quantum advantage available in imaging or detecting objects under quantum illumination can rely on optimal measurements without specifying what they are. We use the continuous-variable Gaussian quantum information formalism to show that quantum illumination is better for object detection compared with coherent states of the same mean photon number, even for simple direct photodetection. The advantage persists if signal energy and object reflectivity are low and background thermal noise is high. The advantage is even greater if we match signal beam detection probabilities rather than mean photon number. We perform all calculations with thermal states, even for non-Gaussian conditioned states with negative Wigner functions. We simulate repeated detection using a Monte Carlo process that clearly shows the advantages obtainable.

preprint2021arXiv

Large-Scale Training System for 100-Million Classification at Alibaba

In the last decades, extreme classification has become an essential topic for deep learning. It has achieved great success in many areas, especially in computer vision and natural language processing (NLP). However, it is very challenging to train a deep model with millions of classes due to the memory and computation explosion in the last output layer. In this paper, we propose a large-scale training system to address these challenges. First, we build a hybrid parallel training framework to make the training process feasible. Second, we propose a novel softmax variation named KNN softmax, which reduces both the GPU memory consumption and computation costs and improves the throughput of training. Then, to eliminate the communication overhead, we propose a new overlapping pipeline and a gradient sparsification method. Furthermore, we design a fast continuous convergence strategy to reduce total training iterations by adaptively adjusting learning rate and updating model parameters. With the help of all the proposed methods, we gain 3.9$\times$ throughput of our training system and reduce almost 60\% of training iterations. The experimental results show that using an in-house 256 GPUs cluster, we could train a classifier of 100 million classes on Alibaba Retail Product Dataset in about five days while achieving a comparable accuracy with the naive softmax training process.

preprint2021arXiv

Quasinormal Modes and Thermodynamics of Regular Black Holes

By applying the dimensionless scheme, we investigate the quasinormal modes and phase transitions analytically for three types of regular black holes. The universal deviations to the first law of mechanics in regular black holes are proved. Meanwhile, we verify that second order phase transitions and Davies points still exist in these three models. In addition, we calculate their quasinormal modes in the eikonal limit by applying the light ring/quasinormal mode correspondence, and discuss the spiral-like shapes and the relations between the quasinormal modes and phase transitions. As the main result, we show that spiral-like shapes in the complex frequency plane are closely related to the parameterization, namely in some particular units the spiral-like shapes will emerge in the models, which may not be of the spiral behaviors reported by other authors. We also discover a universal property of regular black holes, i.e., the imaginary parts of their QNMs do not vanish for the extreme cases, which does not appear in singular black holes, such as the Reissner-Nordström and Kerr black holes, etc.

preprint2020arXiv

Adversarial Light Projection Attacks on Face Recognition Systems: A Feasibility Study

Deep learning-based systems have been shown to be vulnerable to adversarial attacks in both digital and physical domains. While feasible, digital attacks have limited applicability in attacking deployed systems, including face recognition systems, where an adversary typically has access to the input and not the transmission channel. In such setting, physical attacks that directly provide a malicious input through the input channel pose a bigger threat. We investigate the feasibility of conducting real-time physical attacks on face recognition systems using adversarial light projections. A setup comprising a commercially available web camera and a projector is used to conduct the attack. The adversary uses a transformation-invariant adversarial pattern generation method to generate a digital adversarial pattern using one or more images of the target available to the adversary. The digital adversarial pattern is then projected onto the adversary's face in the physical domain to either impersonate a target (impersonation) or evade recognition (obfuscation). We conduct preliminary experiments using two open-source and one commercial face recognition system on a pool of 50 subjects. Our experimental results demonstrate the vulnerability of face recognition systems to light projection attacks in both white-box and black-box attack settings.

preprint2020arXiv

Application of light diffraction theory to qualify the downstream light field modulation property of mitigated KDP crystals

Micro-milling can effectively remove laser damage sites on a KDP (potassium dihydrogen phosphate) surface and then improve the laser damage resistance of the components. However, the repaired KDP surface could cause light propagating turbulence and downstream light intensification with the potential risk to damage downstream optics. In order to analyze the downstream light field modulation caused by Gaussian mitigation pits on KDP crystals, a computational model of the downstream light diffraction based on the angular spectrum theory and the Gaussian repair contour is established. The results show that the phase offset caused by the repaired surface produces a large light field modulation near the rear KDP surface. The modulation generated in the whole downstream light field is greater than that caused by the amplitude change. Therefore, the phase characteristics of the outgoing light could be suggested as a vital research topic for future research on the downstream light field modulation caused by mitigation contours. Significantly, the experimental results on the downstream light intensity distribution have good agreement with the simulation ones, which proves the validity of the established downstream light diffraction model. The phase characterization of the outgoing light is proposed as an evaluation tool in the repair of KDP crystals. The developed analytical method and numerical discrete algorithm could be also applicable in qualifying the repair quality of other optical components applied in high-power laser systems.

preprint2020arXiv

Beating Attackers At Their Own Games: Adversarial Example Detection Using Adversarial Gradient Directions

Adversarial examples are input examples that are specifically crafted to deceive machine learning classifiers. State-of-the-art adversarial example detection methods characterize an input example as adversarial either by quantifying the magnitude of feature variations under multiple perturbations or by measuring its distance from estimated benign example distribution. Instead of using such metrics, the proposed method is based on the observation that the directions of adversarial gradients when crafting (new) adversarial examples play a key role in characterizing the adversarial space. Compared to detection methods that use multiple perturbations, the proposed method is efficient as it only applies a single random perturbation on the input example. Experiments conducted on two different databases, CIFAR-10 and ImageNet, show that the proposed detection method achieves, respectively, 97.9% and 98.6% AUC-ROC (on average) on five different adversarial attacks, and outperforms multiple state-of-the-art detection methods. Results demonstrate the effectiveness of using adversarial gradient directions for adversarial example detection.

preprint2020arXiv

Category-Specific CNN for Visual-aware CTR Prediction at JD.com

As one of the largest B2C e-commerce platforms in China, JD com also powers a leading advertising system, serving millions of advertisers with fingertip connection to hundreds of millions of customers. In our system, as well as most e-commerce scenarios, ads are displayed with images.This makes visual-aware Click Through Rate (CTR) prediction of crucial importance to both business effectiveness and user experience. Existing algorithms usually extract visual features using off-the-shelf Convolutional Neural Networks (CNNs) and late fuse the visual and non-visual features for the finally predicted CTR. Despite being extensively studied, this field still face two key challenges. First, although encouraging progress has been made in offline studies, applying CNNs in real systems remains non-trivial, due to the strict requirements for efficient end-to-end training and low-latency online serving. Second, the off-the-shelf CNNs and late fusion architectures are suboptimal. Specifically, off-the-shelf CNNs were designed for classification thus never take categories as input features. While in e-commerce, categories are precisely labeled and contain abundant visual priors that will help the visual modeling. Unaware of the ad category, these CNNs may extract some unnecessary category-unrelated features, wasting CNN's limited expression ability. To overcome the two challenges, we propose Category-specific CNN (CSCNN) specially for CTR prediction. CSCNN early incorporates the category knowledge with a light-weighted attention-module on each convolutional layer. This enables CSCNN to extract expressive category-specific visual patterns that benefit the CTR prediction. Offline experiments on benchmark and a 10 billion scale real production dataset from JD, together with an Online A/B test show that CSCNN outperforms all compared state-of-the-art algorithms.

preprint2020arXiv

Eco-Driving at Signalized Intersections: A Multiple Signal Optimization Approach

Consecutive traffic signalized intersections can increase vehicle stops, producing vehicle accelerations on arterial roads and potentially increasing vehicle fuel consumption levels. Eco-driving systems are one method to improve vehicle energy efficiency with the help of vehicle connectivity. In this paper, an eco-driving system is developed that computes a fuel-optimized vehicle trajectory while traversing more than one signalized intersection. The system is designed in a modular and scalable fashion allowing it to be implemented in large networks without significantly increasing the computational complexity. The proposed system utilizes signal phasing and timing (SPaT) data that are communicated to connected vehicles (CVs) together with real-time vehicle dynamics to compute fuel-optimum trajectories. The proposed algorithm is incorporated in the INTEGRATION microscopic traffic assignment and simulation software to conduct a comprehensive sensitivity analysis of various variables, including: system market penetration rates (MPRs), demand levels, phase splits, offsets and traffic signal spacings on the system performance. The analysis shows that at 100\% MPR, fuel consumption can be reduced by as high as 13.8\%. Moreover, higher MPRs and shorter phase lengths result in larger fuel savings. Optimum demand levels and traffic signal spacings exist that maximize the effectiveness of the algorithm. Furthermore, the study demonstrates that the algorithm works less effective when the traffic signal offset is closer to its optimal value. Finally, the study highlights the need for further work to enhance the algorithm to deal with over-saturated traffic conditions.

preprint2020arXiv

Face X-ray for More General Face Forgery Detection

In this paper we propose a novel image representation called face X-ray for detecting forgery in face images. The face X-ray of an input face image is a greyscale image that reveals whether the input image can be decomposed into the blending of two images from different sources. It does so by showing the blending boundary for a forged image and the absence of blending for a real image. We observe that most existing face manipulation methods share a common step: blending the altered face into an existing background image. For this reason, face X-ray provides an effective way for detecting forgery generated by most existing face manipulation algorithms. Face X-ray is general in the sense that it only assumes the existence of a blending step and does not rely on any knowledge of the artifacts associated with a specific face manipulation technique. Indeed, the algorithm for computing face X-ray can be trained without fake images generated by any of the state-of-the-art face manipulation methods. Extensive experiments show that face X-ray remains effective when applied to forgery generated by unseen face manipulation techniques, while most existing face forgery detection or deepfake detection algorithms experience a significant performance drop.

preprint2020arXiv

FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping

In this work, we propose a novel two-stage framework, called FaceShifter, for high fidelity and occlusion aware face swapping. Unlike many existing face swapping works that leverage only limited information from the target image when synthesizing the swapped face, our framework, in its first stage, generates the swapped face in high-fidelity by exploiting and integrating the target attributes thoroughly and adaptively. We propose a novel attributes encoder for extracting multi-level target face attributes, and a new generator with carefully designed Adaptive Attentional Denormalization (AAD) layers to adaptively integrate the identity and the attributes for face synthesis. To address the challenging facial occlusions, we append a second stage consisting of a novel Heuristic Error Acknowledging Refinement Network (HEAR-Net). It is trained to recover anomaly regions in a self-supervised way without any manual annotations. Extensive experiments on wild faces demonstrate that our face swapping results are not only considerably more perceptually appealing, but also better identity preserving in comparison to other state-of-the-art methods.

preprint2020arXiv

Fashion Recommendation and Compatibility Prediction Using Relational Network

Fashion is an inherently visual concept and computer vision and artificial intelligence (AI) are playing an increasingly important role in shaping the future of this domain. Many research has been done on recommending fashion products based on the learned user preferences. However, in addition to recommending single items, AI can also help users create stylish outfits from items they already have, or purchase additional items that go well with their current wardrobe. Compatibility is the key factor in creating stylish outfits from single items. Previous studies have mostly focused on modeling pair-wise compatibility. There are a few approaches that consider an entire outfit, but these approaches have limitations such as requiring rich semantic information, category labels, and fixed order of items. Thus, they fail to effectively determine compatibility when such information is not available. In this work, we adopt a Relation Network (RN) to develop new compatibility learning models, Fashion RN and FashionRN-VSE, that addresses the limitations of existing approaches. FashionRN learns the compatibility of an entire outfit, with an arbitrary number of items, in an arbitrary order. We evaluated our model using a large dataset of 49,740 outfits that we collected from Polyvore website. Quantitatively, our experimental results demonstrate state of the art performance compared with alternative methods in the literature in both compatibility prediction and fill-in-the-blank test. Qualitatively, we also show that the item embedding learned by FashionRN indicate the compatibility among fashion items.

preprint2020arXiv

FCEM: A Novel Fast Correlation Extract Model For Real Time Steganalysis of VoIP Stream via Multi-head Attention

Extracting correlation features between codes-words with high computational efficiency is crucial to steganalysis of Voice over IP (VoIP) streams. In this paper, we utilized attention mechanisms, which have recently attracted enormous interests due to their highly parallelizable computation and flexibility in modeling correlation in sequence, to tackle steganalysis problem of Quantization Index Modulation (QIM) based steganography in compressed VoIP stream. We design a light-weight neural network named Fast Correlation Extract Model (FCEM) only based on a variant of attention called multi-head attention to extract correlation features from VoIP frames. Despite its simple form, FCEM outperforms complicated Recurrent Neural Networks (RNNs) and Convolutional Neural Networks (CNNs) models on both prediction accuracy and time efficiency. It significantly improves the best result in detecting both low embedded rates and short samples recently. Besides, the proposed model accelerates the detection speed as twice as before when the sample length is as short as 0.1s, making it a excellent method for online services.

preprint2020arXiv

Feedback Graph Convolutional Network for Skeleton-based Action Recognition

Skeleton-based action recognition has attracted considerable attention in computer vision since skeleton data is more robust to the dynamic circumstance and complicated background than other modalities. Recently, many researchers have used the Graph Convolutional Network (GCN) to model spatial-temporal features of skeleton sequences by an end-to-end optimization. However, conventional GCNs are feedforward networks which are impossible for low-level layers to access semantic information in the high-level layers. In this paper, we propose a novel network, named Feedback Graph Convolutional Network (FGCN). This is the first work that introduces the feedback mechanism into GCNs and action recognition. Compared with conventional GCNs, FGCN has the following advantages: (1) a multi-stage temporal sampling strategy is designed to extract spatial-temporal features for action recognition in a coarse-to-fine progressive process; (2) A dense connections based Feedback Graph Convolutional Block (FGCB) is proposed to introduce feedback connections into the GCNs. It transmits the high-level semantic features to the low-level layers and flows temporal information stage by stage to progressively model global spatial-temporal features for action recognition; (3) The FGCN model provides early predictions. In the early stages, the model receives partial information about actions. Naturally, its predictions are relatively coarse. The coarse predictions are treated as the prior to guide the feature learning of later stages for a accurate prediction. Extensive experiments on the datasets, NTU-RGB+D, NTU-RGB+D120 and Northwestern-UCLA, demonstrate that the proposed FGCN is effective for action recognition. It achieves the state-of-the-art performance on the three datasets.

preprint2020arXiv

GroupIM: A Mutual Information Maximization Framework for Neural Group Recommendation

We study the problem of making item recommendations to ephemeral groups, which comprise users with limited or no historical activities together. Existing studies target persistent groups with substantial activity history, while ephemeral groups lack historical interactions. To overcome group interaction sparsity, we propose data-driven regularization strategies to exploit both the preference covariance amongst users who are in the same group, as well as the contextual relevance of users' individual preferences to each group. We make two contributions. First, we present a recommender architecture-agnostic framework GroupIM that can integrate arbitrary neural preference encoders and aggregators for ephemeral group recommendation. Second, we regularize the user-group latent space to overcome group interaction sparsity by: maximizing mutual information between representations of groups and group members; and dynamically prioritizing the preferences of highly informative members through contextual preference weighting. Our experimental results on several real-world datasets indicate significant performance improvements (31-62% relative NDCG@20) over state-of-the-art group recommendation techniques.

preprint2020arXiv

NLO QCD corrections to $B_c$-pair production in photon-photon collision

The $B_c$ meson pair, including pairs of both pseudoscalar states and vector states, productions in high energy photon-photon interaction are investigated at the next-to-leading order (NLO) accuracy in the nonrelativistic quantum chromodynamics (NRQCD) factorization formalism. The corresponding cross sections at the future $e^+e^-$ colliders with $\sqrt{s}=250$ GeV and $500$ GeV are evaluated. Numerical result indicates that the inclusion of the NLO corrections shall greatly suppress the scale dependence and enhance the prediction reliability. In addition to the phenomenological meaning, the NLO QCD calculation of this process subjects to certain technical issues, which are elucidated in details and might be applicable to other relevant investigations.

preprint2020arXiv

NLO QCD corrections to $J/ψ$ pair production in photon-photon collision

We calculate the next-to-leading order (NLO) quantum chromodynamics (QCD) correction to the exclusive process $γ+γ\to J/ψ+J/ψ$ in the framework of non-relativistic QCD (NRQCD) factorization formalism. The cross sections at the SuperKEKB electron-positron collider, as well as at the future colliders, like the Circular Electron Positron Collider (CEPC) and the International Linear Collider (ILC), are evaluated. Numerical result indicates that the process will be hopefully to be seen by the Belle II detector within the next decade.

preprint2020arXiv

Rethinking the Hyperparameters for Fine-tuning

Fine-tuning from pre-trained ImageNet models has become the de-facto standard for various computer vision tasks. Current practices for fine-tuning typically involve selecting an ad-hoc choice of hyperparameters and keeping them fixed to values normally used for training from scratch. This paper re-examines several common practices of setting hyperparameters for fine-tuning. Our findings are based on extensive empirical evaluation for fine-tuning on various transfer learning benchmarks. (1) While prior works have thoroughly investigated learning rate and batch size, momentum for fine-tuning is a relatively unexplored parameter. We find that the value of momentum also affects fine-tuning performance and connect it with previous theoretical findings. (2) Optimal hyperparameters for fine-tuning, in particular, the effective learning rate, are not only dataset dependent but also sensitive to the similarity between the source domain and target domain. This is in contrast to hyperparameters for training from scratch. (3) Reference-based regularization that keeps models close to the initial model does not necessarily apply for "dissimilar" datasets. Our findings challenge common practices of fine-tuning and encourages deep learning practitioners to rethink the hyperparameters for fine-tuning.

preprint2020arXiv

Transfer Learning via Contextual Invariants for One-to-Many Cross-Domain Recommendation

The rapid proliferation of new users and items on the social web has aggravated the gray-sheep user/long-tail item challenge in recommender systems. Historically, cross-domain co-clustering methods have successfully leveraged shared users and items across dense and sparse domains to improve inference quality. However, they rely on shared rating data and cannot scale to multiple sparse target domains (i.e., the one-to-many transfer setting). This, combined with the increasing adoption of neural recommender architectures, motivates us to develop scalable neural layer-transfer approaches for cross-domain learning. Our key intuition is to guide neural collaborative filtering with domain-invariant components shared across the dense and sparse domains, improving the user and item representations learned in the sparse domains. We leverage contextual invariances across domains to develop these shared modules, and demonstrate that with user-item interaction context, we can learn-to-learn informative representation spaces even with sparse interaction data. We show the effectiveness and scalability of our approach on two public datasets and a massive transaction dataset from Visa, a global payments technology company (19% Item Recall, 3x faster vs. training separate models for each domain). Our approach is applicable to both implicit and explicit feedback settings.

preprint2019arXiv

CLCI-Net: Cross-Level fusion and Context Inference Networks for Lesion Segmentation of Chronic Stroke

Segmenting stroke lesions from T1-weighted MR images is of great value for large-scale stroke rehabilitation neuroimaging analyses. Nevertheless, there are great challenges with this task, such as large range of stroke lesion scales and the tissue intensity similarity. The famous encoder-decoder convolutional neural network, which although has made great achievements in medical image segmentation areas, may fail to address these challenges due to the insufficient uses of multi-scale features and context information. To address these challenges, this paper proposes a Cross-Level fusion and Context Inference Network (CLCI-Net) for the chronic stroke lesion segmentation from T1-weighted MR images. Specifically, a Cross-Level feature Fusion (CLF) strategy was developed to make full use of different scale features across different levels; Extending Atrous Spatial Pyramid Pooling (ASPP) with CLF, we have enriched multi-scale features to handle the different lesion sizes; In addition, convolutional long short-term memory (ConvLSTM) is employed to infer context information and thus capture fine structures to address the intensity similarity issue. The proposed approach was evaluated on an open-source dataset, the Anatomical Tracings of Lesions After Stroke (ATLAS) with the results showing that our network outperforms five state-of-the-art methods. We make our code and models available at https://github.com/YH0517/CLCI_Net.

preprint2019arXiv

Fast Steganalysis Method for VoIP Streams

In this letter, we present a novel and extremely fast steganalysis method of Voice over IP (VoIP) streams, driven by the need for a quick and accurate detection of possible steganography in VoIP streams. We firstly analyzed the correlations in carriers. To better exploit the correlation in code-words, we mapped vector quantization code-words into a semantic space. In order to achieve high detection efficiency, only one hidden layer is utilized to extract the correlations between these code-words. Finally, based on the extracted correlation features, we used the softmax classifier to categorize the input stream carriers. To boost the performance of this proposed model, we incorporate a simple knowledge distillation framework into the training process. Experimental results show that the proposed method achieves state-of-the-art performance both in detection accuracy and efficiency. In particular, the processing time of this method on average is only about 0.05\% when sample length is as short as 0.1s, attaching strong practical value to online serving of steganography monitor.

preprint2019arXiv

NLO QCD Corrections to Inclusive Charmonium and $B_c$ Meson Production in $W^+$ Decays

We calculate the next-to-leading order (NLO) quantum chromodynamics (QCD) corrections to inclusive processes $W^+\to J/ψ(η_c)+c+\bar{s}+X$ and $W^+\to B_c(B_c^{*})+b+\bar{s}+X$ in the framework of nonrelativistic QCD (NRQCD) factorization formalism. Result indicates that the NLO corrections are significant, and the uncertainties in theoretical predictions with NLO corrections are greatly reduced. The charmonium and $B_c$ meson yielding rates at the Large Hadron Collider (LHC) are given.

preprint2019arXiv

X-Net: Brain Stroke Lesion Segmentation Based on Depthwise Separable Convolution and Long-range Dependencies

The morbidity of brain stroke increased rapidly in the past few years. To help specialists in lesion measurements and treatment planning, automatic segmentation methods are critically required for clinical practices. Recently, approaches based on deep learning and methods for contextual information extraction have served in many image segmentation tasks. However, their performances are limited due to the insufficient training of a large number of parameters, which sometimes fail in capturing long-range dependencies. To address these issues, we propose a depthwise separable convolution based X-Net that designs a nonlocal operation namely Feature Similarity Module (FSM) to capture long-range dependencies. The adopted depthwise convolution allows to reduce the network size, while the developed FSM provides a more effective, dense contextual information extraction and thus facilitates better segmentation. The effectiveness of X-Net was evaluated on an open dataset Anatomical Tracings of Lesions After Stroke (ATLAS) with superior performance achieved compared to other six state-of-the-art approaches. We make our code and models available at https://github.com/Andrewsher/X-Net.

preprint2016arXiv

$Extrasolar~Storms$: Pressure-dependent Changes In Light Curve Phase In Brown Dwarfs From Simultaneous $Hubble$ and $Spitzer$ Observations

We present $Spitzer$/IRAC Ch1 and Ch2 monitoring of six brown dwarfs during 8 different epochs over the course of 20 months. For four brown dwarfs, we also obtained simulataneous $HST$/WFC3 G141 Grism spectra during two epochs and derived light curves in five narrow-band filters. Probing different pressure levels in the atmospheres, the multi-wavelength light curves of our six targets all exhibit variations, and the shape of the light curves evolves over the timescale of a rotation period, ranging from 1.4 h to 13 h. We compare the shapes of the light curves and estimate the phase shifts between the light curves observed at different wavelengths by comparing the phase of the primary Fourier components. We use state-of-the-art atmosphere models to determine the flux contribution of different pressure layers to the observed flux in each filter. We find that the light curves that probe higher pressures are similar and in phase, but are offset and often different from the light curves that probe lower pressures. The phase differences between the two groups of light curves suggest that the modulations seen at lower and higher pressures may be introduced by different cloud layers.

preprint2016arXiv

A Comparative Study of Object Trackers for Infrared Flying Bird Tracking

Bird strikes present a huge risk for aircraft, especially since traditional airport bird surveillance is mainly dependent on inefficient human observation. Computer vision based technology has been proposed to automatically detect birds, determine bird flying trajectories, and predict aircraft takeoff delays. However, the characteristics of bird flight using imagery and the performance of existing methods applied to flying bird task are not well known. Therefore, we perform infrared flying bird tracking experiments using 12 state-of-the-art algorithms on a real BIRDSITE-IR dataset to obtain useful clues and recommend feature analysis. We also develop a Struck-scale method to demonstrate the effectiveness of multiple scale sampling adaption in handling the object of flying bird with varying shape and scale. The general analysis can be used to develop specialized bird tracking methods for airport safety, wildness and urban bird population studies.

preprint2016arXiv

Cloud Atlas: Discovery of Patchy Clouds and High-amplitude Rotational Modulations In a Young, Extremely Red L-type Brown Dwarf

Condensate clouds fundamentally impact the atmospheric structure and spectra of exoplanets and brown dwarfs but the connections between surface gravity, cloud structure, dust in the upper atmosphere, and the red colors of some brown dwarfs remain poorly understood. Rotational modulations enable the study of different clouds in the same atmosphere, thereby providing a method to isolate the effects of clouds. Here we present the discovery of high peak-to-peak amplitude (8%) rotational modulations in a low-gravity, extremely red (J-Ks=2.55) L6 dwarf WISEP J004701.06+680352.1 (W0047). Using the Hubble Space Telescope (HST) time-resolved grism spectroscopy we find a best-fit rotational period (13.20$\pm$0.14 hours) with a larger amplitude at 1.1 micron than at 1.7 micron. This is the third largest near-infrared variability amplitude measured in a brown dwarf, demonstrating that large-amplitude variations are not limited to the L/T transition but are present in some extremely red L-type dwarfs. We report a tentative trend between the wavelength dependence of relative amplitude, possibly proxy for small dust grains lofted in the upper atmosphere, and the likelihood of large-amplitude variability. By assuming forsterite as haze particle, we successfully explain the wavelength dependent amplitude with submicron-sized haze particles sizes of around 0.4 μm. W0047 links the earlier spectral and later spectral type brown dwarfs in which rotational modulations have been observed, the large amplitude variations in this object make this a benchmark brown dwarf for the study of cloud properties close to the L/T transition.

preprint2016arXiv

Exploit Bounding Box Annotations for Multi-label Object Recognition

Convolutional neural networks (CNNs) have shown great performance as general feature representations for object recognition applications. However, for multi-label images that contain multiple objects from different categories, scales and locations, global CNN features are not optimal. In this paper, we incorporate local information to enhance the feature discriminative power. In particular, we first extract object proposals from each image. With each image treated as a bag and object proposals extracted from it treated as instances, we transform the multi-label recognition problem into a multi-class multi-instance learning problem. Then, in addition to extracting the typical CNN feature representation from each proposal, we propose to make use of ground-truth bounding box annotations (strong labels) to add another level of local information by using nearest-neighbor relationships of local regions to form a multi-view pipeline. The proposed multi-view multi-instance framework utilizes both weak and strong labels effectively, and more importantly it has the generalization ability to even boost the performance of unseen categories by partial strong labels from other categories. Our framework is extensively compared with state-of-the-art hand-crafted feature based methods and CNN based methods on two multi-label benchmark datasets. The experimental results validate the discriminative power and the generalization ability of the proposed framework. With strong labels, our framework is able to achieve state-of-the-art results in both datasets.

preprint2016arXiv

Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations

Multi-label learning has attracted significant interests in computer vision recently, finding applications in many vision tasks such as multiple object recognition and automatic image annotation. Associating multiple labels to a complex image is very difficult, not only due to the intricacy of describing the image, but also because of the incompleteness nature of the observed labels. Existing works on the problem either ignore the label-label and instance-instance correlations or just assume these correlations are linear and unstructured. Considering that semantic correlations between images are actually structured, in this paper we propose to incorporate structured semantic correlations to solve the missing label problem of multi-label learning. Specifically, we project images to the semantic space with an effective semantic descriptor. A semantic graph is then constructed on these images to capture the structured correlations between them. We utilize the semantic graph Laplacian as a smooth term in the multi-label learning formulation to incorporate the structured semantic correlations. Experimental results demonstrate the effectiveness of the proposed semantic descriptor and the usefulness of incorporating the structured semantic correlations. We achieve better results than state-of-the-art multi-label learning methods on four benchmark datasets.

preprint2016arXiv

Weyl semimetals as catalysts

The search for highly efficient and low-cost catalysts is one of the main driving forces in catalytic chemistry. Current strategies for the catalyst design focus on increasing the number and activity of local catalytic sites, such as the edge-sites of molybdenum disulfides in the hydrogen evolution reaction (HER). Here, we propose and demonstrate a different principle that goes beyond local site optimization by utilizing topological electronic states, a global property of the material, to spur catalytic activity. For HER, we have found excellent catalysts among the transition-metal monopnictides - NbP, TaP, NbAs, and TaAs - which were recently discovered to be topological Weyl semimetals. In addition to the free energy considerations we explore the role of metallicity, carrier mobility and topological electronic states for remarkable HER performance of these materials. The combination of robust topological surface states and large room temperature carrier mobility both of which originate from bulk Dirac bands of the Weyl semimetal appears to be the recipe for good HER catalyst. Our work provides a guiding principle for the discovery of novel catalysts from the emerging field of topological materials.

preprint2014arXiv

A Parallel Way to Select the Parameters of SVM Based on the Ant Optimization Algorithm

A large number of experimental data shows that Support Vector Machine (SVM) algorithm has obvious advantages in text classification, handwriting recognition, image classification, bioinformatics, and some other fields. To some degree, the optimization of SVM depends on its kernel function and Slack variable, the determinant of which is its parameters $δ$ and c in the classification function. That is to say,to optimize the SVM algorithm, the optimization of the two parameters play a huge role. Ant Colony Optimization (ACO) is optimization algorithm which simulate ants to find the optimal path.In the available literature, we mix the ACO algorithm and Parallel algorithm together to find a well parameters.

preprint2014arXiv

HST Rotational Spectral Mapping of Two L-Type Brown Dwarfs: Variability In and Out of Water Bands Indicates High-Altitude Haze Layers

We present time-resolved near-infrared spectroscopy of two L5 dwarfs, 2MASS J18212815+1414010 and 2MASS J15074759-1627386, observed with the Wide Field Camera 3 instrument on the Hubble Space Telescope (HST). We study the wavelength dependence of rotation-modulated flux variations between 1.1 $μ$m and 1.7 $μ$m. We find that the water absorption bands of the two L5 dwarfs at 1.15 $μ$m and 1.4 $μ$m vary at similar amplitudes as the adjacent continuum. This differs from the results of previous HST observations of L/T transition dwarfs, in which the water absorption at 1.4 $μ$m displays variations of about half of the amplitude at other wavelengths. We find that the relative amplitude of flux variability out of the water band with respect to that in the water band shows a increasing trend from the L5 dwarfs toward the early T dwarfs. We utilize the models of Saumon & Marley (2008) and find that the observed variability of the L5 dwarfs can be explained by the presence of spatially varying high-altitude haze layers above the condensate clouds. Therefore, our observations show that the heterogeneity of haze layers - the driver of the variability - must be located at very low pressures, where even the water opacity is negligible. In the near future, the rotational spectral mapping technique could be utilized for other atomic and molecular species to probe different pressure levels in the atmospheres of brown dwarfs and exoplanets and uncover both horizontal and vertical cloud structures.

preprint2013arXiv

Achieving Both High Power and Energy Density in Electrochemical Supercapacitors with Nanoporous Graphene Materials

Supercapacitors, based on the fast ion transportation, are specialized to provide high power, long stability, and efficient energy storage with highly porous electrode materials. However, their low energy density and specific capacitance prevent them from many applications that require long duration. Using a scalable nanoporous graphene synthesis method involving a simple annealing process in hydrogen, here we show graphene supercapacitors capable of achieving a high energy density comparable to what Li-ion batteries can offer, but a much higher power density. Ultra-high specific gravimetric and volumetric capacitances are achieved with highly porous graphene electrodes. Moreover, the supercapacitors assembled with graphene electrodes show excellent stability. Our results demonstrate that by synthesizing graphene materials with an ideal pore size, uniformity, and good ion accessibility, the performance of supercapacitors can be revolutionized.

preprint2013arXiv

Graphene based Supercapacitors with Improved Specific Capacitance and Fast Charging Time at High Current Density

Graphene is a promising material for energy storage, especially for high performance supercapacitors. For real time high power applications, it is critical to have high specific capacitance with fast charging time at high current density. Using a modified Hummer's method and tip sonication for graphene synthesis, here we show graphene-based supercapacitors with high stability and significantly-improved electrical double layer capacitance and energy density with fast charging and discharging time at a high current density, due to enhanced ionic electrolyte accessibility in deeper regions. The discharge capacitance and energy density values, 195 Fg-1 and 83.4 Whkg-1, are achieved at a current density of 2.5 Ag-1. The time required to discharge 64.18 Whkg-1 at 5 A/g is around 25 sec. At 7.5 Ag-1 current density, the cell can deliver a specific capacitance of about 137 Fg-1 and maintain 98 % of its initial value after 10,000 cycles, suggesting that the stable performance of supercapacitors at high current rates is suitable for fast charging-discharging applications. We attribute this superior performance to the highly porous nature of graphene prepared with minimum restacking due to crimple nature wrinkles and the improved current collecting method.

preprint2013arXiv

Oxygen-Vacancy-Induced Antiferromagnetism to Ferromagnetism Transformation in Multiferroic Thin Films

Oxygen vacancies (VOs) effects on magnetic ordering in Eu0.5Ba0.5TiO3-δ (EBTO3-δ) thin films have been investigated using a combination of experimental measurements and first-principles density-functional calculations. Two kinds of EBTO3-δthin films with different oxygen deficiency have been fabricated. A nuclear resonance backscattering spectrometry technique has been used to quantitatively measure contents of the VOs. Eu0.5Ba0.5TiO3 ceramics have been known to exhibit ferroelectric (FE) and G-type antiferromagnetic (AFM) properties. While, a ferromagnetic (FM) behavior with a Curie temperature of 1.85 K has been found in the EBTO3-δ thin films. Spin-polarized Ti3+ ions, which originated from the VOs, has been proven to mediate a FM coupling between the local Eu 4f spins and were believed to be responsible for the great change of the magnetic ordering. Our work opens up a new avenue for developing FM-FE materials by manipulating the oxygen deficiency in AFM-FE multiferroics.

preprint2013arXiv

Predicting Intermediate Storage Performance for Workflow Applications

Configuring a storage system to better serve an application is a challenging task complicated by a multidimensional, discrete configuration space and the high cost of space exploration (e.g., by running the application with different storage configurations). To enable selecting the best configuration in a reasonable time, we design an end-to-end performance prediction mechanism that estimates the turn-around time of an application using storage system under a given configuration. This approach focuses on a generic object-based storage system design, supports exploring the impact of optimizations targeting workflow applications (e.g., various data placement schemes) in addition to other, more traditional, configuration knobs (e.g., stripe size or replication level), and models the system operation at data-chunk and control message level. This paper presents our experience to date with designing and using this prediction mechanism. We evaluate this mechanism using micro- as well as synthetic benchmarks mimicking real workflow applications, and a real application.. A preliminary evaluation shows that we are on a good track to meet our objectives: it can scale to model a workflow application run on an entire cluster while offering an over 200x speedup factor (normalized by resource) compared to running the actual application, and can achieve, in the limited number of scenarios we study, a prediction accuracy that enables identifying the best storage system configuration.

preprint2013arXiv

The Case for Cross-Layer Optimizations in Storage: A Workflow-Optimized Storage System

This paper proposes using file system custom metadata as a bidirectional communication channel between applications and the storage system. This channel can be used to pass hints that enable cross-layer optimizations, an option hindered today by the ossified file-system interface. We study this approach in context of storage system support for large-scale workflow execution systems: Our workflow optimized storage system (WOSS), exploits application hints to provide per-file optimized operations, and exposes data location to enable location-aware scheduling. This paper argues that an incremental adoption path for adopting cross-layer optimizations in storage systems exists, presents the system architecture for a workflow-optimized storage system and its integration with a workflow runtime engine, and evaluates the proposed approach using synthetic as well as real applications workloads.

preprint2012arXiv

A Far-ultraviolet Atlas of Low-resolution Hubble Space Telescope Spectra of T Tauri Stars

We present a far-ultraviolet (FUV) spectral atlas consisting of spectra of 91 pre-main sequence stars. Most stars in this sample were observed with the Space Telescope Imaging Spectrograph (STIS) and Advanced Camera for Surveys (ACS) on the \emph{Hubble Space Telescope} (\emph{HST}). We find strong correlations among the \ion{O}{1} $λ$1304 triplet, %\ion{C}{2} $λ$1335, the \ion{Si}{4} $λλ$1394/1403 doublet, the \ion{C}{4} $λ$1549 doublet, and the \ion{He}{2} $λ$1640 line luminosities. For classical T Tauri stars (CTTSs), we also find strong correlations between these lines and the accretion luminosity, suggesting that these lines form in processes related to accretion. These FUV line fluxes and X-ray luminosity correlate loosely with large scatters. The FUV emission also correlates well with H$α$, H$β$, and \ion{Ca}{2} K line luminosities. These correlations between FUV and optical diagostics can be used to obtain rough estimates of FUV line fluxes from optical observations. Molecular hydrogen (H$_{2}$) emission is generally present in the spectra of actively accreting CTTSs but not the weak-lined T Tauri stars (WTTSs) that are not accreting. The presence of H$_2$ emission in the spectrum of HD 98800 N suggests that the disk should be classified as actively accreting rather than a debris disk. The spectra in the atlas are available at http://archive.stsci.edu/prepds/ttauriatlas.

preprint2011arXiv

HST/COS Spectra of DF Tau and V4046 SGR: First Detection of Molecular Hydrogen Absorption Against the Ly$α$ Emission Line

We report the first detection of molecular hydrogen (H$_{2}$) absorption in the Lyman-$α$ emission line profiles of two classical T Tauri stars (CTTSs), DF Tau and V4046 Sgr, observed by \emph{HST}/COS. This absorption is the energy source for many of the Lyman-band H$_{2}$ fluorescent lines commonly seen in the far-ultraviolet spectra of CTTSs. We find that the absorbed energy in the H$_{2}$ pumping transitions from a portion of the Lyman-$α$ line significantly differ from the amount of energy in the resulting fluorescent emission. By assuming additional absorption in the H I Lyman-$α$ profile along our light of sight, we can correct the H$_{2}$ absorption/emission ratios so that they are close to unity. The required H I absorption for DF Tau is at a velocity close to the radial velocity of the star, consistent with H I absorption in the edge-on disk and interstellar medium. For V4046 Sgr, a nearly face-on system, the required absorption is between +100 km/s and +290 km/s, most likely resulting from H I gas in the accretion columns falling onto the star.

preprint2011arXiv

Magnetic-field measurements of T Tauri stars in the Orion Nebula cluster

We present an analysis of high-resolution ($R \sim 50,000$) infrared K-band echelle spectra of 14 T Tauri stars in the Orion Nebula Cluster. We model Zeeman broadening in three magnetically sensitive \ion{Ti}{1} lines near $2.2\ μ$m and consistently detect kilogauss-level magnetic fields in the stellar photospheres. The data are consistent in each case with the entire stellar surface being covered with magnetic fields, suggesting that magnetic pressure likely dominates over gas pressure in the photospheres of these stars. These very strong magnetic fields might themselves be responsible for the underproduction of X-ray emission of T Tauri stars relative to what is expected based on main-sequence star calibrations. We combine these results with previous measurements of 14 stars in Taurus and 5 stars in the TW Hydrae association to study the potential variation of magnetic-field properties during the first 10 million years of stellar evolution, finding a steady decline in total magnetic flux with age.

preprint2011arXiv

NUV Excess in Slowly Accreting T Tauri Stars: Limits Imposed by Chromospheric Emission

Young stars surrounded by disks with very low mass accretion rates are likely in the final stages of inner disk evolution and therefore particularly interesting to study. We present ultraviolet (UV) observations of the ~5-9 Myr old stars RECX-1 and RECX-11, obtained with the Cosmic Origins Spectrograph (COS) and Space Telescope Imaging Spectrograph (STIS) on the Hubble Space Telescope (HST), as well as optical and near infrared spectroscopic observations. The two stars have similar levels of near UV emission, although spectroscopic evidence indicates that RECX-11 is accreting and RECX-1 is not. The line profiles of Halpha and He I 10830 in RECX-11 show both broad and narrow redshifted absorption components that vary with time, revealing the complexity of the accretion flows. We show that accretion indicators commonly used to measure mass accretion rates, e.g. U band excess luminosity or the Ca II triplet line luminosity are unreliable for low accretors, at least in the middle K spectral range. Using RECX-1 as a template for the intrinsic level of photospheric and chromospheric emission, we determine an upper limit of 3 x 10^-10 solar masses/ year for RECX-11. At this low accretion rate, recent photoevaporation models predict that an inner hole should have developed in the disk. However, the spectral energy distribution of RECX-11 shows fluxes comparable to the median of Taurus in the near infrared, indicating that substantial dust remains. Fluorescent H_2 emission lines formed in the innermost disk are observed in RECX-11, showing that gas is present in the inner disk, along with the dust.

preprint2011arXiv

Optical tuning and ultrafast dynamics of high-temperature superconducting terahertz metamaterials

Through the integration of semiconductors or complex oxides into metal resonators, tunable metamaterials have been achieved by a change of environment using an external stimulus. Metals provide high conductivity to realize a strong resonant response in metamaterials; however, they contribute very little to the tunability. The complex conductivity in high-temperature superconducting films is highly sensitive to external perturbations, which provides new opportunities in achieving tunable metamaterials resulting directly from the resonant elements. Here we demonstrate ultrafast dynamical tuning of resonance in the terahertz (THz) frequency range in YBa_2Cu_3O_7-δ(YBCO) split-ring resonator arrays excited by near-infrared femtosecond laser pulses. The photoexcitation breaks the superconducting Cooper pairs to create the quasiparticle state. This dramatically modifies the imaginary part of the complex conductivity and consequently the metamaterial resonance in an ultrafast timescale. We observed resonance switching accompanied with a wide range frequency tuning as a function of photoexcitation fluence, which also strongly depend on the nano-scale thickness of the superconducting films. All of our experimental results are well reproduced through calculations using an analytical model, which takes into account the SRR resistance and kinetic inductance contributed from the complex conductivity of YBCO films. The theoretical calculations reveal that the increasing SRR resistance upon increasing photoexcitation fluence is responsible for the reduction of resonance strength, and both the resistance and kinetic inductance contribute to the tuning of resonance frequency.

preprint2010arXiv

Observations of Mass Loss from the Transiting Exoplanet HD 209458b

Using the new Cosmic Origins Spectrograph (COS) on the {\it Hubble Space Telescope (HST)}, we obtained moderate-resolution, high signal/noise ultraviolet spectra of HD 209458 and its exoplanet HD 209458b during transit, both orbital quadratures, and secondary eclipse. We compare transit spectra with spectra obtained at non-transit phases to identify spectral features due to the exoplanet's expanding atmosphere. We find that the mean flux decreased by $7.8\pm 1.3$% for the C II 1334.5323Å and 1335.6854Å lines and by $8.2\pm 1.4$% for the Si III 1206.500Å line during transit compared to non-transit times in the velocity interval --50 to +50 km s$^{-1}$. Comparison of the C II and Si III line depths and transit/non-transit line ratios shows deeper absorption features near --10 and +15 km s$^{-1}$ and less certain features near --40 and +30--70 km s$^{-1}$, but future observations are needed to verify this first detection of velocity structure in the expanding atmosphere of an exoplanet. Our results for the C II lines and the non-detection of Si IV 1394.76Å absorption are in agreement with \citet{Vidal-Madjar2004}, but we find absorption during transit in the Si III line contrary to the earlier result. The $8\pm 1$% obscuration of the star during transit is far larger than the 1.5% obscuration by the exoplanet's disk. Absorption during transit at velocities between --50 and +50 km s$^{-1}$ in the C II and Si III lines requires high-velocity ion absorbers, but models that assume that the absorbers are high-temperature thermal ions are inconsistent with the COS spectra. Assuming hydrodynamic model values for the gas temperature and outflow velocity at the limb of the outflow as seen in the C II lines, we find mass-loss rates in the range (8--40)$\times 10^{10}$ g s$^{-1}$.

preprint2010arXiv

Searching for Far-Ultraviolet Auroral/Dayglow Emission from HD209458b

We present recent observations from the HST-Cosmic Origins Spectrograph aimed at characterizing the auroral emission from the extrasolar planet HD209458b. We obtained medium-resolution (R~18-20,000) far-ultraviolet (1150-1700A) spectra at both the Phase 0.25 and Phase 0.75 quadrature positions as well as a stellar baseline measurement at secondary eclipse. This analysis includes a catalog of stellar emission lines and a star-subtracted spectrum of the planet. We present an emission model for planetary H2 emission, and compare this model to the planetary spectrum. No unambiguously identifiable atomic or molecular features are detected, and upper limits are presented for auroral/dayglow line strengths. An orbital velocity cross-correlation analysis finds a statistically significant (3.8 sigma) feature at +15 (+/- 20) km/s in the rest frame of the planet, at 1582 A. This feature is consistent with emission from H2 B-X (2-9) P(4) (lambda_{rest} = 1581.11 A), however the physical mechanism required to excite this transition is unclear. We compare limits on relative line strengths seen in the exoplanet spectrum with models of ultraviolet fluorescence to constrain the atmospheric column density of neutral hydrogen between the star and the planetary surface. These results support models of short period extrasolar giant planets with weak magnetic fields and extended atomic atmospheres.

preprint2010arXiv

The Far-Ultraviolet "Continuum" in Protoplanetary Disk Systems I: Electron-Impact H2 and Accretion Shocks

We present deep spectroscopic observations of the classical T Tauri stars DF Tau and V4046 Sgr in order to better characterize two important sources of far-ultraviolet continuum emission in protoplanetary disks. These new Hubble Space Telescope-Cosmic Origins Spectrograph observations reveal a combination of line and continuum emission from collisionally excited H2 and emission from accretion shocks. H2 is the dominant emission in the 1400-1650 A band spectrum of V4046 Sgr, while an accretion continuum contributes strongly across the far-ultraviolet spectrum of DF Tau. We compare the spectrum of V4046 Sgr to models of electron-impact induced H2 emission to constrain the physical properties of the emitting region, after making corrections for attenuation within the disk. We find reasonable agreement with the broad spectral characteristics of the H2 model, implying N(H2) ~ 10^{18} cm^{-2}, T(H2) = 3000^{+1000}_{-500} K, and a characteristic electron energy in the range of ~ 50 - 100 eV. We propose that self-absorption and hydrocarbons provide the dominant attenuation for H2 line photons originating within the disk. For both DF Tau and V4046 Sgr, we find that a linear fit to the far-UV data can reproduce near-UV/optical accretion spectra. We discuss outstanding issues concerning how these processes operate in protostellar/protoplanetary disks, including the effective temperature and absolute strength of the radiation field in low-mass protoplanetary environments. We find that the 912-2000A continuum in low-mass systems has an effective temperature of ~10^{4} K with fluxes 10^{5-7} times the interstellar level at 1 AU.

preprint2010arXiv

Tuning the Resonance in High Temperature Superconducting Terahertz Metamaterials

In this Letter we present resonance properties in terahertz metamaterials consisting of a split-ring resonator array made from high temperature superconducting films. By varying the temperature, we observed efficient metamaterial resonance switching and frequency tuning with some features not revealed before. The results were well reproduced by numerical simulations of metamaterial resonance using the experimentally measured complex conductivity of the superconducting film. We developed a theoretical model that explains the tuning features, which takes into account the resistive resonance damping and additional split-ring inductance contributed from both the real and imaginary parts of the temperature-dependent complex conductivity. The theoretical model further predicted more efficient resonance switching and frequency shifting in metamaterials consisting of a thinner superconducting split-ring resonator array, which were also verified in experiments.

Hao Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

73 published item(s)

Judge, Then Drive: A Critic-Centric Vision Language Action Framework for Autonomous Driving

ViSRA: A Video-based Spatial Reasoning Agent for Multi-modal Large Language Models

MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning

A GNSS Aided Initial Alignment Method for MEMS-IMU Based on Backtracking Algorithm and Backward Filtering

Accessibility of SPDEs driven by pure jump noise and its applications

Boosting 3D Object Detection via Object-Focused Image Fusion

ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers

Exploring Entity Interactions for Few-Shot Relation Learning (Student Abstract)

Forecast-based Multi-aspect Framework for Multivariate Time-series Anomaly Detection

General Facial Representation Learning in a Visual-Linguistic Manner

Gravitational-Wave Signatures of Chiral-Symmetric Technicolor

High sensitivity air-coupled MHz frequency ultrasound detection using on-chip microcavities

Instance-wise Prompt Tuning for Pretrained Language Models

K-Detector: Identifying Duplicate Crash Failures in Large-Scale Software Delivery

Large-Scale Pre-training for Person Re-identification with Noisy Labels

LogStamp: Automatic Online Log Parsing Based on Sequence Labelling

M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation

MACSA: A Multimodal Aspect-Category Sentiment Analysis Dataset with Multimodal Fine-grained Aligned Annotations

New criterions on nonexistence of periodic orbits of planar dynamical systems and their applications

NLO QCD corrections to pseudoscalar quarkonium production with two heavy flavors in photon-photon collision

Omni-DETR: Omni-Supervised Object Detection with Transformers

Prompt Tuning for Generative Multimodal Pretrained Models

Quantum phase transition in magnetic nanographenes on a lead superconductor

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

Remarkably Enhanced Dynamic Oxygen Migration on Graphene Oxide Supported by Copper Substrate

Rethinking Feature Uncertainty in Stochastic Neural Networks for Adversarial Robustness

Text Steganalysis with Attentional LSTM-CNN

A coarse-to-fine framework for unsupervised multi-contrast MR image deformable registration with dual consistency constraint

Diformer: Directional Transformer for Neural Machine Translation

Gaussian State-Based Quantum Illumination with Simple Photodetection

Large-Scale Training System for 100-Million Classification at Alibaba

Quasinormal Modes and Thermodynamics of Regular Black Holes

Adversarial Light Projection Attacks on Face Recognition Systems: A Feasibility Study

Application of light diffraction theory to qualify the downstream light field modulation property of mitigated KDP crystals

Beating Attackers At Their Own Games: Adversarial Example Detection Using Adversarial Gradient Directions

Category-Specific CNN for Visual-aware CTR Prediction at JD.com

Eco-Driving at Signalized Intersections: A Multiple Signal Optimization Approach

Face X-ray for More General Face Forgery Detection

FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping

Fashion Recommendation and Compatibility Prediction Using Relational Network

FCEM: A Novel Fast Correlation Extract Model For Real Time Steganalysis of VoIP Stream via Multi-head Attention

Feedback Graph Convolutional Network for Skeleton-based Action Recognition

GroupIM: A Mutual Information Maximization Framework for Neural Group Recommendation

NLO QCD corrections to $B_c$-pair production in photon-photon collision

NLO QCD corrections to $J/ψ$ pair production in photon-photon collision

Rethinking the Hyperparameters for Fine-tuning

Transfer Learning via Contextual Invariants for One-to-Many Cross-Domain Recommendation

CLCI-Net: Cross-Level fusion and Context Inference Networks for Lesion Segmentation of Chronic Stroke

Fast Steganalysis Method for VoIP Streams

NLO QCD Corrections to Inclusive Charmonium and $B_c$ Meson Production in $W^+$ Decays

X-Net: Brain Stroke Lesion Segmentation Based on Depthwise Separable Convolution and Long-range Dependencies

$Extrasolar~Storms$: Pressure-dependent Changes In Light Curve Phase In Brown Dwarfs From Simultaneous $Hubble$ and $Spitzer$ Observations

A Comparative Study of Object Trackers for Infrared Flying Bird Tracking

Cloud Atlas: Discovery of Patchy Clouds and High-amplitude Rotational Modulations In a Young, Extremely Red L-type Brown Dwarf

Exploit Bounding Box Annotations for Multi-label Object Recognition

Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations

Weyl semimetals as catalysts

A Parallel Way to Select the Parameters of SVM Based on the Ant Optimization Algorithm

HST Rotational Spectral Mapping of Two L-Type Brown Dwarfs: Variability In and Out of Water Bands Indicates High-Altitude Haze Layers

Achieving Both High Power and Energy Density in Electrochemical Supercapacitors with Nanoporous Graphene Materials

Graphene based Supercapacitors with Improved Specific Capacitance and Fast Charging Time at High Current Density

Oxygen-Vacancy-Induced Antiferromagnetism to Ferromagnetism Transformation in Multiferroic Thin Films

Predicting Intermediate Storage Performance for Workflow Applications

The Case for Cross-Layer Optimizations in Storage: A Workflow-Optimized Storage System

A Far-ultraviolet Atlas of Low-resolution Hubble Space Telescope Spectra of T Tauri Stars

HST/COS Spectra of DF Tau and V4046 SGR: First Detection of Molecular Hydrogen Absorption Against the Ly$α$ Emission Line

Magnetic-field measurements of T Tauri stars in the Orion Nebula cluster

NUV Excess in Slowly Accreting T Tauri Stars: Limits Imposed by Chromospheric Emission

Optical tuning and ultrafast dynamics of high-temperature superconducting terahertz metamaterials

Observations of Mass Loss from the Transiting Exoplanet HD 209458b

Searching for Far-Ultraviolet Auroral/Dayglow Emission from HD209458b

The Far-Ultraviolet "Continuum" in Protoplanetary Disk Systems I: Electron-Impact H2 and Accretion Shocks

Tuning the Resonance in High Temperature Superconducting Terahertz Metamaterials