Source author record

Yi Zhu

Yi Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

68works

41topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Acting Flatterers via LLMs Sycophancy: Combating Clickbait with LLMs Opposing-Stance Reasoning

The widespread proliferation of online content has intensified concerns about clickbait, deceptive or exaggerated headlines designed to attract attention. While Large Language Models (LLMs) offer a promising avenue for addressing this issue, their effectiveness is often hindered by Sycophancy, a tendency to produce reasoning that matches users' beliefs over truthful ones, which deviates from instruction-following principles. Rather than treating sycophancy as a flaw to be eliminated, this work proposes a novel approach that initially harnesses this behavior to generate contrastive reasoning from opposing perspectives. Specifically, we design a Self-renewal Opposing-stance Reasoning Generation (SORG) framework that prompts LLMs to produce high-quality agree and disagree reasoning pairs for a given news title without requiring ground-truth labels. To utilize the generated reasoning, we develop a local Opposing Reasoning-based Clickbait Detection (ORCD) model that integrates three BERT encoders to represent the title and its associated reasoning. The model leverages contrastive learning, guided by soft labels derived from LLM-generated credibility scores, to enhance detection robustness. Experimental evaluations on three benchmark datasets demonstrate that our method consistently outperforms LLM prompting, fine-tuned smaller language models, and state-of-the-art clickbait detection baselines.

preprint2026arXiv

Alethia: A Foundational Encoder for Voice Deepfakes

Existing voice deepfake detection and localization models rely heavily on representations extracted from speech foundation models (SFMs). However, downstream finetuning has now reached a state of diminishing returns. In this paper, we shift the focus to pretraining and propose a novel recipe that combines bottleneck masked embedding prediction with flow-matching based spectrogram reconstruction. The outcome, Alethia, is the first foundational audio encoder for various voice deepfake detection and localization tasks. We evaluate on $5$ different tasks with $56$ benchmark datasets, and note Alethia significantly outperforms state-of-the-art SFMs with superior robustness to real-world perturbations and zero-shot generalization to unseen domains (e.g., singing deepfakes). We also demonstrate the limitation of discrete targets in masked token prediction, and show the importance of continuous embedding prediction and generative pretraining for capturing deepfake artifacts.

preprint2023arXiv

MixGen: A New Multi-Modal Data Augmentation

Data augmentation is a necessity to enhance data efficiency in deep learning. For vision-language pre-training, data is only augmented either for images or for text in previous works. In this paper, we present MixGen: a joint data augmentation for vision-language representation learning to further improve data efficiency. It generates new image-text pairs with semantic relationships preserved by interpolating images and concatenating text. It's simple, and can be plug-and-played into existing pipelines. We evaluate MixGen on four architectures, including CLIP, ViLT, ALBEF and TCL, across five downstream vision-language tasks to show its versatility and effectiveness. For example, adding MixGen in ALBEF pre-training leads to absolute performance improvements on downstream tasks: image-text retrieval (+6.2% on COCO fine-tuned and +5.3% on Flicker30K zero-shot), visual grounding (+0.9% on RefCOCO+), visual reasoning (+$0.9% on NLVR2), visual question answering (+0.3% on VQA2.0), and visual entailment (+0.4% on SNLI-VE).

preprint2022arXiv

ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts

Vision-Language Navigation (VLN) is a challenging task that requires an embodied agent to perform action-level modality alignment, i.e., make instruction-asked actions sequentially in complex visual environments. Most existing VLN agents learn the instruction-path data directly and cannot sufficiently explore action-level alignment knowledge inside the multi-modal inputs. In this paper, we propose modAlity-aligneD Action PrompTs (ADAPT), which provides the VLN agent with action prompts to enable the explicit learning of action-level modality alignment to pursue successful navigation. Specifically, an action prompt is defined as a modality-aligned pair of an image sub-prompt and a text sub-prompt, where the former is a single-view observation and the latter is a phrase like ''walk past the chair''. When starting navigation, the instruction-related action prompt set is retrieved from a pre-built action prompt base and passed through a prompt encoder to obtain the prompt feature. Then the prompt feature is concatenated with the original instruction feature and fed to a multi-layer transformer for action prediction. To collect high-quality action prompts into the prompt base, we use the Contrastive Language-Image Pretraining (CLIP) model which has powerful cross-modality alignment ability. A modality alignment loss and a sequential consistency loss are further introduced to enhance the alignment of the action prompt and enforce the agent to focus on the related prompt sequentially. Experimental results on both R2R and RxR show the superiority of ADAPT over state-of-the-art methods.

preprint2022arXiv

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Multiple datasets and open challenges for object detection have been introduced in recent years. To build more general and powerful object detection systems, in this paper, we construct a new large-scale benchmark termed BigDetection. Our goal is to simply leverage the training data from existing datasets (LVIS, OpenImages and Object365) with carefully designed principles, and curate a larger dataset for improved detector pre-training. Specifically, we generate a new taxonomy which unifies the heterogeneous label spaces from different sources. Our BigDetection dataset has 600 object categories and contains over 3.4M training images with 36M bounding boxes. It is much larger in multiple dimensions than previous benchmarks, which offers both opportunities and challenges. Extensive experiments demonstrate its validity as a new benchmark for evaluating different object detection methods, and its effectiveness as a pre-training dataset.

preprint2022arXiv

Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis

Building Spoken Language Understanding (SLU) robust to Automatic Speech Recognition (ASR) errors is an essential issue for various voice-enabled virtual assistants. Considering that most ASR errors are caused by phonetic confusion between similar-sounding expressions, intuitively, leveraging the phoneme sequence of speech can complement ASR hypothesis and enhance the robustness of SLU. This paper proposes a novel model with Cross Attention for SLU (denoted as CASLU). The cross attention block is devised to catch the fine-grained interactions between phoneme and word embeddings in order to make the joint representations catch the phonetic and semantic features of input simultaneously and for overcoming the ASR errors in downstream natural language understanding (NLU) tasks. Extensive experiments are conducted on three datasets, showing the effectiveness and competitiveness of our approach. Additionally, We also validate the universality of CASLU and prove its complementarity when combining with other robust SLU techniques.

preprint2022arXiv

Chinese Idiom Paraphrasing

Idioms, are a kind of idiomatic expression in Chinese, most of which consist of four Chinese characters. Due to the properties of non-compositionality and metaphorical meaning, Chinese Idioms are hard to be understood by children and non-native speakers. This study proposes a novel task, denoted as Chinese Idiom Paraphrasing (CIP). CIP aims to rephrase idioms-included sentences to non-idiomatic ones under the premise of preserving the original sentence's meaning. Since the sentences without idioms are easier handled by Chinese NLP systems, CIP can be used to pre-process Chinese datasets, thereby facilitating and improving the performance of Chinese NLP tasks, e.g., machine translation system, Chinese idiom cloze, and Chinese idiom embeddings. In this study, CIP task is treated as a special paraphrase generation task. To circumvent difficulties in acquiring annotations, we first establish a large-scale CIP dataset based on human and machine collaboration, which consists of 115,530 sentence pairs. We further deploy three baselines and two novel CIP approaches to deal with CIP problems. The results show that the proposed methods have better performances than the baselines based on the established CIP dataset.

preprint2022arXiv

Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue

Turn-taking, aiming to decide when the next speaker can start talking, is an essential component in building human-robot spoken dialogue systems. Previous studies indicate that multimodal cues can facilitate this challenging task. However, due to the paucity of public multimodal datasets, current methods are mostly limited to either utilizing unimodal features or simplistic multimodal ensemble models. Besides, the inherent class imbalance in real scenario, e.g. sentence ending with short pause will be mostly regarded as the end of turn, also poses great challenge to the turn-taking decision. In this paper, we first collect a large-scale annotated corpus for turn-taking with over 5,000 real human-robot dialogues in speech and text modalities. Then, a novel gated multimodal fusion mechanism is devised to utilize various information seamlessly for turn-taking prediction. More importantly, to tackle the data imbalance issue, we design a simple yet effective data augmentation method to construct negative instances without supervision and apply contrastive learning to obtain better feature representations. Extensive experiments are conducted and the results demonstrate the superiority and competitiveness of our model over several state-of-the-art baselines.

preprint2022arXiv

Global classical solutions of 3D compressible viscoelastic system near equilibrium

In this paper, we prove the global existence of general small solutions to compressible viscoelastic system. We remove the "initial state" assumption ($\tilde ρ_0 \det F_0 =1$) and the "div-curl" structure assumption compared with previous works. It then broadens the class of solutions to a great extent, more precisely the initial density state would not be constant necessarily, and no more structure is need for global well-posedness. It's quite different from the elasticity system in which structure plays an important role. Since we can not obtain any dissipation information for density and deformation tensor, we introduce a new effective flux in the thought of regarding the wildest "nonlinear term" as "linear term". Although the norms of solution may increase now, we can still derive the global existence for it.

preprint2022arXiv

Global well-posedness for 2D non-resistive compressible MHD system in periodic domain

This paper focuses on the 2D compressible magnetohydrodynamic (MHD) equations without magnetic diffusion in a periodic domain. We present a systematic approach to establishing the global existence of smooth solutions when the initial data is close to a background magnetic field. In addition, stability and large-time decay rates are also obtained. When there is no magnetic diffusion, the magnetic field and the density are governed by forced transport equations and the problem considered here is difficult. This paper implements several key observations and ideas to maximize the enhanced dissipation due to hidden structures and interactions. In particular, the weak smoothing and stabilization generated by the background magnetic field and the extra regularization in the divergence part of the velocity field are fully exploited. Compared with the previous works, this paper appears to be the first to investigate such system on bounded domains and the first to solve this problem by pure energy estimates, which help reduce the complexity in other approaches. In addition, this paper combines the well-posedness with the precise large-time behavior, a strategy that can be extended to higher dimensions.

preprint2022arXiv

ImpDet: Exploring Implicit Fields for 3D Object Detection

Conventional 3D object detection approaches concentrate on bounding boxes representation learning with several parameters, i.e., localization, dimension, and orientation. Despite its popularity and universality, such a straightforward paradigm is sensitive to slight numerical deviations, especially in localization. By exploiting the property that point clouds are naturally captured on the surface of objects along with accurate location and intensity information, we introduce a new perspective that views bounding box regression as an implicit function. This leads to our proposed framework, termed Implicit Detection or ImpDet, which leverages implicit field learning for 3D object detection. Our ImpDet assigns specific values to points in different local 3D spaces, thereby high-quality boundaries can be generated by classifying points inside or outside the boundary. To solve the problem of sparsity on the object surface, we further present a simple yet efficient virtual sampling strategy to not only fill the empty region, but also learn rich semantic features to help refine the boundaries. Extensive experimental results on KITTI and Waymo benchmarks demonstrate the effectiveness and robustness of unifying implicit fields into object detection.

preprint2022arXiv

Large-scale Dynamics of Winds Driven by Line Force from a Thin Accretion Disk

Winds play a significant role in active galactic nuclei feedback process. Previous simulations studying winds only focus on a small dynamical range. Therefore, it is unknown how far the winds can go and what the properties of the winds will be if they can move to large radii. We perform simulations to study the large scale dynamics of winds driven by line force. We find that the properties of the winds depend on both black hole mass ($M_{BH}$) and accretion disk luminosity. When the accretion disk luminosity is $0.6L_{edd}$ ($L_{edd}$ being Eddington luminosity), independent of $M_{BH}$, the winds have kinetic energy flux exceeding $1\% L_{edd}$ and can escape from the black hole potential. For the case with the accretion disk luminosity equaling 0.3$L_{edd}$, the strength of the winds decreases with the decrease of $M_{BH}$. If $M_{BH}$ decreases from $10^9$ to $10^6$ solar mass ($M_\odot$), the winds kinetic energy flux decreases from $\sim 0.01 L_{edd}$ to $ \sim 10^{-6} L_{edd}$. In case of $M_{BH}\geq 10^7 M_\odot$, winds can escape from black hole potential. In the case of $M_{BH}=10^6 M_\odot$, the winds can not escape. We find that for the ultra-fast winds observed in hard X-ray bands (\citealt{Gofford et al. 2015}), the observed dependence of the mass flux and the kinetic energy flux on accretion disk luminosity can be well produced by line force driven winds model. We also find that the properties of the ultra-fast winds observed in soft X-ray bands can be explained by the line force driven winds model.

preprint2022arXiv

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution.

preprint2022arXiv

Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition

Existing out-of-distribution (OOD) detection methods are typically benchmarked on training sets with balanced class distributions. However, in real-world applications, it is common for the training sets to have long-tailed distributions. In this work, we first demonstrate that existing OOD detection methods commonly suffer from significant performance degradation when the training set is long-tail distributed. Through analysis, we posit that this is because the models struggle to distinguish the minority tail-class in-distribution samples, from the true OOD samples, making the tail classes more prone to be falsely detected as OOD. To solve this problem, we propose Partial and Asymmetric Supervised Contrastive Learning (PASCL), which explicitly encourages the model to distinguish between tail-class in-distribution samples and OOD samples. To further boost in-distribution classification accuracy, we propose Auxiliary Branch Finetuning, which uses two separate branches of BN and classification layers for anomaly detection and in-distribution classification, respectively. The intuition is that in-distribution and OOD anomaly data have different underlying distributions. Our method outperforms previous state-of-the-art method by $1.29\%$, $1.45\%$, $0.69\%$ anomaly detection false positive rate (FPR) and $3.24\%$, $4.06\%$, $7.89\%$ in-distribution classification accuracy on CIFAR10-LT, CIFAR100-LT, and ImageNet-LT, respectively. Code and pre-trained models are available at https://github.com/amazon-research/long-tailed-ood-detection.

preprint2022arXiv

Pixel-level Correspondence for Self-Supervised Learning from Video

While self-supervised learning has enabled effective representation learning in the absence of labels, for vision, video remains a relatively untapped source of supervision. To address this, we propose Pixel-level Correspondence (PiCo), a method for dense contrastive learning from video. By tracking points with optical flow, we obtain a correspondence map which can be used to match local features at different points in time. We validate PiCo on standard benchmarks, outperforming self-supervised baselines on multiple dense prediction tasks, without compromising performance on image classification.

preprint2022arXiv

Prompt-Learning for Short Text Classification

In the short text, the extremely short length, feature sparsity, and high ambiguity pose huge challenges to classification tasks. Recently, as an effective method for tuning Pre-trained Language Models for specific downstream tasks, prompt-learning has attracted a vast amount of attention and research. The main intuition behind the prompt-learning is to insert the template into the input and convert the text classification tasks into equivalent cloze-style tasks. However, most prompt-learning methods expand label words manually or only consider the class name for knowledge incorporating in cloze-style prediction, which will inevitably incur omissions and bias in short text classification tasks. In this paper, we propose a simple short text classification approach that makes use of prompt-learning based on knowledgeable expansion. Taking the special characteristics of short text into consideration, the method can consider both the short text itself and class name during expanding label words space. Specifically, the top $N$ concepts related to the entity in the short text are retrieved from the open Knowledge Graph like Probase, and we further refine the expanded label words by the distance calculation between selected concepts and class labels. Experimental results show that our approach obtains obvious improvement compared with other fine-tuning, prompt-learning, and knowledgeable prompt-tuning methods, outperforming the state-of-the-art by up to 6 Accuracy points on three well-known datasets.

preprint2022arXiv

Traveling edge states in massive Dirac equations along slowly varying edges

Topologically protected wave motion has attracted considerable interest due to its novel properties and potential applications in many different fields. In this work, we study edge modes and traveling edge states via the linear Dirac equations with so-called domain wall masses. The unidirectional edge state provides a heuristic approach to more general traveling edge states through the localized behavior along slowly varying edges. We show the leading asymptotic solutions of two typical edge states that follow the circular and curved edges with small curvature by analytic and quantitative arguments.

preprint2021arXiv

A phase field model for mass transport with semi-permeable interfaces

In this paper, a thermal-dynamical consistent model for mass transfer across permeable moving interfaces is proposed by using the energy variation method. We consider a restricted diffusion problem where the flux across the interface depends on its conductance and the difference of the concentration on each side. The diffusive interface phase-field framework used here has several advantages over the sharp interface method. First of all, explicit tracking of the interface is no longer necessary. Secondly, the interfacial condition can be incorporated with a variable diffusion coefficient. A detailed asymptotic analysis confirms the diffusive interface model converges to the existing sharp interface model as the interface thickness goes to zero. A decoupled energy stable numerical scheme is developed to solve this system efficiently. Numerical simulations first illustrate the consistency of theoretical results on the sharp interface limit. Then a convergence study and energy decay test are conducted to ensure the efficiency and stability of the numerical scheme. To illustrate the effectiveness of our phase-field approach, several examples are provided, including a study of a two-phase mass transfer problem where drops with deformable interfaces are suspended in a moving fluid.

preprint2021arXiv

Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification

Semi-supervised learning through deep generative models and multi-lingual pretraining techniques have orchestrated tremendous success across different areas of NLP. Nonetheless, their development has happened in isolation, while the combination of both could potentially be effective for tackling task-specific labelled data shortage. To bridge this gap, we combine semi-supervised deep generative models and multi-lingual pretraining to form a pipeline for document classification task. Compared to strong supervised learning baselines, our semi-supervised classification framework is highly competitive and outperforms the state-of-the-art counterparts in low-resource settings across several languages.

preprint2021arXiv

Rapid Multi-Physics Simulation for Electro-Thermal Origami Systems

Electro-thermally actuated origami provides a novel method for creating 3-D systems with advanced morphing and functional capabilities. However, it is currently difficult to simulate the multi-physical behavior of such systems because the electro-thermal actuation and large folding deformations are highly interdependent. In this work, we introduce a rapid multi-physics simulation framework for electro-thermally actuated origami systems that can simultaneously capture: thermo-mechancially coupled actuation, inter panel contact, heat transfer, large deformation folding, and other complex loading applied onto the origami. Comparisons with finite element models validate the proposed framework for simulating origami heat transfer with different system geometries, materials, and surrounding environments. Verification of the simulated folding behaviors against physical electro-thermal micro-origami further demonstrates the validity of the proposed model. Simulations of more complex origami patterns and a case study for origami optimization are provided as application examples to show the capability and efficiency of the model. The framework provides a novel simulation tool for analysis, design, control, and optimization of active origami systems, pushing the boundary for feasible shape morphing and functional capability.

preprint2021arXiv

Three-fold Weyl points in the Schrödinger operator with periodic potentials

Weyl points are degenerate points on the spectral bands at which energy bands intersect conically. They are the origins of many novel physical phenomena and have attracted much attention recently. In this paper, we investigate the existence of such points in the spectrum of the 3-dimensional Schrödinger operator $H = - Δ+V(\textbf{x})$ with $V(\textbf{x})$ being in a large class of periodic potentials. Specifically, we give very general conditions on the potentials which ensure the existence of 3-fold Weyl points on the associated energy bands. Different from 2-dimensional honeycomb structures which possess Dirac points where two adjacent band surfaces touch each other conically, the 3-fold Weyl points are conically intersection points of two energy bands with an extra band sandwiched in between. To ensure the 3-fold and 3-dimensional conical structures, more delicate, new symmetries are required. As a consequence, new techniques combining more symmetries are used to justify the existence of such conical points under the conditions proposed. This paper provides comprehensive proof of such 3-fold Weyl points. In particular, the role of each symmetry endowed to the potential is carefully analyzed. Our proof extends the analysis on the conical spectral points to a higher dimension and higher multiplicities. We also provide some numerical simulations on typical potentials to demonstrate our analysis.

preprint2020arXiv

Direct Measurement of Folding Angle and Strain Vector in Atomically thin WS$_2$ using Second Harmonic Generation

Structural engineering techniques such as local strain engineering and folding provide functional control over critical optoelectronic properties of 2D materials. Accurate monitoring of local strain vector (both strain amplitude and direction) and folding angle in 2D materials is important to optimize the device performance. Conventionally, the accurate measurement of both strain amplitude and direction requires the combined usage of multiple tools, such as atomic force microscopy (AFM), electron microscopy, Raman spectroscopy, etc. Here, we demonstrated the usage of a single tool, polarization-dependent second harmonic generation (SHG) imaging, to determine the folding angle and strain vector accurately in atomically thin tungsten disulfide (WS2). We find that trilayer WS2 folds with folding angle of 600 show 9 times SHG enhancement due to vector superposition of SH wave vectors coming from the individual folding layers. Strain dependent SHG quenching and enhancement is found parallel and perpendicular respectively to the direction of the compressive strain vector. However, despite a variation in strain angle, the total SHG remains constant which allows us to determine the local strain vector accurately using photoelastic approach. We also demonstrate that band-nesting induced transition (C peak) can highly enhance SHG, which can be significantly modulated by strain. Our results would pave the way to enable novel applications of the TMDs in nonlinear optical device.

preprint2020arXiv

Global existence in critical spaces for non Newtonian compressible viscoelastic flows

We are interested in the multi-dimentional compressible viscoelastic flows of Oldroyd type, which is one of non-Newtonian fluids exhibiting the elastic behavior. In order to capture the damping effect of the additional deformation tensor, to the best of our knowledge, the "div-curl" structural condition plays a key role in previous efforts. Our aim of this paper is to remove the structural condition and prove a global existence of strong solutions to compressible viscoelastic flows in critical spaces. The new ingredient lies in the introduction of effective flux $(θ,\mathcal{G})$, which enables us to capture the dissipation arising from \textit{combination} of density and deformation tensor. In absence of compatible conditions, the partial dissipation is found in non-Newtonian compressible fluids, which is weaker than that of usual Navier-Stokes equations.

preprint2020arXiv

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing

We present GluonCV and GluonNLP, the deep learning toolkits for computer vision and natural language processing based on Apache MXNet (incubating). These toolkits provide state-of-the-art pre-trained models, training scripts, and training logs, to facilitate rapid prototyping and promote reproducible research. We also provide modular APIs with flexible building blocks to enable efficient customization. Leveraging the MXNet ecosystem, the deep learning models in GluonCV and GluonNLP can be deployed onto a variety of platforms with different programming languages. The Apache 2.0 license has been adopted by GluonCV and GluonNLP to allow for software distribution, modification, and usage.

preprint2020arXiv

High Reynolds number and high Weissenberg number Oldroyd-B model with dissipation

We give a small data global well-posedness result for an incompressible Oldroyd-B model with wave number dissipation in the equation of stress tensor. The result is uniform in solvent Reynolds numbers, and requires only fractional wave-number dependent dissipation $(-Δ)^β$, $β\ge \frac{1}{2}$ in the added stress.

preprint2020arXiv

Identification of hydrodynamic instability by convolutional neural networks

The onset of hydrodynamic instabilities is of great importance in both industry and daily life, due to the dramatic mechanical and thermodynamic changes for different types of flow motions. In this paper, modern machine learning techniques, especially the convolutional neural networks (CNN), are applied to identify the transition between different flow motions raised by hydrodynamic instability, as well as critical non-dimensionalized parameters for characterizing this transit. CNN not only correctly predicts the critical transition values for both Taylor-Couette (TC) flow and Rayleigh- Bénard (RB) convection under various setups and conditions, but also shows an outstanding performance on robustness and noise-tolerance. In addition, key spatial features used for classifying different flow patterns are revealed by the principal component analysis.

preprint2020arXiv

Improving Semantic Segmentation via Self-Training

Deep learning usually achieves the best results with complete supervision. In the case of semantic segmentation, this means that large amounts of pixelwise annotations are required to learn accurate models. In this paper, we show that we can obtain state-of-the-art results using a semi-supervised approach, specifically a self-training paradigm. We first train a teacher model on labeled data, and then generate pseudo labels on a large set of unlabeled data. Our robust training framework can digest human-annotated and pseudo labels jointly and achieve top performances on Cityscapes, CamVid and KITTI datasets while requiring significantly less supervision. We also demonstrate the effectiveness of self-training on a challenging cross-domain generalization task, outperforming conventional finetuning method by a large margin. Lastly, to alleviate the computational burden caused by the large amount of pseudo labels, we propose a fast training schedule to accelerate the training of segmentation models by up to 2x without performance degradation.

preprint2020arXiv

LSBert: A Simple Framework for Lexical Simplification

Lexical simplification (LS) aims to replace complex words in a given sentence with their simpler alternatives of equivalent meaning, to simplify the sentence. Recently unsupervised lexical simplification approaches only rely on the complex word itself regardless of the given sentence to generate candidate substitutions, which will inevitably produce a large number of spurious candidates. In this paper, we propose a lexical simplification framework LSBert based on pretrained representation model Bert, that is capable of (1) making use of the wider context when both detecting the words in need of simplification and generating substitue candidates, and (2) taking five high-quality features into account for ranking candidates, including Bert prediction order, Bert-based language model, and the paraphrase database PPDB, in addition to the word frequency and word similarity commonly used in other LS methods. We show that our system outputs lexical simplifications that are grammatically correct and semantically appropriate, and obtains obvious improvement compared with these baselines, outperforming the state-of-the-art by 29.8 Accuracy points on three well-known benchmarks.

preprint2020arXiv

On Constructing Confidence Region for Model Parameters in Stochastic Gradient Descent via Batch Means

In this paper, we study a simple algorithm to construct asymptotically valid confidence regions for model parameters using the batch means method. The main idea is to cancel out the covariance matrix which is hard/costly to estimate. In the process of developing the algorithm, we establish process-level functional central limit theorem for Polyak-Ruppert averaging based stochastic gradient descent estimators. We also extend the batch means method to accommodate more general batch size specifications.

preprint2020arXiv

ResNeSt: Split-Attention Networks

It is well known that featuremap attention and multi-path representation are important for visual recognition. In this paper, we present a modularized architecture, which applies the channel-wise attention on different network branches to leverage their success in capturing cross-feature interactions and learning diverse representations. Our design results in a simple and unified computation block, which can be parameterized using only a few variables. Our model, named ResNeSt, outperforms EfficientNet in accuracy and latency trade-off on image classification. In addition, ResNeSt has achieved superior transfer learning results on several public benchmarks serving as the backbone, and has been adopted by the winning entries of COCO-LVIS challenge. The source code for complete system and pretrained models are publicly available.

preprint2020arXiv

Super-transport of Excitons in Atomically Thin Organic Semiconductors at the 2D Quantum Limit

Long-range and fast transport of coherent excitons is important for development of high-speed excitonic circuits and quantum computing applications. However, most of these coherent excitons have only been observed in some low-dimensional semiconductors when coupled with cavities, as there are large inhomogeneous broadening and dephasing effects on the exciton transport in their native states of the materials. Here, by confining coherent excitons at the 2D quantum limit, we firstly observed molecular aggregation enabled super-transport of excitons in atomically thin two-dimensional (2D) organic semiconductors between coherent states, with a measured a high effective exciton diffusion coefficient of 346.9 cm2/sec at room temperature. This value is one to several orders of magnitude higher than the reported values from other organic molecular aggregates and low-dimensional inorganic materials. Without coupling to any optical cavities, the monolayer pentacene sample, a very clean 2D quantum system (1.2 nm thick) with high crystallinity (J type aggregation) and minimal interfacial states, showed superradiant emissions from the Frenkel excitons, which was experimentally confirmed by the temperature-dependent photoluminescence (PL) emission, highly enhanced radiative decay rate, significantly narrowed PL peak width and strongly directional in-plane emission. The coherence in monolayer pentacene samples was observed to be delocalized over 135 molecules, which is significantly larger than the values (a few molecules) observed from other organic thin films. In addition, the super-transport of excitons in monolayer pentacene samples showed highly anisotropic behaviour. Our results pave the way for the development of future high-speed excitonic circuits, fast OLEDs, and other opto-electronic devices.

preprint2020arXiv

Testing kinetically coupled inflation models with CMB distortions

Inflation scenarios kinetically coupled with the Einstein tensor have been widely studied. They can be consistent with current observational data. Future experiments on the measurement on CMB distortions will potentially extend information about the scalar spectrum to small scales $1 \Mpc^{-1} \lesssim k \lesssim 2 \times 10^4 \Mpc^{-1}$. By taking the sensitivity of the PIXIE experiment as the criterion, we perform a model-oriented analysis of the observational prospects of spectral distortions for kinetically coupled inflation. There are five models that possibly generate a detectable level of distortions, among the 49 single-field inflation models listed in Ref. \cite{Martin2013a}. These models are: hybrid inflation in the valley (VHI), non-canonical Kähler inflation (NCKI), generalized MSSM inflation (GMSSMI), generalized renormalization point inflation (GRIPI), and running-mass inflation (RMI). Each of these models can satisfy the Planck constraints on spectral tilt and lead to increase power on scales relevant for CMB distortions in a tuned region of their parameter space. The existence of kinetic coupling suppresses the value of the model parameters with mass dimension for VHI, GMSSMI, and GRIPI, such that these three models can be in agreement with their theoretical considerations. However, the tuned regions for all these models fail to satisfy the constraints on tensor modes.

preprint2020arXiv

Unfitted Nitsche's method for computing wave modes in topological materials

In this paper, we propose an unfitted Nitsche's method for computing wave modes in topological materials. The proposed method is based on Nitsche's technique to study the performance-enhanced topological materials which have strongly heterogeneous structures (e.g., the refractive index is piecewise constant with high contrasts). For periodic bulk materials, we use Floquet-Bloch theory and solve an eigenvalue problem on a torus with unfitted meshes. For the materials with a line defect, a sufficiently large domain with zero boundary conditions is used to compute the localized eigenfunctions corresponding to the edge modes. The interfaces are handled by Nitsche's method on an unfitted uniform mesh. We prove the proposed methods converge optimally, and present numerical examples to validate the theoretical results and demonstrate the capability of simulating topological materials.

preprint2020arXiv

Vision-Dialog Navigation by Exploring Cross-modal Memory

Vision-dialog navigation posed as a new holy-grail task in vision-language disciplinary targets at learning an agent endowed with the capability of constant conversation for help with natural language and navigating according to human responses. Besides the common challenges faced in visual language navigation, vision-dialog navigation also requires to handle well with the language intentions of a series of questions about the temporal context from dialogue history and co-reasoning both dialogs and visual scenes. In this paper, we propose the Cross-modal Memory Network (CMN) for remembering and understanding the rich information relevant to historical navigation actions. Our CMN consists of two memory modules, the language memory module (L-mem) and the visual memory module (V-mem). Specifically, L-mem learns latent relationships between the current language interaction and a dialog history by employing a multi-head attention mechanism. V-mem learns to associate the current visual views and the cross-modal memory about the previous navigation actions. The cross-modal memory is generated via a vision-to-language attention and a language-to-vision attention. Benefiting from the collaborative learning of the L-mem and the V-mem, our CMN is able to explore the memory about the decision making of historical navigation actions which is for the current step. Experiments on the CVDN dataset show that our CMN outperforms the previous state-of-the-art model by a significant margin on both seen and unseen environments.

preprint2020arXiv

Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks

Vision-Language Navigation (VLN) is a task where agents learn to navigate following natural language instructions. The key to this task is to perceive both the visual scene and natural language sequentially. Conventional approaches exploit the vision and language features in cross-modal grounding. However, the VLN task remains challenging, since previous works have neglected the rich semantic information contained in the environment (such as implicit navigation graphs or sub-trajectory semantics). In this paper, we introduce Auxiliary Reasoning Navigation (AuxRN), a framework with four self-supervised auxiliary reasoning tasks to take advantage of the additional training signals derived from the semantic information. The auxiliary tasks have four reasoning objectives: explaining the previous actions, estimating the navigation progress, predicting the next orientation, and evaluating the trajectory consistency. As a result, these additional training signals help the agent to acquire knowledge of semantic representations in order to reason about its activity and build a thorough perception of the environment. Our experiments indicate that auxiliary reasoning tasks improve both the performance of the main task and the model generalizability by a large margin. Empirically, we demonstrate that an agent trained with self-supervised auxiliary reasoning tasks substantially outperforms the previous state-of-the-art method, being the best existing approach on the standard benchmark.

preprint2020arXiv

Wave packets in the fractional nonlinear Schrödinger equation with a honeycomb potential

In this article, we study wave dynamics in the fractional nonlinear Schrödinger equation with a modulated honeycomb potential. This problem arises from recent research interests in the interplay between topological materials and nonlocal governing equations. Both are current focuses in scientific research fields. We first develop the Floquet-Bloch spectral theory of the linear fractional Schrödinger operator with a honeycomb potential. Especially, we prove the existence of conical degenerate points, i.e., Dirac points, at which two dispersion band functions intersect. We then investigate the dynamics of wave packets spectrally localized at a Dirac point and derive the leading effective envelope equation. It turns out the envelope can be described by a nonlinear Dirac equation with a varying mass. With rigorous error estimates, we demonstrate that the asymptotic solution based on the effective envelope equation approximates the true solution well in the weighted-$H^s$ space.

preprint2020arXiv

When Machine Learning Meets Multiscale Modeling in Chemical Reactions

Due to the intrinsic complexity and nonlinearity of chemical reactions, direct applications of traditional machine learning algorithms may face with many difficulties. In this study, through two concrete examples with biological background, we illustrate how the key ideas of multiscale modeling can help to reduce the computational cost of machine learning a lot, as well as how machine learning algorithms perform model reduction automatically in a time-scale separated system. Our study highlights the necessity and effectiveness of an integration of machine learning algorithms and multiscale modeling during the study of chemical reactions.

preprint2019arXiv

Wave packet dynamics in slowly modulated photonic graphene

Mathematical analysis on electromagnetic waves in photonic graphene, a photonic topological material which has a honeycomb structure, is one of the most important current research topics. By modulating the honeycomb structure, numerous topological phenomena have been observed recently. The electromagnetic waves in such a media are generally described by the 2-dimensional wave equation. It has been shown that the corresponding elliptic operator with a honeycomb material weight has Dirac points in its dispersion surfaces. In this paper, we study the time evolution of the wave packets spectrally concentrated at such Dirac points in a modulated honeycomb material weight. We prove that such wave packet dynamics is governed by the Dirac equation with a varying mass in a large but finite time. Our analysis provides mathematical insights to those topological phenomena in photonic graphene.

preprint2016arXiv

$\mathbb{A}^1$-curves on affine complete intersections

We generalize the results of Clemens, Ein, and Voisin regarding rational curves and zero cycles on generic projective complete intersections to the logarithmic setup.

preprint2016arXiv

$\mathcal{PT}$ symmetry in a fractional Schrödinger equation

We investigate the fractional Schrödinger equation with a periodic $\mathcal{PT}$-symmetric potential. In the inverse space, the problem transfers into a first-order nonlocal frequency-delay partial differential equation. We show that at a critical point, the band structure becomes linear and symmetric in the one-dimensional case, which results in a nondiffracting propagation and conical diffraction of input beams. If only one channel in the periodic potential is excited, adjacent channels become uniformly excited along the propagation direction, which can be used to generate laser beams of high power and narrow width. In the two-dimensional case, there appears conical diffraction that depends on the competition between the fractional Laplacian operator and the $\mathcal{PT}$-symmetric potential. This investigation may find applications in novel on-chip optical devices.

preprint2016arXiv

Depth2Action: Exploring Embedded Depth for Large-Scale Action Recognition

This paper performs the first investigation into depth for large-scale human action recognition in video where the depth cues are estimated from the videos themselves. We develop a new framework called depth2action and experiment thoroughly into how best to incorporate the depth information. We introduce spatio-temporal depth normalization (STDN) to enforce temporal consistency in our estimated depth sequences. We also propose modified depth motion maps (MDMM) to capture the subtle temporal changes in depth. These two components significantly improve the action recognition performance. We evaluate our depth2action framework on three large-scale action recognition video benchmarks. Our model achieves state-of-the-art performance when combined with appearance and motion information thus demonstrating that depth2action is indeed complementary to existing approaches.

preprint2016arXiv

Land Use Classification using Convolutional Neural Networks Applied to Ground-Level Images

Land use mapping is a fundamental yet challenging task in geographic science. In contrast to land cover mapping, it is generally not possible using overhead imagery. The recent, explosive growth of online geo-referenced photo collections suggests an alternate approach to geographic knowledge discovery. In this work, we present a general framework that uses ground-level images from Flickr for land use mapping. Our approach benefits from several novel aspects. First, we address the nosiness of the online photo collections, such as imprecise geolocation and uneven spatial distribution, by performing location and indoor/outdoor filtering, and semi- supervised dataset augmentation. Our indoor/outdoor classifier achieves state-of-the-art performance on several bench- mark datasets and approaches human-level accuracy. Second, we utilize high-level semantic image features extracted using deep learning, specifically convolutional neural net- works, which allow us to achieve upwards of 76% accuracy on a challenging eight class land use mapping problem.

preprint2016arXiv

Mesoscopic structural phase progression in photo-excited VO2 revealed by time-resolved x-ray diffraction microscopy

Dynamical phase separation during a solid-solid phase transition poses a challenge for understanding the fundamental processes in correlated materials. Critical information underlying a phase transition, such as localized phase competition, is difficult to reveal by measurements that are spatially averaged over many phase separated regions. The ability to simultaneously track the spatial and temporal evolution of such systems is essential to understanding mesoscopic processes during a phase transition. Using state-of-the-art time-resolved hard x-ray diffraction microscopy, we directly visualize the structural phase progression in a VO2 film upon photoexcitation. Following a homogenous in-plane optical excitation, the phase transformation is initiated at discrete sites and completed by the growth of one lattice structure into the other, instead of a simultaneous isotropic lattice symmetry change. The time-dependent x-ray diffraction spatial maps show that the in-plane phase progression in laser-superheated VO2 is via a displacive lattice transformation as a result of relaxation from an excited monoclinic phase into a rutile phase. The speed of the phase front progression is quantitatively measured, and is faster than the process driven by in-plane thermal diffusion but slower than the sound speed in VO2. The direct visualization of localized structural changes in the time domain opens a new avenue to study mesoscopic processes in driven systems.

preprint2016arXiv

Picosecond electric-field-induced threshold switching in phase-change materials

Many chalcogenide glasses undergo a breakdown in electronic resistance above a critical field strength. Known as threshold switching, this mechanism enables field-induced crystallization in emerging phase-change memory. Purely electronic as well as crystal nucleation assisted models have been employed to explain the electronic breakdown. Here, picosecond electric pulses are used to excite amorphous Ag$_4$In$_3$Sb$_{67}$Te$_{26}$. Field-dependent reversible changes in conductivity and pulse-driven crystallization are observed. The present results show that threshold switching can take place within the electric pulse on sub-picosecond time-scales - faster than crystals can nucleate. This supports purely electronic models of threshold switching and reveals potential applications as an ultrafast electronic switch.

preprint2016arXiv

Spatio-Temporal Sentiment Hotspot Detection Using Geotagged Photos

We perform spatio-temporal analysis of public sentiment using geotagged photo collections. We develop a deep learning-based classifier that predicts the emotion conveyed by an image. This allows us to associate sentiment with place. We perform spatial hotspot detection and show that different emotions have distinct spatial distributions that match expectations. We also perform temporal analysis using the capture time of the photos. Our spatio-temporal hotspot detection correctly identifies emerging concentrations of specific emotions and year-by-year analyses of select locations show there are strong temporal correlations between the predicted emotions and known events.

preprint2016arXiv

Thermal fission rates with temperature dependent fission barriers

The fission processes of thermal excited nuclei are conventionally studied by statistical models which rely on inputs of phenomenological level densities and potential barriers. Therefore the microscopic descriptions of spontaneous fission and induced fission are very desirable for a unified understanding of various fission processes. We propose to study the fission rates, at both low and high temperatures, with microscopically calculated temperature-dependent fission barriers and collective mass parameters. The fission barriers are calculated by the finite-temperature Skyrme-Hartree-Fock+BCS method. The mass parameters are calculated by the temperature-dependent cranking approximation. The thermal fission rates can be obtained by the imaginary free energy approach at all temperatures, in which fission barriers are naturally temperature dependent. The fission at low temperatures can be described mainly as a barrier-tunneling process. While the fission at high temperatures has to incorporate the reflection above barriers. Our results of spontaneous fission rates reasonably agree with other studies and experiments. The temperature dependencies of fission barrier heights and curvatures have been discussed. The temperature dependent behaviors of mass parameters have also been discussed. The thermal fission rates from low to high temperatures with a smooth connection have been given by different approaches. \item[Conclusions] Since the temperature dependencies of fission barrier heights and curvatures, and the mass parameters can vary rapidly for different nuclei, the microscopic descriptions of thermal fission rates are very valuable. Our studies without free parameters provide a consistent picture to study various fissions such as that in fast-neutron reactors, astrophysical environments and fusion reactions for superheavy nuclei.

preprint2015arXiv

$\mathbb{A}^1$-connected varieties over non-closed fields

In this paper, we proved two results regarding the arithmetics of separably $\mathbb{A}^1$-connected varieties of rank one. First we proved over a large field, there is an $\mathbb{A}^1$-curve through any rational point of the boundary, if the boundary divisor is smooth and separably rationally connected. Secondly, we generalize a theorem of Hassett-Tschinkel for the Zariski density of integral points over function fields of curves.

preprint2015arXiv

A Novel Dissipation Property of the Master Equation

Recent studies have shown that the entropy production rate for the master equation consists of two nonnegative terms: the adiabatic and non-adiabatic parts, where the non-adiabatic part is also known as the dissipation rate of a Boltzmann-Shannon relative entropy. In this paper, we provide some nonzero lower bounds for the relative entropy, the entropy production rate, and its adiabatic and non-adiabatic parts. These nonzero lower bounds not only reveal some novel dissipative properties for general nonequilibrium processes which are much stronger than the second law of thermodynamics, but also impose some new constraints on thermodynamic constitutive relations. Moreover, we also provide a mathematical application of these nonzero lower bounds by studying the long-time behavior of the master equation. Extensions to the Tsallis statistics are also discussed, including the nonzero lower bounds for the Tsallis-type relative entropy and its dissipation rate.

preprint2015arXiv

A rigorous derivation of multicomponent diffusion laws

This article is concerned with the dynamics of a mixture of gases. Under the assumption that all the gases are isothermal and inviscid, we show that the governing equations have an elegant conservation-dissipation structure. With the help of this structure, a multicomponent diffusion law is derived mathematically rigorously. This clarifies a long-standing non-uniqueness issue in the field for the first time. The multicomponent diffusion law derived here takes the spatial gradient of an entropic variable as the thermodynamic forces and satisfies a nonlinear version of the Onsager reciprocal relations.

preprint2015arXiv

Layer-dependent surface potential of phosphorene and anisotropic/layer-dependent charge transfer in phosphorene-gold hybrid system

The surface potential and the efficiency of interfacial charge transfer are extremely important for designing future semiconductor devices based on the emerging two-dimensional (2D) phosphorene. Here, we directly measured the strongly layer-dependent surface potential of mono- and few-layer phosphorene on gold, which confirms with the reported theoretical prediction. At the same time, we used an optical way - photoluminescence (PL) spectroscopy to probe the charge transfer in phosphorene-gold hybrid system. We firstly observed highly anisotropic and layer-dependent PL quenching in the phosphorene-gold hybrid system, which is attributed to the highly anisotropic/layer-dependent interfacial charge transfer.

preprint2015arXiv

On the irreducibility of the space of genus zero stable log maps to wonderful compactifications

In this paper, we prove the moduli spaces of genus zero stable log maps to a large class of wonderful compactifications are irreducible and unirational.

preprint2015arXiv

Strong approximation over function fields

By studying $\mathbb{A}^1$-curves on varieties, we propose a geometric approach to strong approximation problem over function fields of complex curves. We prove that strong approximation holds for smooth, low degree affine complete intersections with the boundary smooth at infinity.

preprint2015arXiv

Study of weakly-bound odd-A nuclei with quasiparticle blocking

The coordinate-space Hartree-Fock-Bogoliubov (HFB) approach with quasiparticle blocking has been applied to study the odd-A weakly bound nuclei $^{17,19}$B and $^{37}$Mg, in which halo structures have been reported in experiments. The Skyrme nuclear forces SLy4 and UNEDF1 have been adopted in our calculations. The results with and without blocking have been compared to demonstrate the emergence of deformed halo structures due to blocking effects. In our calculations, $^{19}$B and $^{37}$Mg have remarkable features of deformed halos.

preprint2015arXiv

Very free curves on Fano Complete Intersections

In this paper, we show that general Fano complete intersections over an algebraically closed field of arbitrary characteristics are separably rationally connected. Our proof also implies that general log Fano complete intersections with smooth tame boundary divisors admit very free $A^1$-curves.

preprint2014arXiv

Boltzmann-Equation Based Derivation of Balance Laws in Irreversible Thermodynamics

In this paper we propose a novel approach to construct macroscopic balance equations and constitutive equations describing various irreversible phenomena. It is based on the general principles of non-equilibrium thermodynamics and consists of four basic steps: picking suitable state variables, choosing a strictly concave entropy function, separating entropy fluxes and production rates properly, and determining a dissipation matrix. Our approach takes the advantage of both EIT and GENERIC formalisms, and shows a direct correspondence with Levermore's moment closure hierarchies for the Boltzmann equation. This result may put various macroscopic modeling approaches starting from the general principles of non-equilibrium thermodynamics on a solid microscopic foundation based on the Boltzmann equation.

preprint2014arXiv

Conservation-dissipation formalism of irreversible thermodynamics

We propose a conservation-dissipation formalism (CDF) for coarse-grained descriptions of irreversible processes. This formalism is based on a stability criterion for non-equilibrium thermodynamics. The criterion ensures that non-equilibrium states tend to equilibrium in long time. As a systematic methodology, CDF provides a feasible procedure in choosing non-equilibrium state variables and determining their evolution equations. The equations derived in CDF have a unified elegant form. They are globally hyperbolic, allow a convenient definition of weak solutions, and are amenable to existing numerics. More importantly, CDF is a genuinely nonlinear formalism and works for systems far away from equilibrium. With this formalism, we formulate novel thermodynamics theories for heat conduction in rigid bodies and non-isothermal compressible Maxwell fluid flows as two typical examples. In these examples, the non-equilibrium variables are exactly the conjugate variables of the heat fluxes or stress tensors. The new theory generalizes Cattaneo's law or Maxwell's law in a regularized and nonlinear fashion.

preprint2014arXiv

Feynman Diagrams for Stochastic Inflation and Quantum Field Theory in de Sitter Space

We consider a massive scalar field with quartic self-interaction $λ/4!\,ϕ^4$ in de~Sitter spacetime and present a diagrammatic expansion that describes the field as driven by stochastic noise. This is compared with the Feynman diagrams in the Keldysh basis of the Amphichronous (Closed-Time-Path) Field Theoretical formalism. For all orders in the expansion, we find that the diagrams agree when evaluated in the leading infrared approximation, i.e. to leading order in $m^2/H^2$, where $m$ is the mass of the scalar field and $H$ is the Hubble rate. As a consequence, the correlation functions computed in both approaches also agree to leading infrared order. This perturbative correspondence shows that the stochastic Theory is exactly equivalent to the Field Theory in the infrared. The former can then offer a non-perturbative resummation of the Field Theoretical Feynman diagram expansion, including fields with $0\leq m^2\ll\sqrt λH^2$ for which the perturbation expansion fails at late times.

preprint2014arXiv

Microscopic description of neutron emission rates in compound nuclei

The neutron emission rates in thermal excited nuclei are conventionally described by statistical models with a phenomenological level density parameter that depends on excitation energies, deformations and mass regions. In the microscopic view of hot nuclei, the neutron emission rates can be determined by the external neutron gas densities without any free parameters. Therefore the microscopic description of thermal neutron emissions is desirable that can impact several understandings such as survival probabilities of superheavy compound nuclei and neutron emissivity in reactors. To describe the neutron emission rates microscopically, the external thermal neutron gases are self-consistently obtained based on the Finite-Temperature Hartree-Fock-Bogoliubov (FT-HFB) approach. The results are compared with the statistical model to explore the connections between the FT-HFB approach and the statistical model. The Skyrme FT-HFB equation is solved by HFB-AX in deformed coordinate spaces. Based on the FT-HFB approach, the thermal properties and external neutron gas are properly described with the self-consistent gas substraction procedure. Then neutron emission rates can be obtained based on the densities of external neutron gases. The thermal statistical properties of $^{238}$U and $^{258}$U are studied in detail in terms of excitation energies. The thermal neutron emission rates in $^{238, 258}$U and superheavy compound nuclei $_{112}^{278}$Cn and $_{114}^{292}$Fl are calculated, which agree well with the statistical model by adopting an excitation-energy-dependent level density parameter. The coordinate-space FT-HFB approach can provide reliable microscopic descriptions of neutron emission rates in hot nuclei, as well as microscopic constraints on the excitation energy dependence of level density parameters for statistical models.

preprint2014arXiv

Testing Inflation and Curvaton Scenarios with CMB Distortions

Prior to recombination, Silk damping causes the dissipation of energy from acoustic waves into the monopole of the Cosmic Microwave Background (CMB), resulting in spectral distortions. These can be used to probe the primordial scalar power spectrum on smaller scales than it is possible with CMB anisotropies. An enhancement of power on these scales is nevertheless required for the resulting distortions to be detectable by future experiments like PIXIE. In this paper, we examine all 49 single-field inflation models listed by Martin et al. in the Encyclopaedia Inflationaris [1] and find that only one of these may lead to a detectable level of distortions in a tuned region of its parameter space, namely the original hybrid model. Three effective multi-field scenarios are also studied: with softly and suddenly turning trajectories, and with a mild waterfall trajectory. Softly turning trajectories do not induce distortions at any detectable level, whereas a sudden turn in the field space or a mild waterfall trajectory predicts a peak (plus damped oscillations in the sudden turn case) in the scalar power spectrum, which can lead to an observable amount of CMB distortions. Finally, another scenario leading to potentially detectable distortions involves a curvaton whose blue spectrum is subdominant on CMB angular scales and overtakes the inflaton spectrum on smaller scales. In this case however, we show that the bounds from ultra compact minihaloes are not satisfied. Expectations for an ultimate PRISM-class experiment characterized by an improvement in sensitivity by a factor of ten are discussed for some models.

preprint2014arXiv

Unveiling pseudospin and angular momentum in photonic graphene

Pseudospin, an additional degree of freedom inherent in graphene, plays a key role in understanding many fundamental phenomena such as the anomalous quantum Hall effect, electron chirality and Klein paradox. Unlike the electron spin, the pseudospin was traditionally considered as an unmeasurable quantity, immune to Stern-Gerlach-type experiments. Recently, however, it has been suggested that graphene pseudospin is a real angular momentum that might manifest itself as an observable quantity, but so far direct tests of such a momentum remained unfruitful. Here, by selective excitation of two sublattices of an artificial photonic graphene, we demonstrate pseudospin-mediated vortex generation and topological charge flipping in otherwise uniform optical beams with Bloch momentum traversing through the Dirac points. Corroborated by numerical solutions of the linear massless Dirac-Weyl equation, we show that pseudospin can turn into orbital angular momentum completely, thus upholding the belief that pseudospin is not merely for theoretical elegance but rather physically measurable.

preprint2013arXiv

Infrared Correlations in de Sitter Space: Field Theoretic vs. Stochastic Approach

We consider massive λϕ^4 theory in de Sitter background. The mass of the scalar field ϕis chosen small enough, such that the amplification of superhorizon momentum modes leads to a significant enhancement of infrared correlations, but large enough such that perturbation theory remains valid. Using the Closed-Time-Path approach, we calculate the infrared corrections to the two-point function of ϕto 2-loop order. To this approximation, we find agreement with the correlation found using stochastic methods. When breaking the results down to individual Feynman diagrams obtained by the two different methods, we observe that these agree as well.

preprint2013arXiv

Non-Gaussianities and Curvature Perturbations from Hybrid Inflation

For the original hybrid inflation as well as the supersymmetric F-term and D-term hybrid models, we calculate the level of non-gaussianities and the power spectrum of curvature perturbations generated during the waterfall, taking into account the contribution of entropic modes. We focus on the regime of mild waterfall, in which inflation continues for more than about 60 e-folds N during the waterfall. We find that the associated f_nl parameter goes typically from f_nl \simeq -1 / N_exit in the regime with N >> 60, where N_exit is the number of e-folds between the time of Hubble exit of a pivot scale and the end of inflation, down to f_nl ~-0.3 when N \gtrsim 60, i.e. much smaller in magnitude than the current bound from Planck. Considering only the adiabatic perturbations, the power spectrum is red, with a spectral index n_s = 1 - 4 / N_exit, in the case N >> 60, whereas in the case N \gtrsim 60, it increases up to unity. Including the contribution of entropic modes does not change the observable predictions in the first case. However, in the second case, they are a relevant source for the power spectrum of curvature perturbations, of which the amplitude increases by several orders of magnitudes and can lead to black hole formation. We conclude that due to the important contribution of entropic modes, the parameter space leading to a mild waterfall phase is excluded by CMB observations for all the considered models.

preprint2012arXiv

Nonlinear dynamics of wave packets in PT-symmetric optical lattices near the phase transition point

Nonlinear dynamics of wave packets in PT-symmetric optical lattices near the phase-transition point are analytically studied. A nonlinear Klein-Gordon equation is derived for the envelope of these wave packets. A variety of novel phenomena known to exist in this envelope equation are shown to also exist in the full equation including wave blowup, periodic bound states and solitary wave solutions.

preprint2011arXiv

Can the Reissner-Nordström black hole or Schwarzschild black hole be the stable Planck-scale particle accelerator?

It is shown that the extremal Reissner-Nordström black hole, the non-extremal one with multiple scattering particles, and the Schwarzschild black hole with radial head-on particles are stable under the collision of the particles near the horizon, if the back-reaction effect and the effect generated by gravity of particles are involved. Moreover, the collision near Reissner-Nordström black holes with astrophysically typical mass can not generate the Planck-scale center-of-mass energy. However, the head-on collision near the typical primordial black hole could just occur at the Planck-energy scale.

preprint2011arXiv

Fano Hypersurfaces in Positive Characteristic

We prove that a general Fano hypersurface in a projective space over an algebraically closed field of arbitrary characteristic is separably rationally connected.

preprint2011arXiv

General stationary charged black holes as charged particle accelerators

We study the possibility of getting infinite energy in the center of mass frame of colliding charged particles in a general stationary charged black hole. For black holes with two-fold degenerate horizon, it is found that arbitrary high center-of-mass energy can be attained, provided that one of the particle has critical angular momentum or critical charge, and the remained parameters of particles and black holes satisfy certain restriction. For black holes with multiple-fold degenerate event horizons, the restriction is released. For non-degenerate black holes, the ultra-high center-of-mass is possible to be reached by invoking the multiple scattering mechanism. We obtain a condition for the existence of innermost stable circular orbit with critical angular momentum or charge on any-fold degenerate horizons, which is essential to get ultra-high center-of-mass energy without fine-tuning problem. We also discuss the proper time spending by the particle to reach the horizon and the duality between frame dragging effect and electromagnetic interaction. Some of these general results are applied to braneworld small black holes.

preprint2009arXiv

Separatrix Map Analysis for Fractal Scatterings in Weak Interactions of Solitary Waves

Previous studies have shown that fractal scatterings in weak interactions of solitary waves in the generalized nonlinear Schrödinger equations are described by a universal second-order separatrix map. In this paper, this separatrix map is analyzed in detail, and hence a complete characterization of fractal scatterings in these weak interactions is obtained. In particular, scaling laws of these fractals are derived analytically for different initial conditions, and these laws are confirmed by direct numerical simulations. In addition, an analytical criterion for the occurrence of fractal scatterings is given explicitly.

preprint2008arXiv

A Universal Map for Fractal Structures in Weak Solitary Wave Interactions

Fractal scatterings in weak solitary wave interactions is analyzed for generalized nonlinear Schrödiger equations (GNLS). Using asymptotic methods, these weak interactions are reduced to a universal second-order map. This map gives the same fractal scattering patterns as those in the GNLS equations both qualitatively and quantitatively. Scaling laws of these fractals are also derived.

Yi Zhu

What is connected

Connect this record

See the researcher in context

Building this map preview

68 published item(s)

Acting Flatterers via LLMs Sycophancy: Combating Clickbait with LLMs Opposing-Stance Reasoning

Alethia: A Foundational Encoder for Voice Deepfakes

MixGen: A New Multi-Modal Data Augmentation

ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis

Chinese Idiom Paraphrasing

Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue

Global classical solutions of 3D compressible viscoelastic system near equilibrium

Global well-posedness for 2D non-resistive compressible MHD system in periodic domain

ImpDet: Exploring Implicit Fields for 3D Object Detection

Large-scale Dynamics of Winds Driven by Line Force from a Thin Accretion Disk

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition

Pixel-level Correspondence for Self-Supervised Learning from Video

Prompt-Learning for Short Text Classification

Traveling edge states in massive Dirac equations along slowly varying edges

A phase field model for mass transport with semi-permeable interfaces

Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification

Rapid Multi-Physics Simulation for Electro-Thermal Origami Systems

Three-fold Weyl points in the Schrödinger operator with periodic potentials

Direct Measurement of Folding Angle and Strain Vector in Atomically thin WS$_2$ using Second Harmonic Generation

Global existence in critical spaces for non Newtonian compressible viscoelastic flows

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing

High Reynolds number and high Weissenberg number Oldroyd-B model with dissipation

Identification of hydrodynamic instability by convolutional neural networks

Improving Semantic Segmentation via Self-Training

LSBert: A Simple Framework for Lexical Simplification

On Constructing Confidence Region for Model Parameters in Stochastic Gradient Descent via Batch Means

ResNeSt: Split-Attention Networks

Super-transport of Excitons in Atomically Thin Organic Semiconductors at the 2D Quantum Limit

Testing kinetically coupled inflation models with CMB distortions

Unfitted Nitsche's method for computing wave modes in topological materials

Vision-Dialog Navigation by Exploring Cross-modal Memory

Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks

Wave packets in the fractional nonlinear Schrödinger equation with a honeycomb potential

When Machine Learning Meets Multiscale Modeling in Chemical Reactions

Wave packet dynamics in slowly modulated photonic graphene

$\mathbb{A}^1$-curves on affine complete intersections

$\mathcal{PT}$ symmetry in a fractional Schrödinger equation

Depth2Action: Exploring Embedded Depth for Large-Scale Action Recognition

Land Use Classification using Convolutional Neural Networks Applied to Ground-Level Images

Mesoscopic structural phase progression in photo-excited VO2 revealed by time-resolved x-ray diffraction microscopy

Picosecond electric-field-induced threshold switching in phase-change materials

Spatio-Temporal Sentiment Hotspot Detection Using Geotagged Photos

Thermal fission rates with temperature dependent fission barriers

$\mathbb{A}^1$-connected varieties over non-closed fields

A Novel Dissipation Property of the Master Equation

A rigorous derivation of multicomponent diffusion laws

Layer-dependent surface potential of phosphorene and anisotropic/layer-dependent charge transfer in phosphorene-gold hybrid system

On the irreducibility of the space of genus zero stable log maps to wonderful compactifications

Strong approximation over function fields

Study of weakly-bound odd-A nuclei with quasiparticle blocking

Very free curves on Fano Complete Intersections

Boltzmann-Equation Based Derivation of Balance Laws in Irreversible Thermodynamics

Conservation-dissipation formalism of irreversible thermodynamics

Feynman Diagrams for Stochastic Inflation and Quantum Field Theory in de Sitter Space

Microscopic description of neutron emission rates in compound nuclei

Testing Inflation and Curvaton Scenarios with CMB Distortions

Unveiling pseudospin and angular momentum in photonic graphene

Infrared Correlations in de Sitter Space: Field Theoretic vs. Stochastic Approach

Non-Gaussianities and Curvature Perturbations from Hybrid Inflation

Nonlinear dynamics of wave packets in PT-symmetric optical lattices near the phase transition point

Can the Reissner-Nordström black hole or Schwarzschild black hole be the stable Planck-scale particle accelerator?

Fano Hypersurfaces in Positive Characteristic

General stationary charged black holes as charged particle accelerators

Separatrix Map Analysis for Fractal Scatterings in Weak Interactions of Solitary Waves

A Universal Map for Fractal Structures in Weak Solitary Wave Interactions