Source author record

Hao Jiang

Hao Jiang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

56works

32topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Diffusion-APO: Trajectory-Aware Direct Preference Alignment for Video Diffusion Transformers

Efficiently aligning large-scale video diffusion models with human intent requires a scalable and trajectory-aware pathway that bridges the inherent discrepancy between training noise distributions and practical inference trajectories. While existing paradigms such as Direct Preference Optimization (DPO) and Group Relative Policy Optimization (GRPO) attempt to address this, they are often hindered by either reliance on bias-prone, complex reward models or suboptimal timestep sampling. In this paper, we propose Diffusion-APO (Aligned Preference Optimization), a trajectory-aware algorithm that resolves this misalignment by synchronizing training noise with inference-time denoising paths to maximize gradient signal efficacy. To translate this algorithmic innovation into a practical solution, we introduce a unified and modular RLHF framework that integrates online ranking, half-online anchoring, offline refinement, and distillation-aware drift correction. This framework enables flexible, multi-stage preference alignment across diverse data and computational constraints without relying on scalar-reward-based policy gradients. Through extensive experiments, we demonstrate that Diffusion-APO consistently outperforms standard baselines in visual quality and instruction following, while effectively preserving generative fidelity during model acceleration, providing a robust, end-to-end pathway for scalable video diffusion alignment.

preprint2026arXiv

Electric field switching of altermagnetic spin-splitting in multiferroic skyrmions

Magnetic skyrmions are localized magnetic structures that retain their shape and stability over time, thanks to their topological nature. Recent theoretical and experimental progress has laid the groundwork for understanding magnetic skyrmions characterized by negligible net magnetization and ultrafast dynamics. Notably, skyrmions emerging in materials with altermagnetism, a novel magnetic phase featuring lifted Kramers degeneracy-have remained unreported until now. In this study, we demonstrate that BiFeO3, a multiferroic renowned for its strong coupling between ferroelectricity and magnetism, can transit from a spin cycloid to a Neel-type skyrmion under antidamping spin-orbit torque at room temperature. Strikingly, the altermagnetic spin splitting within BiFeO3 skyrmion can be reversed through the application of an electric field, revealed via the Circular photogalvanic effect. This quasiparticle, which possesses a neutral topological charge, holds substantial promise for diverse applications-most notably, enabling the development of unconventional computing systems with low power consumption and magnetoelectric controllability.

preprint2026arXiv

Exposing and Mitigating Temporal Attack in Deepfake Video Detection

While spatiotemporal deepfake detectors achieve high AUC, our experiments reveal their susceptibility to evasion attacks. These models tend to overfit on fragile temporal spectrum cues, rather than learning robust semantic causality. To mitigate this vulnerability, we propose SpInShield, a temporal spectral-invariant defense framework explicitly designed to decouple semantic motion from manipulatable spectral artifacts. We propose a learnable spectral adversary that dynamically synthesizes severe spectral deformations, simulating extreme attack scenarios. By employing a shortcut suppression optimization strategy, SpInShield compels the encoder to extract reliable forensic cues while purging unstable spectral statistics from the latent space. Experiments show that SpInShield obtains competitive performance on widely used datasets and outperforms the strongest baseline by 21.30 percentage points in AUC under simulated amplitude spectral attacks.

preprint2026arXiv

Stereo Audio Rendering for Personal Sound Zones Using a Binaural Spatially Adaptive Neural Network (BSANN)

A binaural rendering framework for personal sound zones (PSZs) is proposed to enable multiple head-tracked listeners to receive fully independent stereo audio programs. Current PSZ systems typically rely on monophonic rendering and therefore cannot control the left and right ears separately, which limits the quality and accuracy of spatial imaging. The proposed method employs a Binaural Spatially Adaptive Neural Network (BSANN) to generate ear-optimized loudspeaker filters that reconstruct the desired acoustic field at each ear of multiple listeners. The framework integrates anechoically measured loudspeaker frequency responses, analytically modeled transducer directivity, and rigid-sphere head-related transfer functions (HRTFs) to enhance acoustic accuracy and spatial rendering fidelity. An explicit active crosstalk cancellation (XTC) stage further improves three-dimensional spatial perception. Experiments show significant gains in measured objective performance metrics, including inter-zone isolation (IZI), inter-program isolation (IPI), and crosstalk cancellation (XTC), with log-frequency-weighted values of 10.23/10.03 dB (IZI), 11.11/9.16 dB (IPI), and 10.55/11.13 dB (XTC), respectively, over 100-20,000 Hz. The combined use of ear-wise control, accurate acoustic modeling, and integrated active XTC produces a unified rendering method that delivers greater isolation performance, increased robustness to room asymmetry, and more faithful spatial reproduction in real acoustic environments.

preprint2026arXiv

TEA: Temporal Adaptive Satellite Image Semantic Segmentation

Crop mapping based on satellite images time-series (SITS) holds substantial economic value in agricultural production settings, in which parcel segmentation is an essential step. Existing approaches have achieved notable advancements in SITS segmentation with predetermined sequence lengths. However, we found that these approaches overlooked the generalization capability of models across scenarios with varying temporal length, leading to markedly poor segmentation results in such cases. To address this issue, we propose TEA, a TEmporal Adaptive SITS semantic segmentation method to enhance the model's resilience under varying sequence lengths. We introduce a teacher model that encapsulates the global sequence knowledge to guide a student model with adaptive temporal input lengths. Specifically, teacher shapes the student's feature space via intermediate embedding, prototypes and soft label perspectives to realize knowledge transfer, while dynamically aggregating student model to mitigate knowledge forgetting. Finally, we introduce full-sequence reconstruction as an auxiliary task to further enhance the quality of representations across inputs of varying temporal lengths. Through extensive experiments, we demonstrate that our method brings remarkable improvements across inputs of different temporal lengths on common benchmarks. Our code will be publicly available.

preprint2026arXiv

The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs

On-policy distillation (OPD) is widely used for LLM post-training. When pushed with a reward-extrapolation coefficient lambda > 1, the student can lift past the teacher in domain, but past a threshold lambda* the same step violates the output contract on structured-output tasks. In a single-position Bernoulli reduction, we derive a closed-form base-relative clip-safety threshold lambda*(p,b,c) determined by three measurable quantities: the teacher modal probability, the warm-start mass, and the importance-sampling clip strength. Above lambda*, the extrapolated fixed point exits the clip-safe region, changing training from format-preserving to format-collapsing. We extend the rule to calibrated K-ary listwise JSON tasks where a single binding equivalence class dominates the output contract and SFT retains parse headroom. On Amazon Fashion, three pre-registered tests--a fine-grid cliff interval, a budget-extension test, and a small-clip cross-prediction--fall within their locked prediction windows, with the small-clip value matching the closed-form prediction below grid resolution. Operating just below lambda*, ListOPD brings a 1.7B Qwen3 student to in-domain parity with an 8B-SFT baseline at one-fifth the parameters. The gain is driven primarily by format adherence: NDCG@1 on parsed outputs remains flat across lambda, while parse validity sharply changes at the predicted boundary. The cliff diagnostic is rubric-independent, whereas the parity claim uses a Gemini-graded rubric and inherits that evaluator's exposure.

preprint2026arXiv

Think When Needed: Adaptive Reasoning-Driven Multimodal Embeddings with a Dual-LoRA Architecture

Multimodal large language models (MLLMs) have emerged as a powerful backbone for multimodal embeddings. Recent methods introduce chain-of-thought (CoT) reasoning into the embedding pipeline to improve retrieval quality, but remain costly in both model size and inference cost. They typically employ separate reasoner and embedder with substantial parameter overhead, and generate CoT indiscriminately for every input. However, we observe that for simple inputs, discriminative embeddings already perform well, and redundant reasoning can even mislead the model, degrading performance. To address these limitations, we propose Think When Needed (TWN), a unified multimodal embedding framework with adaptive reasoning. TWN introduces a dual-LoRA architecture that attaches reasoning and embedding adapters to a shared frozen backbone, detaching gradients at their interface to mitigate gradient conflicts introduced by joint optimization while keeping parameters close to a single model. Building on this, an adaptive think mechanism uses a self-supervised routing gate to decide per input whether to generate CoT, skipping unnecessary reasoning to reduce inference overhead and even improve retrieval quality. We further explore embedding-guided RL to optimize CoT quality beyond supervised training. On the 78 tasks of MMEB-V2, TWN achieves state-of-the-art embedding quality while being substantially more efficient than existing generative methods, requiring only 3-5% additional parameters relative to the backbone and up to 50% fewer reasoning tokens compared to the full generative mode.

preprint2026arXiv

Towards Customized Multimodal Role-Play

Unified multimodal understanding and generation models enable richer human-AI interaction. Yet jointly customizing a character's persona, dialogue style, and visual identity while maintaining output consistency across modalities remains largely unexplored. To mitigate this gap, we introduce a new task, Customized Multimodal Role-Play (CMRP). We construct the RoleScape-20 dataset comprising 20 characters, including training and evaluation data that cover persona, stylistic descriptions, visual/expressive cues, and text-image interactions. Building on a unified model, we devise UniCharacter, a two-stage training framework containing Unified Supervised Finetuning (Unified-SFT) and character-specific group relative policy optimization (Character-GRPO). Given only 10 images plus corresponding interaction examples, the model acquires the target character and exhibits coherent persona, style, and visual identity in both generated text and images. This process takes about 100 GPU hours. Experiments on the RoleScape-20 dataset show that the proposed method substantially outperforms prior approaches. Ablation studies further validate the effectiveness of our cross-modal consistency design and few-shot customization strategy. We argue that CMRP, coupled with unified modeling, provides a basis for next-generation characterful and immersive interactive agents.

preprint2026arXiv

Unified Personalized Understanding, Generating and Editing

Unified large multimodal models (LMMs) have achieved remarkable progress in general-purpose multimodal understanding and generation. However, they still operate under a ``one-size-fits-all'' paradigm and struggle to model user-specific concepts (e.g., generate a photo of \texttt{<maeve>}) in a consistent and controllable manner. Existing personalization methods typically rely on external retrieval, which is inefficient and poorly integrated into unified multimodal pipelines. Recent personalized unified models introduce learnable soft prompts to encode concept information, yet they either couple understanding and generation or depend on complex multi-stage training, leading to cross-task interference and ultimately to fuzzy or misaligned personalized knowledge. We present \textbf{OmniPersona}, an end-to-end personalization framework for unified LMMs that, for the first time, integrates personalized understanding, generation, and image editing within a single architecture. OmniPersona introduces structurally decoupled concept tokens, allocating dedicated subspaces for different tasks to minimize interference, and incorporates an explicit knowledge replay mechanism that propagates personalized attribute knowledge across tasks, enabling consistent personalized behavior. To systematically evaluate unified personalization, we propose \textbf{\texttt{OmniPBench}}, extending the public UnifyBench concept set with personalized editing tasks and cross-task evaluation protocols integrating understanding, generation, and editing. Experimental results demonstrate that OmniPersona delivers competitive and robust performance across diverse personalization tasks. We hope OmniPersona will serve as a strong baseline and spur further research on controllable, unified personalization.

preprint2024arXiv

Hybrid Vector Message Passing for Generalized Bilinear Factorization

In this paper, we propose a new message passing algorithm that utilizes hybrid vector message passing (HVMP) to solve the generalized bilinear factorization (GBF) problem. The proposed GBF-HVMP algorithm integrates expectation propagation (EP) and variational message passing (VMP) via variational free energy minimization, yielding tractable Gaussian messages. Furthermore, GBF-HVMP enables vector/matrix variables rather than scalar ones in message passing, resulting in a loop-free Bayesian network that improves convergence. Numerical results show that GBF-HVMP significantly outperforms state-of-the-art methods in terms of NMSE performance and computational complexity.

preprint2022arXiv

Bayesian Inverse Uncertainty Quantification of the Physical Model Parameters for the Spallation Neutron Source First Target Station

The reliability of the mercury spallation target is mission-critical for the neutron science program of the spallation neutron source at the Oak Ridge National Laboratory. We present an inverse uncertainty quantification (UQ) study using the Bayesian framework for the mercury equation of state model parameters, with the assistance of polynomial chaos expansion surrogate models. By leveraging high-fidelity structural mechanics simulations and real measured strain data, the inverse UQ results reveal a maximum-a-posteriori estimate, mean, and standard deviation of $6.5\times 10^4$ ($6.49\times 10^4 \pm 2.39\times 10^3$) Pa for the tensile cutoff threshold, $12112.1$ ($12111.8 \pm 14.9$) kg/m$^3$ for the mercury density, and $1850.4$ ($1849.7 \pm 5.3$) m/s for the mercury speed of sound. These values do not necessarily represent the nominal mercury physical properties, but the ones that fit the strain data and the solid mechanics model we have used, and can be explained by three reasons: The limitations of the computer model or what is known as the model-form uncertainty, the biases and errors in the experimental data, and the mercury cavitation damage that also contributes to the change in mercury behavior. Consequently, the equation of state model parameters try to compensate for these effects to improve fitness to the data. The mercury target simulations using the updated parametric values result in an excellent agreement with 88% average accuracy compared to experimental data, 6% average increase compared to reference parameters, with some sensors experiencing an increase of more than 25%. With a more accurate simulated strain response, the component fatigue analysis can utilize the comprehensive strain history data to evaluate the target vessel's lifetime closer to its real limit, saving tremendous target costs.

preprint2022arXiv

ConfPred: A layered intergrowth structure prediction model based on confinement self-assembly in two-dimensional interlayer space

We constructed a simple but effective model to predict the layered intergrowth structures by combining the self-assembly phenomenon in confined space and the sandwich configuration of layered materials. In this model, a two-dimensional confined space is constructed by two known block layers, such as the Fe$_2$As$_2$ block of iron-based superconductors. Then, the crystal structure prediction is carried out only inside the confined space to search for brand-new block layers. We realized this model on the basis of the USPEX9.4 code. In the test, the already existing iron-based superconductors can be always successfully found, such as Ba$_2$Ti$_2$Fe$_2$As$_4$O, Sr$_3$Sc$_2$Fe$_2$As$_2$O$_5$, Sr$_4$V$_2$Fe$_2$As$_2$O$_6$, and so on. The comparison test suggests that our model has remarkable advantages in searching for intergrowth structures. With this space confinement prediction model, a structure prediction of layered intergrowth materials even with up to six elements can be performed with an acceptable machine time consumption. So far, we have done some multi-composition crystal structure predictions of iron-based superconductor, and found several stable and metastable structures, such as Ba$_2$Fe$_2$As$_3$, Eu$_2$Fe$_2$As$_3$, La$_2$O$_2$ClFeAs, LiOMn$_2$As,Li$_4$OFe$_2$As$_2$.

preprint2022arXiv

Ego4D: Around the World in 3,000 Hours of Egocentric Video

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant. Ego4D dramatically expands the volume of diverse egocentric video footage publicly available to the research community. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. Furthermore, we present a host of new benchmark challenges centered around understanding the first-person visual experience in the past (querying an episodic memory), present (analyzing hand-object manipulation, audio-visual conversation, and social interactions), and future (forecasting activities). By publicly sharing this massive annotated dataset and benchmark suite, we aim to push the frontier of first-person perception. Project page: https://ego4d-data.org/

preprint2022arXiv

Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization

Augmented reality devices have the potential to enhance human perception and enable other assistive functionalities in complex conversational environments. Effectively capturing the audio-visual context necessary for understanding these social interactions first requires detecting and localizing the voice activities of the device wearer and the surrounding people. These tasks are challenging due to their egocentric nature: the wearer's head motion may cause motion blur, surrounding people may appear in difficult viewing angles, and there may be occlusions, visual clutter, audio noise, and bad lighting. Under these conditions, previous state-of-the-art active speaker detection methods do not give satisfactory results. Instead, we tackle the problem from a new setting using both video and multi-channel microphone array audio. We propose a novel end-to-end deep learning approach that is able to give robust voice activity detection and localization results. In contrast to previous methods, our method localizes active speakers from all possible directions on the sphere, even outside the camera's field of view, while simultaneously detecting the device wearer's own voice activity. Our experiments show that the proposed method gives superior results, can run in real time, and is robust against noise and clutter.

preprint2022arXiv

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

To alleviate the data scarcity problem in training question answering systems, recent works propose additional intermediate pre-training for dense passage retrieval (DPR). However, there still remains a large discrepancy between the provided upstream signals and the downstream question-passage relevance, which leads to less improvement. To bridge this gap, we propose the HyperLink-induced Pre-training (HLP), a method to pre-train the dense retriever with the text relevance induced by hyperlink-based topology within Web documents. We demonstrate that the hyperlink-based structures of dual-link and co-mention can provide effective relevance signals for large-scale pre-training that better facilitate downstream passage retrieval. We investigate the effectiveness of our approach across a wide range of open-domain QA datasets under zero-shot, few-shot, multi-hop, and out-of-domain scenarios. The experiments show our HLP outperforms the BM25 by up to 7 points as well as other pre-training methods by more than 10 points in terms of top-20 retrieval accuracy under the zero-shot scenario. Furthermore, HLP significantly outperforms other pre-training methods under the other scenarios.

preprint2022arXiv

KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models

Previous works show the great potential of pre-trained language models (PLMs) for storing a large amount of factual knowledge. However, to figure out whether PLMs can be reliable knowledge sources and used as alternative knowledge bases (KBs), we need to further explore some critical features of PLMs. Firstly, knowledge memorization and identification abilities: traditional KBs can store various types of entities and relationships; do PLMs have a high knowledge capacity to store different types of knowledge? Secondly, reasoning ability: a qualified knowledge source should not only provide a collection of facts, but support a symbolic reasoner. Can PLMs derive new knowledge based on the correlations between facts? To evaluate these features of PLMs, we propose a benchmark, named Knowledge Memorization, Identification, and Reasoning test (KMIR). KMIR covers 3 types of knowledge, including general knowledge, domain-specific knowledge, and commonsense, and provides 184,348 well-designed questions. Preliminary experiments with various representative pre-training language models on KMIR reveal many interesting phenomenons: 1) The memorization ability of PLMs depends more on the number of parameters than training schemes. 2) Current PLMs are struggling to robustly remember the facts. 3) Model compression technology retains the amount of knowledge well, but hurts the identification and reasoning abilities. We hope KMIR can facilitate the design of PLMs as better knowledge sources.

preprint2022arXiv

Model Calibration of the Liquid Mercury Spallation Target using Evolutionary Neural Networks and Sparse Polynomial Expansions

The mercury constitutive model predicting the strain and stress in the target vessel plays a central role in improving the lifetime prediction and future target designs of the mercury targets at the Spallation Neutron Source (SNS). We leverage the experiment strain data collected over multiple years to improve the mercury constitutive model through a combination of large-scale simulations of the target behavior and the use of machine learning tools for parameter estimation. We present two interdisciplinary approaches for surrogate-based model calibration of expensive simulations using evolutionary neural networks and sparse polynomial expansions. The experiments and results of the two methods show a very good agreement for the solid mechanics simulation of the mercury spallation target. The proposed methods are used to calibrate the tensile cutoff threshold, mercury density, and mercury speed of sound during intense proton pulse experiments. Using strain experimental data from the mercury target sensors, the newly calibrated simulations achieve 7\% average improvement on the signal prediction accuracy and 8\% reduction in mean absolute error compared to previously reported reference parameters, with some sensors experiencing up to 30\% improvement. The proposed calibrated simulations can significantly aid in fatigue analysis to estimate the mercury target lifetime and integrity, which reduces abrupt target failure and saves a tremendous amount of costs. However, an important conclusion from this work points out to a deficiency in the current constitutive model based on the equation of state in capturing the full physics of the spallation reaction. Given that some of the calibrated parameters that show a good agreement with the experimental data can be nonphysical mercury properties, we need a more advanced two-phase flow model to capture bubble dynamics and mercury cavitation.

preprint2022arXiv

PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN

Beyond topical relevance, passage ranking for open-domain factoid question answering also requires a passage to contain an answer (answerability). While a few recent studies have incorporated some reading capability into a ranker to account for answerability, the ranker is still hindered by the noisy nature of the training data typically available in this area, which considers any passage containing an answer entity as a positive sample. However, the answer entity in a passage is not necessarily mentioned in relation with the given question. To address the problem, we propose an approach called \ttt{PReGAN} for Passage Reranking based on Generative Adversarial Neural networks, which incorporates a discriminator on answerability, in addition to a discriminator on topical relevance. The goal is to force the generator to rank higher a passage that is topically relevant and contains an answer. Experiments on five public datasets show that \ttt{PReGAN} can better rank appropriate passages, which in turn, boosts the effectiveness of QA systems, and outperforms the existing approaches without using external data.

preprint2022arXiv

Residual-Aided End-to-End Learning of Communication System without Known Channel

Leveraging powerful deep learning techniques, the end-to-end (E2E) learning of communication system is able to outperform the classical communication system. Unfortunately, this communication system cannot be trained by deep learning without known channel. To deal with this problem, a generative adversarial network (GAN) based training scheme has been recently proposed to imitate the real channel. However, the gradient vanishing and overfitting problems of GAN will result in the serious performance degradation of E2E learning of communication system. To mitigate these two problems, we propose a residual aided GAN (RA-GAN) based training scheme in this paper. Particularly, inspired by the idea of residual learning, we propose a residual generator to mitigate the gradient vanishing problem by realizing a more robust gradient backpropagation. Moreover, to cope with the overfitting problem, we reconstruct the loss function for training by adding a regularizer, which limits the representation ability of RA-GAN. Simulation results show that the trained residual generator has better generation performance than the conventional generator, and the proposed RA-GAN based training scheme can achieve the near-optimal block error rate (BLER) performance with a negligible computational complexity increase in both the theoretical channel model and the ray-tracing based channel dataset.

preprint2022arXiv

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline

Supersized pre-trained language models have pushed the accuracy of various natural language processing (NLP) tasks to a new state-of-the-art (SOTA). Rather than pursuing the reachless SOTA accuracy, more and more researchers start paying attention on model efficiency and usability. Different from accuracy, the metric for efficiency varies across different studies, making them hard to be fairly compared. To that end, this work presents ELUE (Efficient Language Understanding Evaluation), a standard evaluation, and a public leaderboard for efficient NLP models. ELUE is dedicated to depict the Pareto Frontier for various language understanding tasks, such that it can tell whether and how much a method achieves Pareto improvement. Along with the benchmark, we also release a strong baseline, ElasticBERT, which allows BERT to exit at any layer in both static and dynamic ways. We demonstrate the ElasticBERT, despite its simplicity, outperforms or performs on par with SOTA compressed and early exiting models. With ElasticBERT, the proposed ELUE has a strong Pareto Frontier and makes a better evaluation for efficient NLP models.

preprint2021arXiv

Feasible Computationally Efficient Path Planning for UAV Collision Avoidance

This paper presents a robust computationally efficient real-time collision avoidance algorithm for Unmanned Aerial Vehicle (UAV), namely Memory-based Wall Following-Artificial Potential Field (MWF-APF) method. The new algorithm switches between Wall-Following Method (WFM) and Artificial Potential Field method (APF) with improved situation awareness capability. Historical trajectory is taken into account to avoid repetitive wrong decision. Furthermore, it can be effectively applied to platform with low computing capability. As an example, a quad-rotor equipped with limited number of Time-of-Flight (TOF) rangefinders is adopted to validate the effectiveness and efficiency of this algorithm. Both software simulation and physical flight test have been conducted to demonstrate the capability of the MWF-APF method in complex scenarios.

preprint2021arXiv

Overcoming Long-term Catastrophic Forgetting through Adversarial Neural Pruning and Synaptic Consolidation

Artificial neural networks face the well-known problem of catastrophic forgetting. What's worse, the degradation of previously learned skills becomes more severe as the task sequence increases, known as the long-term catastrophic forgetting. It is due to two facts: first, as the model learns more tasks, the intersection of the low-error parameter subspace satisfying for these tasks becomes smaller or even does not exist; second, when the model learns a new task, the cumulative error keeps increasing as the model tries to protect the parameter configuration of previous tasks from interference. Inspired by the memory consolidation mechanism in mammalian brains with synaptic plasticity, we propose a confrontation mechanism in which Adversarial Neural Pruning and synaptic Consolidation (ANPyC) is used to overcome the long-term catastrophic forgetting issue. The neural pruning acts as long-term depression to prune task-irrelevant parameters, while the novel synaptic consolidation acts as long-term potentiation to strengthen task-relevant parameters. During the training, this confrontation achieves a balance in that only crucial parameters remain, and non-significant parameters are freed to learn subsequent tasks. ANPyC avoids forgetting important information and makes the model efficient to learn a large number of tasks. Specifically, the neural pruning iteratively relaxes the current task's parameter conditions to expand the common parameter subspace of the task; the synaptic consolidation strategy, which consists of a structure-aware parameter-importance measurement and an element-wise parameter updating strategy, decreases the cumulative error when learning new tasks. The full source code is available at https://github.com/GeoX-Lab/ANPyC.

preprint2021arXiv

Robopheus: A Virtual-Physical Interactive Mobile Robotic Testbed

The mobile robotic testbed is an essential and critical support to verify the effectiveness of mobile robotics research. This paper introduces a novel multi-robot testbed, named Robopheus, which exploits the ideas of virtual-physical modeling in digital-twin. Unlike most existing testbeds, the developed Robopheus constructs a bridge that connects the traditional physical hardware and virtual simulation testbeds, providing scalable, interactive, and high-fidelity simulations-tests on both sides. Another salient feature of the Robopheus is that it enables a new form to learn the actual models from the physical environment dynamically and is compatible with heterogeneous robot chassis and controllers. In turn, the virtual world's learned models are further leveraged to approximate the robot dynamics online on the physical side. Extensive experiments demonstrate the extraordinary performance of the Robopheus. Significantly, the physical-virtual interaction design increases the trajectory accuracy of a real robot by 300%, compared with that of not using the interaction.

preprint2020arXiv

Design, Control, and Applications of a Soft Robotic Arm

This paper presents the design, control, and applications of a multi-segment soft robotic arm. In order to design a soft arm with large load capacity, several design principles are proposed by analyzing two kinds of buckling issues, under which we present a novel structure named Honeycomb Pneumatic Networks (HPN). Parameter optimization method, based on finite element method (FEM), is proposed to optimize HPN Arm design parameters. Through a quick fabrication process, several prototypes with different performance are made, one of which can achieve the transverse load capacity of 3 kg under 3 bar pressure. Next, considering different internal and external conditions, we develop three controllers according to different model precision. Specifically, based on accurate model, an open-loop controller is realized by combining piece-wise constant curvature (PCC) modeling method and machine learning method. Based on inaccurate model, a feedback controller, using estimated Jacobian, is realized in 3D space. A model-free controller, using reinforcement learning to learn a control policy rather than a model, is realized in 2D plane, with minimal training data. Then, these three control methods are compared on a same experiment platform to explore the applicability of different methods under different conditions. Lastly, we figure out that soft arm can greatly simplify the perception, planning, and control of interaction tasks through its compliance, which is its main advantage over the rigid arm. Through plentiful experiments in three interaction application scenarios, human-robot interaction, free space interaction task, and confined space interaction task, we demonstrate the potential application prospect of the soft arm.

preprint2020arXiv

Learning Differential Diagnosis of Skin Conditions with Co-occurrence Supervision using Graph Convolutional Networks

Skin conditions are reported the 4th leading cause of nonfatal disease burden worldwide. However, given the colossal spectrum of skin disorders defined clinically and shortage in dermatology expertise, diagnosing skin conditions in a timely and accurate manner remains a challenging task. Using computer vision technologies, a deep learning system has proven effective assisting clinicians in image diagnostics of radiology, ophthalmology and more. In this paper, we propose a deep learning system (DLS) that may predict differential diagnosis of skin conditions using clinical images. Our DLS formulates the differential diagnostics as a multi-label classification task over 80 conditions when only incomplete image labels are available. We tackle the label incompleteness problem by combining a classification network with a Graph Convolutional Network (GCN) that characterizes label co-occurrence and effectively regularizes it towards a sparse representation. Our approach is demonstrated on 136,462 clinical images and concludes that the classification accuracy greatly benefit from the Co-occurrence supervision. Our DLS achieves 93.6% top-5 accuracy on 12,378 test images and consistently outperform the baseline classification network.

preprint2020arXiv

Real-time 3D Deep Multi-Camera Tracking

Tracking a crowd in 3D using multiple RGB cameras is a challenging task. Most previous multi-camera tracking algorithms are designed for offline setting and have high computational complexity. Robust real-time multi-camera 3D tracking is still an unsolved problem. In this work, we propose a novel end-to-end tracking pipeline, Deep Multi-Camera Tracking (DMCT), which achieves reliable real-time multi-camera people tracking. Our DMCT consists of 1) a fast and novel perspective-aware Deep GroudPoint Network, 2) a fusion procedure for ground-plane occupancy heatmap estimation, 3) a novel Deep Glimpse Network for person detection and 4) a fast and accurate online tracker. Our design fully unleashes the power of deep neural network to estimate the "ground point" of each person in each color image, which can be optimized to run efficiently and robustly. Our fusion procedure, glimpse network and tracker merge the results from different views, find people candidates using multiple video frames and then track people on the fused heatmap. Our system achieves the state-of-the-art tracking results while maintaining real-time performance. Apart from evaluation on the challenging WILDTRACK dataset, we also collect two more tracking datasets with high-quality labels from two different environments and camera settings. Our experimental results confirm that our proposed real-time pipeline gives superior results to previous approaches.

preprint2020arXiv

Review of data analysis in vision inspection of power lines with an in-depth discussion of deep learning technology

The widespread popularity of unmanned aerial vehicles enables an immense amount of power lines inspection data to be collected. How to employ massive inspection data especially the visible images to maintain the reliability, safety, and sustainability of power transmission is a pressing issue. To date, substantial works have been conducted on the analysis of power lines inspection data. With the aim of providing a comprehensive overview for researchers who are interested in developing a deep-learning-based analysis system for power lines inspection data, this paper conducts a thorough review of the current literature and identifies the challenges for future research. Following the typical procedure of inspection data analysis, we categorize current works in this area into component detection and fault diagnosis. For each aspect, the techniques and methodologies adopted in the literature are summarized. Some valuable information is also included such as data description and method performance. Further, an in-depth discussion of existing deep-learning-related analysis methods in power lines inspection is proposed. Finally, we conclude the paper with several research trends for the future of this area, such as data quality problems, small object detection, embedded application, and evaluation baseline.

preprint2020arXiv

Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient

Deep Q-learning algorithms often suffer from poor gradient estimations with an excessive variance, resulting in unstable training and poor sampling efficiency. Stochastic variance-reduced gradient methods such as SVRG have been applied to reduce the estimation variance (Zhao et al. 2019). However, due to the online instance generation nature of reinforcement learning, directly applying SVRG to deep Q-learning is facing the problem of the inaccurate estimation of the anchor points, which dramatically limits the potentials of SVRG. To address this issue and inspired by the recursive gradient variance reduction algorithm SARAH (Nguyen et al. 2017), this paper proposes to introduce the recursive framework for updating the stochastic gradient estimates in deep Q-learning, achieving a novel algorithm called SRG-DQN. Unlike the SVRG-based algorithms, SRG-DQN designs a recursive update of the stochastic gradient estimate. The parameter update is along an accumulated direction using the past stochastic gradient information, and therefore can get rid of the estimation of the full gradients as the anchors. Additionally, SRG-DQN involves the Adam process for further accelerating the training process. Theoretical analysis and the experimental results on well-known reinforcement learning tasks demonstrate the efficiency and effectiveness of the proposed SRG-DQN algorithm.

preprint2016arXiv

A note on the convergence of nonconvex line search

In this note, we consider the line search for a class of abstract nonconvex algorithm which have been deeply studied in the Kurdyka-Lojasiewicz theory. We provide a weak convergence result of the line search in general. When the objective function satisfies the Kurdyka-Lojasiewicz property and some certain assumption, a global convergence result can be derived. An application is presented for the L0-regularized least square minimization in the end of the paper.

preprint2016arXiv

Accelerated atomistic simulation study on the stability and mobility of carbon tri-interstitial cluster in cubic SiC

Using a combination of kinetic Activation Relaxation Technique with empirical potential and ab initio based climbing image nudged elastic band method, we perform an extensive search of the migration and rotation paths of the most stable carbon tri-interstitial cluster in cubic SiC. Our research reveals paths with the lowest energy barriers to migration, rotation, and dissociation of the most stable cluster. The kinetic properties of the most stable cluster, including its mobility, rotation behavior at different temperatures and stability against high temperature annealing, are discussed based on the calculated transition barriers. In addition to fundamental insights, our study provides a methodology for investigation of other extended defects in a technologically important material.

preprint2016arXiv

Crystal Chemistry and Structural Design of Iron-Based Superconductors

The second class of high-temperature superconductors (HTSCs), iron-based pnictides and chalcogenides, necessarily contain Fe$_2$$X_2$ ("$X$" refers to a pnictogen or a chalcogen element) layers, just like the first class of HTSCs which possess the essential CuO$_2$ sheets. So far, dozens of iron-based HTSCs, classified into nine groups, have been discovered. In this article, the crystal-chemistry aspects of the known iron-based superconductors are reviewed and summarized by employing "hard and soft acids and bases (HSAB)" concept. Based on these understandings, we propose an alternative route to exploring new iron-based superconductors via rational structural design.

preprint2016arXiv

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Today's person detection methods work best when people are in common upright poses and appear reasonably well spaced out in the image. However, in many real images, that's not what people do. People often appear quite close to each other, e.g., with limbs linked or heads touching, and their poses are often not pedestrian-like. We propose an approach to detangle people in multi-person images. We formulate the task as a region assembly problem. Starting from a large set of overlapping regions from body part semantic segmentation and generic object proposals, our optimization approach reassembles those pieces together into multiple person instances. It enforces that the composed body part regions of each person instance obey constraints on relative sizes, mutual spatial relationships, foreground coverage, and exclusive label assignments when overlapping. Since optimal region assembly is a challenging combinatorial problem, we present a Lagrangian relaxation method to accelerate the lower bound estimation, thereby enabling a fast branch and bound solution for the global optimum. As output, our method produces a pixel-level map indicating both 1) the body part labels (arm, leg, torso, and head), and 2) which parts belong to which individual person. Our results on three challenging datasets show our method is robust to clutter, occlusion, and complex poses. It outperforms a variety of competing methods, including existing detector CRF methods and region CNN approaches. In addition, we demonstrate its impact on a proxemics recognition task, which demands a precise representation of "whose body part is where" in crowded images.

preprint2016arXiv

Radiation-induced mobility of small defect clusters in covalent materials

Although defect clusters are detrimental to electronic and mechanical properties of semiconductor materials, annihilation of such clusters is limited by their lack of thermal mobility due to high migration barriers. Here, we find that small clusters in bulk SiC (a covalent material of importance for both electronic and nuclear applications) can become mobile at room temperature under the influence of electron radiation. So far, direct observation of radiation-induced diffusion of defect clusters in bulk materials has not been demonstrated yet. This finding was made possible by low angle annular dark field (LAADF) scanning transmission electron microscopy (STEM) combined with non-rigid registration technique to remove sample instability, which enables atomic resolution imaging of small migrating defect clusters. We show that the underlying mechanism of this athermal diffusion is ballistic collision between incoming electrons and cluster atoms. Our findings suggest that defect clusters may be mobile under certain irradiation conditions, changing current understanding of cluster annealing process in irradiated covalent materials.

preprint2016arXiv

Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video

Understanding the camera wearer's activity is central to egocentric vision, yet one key facet of that activity is inherently invisible to the camera--the wearer's body pose. Prior work focuses on estimating the pose of hands and arms when they come into view, but this 1) gives an incomplete view of the full body posture, and 2) prevents any pose estimate at all in many frames, since the hands are only visible in a fraction of daily life activities. We propose to infer the "invisible pose" of a person behind the egocentric camera. Given a single video, our efficient learning-based approach returns the full body 3D joint positions for each frame. Our method exploits cues from the dynamic motion signatures of the surrounding scene--which changes predictably as a function of body pose--as well as static scene structures that reveal the viewpoint (e.g., sitting vs. standing). We further introduce a novel energy minimization scheme to infer the pose sequence. It uses soft predictions of the poses per time instant together with a non-parametric model of human pose dynamics over longer windows. Our method outperforms an array of possible alternatives, including deep learning approaches for direct pose regression from images.

preprint2016arXiv

Superconductivity and Ferromagnetism in Hole-Doped RbEuFe$_4$As$_4$

We discover a robust coexistence of superconductivity and ferromagnetism in an iron arsenide RbEuFe$_4$As$_4$. The new material crystallizes in an intergrowth structure of RbFe$_2$As$_2$ and EuFe$_2$As$_2$, such that the Eu sublattice turns out to be primitive instead of being body-centered in EuFe$_2$As$_2$. The FeAs layers, featured by asymmetric As coordinations, are hole doped due to charge homogenization. Our combined measurements of electrical transport, magnetization and heat capacity unambiguously and consistently indicate bulk superconductivity at 36.5 K in the FeAs layers and ferromagnetism at 15 K in the Eu sublattice. Interestingly, the Eu-spin ferromagnetic ordering belongs to a rare third-order transition, according to the Ehrenfest classification of phase transition. We also identify an additional anomaly at $\sim$ 5 K, which is possibly associated with the interplay between superconductivity and ferromagnetism.

preprint2015arXiv

Electronic structure of quasi-one-dimensional superconductor K$_2$Cr$_3$As$_3$ from first-principles calculations

The electronic structure of quasi-one-dimensional superconductor K$_2$Cr$_3$As$_3$ is studied through systematic first-principles calculations. The ground state of K$_2$Cr$_3$As$_3$ is paramagnetic but very close to a ferromagnetic instability. Close to the Fermi level, the Cr-3d$_{z^2}$, d$_{xy}$, and d$_{x^2-y^2}$ orbitals dominate the electronic states, and three bands cross $E_F$ to form one 3D Fermi surface sheet and two quasi-1D sheets. The electron DOS at $E_F$ is less than 1/3 of the experimental value, indicating an intermediate electron renormalization factor around $E_F$. Despite of the relatively small atomic numbers, the antisymmetric spin-orbit coupling splitting is sizable ($\approx$ 60 meV) on the 3D Fermi surface sheet as well as on one of the quasi-1D sheets. Finally, the imaginary part of bare electron susceptibility shows large peaks at $Γ$, suggesting the existence of large ferromagnetic spin fluctuation in the compound.

preprint2015arXiv

New fast divide-and-conquer algorithms for the symmetric tridiagonal eigenvalue problem

In this paper, two accelerated divide-and-conquer algorithms are proposed for the symmetric tridiagonal eigenvalue problem, which cost $O(N^2r)$ {flops} in the worst case, where $N$ is the dimension of the matrix and $r$ is a modest number depending on the distribution of eigenvalues. Both of these algorithms use hierarchically semiseparable (HSS) matrices to approximate some intermediate eigenvector matrices which are Cauchy-like matrices and are off-diagonally low-rank. The difference of these two versions lies in using different HSS construction algorithms, one (denoted by {ADC1}) uses a structured low-rank approximation method and the other ({ADC2}) uses a randomized HSS construction algorithm. For the ADC2 algorithm, a method is proposed to estimate the off-diagonal rank. Numerous experiments have been done to show their stability and efficiency. These algorithms are implemented in parallel in a shared memory environment, and some parallel implementation details are included. Comparing the ADCs with highly optimized multithreaded libraries such as Intel MKL, we find that ADCs could be more than 6x times faster for some large matrices with few deflations.

preprint2015arXiv

On the solution of stochastic optimization and variational problems in imperfect information regimes

We consider the solution of a stochastic convex optimization problem $\mathbb{E}[f(x;θ^*,ξ)]$ over a closed and convex set $X$ in a regime where $θ^*$ is unavailable and $ξ$ is a suitably defined random variable. Instead, $θ^*$ may be obtained through the solution of a learning problem that requires minimizing a metric $\mathbb{E}[g(θ;η)]$ in $θ$ over a closed and convex set $Θ$. Traditional approaches have been either sequential or direct variational approaches. In the case of the former, this entails the following steps: (i) a solution to the learning problem, namely $θ^*$, is obtained; and (ii) a solution is obtained to the associated computational problem which is parametrized by $θ^*$. Such avenues prove difficult to adopt particularly since the learning process has to be terminated finitely and consequently, in large-scale instances, sequential approaches may often be corrupted by error. On the other hand, a variational approach requires that the problem may be recast as a possibly non-monotone stochastic variational inequality problem in the $(x,θ)$ space; but there are no known first-order stochastic approximation schemes are currently available for the solution of this problem. To resolve the absence of convergent efficient schemes, we present a coupled stochastic approximation scheme which simultaneously solves both the computational and the learning problems. The obtained schemes are shown to be equipped with almost sure convergence properties in regimes when the function $f$ is either strongly convex as well as merely convex.

preprint2015arXiv

Physical properties and electronic structure of Sr$_2$Cr$_3$As$_2$O$_2$ containing CrO$_2$ and Cr$_2$As$_2$ square-planar lattices

We report the physical properties and electronic structure calculations of a layered chromium oxypnictide, Sr$_2$Cr$_3$As$_2$O$_2$, which crystallizes in a Sr$_2$Mn$_3$As$_2$O$_2$-type structure containing both CrO$_2$ planes and Cr$_2$As$_2$ layers. The newly synthesized material exhibits a metallic conduction with a dominant electron-magnon scattering. Magnetic and specific-heat measurements indicate at least two intrinsic magnetic transitions below room temperature. One is an antiferromagnetic transition at 291 K, probably associated with a spin ordering in the Cr$_2$As$_2$ layers. Another transition is broad, occurring at around 38 K, and possibly due to a short-range spin order in the CrO$_2$ planes. Our first-principles calculations indicate predominant two-dimensional antiferromagnetic exchange couplings, and suggest a KG-type (i.e. K$_2$NiF$_4$ type for CrO$_2$ planes and G type for Cr$_2$As$_2$ layers) magnetic structure, with reduced moments for both Cr sublattices. The corresponding electronic states near the Fermi energy are mostly contributed from Cr-3$d$ orbitals which weakly (modestly) hybridize with the O-2$p$ (As-4$p$) orbitals in the CrO$_2$ (Cr$_2$As$_2$) layers. The bare bandstructure density of states at the Fermi level is only $\sim$1/4 of the experimental value derived from the low-temperature specific-heat data, consistent with the remarkable electron-magnon coupling. The title compound is argued to be a possible candidate to host superconductivity.

preprint2015arXiv

Reduced Dimensionality and Magnetic Frustration in KCr$_3$As$_3$

We study the electronic and magnetic structures of the newly-discovered compound KCr$_3$As$_3$. The non-magnetic state has five Fermi surface sheets involving respectively three quasi-one-dimensional and two three-dimensional energy bands. However, the ground state is magnetic, exhibiting a novel interlayer antiferromagnetic order where the basic block-spin state of a unit Cr triangle retains a high spin magnitude. Moreover, its Fermi surface involves three one-dimensional sheets only, providing evidence for local moments in this compound due to the reduced dimensionality. By fitting a twisted spin tube model the magnetic frustrations caused by local moments are found to be relaxed, leading to gapless spin excitations. A frustration-induced transition to the disordered low block-spin state is expected upon increasing the intralayer exchange interaction.

preprint2015arXiv

Superconductivity in quasi-one-dimensional Cs2Cr3As3 with large interchain distance

Since the discovery of high-temperature superconductivity (SC) in quasi-two-dimensional copper oxides, a few layered compounds, which bear similarities to the cuprates, have also been found to host unconventional SC. Our recent observation of SC at 6.1 K in correlated electron material K2Cr3As3 (J. K. Bao et al., arXiv: 1412.0067) represents an obviously different paradigm, primarily because of its quasi-one-dimensional (Q1D) nature. The new material is structurally featured by the (Cr3As3)2- double-walled subnano-tubes composed of face-sharing Cr6/2 (As6/2) octahedron linear chains, which are well separated by columns of K+ counterions. Later, an isostructural superconducting Rb2Cr3As3 was synthesized, thus forming a new superconducting family. Here we report the third member, Cs2Cr3As3, which possesses the largest interchain distance. SC appears below 2.2 K. Similar to the former two sister compounds, Cs2Cr3As3 exhibits a non-Fermi liquid behavior with a linear temperature dependence of resistivity in the normal state, and a high upper critical field beyond the Pauli limit as well, suggesting common unconventional SC in the Q1D Cr-based material.

preprint2015arXiv

Superconductivity in quasi-one-dimensional K$_2$Cr$_3$As$_3$ with significant electron correlations

We report the discovery of bulk superconductivity (SC) at 6.1 K in a quasi-one-dimensional (Q1D) chromium pnictide K$_2$Cr$_3$As$_3$ which contains [(Cr$_3$As$_3$)$^{2-}$]$_{\infty}$ double-walled subnano-tubes with face-sharing Cr$_{6/2}$ (As$_{6/2}$) octahedron linear chains in the inner (outer) wall. The material has a large electronic specific-heat coefficient of 70$\sim$75 mJ K$^{-2}$ mol$^{-1}$, indicating significantly strong electron correlations. Signature of non-Fermi liquid behavior is shown by the linear temperature dependence of resistivity in a broad temperature range from 7 to 300 K. Unconventional SC is preliminarily manifested by the estimated upper critical field exceeding the Pauli limit by a factor of three to four. The title compound represents a rare example that possibly unconventional SC emerges in a Q1D system with strong electron correlations.

preprint2015arXiv

Synthesis, crystal structure and physical properties of quasi-one-dimensional ACr$_3$As$_3$ (A = Rb, Cs)

Recently, new Cr-based superconductors, A$_2$Cr$_3$As$_3$ (A = K, Rb, Cs), have gained a strong interest because of their one-dimensional crystal structures and electron correlations. Here we report the crystal structure and physical properties of two related materials ACr$_3$As$_3$ (A = Rb, Cs) which are synthesized via a soft-chemical A+ deintercalation in A$_2$Cr$_3$As$_3$. The new compounds remain one-dimensional (Cr$_3$As$_3$)$_{\infty}$ linear chains, and the interchain distance can be tuned by the incorporation of the alkali-metal cations with different sizes. The physical-property measurements indicate a local-moment behavior at high temperatures, and the moments freeze into a cluster spin-glass state below 5$\sim$6 K. No superconductivity was observed in both materials. We also found that, with increasing the interchain distance, the Cr effective moments increase monotonically, accompanied with the enhancement of semi-conductivity. Our results shed light on the understanding of occurrence of superconductivity in A$_2$Cr$_3$As$_3$.

preprint2015arXiv

Unconventional superconductivity in quasi-one-dimensional Rb$_2$Cr$_3$As$_3$

Following the discovery of superconductivity in quasi-one-dimensional K$_2$Cr$_3$As$_3$ containing [(Cr$_3$As$_3$)$^{2-}$]$_{\infty}$ chains [J. K. Bao et al., arXiv: 1412.0067 (2014)], we succeeded in synthesizing an analogous compound, Rb$_2$Cr$_3$As$_3$, which also crystallizes in a hexagonal lattice. The replacement of K by Rb results in an expansion of $a$ axis by 3\%, indicating a weaker interchain coupling in Rb$_2$Cr$_3$As$_3$. Bulk superconductivity emerges at 4.8 K, above which the normal-state resistivity shows a linear temperature dependence up to 35 K. The estimated upper critical field at zero temperature exceeds the Pauli paramagnetic limit by a factor of two. Furthermore, the electronic specific-heat coefficient extrapolated to zero temperature in the mixed state increases with $\sqrt{H}$, suggesting existence of nodes in the superconducting energy gap. Hence Rb$_2$Cr$_3$As$_3$ manifests itself as another example of unconventional superconductor in the Cr$_3$As$_3$-chain based system.

preprint2014arXiv

Adaptive Augmented Lagrangian Methods: Algorithms and Practical Numerical Experience

In this paper, we consider augmented Lagrangian (AL) algorithms for solving large-scale nonlinear optimization problems that execute adaptive strategies for updating the penalty parameter. Our work is motivated by the recently proposed adaptive AL trust region method by Curtis, Jiang, and Robinson [Math. Prog., DOI: 10.1007/s10107-014-0784-y, 2013]. The first focal point of this paper is a new variant of the approach that employs a line search rather than a trust region strategy, where a critical algorithmic feature for the line search strategy is the use of convexified piecewise quadratic models of the AL function for computing the search directions. We prove global convergence guarantees for our line search algorithm that are on par with those for the previously proposed trust region method. A second focal point of this paper is the practical performance of the line search and trust region algorithm variants in Matlab software, as well as that of an adaptive penalty parameter updating strategy incorporated into the Lancelot software. We test these methods on problems from the CUTEst and COPS collections, as well as on challenging test problems related to optimal power flow. Our numerical experience suggests that the adaptive algorithms outperform traditional AL methods in terms of efficiency and reliability. As with traditional AL algorithms, the adaptive methods are matrix-free and thus represent a viable option for solving extreme-scale problems.

preprint2014arXiv

Anomalous Eu Valence State and Superconductivity in Undoped Eu3Bi2S4F4

We have synthesized a novel europium bismuth sulfofluoride, Eu3Bi2S4F4, by solid-state reactions in sealed evacuated quartz ampoules. The compound crystallizes in a tetragonal lattice (space group I4/mmm, a = 4.0771(1) A, c = 32.4330(6) A, and Z = 2), in which CaF2-type Eu3F4 layers and NaCl-like BiS2 bilayers stack alternately along the crystallographic c axis. There are two crystallographically distinct Eu sites, Eu(1) and Eu(2) at the Wyckoff positions 4e and 2a, respectively. Our bond-valence-sum calculation, based on the refined structural data, indicates that Eu(1) is essentially divalent, whilst Eu(2) has an average valence of +2.64(5). This anomalous Eu valence state is further confirmed and supported, respectively, by Mossbauer and magnetization measurements. The Eu3+ components donate electrons into the conduction bands that are mainly composed of Bi- 6px and 6py states. Consequently, the material itself shows metallic conduction, and superconducts at 1.5 K without extrinsic chemical doping.

preprint2014arXiv

Black Phosphorus Radio-Frequency Transistors

Few-layer and thin film forms of layered black phosphorus (BP) have recently emerged as a promising material for applications in high performance nanoelectronics and infrared optoelectronics. Layered BP thin film offers a moderate bandgap of around 0.3 eV and high carrier mobility, leading to transistors with decent on-off ratio and high on-state current density. Here, we demonstrate the gigahertz frequency operation of black phosphorus field-effect transistors for the first time. The BP transistors demonstrated here show excellent current saturation with an on-off ratio exceeding 2000. We achieved a current density in excess of 270 mA/mm and DC transconductance above 180 mS/mm for hole conduction. Using standard high frequency characterization techniques, we measured a short-circuit current-gain cut-off frequency fT of 12 GHz and a maximum oscillation frequency fmax of 20 GHz in 300 nm channel length devices. BP devices may offer advantages over graphene transistors for high frequency electronics in terms of voltage and power gain due to the good current saturation properties arising from their finite bandgap, thus enabling the future ubiquitous transistor technology that can operate in the multi-GHz frequency range and beyond.

preprint2014arXiv

Charge-density wave, superconductivity and $f$-electron valence instability in EuBiS$_2$F

Superconductivity (SC) and charge-density wave (CDW) are two contrasting yet relevant collective electronic states which have received sustained interest for decades. Here we report that, in a layered europium bismuth sulfofluoride, EuBiS$_2$F, a CDW-like transition occurs at 280 K, below which SC emerges at 0.3 K, without any extrinsic doping. The Eu ions were found to exhibit an anomalously temperature-independent mixed valence of about +2.2, associated with the formation of CDW. The mixed valence of Eu gives rise to self electron doping into the conduction bands mainly consisting of the in-plane Bi-6$p$ states, which in turn brings about the CDW and SC. In particular, the electronic specific-heat coefficient is enhanced by ~ 50 times, owing to the significant hybridizations between Eu-4$f$ and Bi-6$p$ electrons, as verified by band-structure calculations. Thus, EuBiS$_2$F manifests itself as an unprecedented material that simultaneously accommodates SC, CDW and $f$-electron valence instability.

preprint2014arXiv

Study of the material photon and electron background and the liquid argon detector veto efficiency of the CDEX-10 experiment

The China Dark Matter Experiment (CDEX) is located at the China Jinping underground laboratory (CJPL) and aims to directly detect the WIMP flux with high sensitivity in the low mass region. Here we present a study of the predicted photon and electron backgrounds including the background contribution of the structure materials of the germanium detector, the passive shielding materials, and the intrinsic radioactivity of the liquid argon that serves as an anti-Compton active shielding detector. A detailed geometry is modeled and the background contribution has been simulated based on the measured radioactivities of all possible components within the GEANT4 program. Then the photon and electron background level in the energy region of interest (<10^-2 events kg-1 day-1 keV-1 (cpkkd)) is predicted based on Monte Carlo simulations. The simulated result is consistent with the design goal of CDEX-10 experiment, 0.1 cpkkd, which shows that the active and passive shield design of CDEX-10 is effective and feasible.

preprint2013arXiv

Direct and full-scale experimental verifications towards ground-satellite quantum key distribution

Quantum key distribution (QKD), provides the only intrinsically unconditional secure method for communication based on principle of quantum mechanics. Compared with fiber-based demonstrations-, free-space links could provide the most appealing solution for much larger distance. Despite of significant efforts, so far all realizations rely on stationary sites. Justifications are therefore extremely crucial for applications via a typical Low Earth Orbit Satellite (LEOS). To achieve direct and full-scale verifications, we demonstrate here three independent experiments with a decoy-state QKD system overcoming all the demanding conditions. The system is operated in a moving platform through a turntable, a floating platform through a hot-air balloon, and a huge loss channel, respectively, for substantiating performances under rapid motion, attitude change, vibration, random movement of satellites and in high-loss regime. The experiments cover expanded ranges for all the leading parameters of LEOS. Our results pave the way towards ground-satellite QKD and global quantum communication network.

preprint2013arXiv

Introduction of the CDEX experiment

Weakly Interacting Massive Particles (WIMPs) are the candidates of dark matter in our universe. Up to now any direct interaction of WIMP with nuclei has not been observed yet. The exclusion limits of the spin-independent cross section of WIMP-nucleon which have been experimentally obtained is about 10^{-7}pb at high mass region and only 10^{-5}pb} at low mass region. China Jin-Ping underground laboratory CJPL is the deepest underground lab in the world and provides a very promising environment for direct observation of dark matter. The China Dark Matter Experiment (CDEX) experiment is going to directly detect the WIMP flux with high sensitivity in the low mass region. Both CJPL and CDEX have achieved a remarkable progress in recent two years. The CDEX employs a point-contact germanium semi-conductor detector PCGe whose detection threshold is less than 300 eV. We report the measurement results of Muon flux, monitoring of radioactivity and Radon concentration carried out in CJPL, as well describe the structure and performance of the 1 kg PCGe detector CDEX-1 and 10kg detector array CDEX-10 including the detectors, electronics, shielding and cooling systems. Finally we discuss the physics goals of the CDEX-1, CDEX-10 and the future CDEX-1T detectors.

preprint2013arXiv

K and Mn co-doped BaCd2As2: a hexagonal structured bulk diluted magnetic semiconductor with large magnetoresistance

A bulk diluted magnetic semiconductor was found in the K and Mn co-doped BaCd2As2 system. Different from recently reported tetragonal ThCr2Si2-structured II-II-V based(Ba,K)(Zn,Mn)2As2, the Ba1-yKyCd2-xMnxAs2 system has a hexagonal CaAl2Si2-type structure with the Cd2As2 layer forming a honeycomb-like network. The Mn concentration reaches up to its x ? 0.4. Magnetization measurements show that the samples undergo ferromagnetic transitions with Curie temperature up to 16 K. With low coercive field less than 10 Oe and large magnetoresistence of about -70%, the hexagonal structured Ba1-yKyCd2-xMnxAs2 can be served as a promising candidate for spin manipulations.

preprint2013arXiv

Superconductivity, charge- or spin-density wave, and metal-nonmetal transition in BaTi$_{2}$(Sb$_{1-x}$Bi$_{x}$)$_{2}$O

We have performed an isovalent substitution study in a layered titanium oxypnictide system BaTi$_{2}$(Sb$_{1-x}$Bi$_{x}$)$_{2}$O (0$\leq x\leq$ 0.40) by the measurements of x-ray diffraction, electrical resistivity and magnetic susceptibility. The parent compound BaTi$_{2}$Sb$_{2}$O is confirmed to exhibit superconductivity at 1.5 K as well as charge- or spin-density wave (CDW/SDW) ordering below 55 K. With the partial substitution of Sb by Bi, the lattice parameters $a$, $c$ and $c/a$ all increase monotonically, indicating negative chemical pressure and lattice distortion on the (super)conducting Ti$_2$Sb$_2$O-layers. The Bi doping elevates the superconducting transition temperature to its maximum $T_c$=3.7 K at $x=$0.17, and then $T_c$ decreases gradually with additional Bi doping. A metal-to-nonmetal transition takes place around $x$=0.3, and superconductivity at $\sim$1K exists at the nonmetal side. The CDW/SDW anomaly, in comparison, is rapidly suppressed by the Bi doping, and vanishes for $x\geq$0.17. The results are discussed in terms of negative chemical pressure and disorder effect.

preprint2013arXiv

The CDEX-1 1 kg Point-Contact Germanium Detector for Low Mass Dark Matter Searches

The CDEX Collaboration has been established for direct detection of light dark matter particles, using ultra-low energy threshold p-type point-contact germanium detectors, in China JinPing underground Laboratory (CJPL). The first 1 kg point-contact germanium detector with a sub-keV energy threshold has been tested in a passive shielding system located in CJPL. The outputs from both the point-contact p+ electrode and the outside n+ electrode make it possible to scan the lower energy range of less than 1 keV and at the same time to detect the higher energy range up to 3 MeV. The outputs from both p+ and n+ electrode may also provide a more powerful method for signal discrimination for dark matter experiment. Some key parameters, including energy resolution, dead time, decay times of internal X-rays, and system stability, have been tested and measured. The results show that the 1 kg point-contact germanium detector, together with its shielding system and electronics, can run smoothly with good performances. This detector system will be deployed for dark matter search experiments.

preprint2012arXiv

Novel weak-ferromagnetic metallic state in heavily doped Ba$_{1-x}$K$_{x}$Mn$_{2}$As$_{2}$

Heavily doped Ba$_{1-x}$K$_{x}$Mn$_{2}$As$_{2}$ ($x$=0.19 and 0.26) single crystals were successfully grown, and investigated by the measurements of resistivity and anisotropic magnetic susceptibility. In contrast to the antiferromagnetic insulating ground state of the undoped BaMn$_{2}$As$_{2}$, the K-doped crystals show metallic conduction with weak ferromagnetism below $\sim$50 K and Curie-Weiss-like in-plane magnetic susceptibility above $\sim$50 K. Under high pressures up to 6 GPa, the low-temperature metallicity changes into a state characterized by a Kondo-like resistivity minimum without any signature of superconductivity above 2.5 K. Electronic structure calculations for $x$=0.25 using $2\times2\times1$ supercell reproduce the hole-doped metallic state. The density of states at Fermi energy have significant As 4$p$ components, suggesting that the 4$p$ holes are mainly responsible for the metallic conduction. Our results suggest that the interplay between itinerant 4$p$ holes and local 3$d$ moments is mostly responsible for the novel metallic state.

preprint2012arXiv

Self-doping effect and possible antiferromagnetism at titanium-layers in the iron-based superconductor Ba$_2$Ti$_2$Fe$_2$As$_4$O

The electronic structure of Ba$_2$Ti$_2$Fe$_2$As$_4$O, a newly discovered superconductor, is investigated using first-principles calculations based on local density approximations. Multiple Fermi surface sheets originating from Ti-3$d$ and Fe-3$d$ states are present corresponding to the conducting Ti$_2$As$_2$O and Fe$_2$As$_2$ layers respectively. Compared with BaFe$_2$As$_2$, sizeable changes in the related Fermi surface sheets indicate significant electron transfer (about 0.12$e$) from Ti to Fe, which suppresses the stripe-like antiferromagnetism at the Fe sites and simultaneously induces superconductivity. Our calculations also suggest that an additional Néel-type antiferromagnetic instability at the Ti sites is relatively robust against the electron transfer, which accounts for the anomaly at 125 K in the superconducting Ba$_2$Ti$_2$Fe$_2$As$_4$O.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2605.08737:author:2:hao-jiang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.08129:author:6:hao-jiang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.07503:author:5:hao-jiang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.07398:author:7:hao-jiang

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.14448:author:4:hao-jiang

Imported May 20, 2026Synced May 20, 2026

12 works

Guang-Han Cao

Researcher

Guang-Han Cao contributes to research discovery and scholarly infrastructure.

Open to collaborate

12 works

Zhu-An Xu

Researcher

Zhu-An Xu contributes to research discovery and scholarly infrastructure.

Open to collaborate

9 works

Jin-Ke Bao

Researcher

Jin-Ke Bao contributes to research discovery and scholarly infrastructure.

Open to collaborate

9 works

Yun-Lei Sun

Researcher

Yun-Lei Sun contributes to research discovery and scholarly infrastructure.

Open to collaborate

Hao Jiang

What is connected

Connect this record

See the researcher in context

Building this map preview

56 published item(s)

Diffusion-APO: Trajectory-Aware Direct Preference Alignment for Video Diffusion Transformers

Electric field switching of altermagnetic spin-splitting in multiferroic skyrmions

Exposing and Mitigating Temporal Attack in Deepfake Video Detection

Stereo Audio Rendering for Personal Sound Zones Using a Binaural Spatially Adaptive Neural Network (BSANN)

TEA: Temporal Adaptive Satellite Image Semantic Segmentation

The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs

Think When Needed: Adaptive Reasoning-Driven Multimodal Embeddings with a Dual-LoRA Architecture

Towards Customized Multimodal Role-Play

Unified Personalized Understanding, Generating and Editing

Hybrid Vector Message Passing for Generalized Bilinear Factorization

Bayesian Inverse Uncertainty Quantification of the Physical Model Parameters for the Spallation Neutron Source First Target Station

ConfPred: A layered intergrowth structure prediction model based on confinement self-assembly in two-dimensional interlayer space

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models

Model Calibration of the Liquid Mercury Spallation Target using Evolutionary Neural Networks and Sparse Polynomial Expansions

PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN

Residual-Aided End-to-End Learning of Communication System without Known Channel

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline

Feasible Computationally Efficient Path Planning for UAV Collision Avoidance

Overcoming Long-term Catastrophic Forgetting through Adversarial Neural Pruning and Synaptic Consolidation

Robopheus: A Virtual-Physical Interactive Mobile Robotic Testbed

Design, Control, and Applications of a Soft Robotic Arm

Learning Differential Diagnosis of Skin Conditions with Co-occurrence Supervision using Graph Convolutional Networks

Real-time 3D Deep Multi-Camera Tracking

Review of data analysis in vision inspection of power lines with an in-depth discussion of deep learning technology

Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient

A note on the convergence of nonconvex line search

Accelerated atomistic simulation study on the stability and mobility of carbon tri-interstitial cluster in cubic SiC

Crystal Chemistry and Structural Design of Iron-Based Superconductors

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Radiation-induced mobility of small defect clusters in covalent materials

Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video

Superconductivity and Ferromagnetism in Hole-Doped RbEuFe$_4$As$_4$

Electronic structure of quasi-one-dimensional superconductor K$_2$Cr$_3$As$_3$ from first-principles calculations

New fast divide-and-conquer algorithms for the symmetric tridiagonal eigenvalue problem

On the solution of stochastic optimization and variational problems in imperfect information regimes

Physical properties and electronic structure of Sr$_2$Cr$_3$As$_2$O$_2$ containing CrO$_2$ and Cr$_2$As$_2$ square-planar lattices

Reduced Dimensionality and Magnetic Frustration in KCr$_3$As$_3$

Superconductivity in quasi-one-dimensional Cs2Cr3As3 with large interchain distance

Superconductivity in quasi-one-dimensional K$_2$Cr$_3$As$_3$ with significant electron correlations

Synthesis, crystal structure and physical properties of quasi-one-dimensional ACr$_3$As$_3$ (A = Rb, Cs)

Unconventional superconductivity in quasi-one-dimensional Rb$_2$Cr$_3$As$_3$

Adaptive Augmented Lagrangian Methods: Algorithms and Practical Numerical Experience

Anomalous Eu Valence State and Superconductivity in Undoped Eu3Bi2S4F4

Black Phosphorus Radio-Frequency Transistors

Charge-density wave, superconductivity and $f$-electron valence instability in EuBiS$_2$F

Study of the material photon and electron background and the liquid argon detector veto efficiency of the CDEX-10 experiment

Direct and full-scale experimental verifications towards ground-satellite quantum key distribution

Introduction of the CDEX experiment

K and Mn co-doped BaCd2As2: a hexagonal structured bulk diluted magnetic semiconductor with large magnetoresistance

Superconductivity, charge- or spin-density wave, and metal-nonmetal transition in BaTi$_{2}$(Sb$_{1-x}$Bi$_{x}$)$_{2}$O

The CDEX-1 1 kg Point-Contact Germanium Detector for Low Mass Dark Matter Searches

Novel weak-ferromagnetic metallic state in heavily doped Ba$_{1-x}$K$_{x}$Mn$_{2}$As$_{2}$

Self-doping effect and possible antiferromagnetism at titanium-layers in the iron-based superconductor Ba$_2$Ti$_2$Fe$_2$As$_4$O