Source author record

Xu Han

Xu Han appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

48works

29topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Recent progress in multimodal large language models (MLLMs) has brought AI capabilities from static offline data processing to real-time streaming interaction, yet they still remain far from human-level multimodal interaction. The key bottlenecks are no longer modality coverage or latency alone, but the interaction paradigm itself. First, perception and response are still separated into alternating phases, preventing models from incorporating new inputs for timely adjustment during generation. Second, most current models remain reactive, responding only to explicit user requests instead of acting proactively in the evolving multimodal environment. We present MiniCPM-o 4.5, our latest effort towards human-like multimodal interaction, which mitigates these gaps by real-time full-duplex omni-modal interaction. It can see, listen, and speak simultaneously in real-time, while also exhibiting proactive behaviors such as issuing reminders or comments based on its continuous understanding of the live scene. The key technique behind MiniCPM-o 4.5 is Omni-Flow, a unified streaming framework that aligns omni-modal inputs and outputs along a shared temporal axis. This formulation converts conventional turn-based interaction into a full-duplex, time-aligned process, enabling simultaneous perception and response and allowing proactive behavior to arise within the same framework. With a total of 9B parameters, MiniCPM-o 4.5 approaches Gemini 2.5 Flash in vision-language capabilities, delivering state-of-the-art open-source performance at its scale. It also surpasses Qwen3-Omni-30B-A3B in omni-modal understanding and delivers better speech generation, with significantly higher computation efficiency. Driven by its efficient architecture design and inference optimization, the model can perform real-time full-duplex omni-modal interaction on edge devices with less than 12GB RAM cost.

preprint2022arXiv

A Roadmap for Big Model

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm. Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields. At present, there is a lack of research work that sorts out the overall progress of BMs and guides the follow-up research. In this paper, we cover not only the BM technologies themselves but also the prerequisites for BM training and applications with BMs, dividing the BM review into four parts: Resource, Models, Key Technologies and Application. We introduce 16 specific BM-related topics in those four parts, they are Data, Knowledge, Computing System, Parallel Training System, Language Model, Vision Model, Multi-modal Model, Theory&Interpretability, Commonsense Reasoning, Reliability&Security, Governance, Evaluation, Machine Translation, Text Generation, Dialogue and Protein Research. In each topic, we summarize clearly the current studies and propose some future research directions. At the end of this paper, we conclude the further development of BMs in a more general view.

preprint2022arXiv

Atomic-Scale Visualization of Chiral Charge Density Wave States and Their Reversible Transition

Chirality is essential for various amazing phenomena in life and matter. However,chirality and its switching in electronic superlattices, such as charge density wave(CDW) arrays, remain elusive. In this study, we characterize the chirality transition with atom-resolution imaging in a single-layer NbSe2 CDW pattern by technique of scanning tunneling microscopy. The atomic lattice of the CDW array is found continuous and intact although its chirality is switched. Several intermediate states are tracked by time-resolved imaging, revealing the fast and dynamic chirality transition. Importantly, the switching is reversibly realized with an external electric-field. Our findings unveil the delicate transition process of chiral CDW array in a 2D crystal down to the atomic scale and may be applicable for future nanoscale devices.

preprint2022arXiv

Exfoliation of 2D van der Waals crystals in ultrahigh vacuum for interface engineering

Two-dimensional (2D) materials and their heterostructures have been intensively studied in recent years due to their potential applications in electronic, optoelectronic, and spintronic devices. Nonetheless, the realization of 2D heterostructures with atomically flat and clean interfaces remains challenging, especially for air-sensitive materials, which hinders the in-depth investigation of interface-induced phenomena and the fabrication of high-quality devices. Here, we circumvented this challenge by exfoliating 2D materials in an ultrahigh vacuum. Remarkably, ultraflat and clean substrate surfaces can assist the exfoliation of 2D materials, regardless of the substrate and 2D material, thus providing a universal method for the preparation of heterostructures with ideal interfaces. In addition, we studied the properties of two prototypical systems that cannot be achieved previously, including the electronic structure of monolayer phospherene and optical responses of transition metal dichalcogenides on different metal substrates. Our work paves the way to engineer rich interface-induced phenomena, such as proximity effects and moiré superlattices.

preprint2022arXiv

Fully Hyperbolic Neural Networks

Hyperbolic neural networks have shown great potential for modeling complex data. However, existing hyperbolic networks are not completely hyperbolic, as they encode features in a hyperbolic space yet formalize most of their operations in the tangent space (a Euclidean subspace) at the origin of the hyperbolic space. This hybrid method greatly limits the modeling ability of networks. In this paper, we propose a fully hyperbolic framework to build hyperbolic networks based on the Lorentz model by adapting the Lorentz transformations (including boost and rotation) to formalize essential operations of neural networks. Moreover, we also prove that linear transformation in tangent spaces used by existing hyperbolic networks is a relaxation of the Lorentz rotation and does not include the boost, implicitly limiting the capabilities of existing hyperbolic networks. The experimental results on four NLP tasks show that our method has better performance for building both shallow and deep networks. Our code will be released to facilitate follow-up research.

preprint2022arXiv

Knowledge Inheritance for Pre-trained Language Models

Recent explorations of large-scale pre-trained language models (PLMs) have revealed the power of PLMs with huge amounts of parameters, setting off a wave of training ever-larger PLMs. However, it requires tremendous computational resources to train a large-scale PLM, which may be practically unaffordable. In addition, existing large-scale PLMs are mainly trained from scratch individually, ignoring that many well-trained PLMs are available. To this end, we explore the question how could existing PLMs benefit training large-scale PLMs in future. Specifically, we introduce a pre-training framework named "knowledge inheritance" (KI) and explore how could knowledge distillation serve as auxiliary supervision during pre-training to efficiently learn larger PLMs. Experimental results demonstrate the superiority of KI in training efficiency. We also conduct empirical analyses to explore the effects of teacher PLMs' pre-training settings, including model architecture, pre-training data, etc. Finally, we show that KI could be applied to domain adaptation and knowledge transfer.

preprint2022arXiv

Meta Reinforcement Learning with Successor Feature Based Context

Most reinforcement learning (RL) methods only focus on learning a single task from scratch and are not able to use prior knowledge to learn other tasks more effectively. Context-based meta RL techniques are recently proposed as a possible solution to tackle this. However, they are usually less efficient than conventional RL and may require many trial-and-errors during training. To address this, we propose a novel meta-RL approach that achieves competitive performance comparing to existing meta-RL algorithms, while requires significantly fewer environmental interactions. By combining context variables with the idea of decomposing reward in successor feature framework, our method does not only learn high-quality policies for multiple tasks simultaneously but also can quickly adapt to new tasks with a small amount of training. Compared with state-of-the-art meta-RL baselines, we empirically show the effectiveness and data efficiency of our method on several continuous control tasks.

preprint2022arXiv

One-step exfoliation method for plasmonic activation of large-area 2D crystals

Advanced exfoliation techniques are crucial for exploring the intrinsic properties and applications of 2D materials. Though the recently discovered Au-enhanced exfoliation technique provides an effective strategy for preparation of large-scale 2D crystals, the high cost of gold hinders this method from being widely adopted in industrial applications. In addition, direct Au contact could significantly quench photoluminescence (PL) emission in 2D semiconductors. It is therefore crucial to find alternative metals that can replace gold to achieve efficient exfoliation of 2D materials. Here, we present a one-step Ag-assisted method that can efficiently exfoliate many large-area 2D monolayers, where the yield ratio is comparable to Au-enhanced exfoliation method. Differing from Au film, however, the surface roughness of as-prepared Ag films on SiO2/Si substrate is much higher, which facilitates the generation of surface plasmons resulting from the nanostructures formed on the rough Ag surface. More interestingly, the strong coupling between 2D semiconductor crystals (e.g. MoS2, MoSe2) and Ag film leads to a unique PL enhancement that has not been observed in other mechanical exfoliation techniques, which can be mainly attributed to enhanced light-matter interaction as a result of extended propagation of surface plasmonic polariton (SPP). Our work provides a lower-cost and universal Ag-assisted exfoliation method, while at the same offering enhanced SPP-matter interactions.

preprint2022arXiv

OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

Employing Vehicle-to-Vehicle communication to enhance perception performance in self-driving technology has attracted considerable attention recently; however, the absence of a suitable open dataset for benchmarking algorithms has made it difficult to develop and assess cooperative perception technologies. To this end, we present the first large-scale open simulated dataset for Vehicle-to-Vehicle perception. It contains over 70 interesting scenes, 11,464 frames, and 232,913 annotated 3D vehicle bounding boxes, collected from 8 towns in CARLA and a digital town of Culver City, Los Angeles. We then construct a comprehensive benchmark with a total of 16 implemented models to evaluate several information fusion strategies~(i.e. early, late, and intermediate fusion) with state-of-the-art LiDAR detection algorithms. Moreover, we propose a new Attentive Intermediate Fusion pipeline to aggregate information from multiple connected vehicles. Our experiments show that the proposed pipeline can be easily integrated with existing 3D LiDAR detectors and achieve outstanding performance even with large compression rates. To encourage more researchers to investigate Vehicle-to-Vehicle perception, we will release the dataset, benchmark methods, and all related codes in https://mobility-lab.seas.ucla.edu/opv2v/.

preprint2022arXiv

PPT: Pre-trained Prompt Tuning for Few-shot Learning

Prompts for pre-trained language models (PLMs) have shown remarkable performance by bridging the gap between pre-training tasks and various downstream tasks. Among these methods, prompt tuning, which freezes PLMs and only tunes soft prompts, provides an efficient and effective solution for adapting large-scale PLMs to downstream tasks. However, prompt tuning is yet to be fully explored. In our pilot experiments, we find that prompt tuning performs comparably with conventional full-model fine-tuning when downstream data are sufficient, whereas it performs much worse under few-shot learning settings, which may hinder the application of prompt tuning in practice. We attribute this low performance to the manner of initializing soft prompts. Therefore, in this work, we propose to pre-train prompts by adding soft prompts into the pre-training stage to obtain a better initialization. We name this Pre-trained Prompt Tuning framework "PPT". To ensure the generalization of PPT, we formulate similar classification tasks into a unified task form and pre-train soft prompts for this unified task. Extensive experiments show that tuning pre-trained prompts for downstream tasks can reach or even outperform full-model fine-tuning under both full-data and few-shot settings. Our approach is effective and efficient for using large-scale PLMs in practice.

preprint2022arXiv

Predicting Physics in Mesh-reduced Space with Temporal Attention

Graph-based next-step prediction models have recently been very successful in modeling complex high-dimensional physical systems on irregular meshes. However, due to their short temporal attention span, these models suffer from error accumulation and drift. In this paper, we propose a new method that captures long-term dependencies through a transformer-style temporal attention model. We introduce an encoder-decoder structure to summarize features and create a compact mesh representation of the system state, to allow the temporal model to operate on a low-dimensional mesh representations in a memory efficient manner. Our method outperforms a competitive GNN baseline on several complex fluid dynamics prediction tasks, from sonic shocks to vascular flow. We demonstrate stable rollouts without the need for training noise and show perfectly phase-stable predictions even for very long sequences. More broadly, we believe our approach paves the way to bringing the benefits of attention-based sequence models to solving high-dimensional complex physics tasks.

preprint2022arXiv

Quantum transduction with microwave and optical entanglement

Quantum transduction refers to the coherent conversion between microwave and optical states, which can be achieved by quantum teleportation if given high fidelity microwave-optical entanglement, namely entanglement-based quantum transduction. Reliable microwave-optical entanglement can be generated using various platforms. In this paper, we base the discussion on piezo-optomechanical system and make the teleportation induced conversion scheme more concrete in the framework of quantum channel theory. By comparing the quantum capacity between the entanglement-based conversion channel and the traditional direct quantum transduction channel, we show entanglement-based scheme indeed admits a positive transduction rate when the direct quantum transduction has zero quantum capacity. Given two piezo-optomechanical systems, we also investigate the generation of microwave-microwave entanglement from entanglement swapping within continuous variable and discrete variable settings, showing the potentials of directly connecting microwave quantum processor by microwave-microwave quantum teleportation.

preprint2022arXiv

Sampling-based Fast Gradient Rescaling Method for Highly Transferable Adversarial Attacks

Deep neural networks have shown to be very vulnerable to adversarial examples crafted by adding human-imperceptible perturbations to benign inputs. After achieving impressive attack success rates in the white-box setting, more focus is shifted to black-box attacks. In either case, the common gradient-based approaches generally use the $sign$ function to generate perturbations at the end of the process. However, only a few works pay attention to the limitation of the $sign$ function. Deviation between the original gradient and the generated noises may lead to inaccurate gradient update estimation and suboptimal solutions for adversarial transferability, which is crucial for black-box attacks. To address this issue, we propose a Sampling-based Fast Gradient Rescaling Method (S-FGRM) to improve the transferability of the crafted adversarial examples. Specifically, we use data rescaling to substitute the inefficient $sign$ function in gradient-based attacks without extra computational cost. We also propose a Depth First Sampling method to eliminate the fluctuation of rescaling and stabilize the gradient update. Our method can be used in any gradient-based optimizations and is extensible to be integrated with various input transformation or ensemble methods for further improving the adversarial transferability. Extensive experiments on the standard ImageNet dataset show that our S-FGRM could significantly boost the transferability of gradient-based attacks and outperform the state-of-the-art baselines.

preprint2022arXiv

Single electrons on solid neon as a solid-state qubit platform

Progress toward the realization of quantum computers requires persistent advances in their constituent building blocks - qubits. Novel qubit platforms that simultaneously embody long coherence, fast operation, and large scalability offer compelling advantages in the construction of quantum computers and many other quantum information systems. Electrons, ubiquitous elementary particles of nonzero charge, spin, and mass, have commonly been perceived as paradigmatic local quantum information carriers. Despite superior controllability and configurability, their practical performance as qubits via either motional or spin states depends critically on their material environment. Here we report our experimental realization of a new qubit platform based upon isolated single electrons trapped on an ultraclean solid neon surface in vacuum. By integrating an electron trap in a circuit quantum electrodynamics architecture, we achieve strong coupling between the motional states of a single electron and a single microwave photon in an on-chip superconducting resonator. Qubit gate operations and dispersive readout are implemented to measure the energy relaxation time $T_1$ of $15~μ$s and phase coherence time $T_2$ over $200~$ns. These results indicate that the electron-on-solid-neon qubit already performs near the state of the art as a charge qubit.

preprint2022arXiv

Skilled Mutual Fund Selection: False Discovery Control under Dependence

Selecting skilled mutual funds through the multiple testing framework has received increasing attention from finance researchers and statisticians. The intercept $α$ of Carhart four-factor model is commonly used to measure the true performance of mutual funds, and positive $α$'s are considered as skilled. We observe that the standardized OLS estimates of $α$'s across the funds possess strong dependence and nonnormality structures, indicating that the conventional multiple testing methods are inadequate for selecting the skilled funds. We start from a decision theoretic perspective, and propose an optimal testing procedure to minimize a combination of false discovery rate and false non-discovery rate. Our proposed testing procedure is constructed based on the probability of each fund not being skilled conditional on the information across all of the funds in our study. To model the distribution of the information used for the testing procedure, we consider a mixture model under dependence and propose a new method called "approximate empirical Bayes" to fit the parameters. Empirical studies show that our selected skilled funds have superior long-term and short-term performance, e.g., our selection strongly outperforms the S\&P 500 index during the same period.

preprint2022arXiv

Superconducting cavity piezo-electromechanics: the realization of an acoustic frequency comb at microwave frequencies

We present a nonlinear multimode superconducting electroacoustic system, where the interplay between superconducting kinetic inductance and piezoelectric strong coupling establishes an effective Kerr nonlinearity among multiple acoustic modes at 10 GHz that could hardly be achieved via intrinsic mechanical nonlinearity. By exciting this multimode Kerr system with a single microwave tone, we further demonstrate a coherent electroacoustic frequency comb and provide theoretical understanding of multimode nonlinear interaction in the superstrong coupling limit. This nonlinear superconducting electroacoustic system sheds light on the active control of multimode resonator systems and offers an enabling platform for the dynamic study of microcombs at microwave frequencies.

preprint2021arXiv

Bridging the gap between atomically thin semiconductors and metal leads

Electrically interfacing atomically thin transition metal dichalcogenide semiconductors (TMDSCs) with metal leads is challenging because of undesired interface barriers, which have drastically constrained the electrical performance of TMDSC devices for exploring their unconventional physical properties and realizing potential electronic applications. Here we demonstrate a strategy to achieve nearly barrier-free electrical contacts with few-layer TMDSCs by engineering interfacial bonding distortion. The carrier-injection efficiency of such electrical junction is substantially increased with robust ohmic behaviors from room to cryogenic temperatures. The performance enhancements of TMDSC field-effect transistors are well reflected by the ultralow contact resistance (down to 90 Ohm um in MoS2, towards the quantum limit), the ultrahigh field-effect mobility (up to 358,000 cm2V-1s-1 in WSe2) and the prominent transport characteristics at cryogenic temperatures. This method also offers new possibilities of the local manipulation of structures and electronic properties for TMDSC device design.

preprint2021arXiv

Identity-aware Facial Expression Recognition in Compressed Video

This paper targets to explore the inter-subject variations eliminated facial expression representation in the compressed video domain. Most of the previous methods process the RGB images of a sequence, while the off-the-shelf and valuable expression-related muscle movement already embedded in the compression format. In the up to two orders of magnitude compressed domain, we can explicitly infer the expression from the residual frames and possible to extract identity factors from the I frame with a pre-trained face recognition network. By enforcing the marginal independent of them, the expression feature is expected to be purer for the expression and be robust to identity shifts. We do not need the identity label or multiple expression samples from the same person for identity elimination. Moreover, when the apex frame is annotated in the dataset, the complementary constraint can be further added to regularize the feature-level game. In testing, only the compressed residual frames are required to achieve expression prediction. Our solution can achieve comparable or better performance than the recent decoded image based methods on the typical FER benchmarks with about 3$\times$ faster inference with compressed data.

preprint2021arXiv

Inverse Faraday Effect in an Optomagnonic Waveguide

Single-mode high-index-contrast waveguides have been ubiquitously exploited in optical, microwave, and phononic structures for achieving enhanced wave-matter interactions. Although micro-scale optomechanical and electro-optical devices have been widely studied, optomagnonic devices remain a grand challenge at the microscale. Here, we introduce a planar optomagnonic waveguide platform based on a ferrimagnetic insulator that simultaneously supports single transverse mode of spin waves (magnons) and highly confined optical modes. The co-localization of spin and light waves gives rise to enhanced inverse Faraday effect, and as a result, magnons are excited by an effective magnetic field generated by interacting optical photons. Moreover, the strongly enhanced optomagnonic interaction allows us to observe such effect using low-power (milliwatt level) light signals in the continuous-wave form, as opposed to high-intensity (megawatt peak power) light pulses that are typically required in magnetic bulk materials or thin films. The optically-driven magnons are detected electrically with preserved phase coherence, showing the feasibility for launching spin waves with low-power continuous optical fields.

preprint2020arXiv

A Deep Network for Joint Registration and Reconstruction of Images with Pathologies

Registration of images with pathologies is challenging due to tissue appearance changes and missing correspondences caused by the pathologies. Moreover, mass effects as observed for brain tumors may displace tissue, creating larger deformations over time than what is observed in a healthy brain. Deep learning models have successfully been applied to image registration to offer dramatic speed up and to use surrogate information (e.g., segmentations) during training. However, existing approaches focus on learning registration models using images from healthy patients. They are therefore not designed for the registration of images with strong pathologies for example in the context of brain tumors, and traumatic brain injuries. In this work, we explore a deep learning approach to register images with brain tumors to an atlas. Our model learns an appearance mapping from images with tumors to the atlas, while simultaneously predicting the transformation to atlas space. Using separate decoders, the network disentangles the tumor mass effect from the reconstruction of quasi-normal images. Results on both synthetic and real brain tumor scans show that our approach outperforms cost function masking for registration to the atlas and that reconstructed quasi-normal images can be used for better longitudinal registrations.

preprint2020arXiv

An Upper Bound for Functions of Estimators in High Dimensions

We provide an upper bound as a random variable for the functions of estimators in high dimensions. This upper bound may help establish the rate of convergence of functions in high dimensions. The upper bound random variable may converge faster, slower, or at the same rate as estimators depending on the behavior of the partial derivative of the function. We illustrate this via three examples. The first two examples use the upper bound for testing in high dimensions, and third example derives the estimated out-of-sample variance of large portfolios. All our results allow for a larger number of parameters, p, than the sample size, n.

preprint2020arXiv

Cavity piezo-mechanics for superconducting-nanophotonic quantum interface

Hybrid quantum systems are essential for the realization of distributed quantum networks. In particular, piezo-mechanics operating at typical superconducting qubit frequencies features low thermal excitations, and offers an appealing platform to bridge superconducting quantum processors and optical telecommunication channels. However, integrating superconducting and optomechanical elements at cryogenic temperatures with sufficiently strong interactions remains a tremendous challenge. Here, we report an integrated superconducting cavity piezo-optomechanical platform where 10-GHz phonons are resonantly coupled with photons in a superconducting and a nanophotonic cavities at the same time. Benefited from the achieved large piezo-mechanical cooperativity ($C_\mathrm{em}\sim7$) and the enhanced optomechanical coupling boosted by a pulsed optical pump, we demonstrate coherent interactions at cryogenic temperatures via the observation of efficient microwave-optical photon conversion. This hybrid interface makes a substantial step towards quantum communication at large scale, as well as novel explorations in microwave-optical photon entanglement and quantum sensing mediated by gigahertz phonons.

preprint2020arXiv

Magnon-photon strong coupling for tunable microwave circulators

We present a generic theoretical framework to describe non-reciprocal microwave circulation in a multimode cavity magnonic system and assess the optimal performance of practical circulator devices. We show that high isolation (> 56 dB), extremely low insertion loss (< 0.05 dB), and flexible bandwidth control can be potentially realized in high-quality-factor superconducting cavity based magnonic platforms. These circulation characteristics are analyzed with materials of different spin densities. For high-spin-density materials such as yttrium iron garnet, strong coupling operation regime can be harnessed to obtain a broader circulation bandwidth. We also provide practical design principles for a highly integratible low-spin-density material (vanadium tetracyanoethylene) for narrow-band circulator operation, which could benefit noise-sensitive quantum microwave measurements. This theory can be extended to other coupled systems and provide design guidelines for achieving tunable microwave non-reciprocity for both classical and quantum applications.

preprint2020arXiv

Microwave--Optical entanglement in a Strongly Coupled Electro-Optomechanical System

Quantum transduction between microwave and optics can be realized by quantum teleportation if given reliable microwave-optical entanglement, namely entanglement-based quantum transduction. To realize this protocol, an entangled source with high-fidelity between the two frequencies is necessary. In this paper, we study the microwave and optical entanglement generation based on a generic cavity electro-optomechanical system in the strong coupling regime. Splittings are shown in the microwave and optical output spectra and the frequency entanglement between the two modes is quantified. We show that entanglement can be straightforwardly encoded in the frequency-bin degree of freedom and propose a feasible experiment to verify entangled photon pairs. The experimental implementation is systematically analyzed, and the preferable parameter regime for entanglement verification is identified. An inequality is given as a criterion for good entanglement verification with analysis of practical imperfections.

preprint2020arXiv

Multi-stage Power Scheduling Framework for Data Center with Chilled Water Storage in Energy and Regulation Markets

Leveraging electrochemical and thermal energy storage systems has been proposed as a strategy to reduce peak power in data centers. Thermal energy storage systems, such as chilled water tanks, have gained increasing attention in data centers for load shifting due to their relatively small capital and operational costs compared to electrochemical energy storage. However, there are few studies investigating the possibility of utilizing thermal energy storage system with resources to provide ancillary services (e.g., frequency regulation) to the grid. This paper proposes a synergistic control strategy for the data center with a chilled water storage providing frequency regulation service by adjusting the chiller capacity, storage charging rate, and IT server CPU frequency. Then, a three-stage multi-market scheduling framework based on a model predictive control scheme is developed to minimize operational costs of data centers participating in both energy and regulation markets. The framework solves a power baseline scheduling problem, a regulation reserve problem, and a real-time power signal tracking problem sequentially. Simulation results show that utilizing the thermal energy storage can increase the regulation capacity bid, reduce energy costs and demand charges, and also harvest frequency regulation revenues. The proposed multi-market scheduling framework in a span of two days can reduce the operational costs up to 8.8% ($1,606.4) compared to the baseline with 0.2% (\$38.7) energy cost reduction, 6.5% (\$1,179.4) from demand reduction, and 2.1% (\$338.3) from regulation revenues.

preprint2020arXiv

On-chip sensing of hotspots in superconducting terahertz emitters

Intrinsic Josephson junctions in high-temperature superconductor Bi2Sr2CaCu2O8 are known for their capability to emit high-power terahertz photons with widely tunable frequencies. Hotspots, as inhomogeneous temperature distributions across the junctions, are believed to play a critical role in synchronizing the gauge-invariant phase difference among the junctions, so as to achieve coherent strong emission. Previous optical imaging techniques have indirectly suggested that the hotspot temperature can go higher than the superconductor critical temperature. However, such optical approaches often disturb the local temperature profile and are too slow for device applications. In this paper, we demonstrate an on-chip in situ sensing technique that can precisely quantify the local temperature profile. This is achieved by fabricating a series of micro "sensor" junctions on top of an "emitter" junction and measuring the critical current on the sensors versus the bias current applied to the emitter. This fully electronic on-chip design could enable efficient close-loop control of hotspots in BSCCO junctions and significantly enhance the functionality of superconducting terahertz emitters.

preprint2020arXiv

VoteNet+ : An Improved Deep Learning Label Fusion Method for Multi-atlas Segmentation

In this work, we improve the performance of multi-atlas segmentation (MAS) by integrating the recently proposed VoteNet model with the joint label fusion (JLF) approach. Specifically, we first illustrate that using a deep convolutional neural network to predict atlas probabilities can better distinguish correct atlas labels from incorrect ones than relying on image intensity difference as is typical in JLF. Motivated by this finding, we propose VoteNet+, an improved deep network to locally predict the probability of an atlas label to differs from the label of the target image. Furthermore, we show that JLF is more suitable for the VoteNet framework as a label fusion method than plurality voting. Lastly, we use Platt scaling to calibrate the probabilities of our new model. Results on LPBA40 3D MR brain images show that our proposed method can achieve better performance than VoteNet.

preprint2020arXiv

Waveguide cavity optomagnonics for broadband multimode microwave-to-optics conversion

Cavity optomagnonics has emerged as a promising platform for studying coherent photon-spin interactions as well as tunable microwave-to-optical conversion. However, current implementation of cavity optomagnonics in ferrimagnetic crystals remains orders of magnitude larger in volume than state-of-the-art cavity optomechanical devices, resulting in very limited magneto-optical interaction strength. Here, we demonstrate a cavity optomagnonic device based on integrated waveguides and its application for microwave-to-optical conversion. By designing a ferrimagnetic rib waveguide to support multiple magnon modes with maximal mode overlap to the optical field, we realize a high magneto-optical cooperativity which is three orders of magnitude higher compared to previous records obtained on polished YIG spheres. Furthermore, we achieve tunable conversion of microwave photons at around 8.45 GHz to 1550 nm light with a broad conversion bandwidth as large as 16.1 MHz. The unique features of the system point to novel applications at the crossroad between quantum optics and magnonics.

preprint2019arXiv

Coulomb focusing in retrapped ionization with near-circularly polarized laser field

The full three-dimensional photoelectron momentum distributions of argon are measured in intense near-circularly polarized laser fields. We observed that the transverse momentum distribution of ejected electrons by 410-nm near-circularly polarized field is unexpectedly narrowed with increasing laser intensity, which is contrary to the conventional rules predicted by adiabatic theory. By analyzing the momentum-resolved angular momentum distribution measured experimentally and the corresponding trajectories of ejected electrons semiclassically, the narrowing can be attributed to a temporary trapping and thereby focusing of a photoelectron by the atomic potential in a quasibound state. With the near-circularly polarized laser field, the strong Coulomb interaction with the rescattering electrons is avoided, thus the Coulomb focusing in the retrapped process is highlighted. We believe that these findings will facilitate understanding and steering electron dynamics in the Coulomb coupled system.

preprint2019arXiv

Efficient training and design of photonic neural network through neuroevolution

Recently, optical neural networks (ONNs) integrated in photonic chips has received extensive attention because they are expected to implement the same pattern recognition tasks in the electronic platforms with high efficiency and low power consumption. However, the current lack of various learning algorithms to train the ONNs obstructs their further development. In this article, we propose a novel learning strategy based on neuroevolution to design and train the ONNs. Two typical neuroevolution algorithms are used to determine the hyper-parameters of the ONNs and to optimize the weights (phase shifters) in the connections. In order to demonstrate the effectiveness of the training algorithms, the trained ONNs are applied in the classification tasks for iris plants dataset, wine recognition dataset and modulation formats recognition. The calculated results exhibit that the training algorithms based on neuroevolution are competitive with other traditional learning algorithms on both accuracy and stability. Compared with previous works, we introduce an efficient training method for the ONNs and demonstrate their broad application prospects in pattern recognition, reinforcement learning and so on.

preprint2019arXiv

Heralded Generation and Detection of Entangled Microwave--Optical Photon Pairs

Quantum state transfer between microwave and optical frequencies is essential for connecting superconducting quantum circuits to coherent optical systems and extending microwave quantum networks over long distances. To build such a hybrid `quantum Internet,' an important experiment in the quantum regime is to entangle microwave and optical modes. Based on the model of a generic cavity electro-optomechanical system, we present a heralded scheme to generate entangled microwave--optical photon pairs, which can bypass the efficiency threshold for quantum channel capacity in direct transfer protocols. The parameter regime for entanglement verification is identified that is compatible with realistic experimental settings. Our scheme is feasible given the latest experimental progress on electro-optomechanics, and can be potentially generalized to various physical systems.

preprint2019arXiv

Machine learning and evolutionary algorithm studies of graphene metamaterials for optimized plasmon-induced transparency

Machine learning and optimization algorithms have been widely applied in the design and optimization for photonic devices. In this article, we briefly review recent progress of this field of research and show some data-driven applications (e.g. spectrum prediction, inverse design and performance optimization) for novel graphene metamaterials (GMs). The structure of the GMs is well-designed to achieve the wideband plasmon induced transparency effect, which is regarded as optimization object and can be theoretically demonstrated by using transfer matrix method. Some classical machine learning algorithms, including k nearest neighbour, decision tree, random forest and artificial neural networks, are utilized to equivalently substitute the numerical simulation in the forward spectrum prediction and complete the inverse design for the GMs. The calculated results demonstrate that all the algorithms are effective and the random forest has advantages in terms of accuracy and training speed. Moreover, the single-objective and multi-objective optimization algorithms are used to achieve steep transmission characteristics by synthetically taking many performance metrics into consideration. The maximum difference between the transmission peaks and dips in the optimized transmission spectrum can reach 0.97. In comparison to previous works, we provide a guidance for intelligent design of photonic devices and advanced materials based on machine learning and evolutionary algorithms.

preprint2019arXiv

Radiative cooling of a superconducting resonator

Cooling microwave resonators to near the quantum ground state, crucial for their operation in the quantum regime, is typically achieved by direct device refrigeration to a few tens of millikelvin. However, in quantum experiments that require high operation power such as microwave-to-optics quantum transduction, it is desirable to operate at higher temperatures with non-negligible environmental thermal excitations, where larger cooling power is available. In this Letter, we present a radiative cooling protocol to prepare a superconducting microwave mode near its quantum ground state in spite of warm environment temperatures for the resonator. In this proof-of-concept experiment, the mode occupancy of a 10-GHz superconducting resonator thermally anchored at 1.02~K is reduced to $0.44\pm0.05$ by radiatively coupling to a 70-mK cold load. This radiative cooling scheme allows high-operation-power microwave experiments to work in the quantum regime, and opens possibilities for routing microwave quantum states to elevated temperatures.

preprint2019arXiv

Strong coupling-enabled broadband non-reciprocity

Non-reciprocity of signal transmission enhances capacity of communication channels and protects transmission quality against possible signal instabilities, thus becoming an important component ensuring coherent information processing. However, non-reciprocal transmission requires breaking time-reversal symmetry (TRS) which poses challenges of both practical and fundamental character hindering the progress. Here we report a new scheme for achieving broadband non-reciprocity using a specially engineered hybrid microwave cavity. The TRS breaking is realized via strong coherent coupling between a selected chiral mode in the microwave cavity and a single collective spin excitation (magnon) in a ferromagnetic yttrium iron garnet (YIG) sphere. The non-reciprocity in transmission is observed spanning nearly a 0.5 GHz frequency band, which outperforms by two orders of magnitude the previously achieved bandwidths. Our findings open new directions for robust coherent information processing in a broad range of systems in both classical and quantum regimes.

preprint2018arXiv

Sentence Segmentation for Classical Chinese Based on LSTM with Radical Embedding

In this paper, we develop a low than character feature embedding called radical embedding, and apply it on LSTM model for sentence segmentation of pre modern Chinese texts. The datasets includes over 150 classical Chinese books from 3 different dynasties and contains different literary styles. LSTM CRF model is a state of art method for the sequence labeling problem. Our new model adds a component of radical embedding, which leads to improved performances. Experimental results based on the aforementioned Chinese books demonstrates a better accuracy than earlier methods on sentence segmentation, especial in Tang Epitaph texts.

preprint2016arXiv

Cavity piezomechanical strong coupling and frequency conversion on an aluminum nitride chip

Schemes to achieve strong coupling between mechanical modes of aluminum nitride microstructures and microwave cavity modes due to the piezoelectric effect are proposed. We show that the strong coupling regime is feasible for an on-chip aluminum nitride device that is either enclosed by a three-dimensional microwave cavity or integrated with a superconducting coplanar resonator. Combining with optomechanics, the piezomechanical strong coupling permits coherent conversion between microwave and optical modes with high efficiency. Hence, the piezomechanical system will be an efficient transducer for applications in hybrid quantum systems.

preprint2016arXiv

Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion

Joint representation learning of text and knowledge within a unified semantic space enables us to perform knowledge graph completion more accurately. In this work, we propose a novel framework to embed words, entities and relations into the same continuous vector space. In this model, both entity and relation embeddings are learned by taking knowledge graph and plain text into consideration. In experiments, we evaluate the joint learning model on three tasks including entity prediction, relation prediction and relation classification from text. The experiment results show that our model can significantly and consistently improve the performance on the three tasks as compared with other baselines.

preprint2016arXiv

Multimode Strong Coupling in Superconducting Cavity Piezo-electromechanics

High frequency mechanical resonators subjected to low thermal phonon occupancy are easier to be prepared to the ground state by direct cryogenic cooling. Their extreme stiffness, however, poses a significant challenge for external interrogations. Here we demonstrate a superconducting cavity piezo-electromechanical system in which multiple modes of a bulk acoustic resonator oscillating at $10\,\textrm{GHz}$ are coupled to a planar microwave superconducting resonator with a cooperativity exceeding $2\times10^{3}$, deep in the strong coupling regime. By implementation of the non-contact coupling scheme to reduce mechanical dissipation, the system exhibits excellent coherence characterized by a frequency-quality factor product of $7.5\times10^{15}\,\textrm{Hz}$. Interesting dynamics of temporal oscillations of the microwave energy is observed, implying the coherent conversion between phonons and photons. The demonstrated high frequency cavity piezo-electromechanics is compatible with superconducting qubits, representing an important step towards hybrid quantum systems.

preprint2016arXiv

Noise-free quantum optical frequency shifting driven by mechanics

The ability to manipulate single photons is of critical importance for fundamental quantum optics studies and practical implementations of quantum communications. While extraordinary progresses have been made in controlling spatial, temporal, spin and orbit angular momentum degrees of freedom, frequency-domain control of single photons so far relies on nonlinear optical effects, which have faced obstacles such as noise photons, narrow bandwidth and demanding optical filtering. Here we demonstrate the first integrated near-unity efficiency frequency manipulation of single photons, by stretching and compressing a waveguide at 8.3 billion cycles per second. Frequency shift up to 150 GHz at telecom wavelength is realized without measurable added noise and the preservation of quantum coherence is verified through quantum interference between twin photons of different colors. This single photon frequency control approach will be invaluable for increasing the channel capacity of quantum communications and compensating frequency mismatch between quantum systems, paving the road toward hybrid quantum network.

preprint2016arXiv

Phase-only signal reconstruction by MagnitudeCut

In this paper, we present a new algorithm, called MagnitudeCut, for recovering a signal from the phase of its Fourier transform. We casted our recovering problem into a new convex optimization problem, and then solved it by the block coordinate descent algorithm and the interior point algorithm, in which the iteration process consists of matrix vector product and inner product. We used the new method for reconstruction of a set of signal/image. The simulation results reveal that the proposed MagnitudeCut method can reconstruct the original signal with fewer sampling number of the phase information than that of the Greedy algorithm and iterative method under the same reconstruction error. Moreover, our algorithm can also reconstruct the symmetric image from its Fourier phase.

preprint2014arXiv

Microwave assisted coherent and nonlinear control in cavity piezo-optomechanical system

We present a cavity piezo-optomechanical system where microwave and optical degrees of freedom are coupled through an ultra-high frequency mechanical resonator. By utilizing the coherence among the three interacting modes, we demonstrate optical amplification, coherent absorption and a more general asymmetric Fano resonance. The strong piezoelectric drive further allows access to the large-amplitude-induced optomechanical nonlinearity, with which optical transparency at higher harmonics through multi-phonon scattering is demonstrated.

preprint2014arXiv

Performance evaluation of wavelet scattering network in image texture classification in various color spaces

Texture plays an important role in many image analysis applications. In this paper, we give a performance evaluation of color texture classification by performing wavelet scattering network in various color spaces. Experimental results on the KTH_TIPS_COL database show that opponent RGB based wavelet scattering network outperforms other color spaces. Therefore, when dealing with the problem of color texture classification, opponent RGB based wavelet scattering network is recommended.

preprint2014arXiv

Phase noise of self-sustained optomechanical oscillators

In this paper we present a theory that predicts the phase noise characteristics of self-sustained optomechanical oscillators. By treating the cavity optomechanical system as a feedback loop consisting of an optical cavity and a mechanical resonator, we analytically derive the transfer functions relating the amplitude/phase noise of all the relevant dynamical quantities from the quantum Langevin equations, and obtain a closed-form expressions for the phase noise spectral densities contributed from thermomechanical noise, photon shot noise, and low-frequency technical laser noise. We numerically calculate the phase noise for various situations and perform a sample calculation for an experimentally demonstrated system. We also show that the presented model reduces to the well-known Leeson's phase noise model when the amplitude noise and the amplitude/phase noise inter-transfers are ignored.

preprint2012arXiv

Confirming the 115.5-day periodicity in the X-ray light curve of ULX NGC 5408 X-1

The Swift/XRT light curve of the ultraluminous X-ray (ULX) source NGC 5408 X-1 was re-analyzed with two new numerical approaches, Weighted Wavelet $Z$-transform (WWZ) and CLEANest, that are different from previous studies. Both techniques detected a prominent periodicity with a time scale of $115.5\pm1.5$ days, in excellent agreement with the detection of the same periodicity first reported by Strohmayer (2009). Monte Carlo simulation was employed to test the statisiticak confidence of the 115.5-day periodicity, yielding a statistical significance of $> 99.98%$ (or $>3.8σ$). The robust detection of the 115.5-day quasi-periodic oscillations (QPOs), if it is due to the orbital motion of the binary, would infer a mass of a few thousand $M_\odot$ for the central black hole, implying an intermediate-mass black hole in NGC 5408 X-1.

preprint2012arXiv

Multiple periodic oscillations in the radio light curves of NRAO 530

In this paper, the time series analysis method CLEANest is employed to search for characteristic periodicities in the radio light curves of the blazar NRAO 530 at 4.8, 8.0 and 14.5 GHz over a time baseline of three decades. Two prominent periodicities on time scales of about 6.3 and 9.5 yr are identified at all three frequencies, in agreement with previous results derived from different numerical techniques, confirming the multiplicity of the periodicities in NRAO 530. In addition to these two significant periods, there is also evidence of shorter-timescale periodicities of about 5.0 yr, 4.2 yr, 3.4 yr and 2.8 yr showing lower amplitude in the periodograms. The physical mechanisms responsible for the radio quasi-periodic oscillations and the multiplicity of the periods are discussed.

preprint2011arXiv

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any SNPs are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging under arbitrary dependence. In the current paper, we propose a novel method based on principal factor approximation, which successfully subtracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive an approximate expression for false discovery proportion (FDP) in large scale multiple testing when a common threshold is used and provide a consistent estimate of realized FDP. This result has important applications in controlling FDR and FDP. Our estimate of realized FDP compares favorably with Efron (2007)'s approach, as demonstrated in the simulated examples. Our approach is further illustrated by some real data applications. We also propose a dependence-adjusted procedure, which is more powerful than the fixed threshold procedure.

preprint2011arXiv

The effect of winning an Oscar Award on survival: Correcting for healthy performer survivor bias with a rank preserving structural accelerated failure time model

We study the causal effect of winning an Oscar Award on an actor or actress's survival. Does the increase in social rank from a performer winning an Oscar increase the performer's life expectancy? Previous studies of this issue have suffered from healthy performer survivor bias, that is, candidates who are healthier will be able to act in more films and have more chance to win Oscar Awards. To correct this bias, we adapt Robins' rank preserving structural accelerated failure time model and $g$-estimation method. We show in simulation studies that this approach corrects the bias contained in previous studies. We estimate that the effect of winning an Oscar Award on survival is 4.2 years, with a 95% confidence interval of $[-0.4,8.4]$ years. There is not strong evidence that winning an Oscar increases life expectancy.

preprint2010arXiv

Control of the False Discovery Rate Under Arbitrary Covariance Dependence

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any genes are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging under arbitrary dependence. In the current paper, we propose a new methodology based on principal factor approximation, which successfully substracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive the theoretical distribution for false discovery proportion (FDP) in large scale multiple testing when a common threshold is used and provide a consistent FDP. This result has important applications in controlling FDR and FDP. Our estimate of FDP compares favorably with Efron (2007)'s approach, as demonstrated by in the simulated examples. Our approach is further illustrated by some real data applications.

Xu Han

What is connected

Connect this record

See the researcher in context

Building this map preview

48 published item(s)

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

A Roadmap for Big Model

Atomic-Scale Visualization of Chiral Charge Density Wave States and Their Reversible Transition

Exfoliation of 2D van der Waals crystals in ultrahigh vacuum for interface engineering

Fully Hyperbolic Neural Networks

Knowledge Inheritance for Pre-trained Language Models

Meta Reinforcement Learning with Successor Feature Based Context

One-step exfoliation method for plasmonic activation of large-area 2D crystals

OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

PPT: Pre-trained Prompt Tuning for Few-shot Learning

Predicting Physics in Mesh-reduced Space with Temporal Attention

Quantum transduction with microwave and optical entanglement

Sampling-based Fast Gradient Rescaling Method for Highly Transferable Adversarial Attacks

Single electrons on solid neon as a solid-state qubit platform

Skilled Mutual Fund Selection: False Discovery Control under Dependence

Superconducting cavity piezo-electromechanics: the realization of an acoustic frequency comb at microwave frequencies

Bridging the gap between atomically thin semiconductors and metal leads

Identity-aware Facial Expression Recognition in Compressed Video

Inverse Faraday Effect in an Optomagnonic Waveguide

A Deep Network for Joint Registration and Reconstruction of Images with Pathologies

An Upper Bound for Functions of Estimators in High Dimensions

Cavity piezo-mechanics for superconducting-nanophotonic quantum interface

Magnon-photon strong coupling for tunable microwave circulators

Microwave--Optical entanglement in a Strongly Coupled Electro-Optomechanical System

Multi-stage Power Scheduling Framework for Data Center with Chilled Water Storage in Energy and Regulation Markets

On-chip sensing of hotspots in superconducting terahertz emitters

VoteNet+ : An Improved Deep Learning Label Fusion Method for Multi-atlas Segmentation

Waveguide cavity optomagnonics for broadband multimode microwave-to-optics conversion

Coulomb focusing in retrapped ionization with near-circularly polarized laser field

Efficient training and design of photonic neural network through neuroevolution

Heralded Generation and Detection of Entangled Microwave--Optical Photon Pairs

Machine learning and evolutionary algorithm studies of graphene metamaterials for optimized plasmon-induced transparency

Radiative cooling of a superconducting resonator

Strong coupling-enabled broadband non-reciprocity

Sentence Segmentation for Classical Chinese Based on LSTM with Radical Embedding

Cavity piezomechanical strong coupling and frequency conversion on an aluminum nitride chip

Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion

Multimode Strong Coupling in Superconducting Cavity Piezo-electromechanics

Noise-free quantum optical frequency shifting driven by mechanics

Phase-only signal reconstruction by MagnitudeCut

Microwave assisted coherent and nonlinear control in cavity piezo-optomechanical system

Performance evaluation of wavelet scattering network in image texture classification in various color spaces

Phase noise of self-sustained optomechanical oscillators

Confirming the 115.5-day periodicity in the X-ray light curve of ULX NGC 5408 X-1

Multiple periodic oscillations in the radio light curves of NRAO 530

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

The effect of winning an Oscar Award on survival: Correcting for healthy performer survivor bias with a rank preserving structural accelerated failure time model

Control of the False Discovery Rate Under Arbitrary Covariance Dependence