Source author record

Peng Zhou

Peng Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

40works

32topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Non-Prehensile Tool-Object Manipulation by Integrating LLM-Based Planning and Manoeuvrability-Driven Controls

The ability to wield tools was once considered exclusive to human intelligence, but it's now known that many other animals, like crows, possess this capability. Yet, robotic systems still fall short of matching biological dexterity. In this paper, we investigate the use of Large Language Models (LLMs), tool affordances, and object manoeuvrability for non-prehensile tool-based manipulation tasks. Our novel method leverages LLMs based on scene information and natural language instructions to enable symbolic task planning for tool-object manipulation. This approach allows the system to convert a human language sentence into a sequence of feasible motion functions. We have developed a novel manoeuvrability-driven controller using a new tool affordance model derived from visual feedback. This controller helps guide the robot's tool utilization and manipulation actions, even within confined areas, using a stepping incremental approach. The proposed methodology is evaluated with experiments to prove its effectiveness under various manipulation scenarios.

preprint2026arXiv

UniBiDex: A Unified Teleoperation Framework for Robotic Bimanual Dexterous Manipulation

We present UniBiDex a unified teleoperation framework for robotic bimanual dexterous manipulation that supports both VRbased and leaderfollower input modalities UniBiDex enables realtime contactrich dualarm teleoperation by integrating heterogeneous input devices into a shared control stack with consistent kinematic treatment and safety guarantees The framework employs nullspace control to optimize bimanual configurations ensuring smooth collisionfree and singularityaware motion across tasks We validate UniBiDex on a longhorizon kitchentidying task involving five sequential manipulation subtasks demonstrating higher task success rates smoother trajectories and improved robustness compared to strong baselines By releasing all hardware and software components as opensource we aim to lower the barrier to collecting largescale highquality human demonstration datasets and accelerate progress in robot learning.

preprint2023arXiv

Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning

The dynamic expansion architecture is becoming popular in class incremental learning, mainly due to its advantages in alleviating catastrophic forgetting. However, task confusion is not well assessed within this framework, e.g., the discrepancy between classes of different tasks is not well learned (i.e., inter-task confusion, ITC), and certain priority is still given to the latest class batch (i.e., old-new confusion, ONC). We empirically validate the side effects of the two types of confusion. Meanwhile, a novel solution called Task Correlated Incremental Learning (TCIL) is proposed to encourage discriminative and fair feature utilization across tasks. TCIL performs a multi-level knowledge distillation to propagate knowledge learned from old tasks to the new one. It establishes information flow paths at both feature and logit levels, enabling the learning to be aware of old classes. Besides, attention mechanism and classifier re-scoring are applied to generate more fair classification scores. We conduct extensive experiments on CIFAR100 and ImageNet100 datasets. The results demonstrate that TCIL consistently achieves state-of-the-art accuracy. It mitigates both ITC and ONC, while showing advantages in battle with catastrophic forgetting even no rehearsal memory is reserved.

preprint2022arXiv

A Dual-Arm Collaborative Framework for Dexterous Manipulation in Unstructured Environments with Contrastive Planning

Most object manipulation strategies for robots are based on the assumption that the object is rigid (i.e., with fixed geometry) and the goal's details have been fully specified (e.g., the exact target pose). However, there are many tasks that involve spatial relations in human environments where these conditions may be hard to satisfy, e.g., bending and placing a cable inside an unknown container. To develop advanced robotic manipulation capabilities in unstructured environments that avoid these assumptions, we propose a novel long-horizon framework that exploits contrastive planning in finding promising collaborative actions. Using simulation data collected by random actions, we learn an embedding model in a contrastive manner that encodes the spatio-temporal information from successful experiences, which facilitates the subgoal planning through clustering in the latent space. Based on the keypoint correspondence-based action parameterization, we design a leader-follower control scheme for the collaboration between dual arms. All models of our policy are automatically trained in simulation and can be directly transferred to real-world environments. To validate the proposed framework, we conduct a detailed experimental study on a complex scenario subject to environmental and reachability constraints in both simulation and real environments.

preprint2022arXiv

A Fully Memristive Spiking Neural Network with Unsupervised Learning

We present a fully memristive spiking neural network (MSNN) consisting of physically-realizable memristive neurons and memristive synapses to implement an unsupervised Spiking Time Dependent Plasticity (STDP) learning rule. The system is fully memristive in that both neuronal and synaptic dynamics can be realized by using memristors. The neuron is implemented using the SPICE-level memristive integrate-and-fire (MIF) model, which consists of a minimal number of circuit elements necessary to achieve distinct depolarization, hyperpolarization, and repolarization voltage waveforms. The proposed MSNN uniquely implements STDP learning by using cumulative weight changes in memristive synapses from the voltage waveform changes across the synapses, which arise from the presynaptic and postsynaptic spiking voltage signals during the training process. Two types of MSNN architectures are investigated: 1) a biologically plausible memory retrieval system, and 2) a multi-class classification system. Our circuit simulation results verify the MSNN's unsupervised learning efficacy by replicating biological memory retrieval mechanisms, and achieving 97.5% accuracy in a 4-pattern recognition problem in a large scale discriminative MSNN.

preprint2022arXiv

A Multi-Sensor Interface to Improve the Learning Experience in Arc Welding Training Tasks

This paper presents the development of a multi-sensor user interface to facilitate the instruction of arc welding tasks. Traditional methods to acquire hand-eye coordination skills are typically conducted through one-to-one instruction where trainees must wear protective helmets and conduct several tests. This approach is inefficient as the harmful light emitted from the electric arc impedes the close monitoring of the process; Practitioners can only observe a small bright spot. To tackle these problems, recent training approaches have leveraged virtual reality to safely simulate the process and visualize the geometry of the workpieces. However, the synthetic nature of these types of simulation platforms reduces their effectiveness as they fail to comprise actual welding interactions with the environment, which hinders the trainees' learning process. To provide users with a real welding experience, we have developed a new multi-sensor extended reality platform for arc welding training. Our system is composed of: (1) An HDR camera, monitoring the real welding spot in real-time; (2) A depth sensor, capturing the 3D geometry of the scene; and (3) A head-mounted VR display, visualizing the process safely. Our innovative platform provides users with a "bot trainer", virtual cues of the seam geometry, automatic spot tracking, and performance scores. To validate the platform's feasibility, we conduct extensive experiments with several welding training tasks. We show that compared with the traditional training practice and recent virtual reality approaches, our automated multi-sensor method achieves better performances in terms of accuracy, learning curve, and effectiveness.

preprint2022arXiv

Gradient-based Neuromorphic Learning on Dynamical RRAM Arrays

We present MEMprop, the adoption of gradient-based learning to train fully memristive spiking neural networks (MSNNs). Our approach harnesses intrinsic device dynamics to trigger naturally arising voltage spikes. These spikes emitted by memristive dynamics are analog in nature, and thus fully differentiable, which eliminates the need for surrogate gradient methods that are prevalent in the spiking neural network (SNN) literature. Memristive neural networks typically either integrate memristors as synapses that map offline-trained networks, or otherwise rely on associative learning mechanisms to train networks of memristive neurons. We instead apply the backpropagation through time (BPTT) training algorithm directly on analog SPICE models of memristive neurons and synapses. Our implementation is fully memristive, in that synaptic weights and spiking neurons are both integrated on resistive RAM (RRAM) arrays without the need for additional circuits to implement spiking dynamics, e.g., analog-to-digital converters (ADCs) or thresholded comparators. As a result, higher-order electrophysical effects are fully exploited to use the state-driven dynamics of memristive neurons at run time. By moving towards non-approximate gradient-based learning, we obtain highly competitive accuracy amongst previously reported lightweight dense fully MSNNs on several benchmarks.

preprint2022arXiv

Logical and Physical Reversibility of Conservative Skyrmion Logic

Magnetic skyrmions are nanoscale whirls of magnetism that can be propagated with electrical currents. The repulsion between skyrmions inspires their use for reversible computing based on the elastic billiard ball collisions proposed for conservative logic in 1982. Here we evaluate the logical and physical reversibility of this skyrmion logic paradigm, as well as the limitations that must be addressed before dissipation-free computation can be realized.

preprint2022arXiv

Model Predictive Manipulation of Compliant Objects with Multi-Objective Optimizer and Adversarial Network for Occlusion Compensation

The robotic manipulation of compliant objects is currently one of the most active problems in robotics due to its potential to automate many important applications. Despite the progress achieved by the robotics community in recent years, the 3D shaping of these types of materials remains an open research problem. In this paper, we propose a new vision-based controller to automatically regulate the shape of compliant objects with robotic arms. Our method uses an efficient online surface/curve fitting algorithm that quantifies the object's geometry with a compact vector of features; This feedback-like vector enables to establish an explicit shape servo-loop. To coordinate the motion of the robot with the computed shape features, we propose a receding-time estimator that approximates the system's sensorimotor model while satisfying various performance criteria. A deep adversarial network is developed to robustly compensate for visual occlusions in the camera's field of view, which enables to guide the shaping task even with partial observations of the object. Model predictive control is utilized to compute the robot's shaping motions subject to workspace and saturation constraints. A detailed experimental study is presented to validate the effectiveness of the proposed control framework.

preprint2022arXiv

SPICEprop: Backpropagating Errors Through Memristive Spiking Neural Networks

We present a fully memristive spiking neural network (MSNN) consisting of novel memristive neurons trained using the backpropagation through time (BPTT) learning rule. Gradient descent is applied directly to the memristive integrated-and-fire (MIF) neuron designed using analog SPICE circuit models, which generates distinct depolarization, hyperpolarization, and repolarization voltage waveforms. Synaptic weights are trained by BPTT using the membrane potential of the MIF neuron model and can be processed on memristive crossbars. The natural spiking dynamics of the MIF neuron model are fully differentiable, eliminating the need for gradient approximations that are prevalent in the spiking neural network literature. Despite the added complexity of training directly on SPICE circuit models, we achieve 97.58% accuracy on the MNIST testing dataset and 75.26% on the Fashion-MNIST testing dataset, the highest accuracies among all fully MSNNs.

preprint2022arXiv

WE model: A Machine Learning Model Based on Data-Driven Movie Derivatives Market Prediction

The mature development and the extension of the industry chain make the income structure of the film industry. The income of the traditional film industry depends on the box office and also includes movie merchandising, advertisement, home entertainment, book sales etc. Movie merchandising can even become more profitable than the box office. Therefore, market analysis and forecasting methods for multi-feature merchandising of multi-type films are particularly important. Traditional market research is time-consuming and labour-intensive, and its practical value is restricted. Due to the limited research method, more effective predictive analysis technology needs to be formed. With the rapid development of machine learning and big data, a large number of machine learning algorithms for predictive regression and classification recognition have been proposed and widely used in product design and industry analysis. This paper proposes a high-precision movie merchandising prediction model based on machine learning technology: WE model. This model integrates three machine learning algorithms to accurately predict the movie merchandising market. The WE model learns the relationship between the movie merchandising market and movie features by analyzing the main feature information of movies. After testing, the accuracy rate of prediction and evaluation in the merchandising market reaches 72.5%, and it has achieved a strong market control effect.

preprint2021arXiv

Deep Video Inpainting Detection

This paper studies video inpainting detection, which localizes an inpainted region in a video both spatially and temporally. In particular, we introduce VIDNet, Video Inpainting Detection Network, which contains a two-stream encoder-decoder architecture with attention module. To reveal artifacts encoded in compression, VIDNet additionally takes in Error Level Analysis frames to augment RGB frames, producing multimodal features at different levels with an encoder. Exploring spatial and temporal relationships, these features are further decoded by a Convolutional LSTM to predict masks of inpainted regions. In addition, when detecting whether a pixel is inpainted or not, we present a quad-directional local attention module that borrows information from its surrounding pixels from four directions. Extensive experiments are conducted to validate our approach. We demonstrate, among other things, that VIDNet not only outperforms by clear margins alternative inpainting detection methods but also generalizes well on novel videos that are unseen during training.

preprint2021arXiv

On the Coupling Effects between Elastic and Electromagnetic Fields from the Perspective of Conservation of Energy

Coupling effects among different physical fields substantially reflect the conversion of energies from one form into another. For simple physical processes, their governing or constitutive equations all satisfy the law of conservation of energy. Then, analysis is extended to coupling effects. First, it is found for the linear direct and converse piezoelectric and piezomagnetic effects, their constitutive equations guarantee that the total energy is conserved during the process of energy conversion between the elastic and electromagnetic fields; however, energies are converted via work terms, $(β_{ijk} E_i )_{,k} v_j$ and $(γ_{ijk} H_i)_{,k} v_j$, rather than via energy terms, $β_{ijk} E_i e_{jk}$ and $γ_{ijk} H_i e_{jk}$. Second, for the generalized Villari effects, the electromagnetic energy can be treated as an extra contribution to the generalized elastic energy. Third, for electrostriction and magnetostriction, it is argued both effects are induced by the Maxwell stress; moreover, their energy is purely electromagnetic and thus both have no converse effects. During these processes, energy can be converted in three different ways, i.e., via non-potential forces, via cross-dependence of energy terms and directly via the electromagnetic interactions of ions and electrons. In the end, general coupling processes which involve elastic, electromagnetic fields and diffusion are also analyzed. The advantage of using this energy formulation is that it facilitates discussions of the conversion of energies and provides better physical insights into the mechanisms of these coupling effects.

preprint2020arXiv

DeepStrip: High Resolution Boundary Refinement

In this paper, we target refining the boundaries in high resolution images given low resolution masks. For memory and computation efficiency, we propose to convert the regions of interest into strip images and compute a boundary prediction in the strip domain. To detect the target boundary, we present a framework with two prediction layers. First, all potential boundaries are predicted as an initial prediction and then a selection layer is used to pick the target boundary and smooth the result. To encourage accurate prediction, a loss which measures the boundary distance in the strip domain is introduced. In addition, we enforce a matching consistency and C0 continuity regularization to the network to reduce false alarms. Extensive experiments on both public and a newly created high resolution dataset strongly validate our approach.

preprint2020arXiv

FastBERT: a Self-distilling BERT with Adaptive Inference Time

Pre-trained language models like BERT have proven to be highly performant. However, they are often computationally expensive in many practical scenarios, for such heavy models can hardly be readily implemented with limited resources. To improve their efficiency with an assured model performance, we propose a novel speed-tunable FastBERT with adaptive inference time. The speed at inference can be flexibly adjusted under varying demands, while redundant calculation of samples is avoided. Moreover, this model adopts a unique self-distillation mechanism at fine-tuning, further enabling a greater computational efficacy with minimal loss in performance. Our model achieves promising results in twelve English and Chinese datasets. It is able to speed up by a wide range from 1 to 12 times than BERT if given different speedup thresholds to make a speed-performance tradeoff.

preprint2020arXiv

Inclusive GAN: Improving Data and Minority Coverage in Generative Models

Generative Adversarial Networks (GANs) have brought about rapid progress towards generating photorealistic images. Yet the equitable allocation of their modeling capacity among subgroups has received less attention, which could lead to potential biases against underrepresented minorities if left uncontrolled. In this work, we first formalize the problem of minority inclusion as one of data coverage, and then propose to improve data coverage by harmonizing adversarial training with reconstructive generation. The experiments show that our method outperforms the existing state-of-the-art methods in terms of data coverage on both seen and unseen data. We develop an extension that allows explicit control over the minority subgroups that the model should ensure to include, and validate its effectiveness at little compromise from the overall performance on the entire dataset. Code, models, and supplemental videos are available at GitHub.

preprint2020arXiv

Log D-modules and Index theorems

We study log D-modules on smooth log pairs and construct a comparison theorem of log de Rham complexes. The proof uses Sabbah's generalized b-functions. As applications, we deduce a log index theorem and a Riemann-Roch type formula for perverse sheaves on smooth quasi-projective varieties. The log index theorem naturally generalizes the Dubson-Kashiwara index theorem on smooth projective varieties.

preprint2020arXiv

M2KD: Multi-model and Multi-level Knowledge Distillation for Incremental Learning

Incremental learning targets at achieving good performance on new categories without forgetting old ones. Knowledge distillation has been shown critical in preserving the performance on old classes. Conventional methods, however, sequentially distill knowledge only from the last model, leading to performance degradation on the old classes in later incremental learning steps. In this paper, we propose a multi-model and multi-level knowledge distillation strategy. Instead of sequentially distilling knowledge only from the last model, we directly leverage all previous model snapshots. In addition, we incorporate an auxiliary distillation to further preserve knowledge encoded at the intermediate feature levels. To make the model more memory efficient, we adapt mask based pruning to reconstruct all previous models with a small memory footprint. Experiments on standard incremental learning benchmarks show that our method preserves the knowledge on old classes better and improves the overall performance over standard distillation techniques.

preprint2020arXiv

Occlusion-Adaptive Deep Network for Robust Facial Expression Recognition

Recognizing the expressions of partially occluded faces is a challenging computer vision problem. Previous expression recognition methods, either overlooked this issue or resolved it using extreme assumptions. Motivated by the fact that the human visual system is adept at ignoring the occlusion and focus on non-occluded facial areas, we propose a landmark-guided attention branch to find and discard corrupted features from occluded regions so that they are not used for recognition. An attention map is first generated to indicate if a specific facial part is occluded and guide our model to attend to non-occluded regions. To further improve robustness, we propose a facial region branch to partition the feature maps into non-overlapping facial blocks and task each block to predict the expression independently. This results in more diverse and discriminative features, enabling the expression recognition system to recover even though the face is partially occluded. Depending on the synergistic effects of the two branches, our occlusion-adaptive deep network significantly outperforms state-of-the-art methods on two challenging in-the-wild benchmark datasets and three real-world occluded expression datasets.

preprint2020arXiv

Quasi-parallel X-ray microbeam obtained using a parabolic monocapillary X-ray lens with an embedded square-shaped lead occluder

A parabolic monocapillary X-ray lens (PMXRL) is designed to effectively constrain a laboratory point X-ray source into a parallel beam. A square-shaped lead occluder (SSLO) is used to block direct X-rays in the PMXRL. To design the PMXRL, we use Python to simulate the conic parameter (p = 0.001 mm) of the lens and then use a drawing machine to draw a corresponding lens (p = 0.000939 mm) with a total length of 60.8 mm. We place the SSLO at the lens inlet for optical testing. The results show that the controlled outgoing beam has a divergence of less than 0.4 mrad in the range of 15-45 mm of the lens outlet, which achieves excellent optical performance in X-ray imaging methodology. The design details are reported in this paper.

preprint2020arXiv

Reservoir Computing with Planar Nanomagnet Arrays

Reservoir computing is an emerging methodology for neuromorphic computing that is especially well-suited for hardware implementations in size, weight, and power (SWaP) constrained environments. This work proposes a novel hardware implementation of a reservoir computer using a planar nanomagnet array. A small nanomagnet reservoir is demonstrated via micromagnetic simulations to be able to identify simple waveforms with 100% accuracy. Planar nanomagnet reservoirs are a promising new solution to the growing need for dedicated neuromorphic hardware.

preprint2018arXiv

Central Limit theorem for toric \kahler manifolds

Associated to the Bergman kernels of a polarized toric \kahler manifold $(M, ω, L, h)$ are sequences of measures $\{μ_k^z\}_{k=1}^{\infty}$ parametrized by the points $z \in M$. For each $z$ in the open orbit, we prove a central limit theorem for $μ_k^z$. The center of mass of $μ_k^z$ is the image of $z$ under the moment map; after re-centering at $0$ and dilating by $\sqrt{k}$, the re-normalized measure tends to a centered Gaussian whose variance is the Hessian of the \kahler potential at $z$. We further give a remainder estimate of Berry-Esseen type. The sequence $\{μ_k^z\}$ is generally not a sequence of convolution powers and the proofs only involve \kahler analysis.

preprint2018arXiv

Interface asymptotics of Partial Bergman kernels around a critical level

In a recent series of articles (arXiv:1604.06655, arXiv:1708.09267), the authors have studied the transition behavior of partial Bergman kernels $Π_{k, [E_1, E_2]}(z,w)$ and the associated DOS (density of states) $Π_{k, [E_1, E_2]}(z)$ across the interface $\ccal$ between the allowed and forbidden regions. Partial Bergman kernels are Toeplitz Hamiltonians quantizing Morse functions $H: M \to \R$ on a \kahler manifold. The allowed region is $H^{-1}([E_1, E_2])$ and the interface $\ccal$ is its boundary. In prior articles it was assumed that the endpoints $E_j$ were regular values of $H$. This article completes the series by giving parallel results when an endpoint is a critical value of $H$. In place of the Erf scaling asymptotics in a $k^{-\half} $ tube around $\ccal$ for regular interfaces, one obtains $δ$-asymptotics in $k^{-\frac{1}{4}}$-tubes around singular points of a critical interface. In $k^{-\half}$ tubes, the transition law is given by the osculating metaplectic propagator.

preprint2017arXiv

Interface asymptotics of partial Bergman kernels on $S^1$-symmetric Kaehler manifolds

This article is concerned with asymptotics of equivariant Bergman kernels and partial Bergman kernels for polarized projective Kahler manifolds invariant under a Hamiltonian holomorphic $S^1$ action. Asymptotics of partial Bergman kernel are obtained in the allowed region $\mathcal{A}$ resp. forbidden region $\mathcal{F}$, generalizing results of Shiffman-Zelditch, Shiffman-Tate-Zelditch and Pokorny-Singer for toric Kahler manifolds. The main result gives scaling asymptotics of equivariant Bergman kernels and partial Bergman kernels in the transition region around the interface $\partial \mathcal{A}$, generalizing recent work of Ross-Singer on partial Bergman kernels, and refining the Ross-Singer transition asymptotics to apply to equivariant Bergman kernels.

preprint2017arXiv

Twisted Polytope Sheaves and Coherent-Constructible Correspondence for Toric Varieties

Given a smooth projective toric variety $X_Σ$ of complex dimension $n$, Fang-Liu-Treumann-Zaslow \cite{FLTZ} showed that there is a quasi-embedding of the differential graded (dg) derived category of coherent sheaves $Coh(X_Σ)$ into the dg derived category of constructible sheaves on a torus $Sh(T^n, Λ_Σ)$. Recently, Kuwagaki \cite{Ku2} proved that the quasi-embedding is a quasi-equivalence, and generalized the result to toric stacks. Here we give a different proof in the smooth projective case, using non-characteristic deformation of sheaves to find twisted polytope sheaves that co-represent the stalk functors.

preprint2016arXiv

Arrayed van der Waals Vertical Heterostructures based on 2D GaSe Grown by Molecular Beam Epitaxy

Vertically stacking two dimensional (2D) materials can enable the design of novel electronic and optoelectronic devices and realize complex functionality. However, the fabrication of such artificial heterostructures in wafer scale with an atomically-sharp interface poses an unprecedented challenge. Here, we demonstrate a convenient and controllable approach for the production of wafer-scale 2D GaSe thin films by molecular beam epitaxy. In-situ reflection high-energy electron diffraction oscillations and Raman spectroscopy reveal a layer-by-layer van der Waals epitaxial growth mode. Highly-efficient photodetector arrays were fabricated based on few-layer GaSe on Si. These photodiodes show steady rectifying characteristics and a relatively high external quantum efficiency of 23.6%. The resultant photoresponse is super-fast and robust with a response time of 60 us. Importantly, the device shows no sign of degradation after 1 million cycles of operation. Our study establishes a new approach to produce controllable, robust and large-area 2D heterostructures and presents a crucial step for further practical applications.

preprint2016arXiv

Scaling of Harmonic Oscillator Eigenfunctions and Their Nodal Sets Around the Caustic

We study the scaling asymptotics of the eigenspace projection kernels $Π_{\hbar, E}(x,y)$ of the isotropic Harmonic Oscillator $- \hbar ^2 Δ+ |x|^2$ of eigenvalue $E = \hbar(N + \frac{d}{2})$ in the semi-classical limit $\hbar \to 0$. The principal result is an explicit formula for the scaling asymptotics of $Π_{\hbar, E}(x,y)$ for $x,y$ in a $\hbar^{2/3}$ neighborhood of the caustic $\mathcal C_E$ as $\hbar \to 0.$ The scaling asymptotics are applied to the distribution of nodal sets of Gaussian random eigenfunctions around the caustic as $\hbar \to 0$. In previous work we proved that the density of zeros of Gaussian random eigenfunctions of $\hat{H}_{\hbar}$ have different orders in the Planck constant $\hbar$ in the allowed and forbidden regions: In the allowed region the density is of order $\hbar^{-1}$ while it is $\hbar^{-1/2}$ in the forbidden region. Our main result on nodal sets is that the density of zeros is of order $\hbar^{-\frac{2}{3}}$ in an $\hbar^{\frac{2}{3}}$-tube around the caustic. This tube radius is the `critical radius'. For annuli of larger inner and outer radii $\hbar^α$ with $0< α< \frac{2}{3}$ we obtain density results which interpolate between this critical radius result and our prior ones in the allowed and forbidden region. We also show that the Hausdorff $(d-2)$-dimensional measure of the intersection of the nodal set with the caustic is of order $\hbar^{- \frac{2}{3}}$.

preprint2016arXiv

Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling

Recurrent Neural Network (RNN) is one of the most popular architectures used in Natural Language Processsing (NLP) tasks because its recurrent structure is very suitable to process variable-length text. RNN can utilize distributed representations of words by first converting the tokens comprising each text into vectors, which form a matrix. And this matrix includes two dimensions: the time-step dimension and the feature vector dimension. Then most existing models usually utilize one-dimensional (1D) max pooling operation or attention-based operation only on the time-step dimension to obtain a fixed-length vector. However, the features on the feature vector dimension are not mutually independent, and simply applying 1D pooling operation over the time-step dimension independently may destroy the structure of the feature representation. On the other hand, applying two-dimensional (2D) pooling operation over the two dimensions may sample more meaningful features for sequence modeling tasks. To integrate the features on both dimensions of the matrix, this paper explores applying 2D max pooling operation to obtain a fixed-length representation of the text. This paper also utilizes 2D convolution to sample more meaningful information of the matrix. Experiments are conducted on six text classification tasks, including sentiment analysis, question classification, subjectivity classification and newsgroup classification. Compared with the state-of-the-art models, the proposed models achieve excellent performance on 4 out of 6 tasks. Specifically, one of the proposed models achieves highest accuracy on Stanford Sentiment Treebank binary classification and fine-grained classification tasks.

preprint2016arXiv

Wafer-scale arrayed p-n junctions based on few-layer epitaxial GaTe

Two-dimensional (2D) materials have attracted substantial attention in electronic and optoelectronic applications with superior advantages of being flexible, transparent and highly tunable. Gapless graphene exhibits ultra-broadband and fast photoresponse while the 2D semiconducting MoS2 and GaTe unveil high sensitivity and tunable responsivity to visible light. However, the device yield and the repeatability call for a further improvement of the 2D materials to render large-scale uniformity. Here we report a layer-by-layer growth of wafer-scale GaTe with a hole mobility of 28.4 cm2/Vs by molecular beam epitaxy. The arrayed p-n junctions were developed by growing few-layer GaTe directly on three-inch Si wafers. The resultant diodes reveal good rectifying characteristics, photoresponse with a maximum photoresponsivity of 2.74 A/W and a high photovoltaic external quantum efficiency up to 62%. The photocurrent reaches saturation fast enough to capture a time constant of 22 μs and shows no sign of device degradation after 1.37 million cycles of operation. Most strikingly, such high performance has been achieved across the entire wafer, making the volume production of devices accessible. Finally, several photo-images were acquired by the GaTe/Si photodiodes with a reasonable contrast and spatial resolution, demonstrating for the first time the potential of integrating the 2D materials with the silicon technology for novel optoelectronic devices.

preprint2015arXiv

Magnetotransport properties of Cd3As2 nanostructures

Three-dimensional (3D) topological Dirac semimetal is a new kind of material that has a linear energy dispersion in 3D momentum space and can be viewed as an analog of graphene. Extensive efforts have been devoted to the understanding of bulk materials, but yet it remains a challenge to explore the intriguing physics in low-dimensional Dirac semimetals. Here, we report on the synthesis of Cd3As2 nanowires and nanobelts and a systematic investigation of their magnetotransport properties. Temperature-dependent ambipolar behavior is evidently demonstrated, suggesting the presence of finite-size of bandgap in nanowires. Cd3As2 nanobelts, however, exhibit metallic characteristics with a high carrier mobility exceeding 32,000 cm2V-1s-1 and pronounced anomalous double-period Shubnikov-de Haas (SdH) oscillations. Unlike the bulk counterpart, the Cd3As2 nanobelts reveal the possibility of unusual change of the Fermi sphere owing to the suppression of the dimensionality. More importantly, their SdH oscillations can be effectively tuned by the gate voltage. The successful synthesis of Cd3As2 nanostructures and their rich physics open up exciting nanoelectronic applications of 3D Dirac semimetals.

preprint2014arXiv

Analysis on Irreversible Processes using the Phase-Field Variational Approach with the Entropy or Energy Functional

The variational approach usually used in phase field models (PFVA) is applied here to analyse complex irreversible processes such as thermoelectric (TE) effects and thermally driven mass transport (TDMT). Complex irreversible processes arise from the coupling effects between simple irreversible processes. Each simple irreversible process is assiciated with an entropy or energy density function. During complex irreversible processes with multiple fields present, this entropy or energy density function is assumed to be dependent on all independent field variables. Using the total entropy functionals, the TE effects and TDMT are analysed and important kinetic coefficients such as the Seebeck coefficient and the heat of transport are determined with straightforward physical contents. Using the total energy functionals, the linear irreversible processes are analysed with the Onsager approach and the nonlinear irreversible processes with PFVA. It is found both the Onsager's relations and the fluxes defined using PFVA guarantee the satisfaction of the first law of thermodynamics during the process of conversion of energies. To analyze the diffusion process under the influence of elasticity, PFVA is also modified to incorporate the reversible evolution of elastic fields. It is shown energies are conserved via both the irreversible diffusion process and the reversible evolution of the elastic fields. Finally, PFVA is generalized to study nonequilibrium thermodynamics using an extra kinetic contribution to the entropy density function. The analyses can be extended to a nonequilibrium thermodynamic system with multiple physical fields present. Thus, it is believed PFVA has the potential of not only significantly advancing our understanding of the thermodynamics of irreversible processes, but also making thermodynamics as a discipline and the study of it truly dynamic.

preprint2014arXiv

Tunable charge-trap memory based on few-layer MoS2

Charge-trap memory with high-\k dielectric materials is considered to be a promising candidate for next-generation memory devices. Ultrathin layered two-dimensional (2D) materials like graphene and MoS2 have been receiving much attention because of their novel physical properties and potential applications in electronic devices. Here, we report on a dual-gate charge-trap memory device composed of a few-layer MoS2 channel and a three-dimensional (3D) Al2O3/HfO2/Al2O3 charge-trap gate stack. Owing to the extraordinary trapping ability of both electrons and holes in HfO2, the MoS2 memory device exhibits an unprecedented memory window exceeding 20 V. More importantly, with a back gate the window size can be effectively tuned from 15.6 to 21 V; the program/erase current ratio can reach up to 104, far beyond Si-based flash memory, which allows for multi-bit information storage. Furthermore, the device shows a high mobility of 170 cm2V-1s-1, a good endurance of hundreds of cycles and a stable retention of ~28% charge loss after 10 years which is drastically lower than ever reported MoS2 flash memory. The combination of 2D materials with traditional high-\k charge-trap gate stacks opens up an exciting field of novel nonvolatile memory devices.

preprint2013arXiv

Fermionic Isocurvature Perturbations

Isocurvature perturbations in the inflationary literature typically involve quantum fluctuations of bosonic field degrees of freedom. In this work, we consider isocurvature perturbations from fermionic quantum fluctuations during inflation. When a stable massive fermion is coupled to a non-conformal sector different from the scalar metric perturbations, observably large amplitude scale invariant isocurvature perturbations can be generated. In addition to the computation of the isocurvature two-point function, an estimate of the local non-Gaussianities is also given and found to be promising for observations in a corner of the parameter space. The results provide a new class of cosmological probes for theories with stable massive fermions. On the technical side, we explicitly renormalize the composite operator in curved spacetime and show that gravitational Ward identities play an important role in suppressing certain contributions to the fermionic isocurvature perturbations.

preprint2013arXiv

Nodal Sets of Random Eigenfunctions for the Isotropic Harmonic Oscillator

We consider Gaussian random eigenfunctions (Hermite functions) of fixed energy level of the isotropic semi-classical Harmonic Oscillator on ${\bf R}^n$. We calculate the expected density of zeros of a random eigenfunction in the semi-classical limit $h \to 0.$ In the allowed region the density is of order $h^{-1},$ while in the forbidden region the density is of order $h^{-\frac{1}{2}}$. The computer graphics due to E.J. Heller illustrate this difference in "frequency" between the allowed and forbidden nodal sets.

preprint2013arXiv

Quadratic Isocurvature Cross-Correlation, Ward Identity, and Dark Matter

Sources of isocurvature perturbations and large non-Gaussianities include field degrees of freedom whose vacuum expectation values are smaller than the expansion rate of inflation. The inhomogeneities in the energy density of such fields are quadratic in the fields to leading order in the inhomogeneity expansion. Although it is often assumed that such isocurvature perturbations and inflaton-driven curvature perturbations are uncorre- lated, this is not obvious from a direct computational point of view due to the form of the minimal gravitational interactions. We thus compute the irreducible gravitational contributions to the quadratic isocurvature-curvature cross-correlation. We find a small but non-decaying cross-correlation, which in principle serves as a consistency prediction of this large class of isocurvature perturbations. We apply our cross-correlation result to two dark matter isocurvature perturbation scenarios: QCD axions and WIMPZILLAs. On the technical side, we utilize a gravita- tional Ward identity in a novel manner to demonstrate the gauge invariance of the computation. Furthermore, the detailed computation is interpreted in terms of a soft-ζ theorem and a gravitational Ward identity. Finally, we also identify explicitly all the counterterms that are necessary for renormalizing the isocurvature perturbation composite operator in inflationary cosmological backgrounds.

preprint2013arXiv

STor: Social Network based Anonymous Communication in Tor

Anonymity networks hide user identities with the help of relayed anonymity routers. However, the state-of-the-art anonymity networks do not provide an effective trust model. As a result, users cannot circumvent malicious or vulnerable routers, thus making them susceptible to malicious router based attacks (e.g., correlation attacks). In this paper, we propose a novel social network based trust model to help anonymity networks circumvent malicious routers and obtain secure anonymity. In particular, we design an input independent fuzzy model to determine trust relationships between friends based on qualitative and quantitative social attributes, both of which can be readily obtained from existing social networks. Moreover, we design an algorithm for propagating trust over an anonymity network. We integrate these two elements in STor, a novel social network based Tor. We have implemented STor by modifying the Tor's source code and conducted experiments on PlanetLab to evaluate the effectiveness of STor. Both simulation and PlanetLab experiment results have demonstrated that STor can achieve secure anonymity by establishing trust-based circuits in a distributed way. Although the design of STor is based on Tor network, the social network based trust model can be adopted by other anonymity networks.

preprint2012arXiv

Energy Stable and Efficient Finite-Difference Nonlinear Multigrid Schemes for the Modified Phase Field Crystal Equation

In this paper we present two unconditionally energy stable finite difference schemes for the Modified Phase Field Crystal (MPFC) equation, a sixth-order nonlinear damped wave equation, of which the purely parabolic Phase Field Crystal (PFC) model can be viewed as a special case. The first is a convex splitting scheme based on an appropriate decomposition of the discrete energy and is first order accurate in time and second order accurate in space. The second is a new, fully second-order scheme that also respects the convex splitting of the energy. Both schemes are nonlinear but may be formulated from the gradients of strictly convex, coercive functionals. Thus, both are uniquely solvable regardless of the time and space step sizes. The schemes are solved by efficient nonlinear multigrid methods. Numerical results are presented demonstrating the accuracy, energy stability, efficiency, and practical utility of the schemes. In particular, we show that our multigrid solvers enjoy optimal, or nearly optimal complexity in the solution of the nonlinear schemes.

preprint2012arXiv

Gravity waves and non-Gaussian features from particle production in a sector gravitationally coupled to the inflaton

We study the possibility that particle production during inflation could source observable gravity waves on scales relevant for Cosmic Microwave Background experiments. A crucial constraint on such scenarios arises because particle production can also source inflaton perturbations, and might ruin the usual predictions for a nearly scale invariant spectrum of nearly Gaussian curvature fluctuations. To minimize this effect, we consider two models of particle production in a sector that is only gravitationally coupled to the inflaton. For a single instantaneous burst of massive particle production, we find that localized features in the scalar spectrum and bispectrum might be observable, but gravitational wave signatures are unlikely to be detectable (due to the suppressed quadrupole moment of non-relativistic quanta) without invoking some additional effects. We also consider a model with a rolling pseudoscalar that leads to a continuous production of relativistic gauge field fluctuations during inflation. Here we find that gravitational waves from particle production can actually exceed the usual inflationary vacuum fluctuations in a regime where non-Gaussianity is consistent with observational limits. In this model observable B-mode polarization can be obtained for any choice of inflaton potential, and the amplitude of the signal is not necessarily correlated with the scale of inflation.

preprint2011arXiv

Gravitational Fermion Production in Inflationary Cosmology

We revisit the gravitational production of massive Dirac fermions in inflationary cosmology with a focus on clarifying the analytic computation of the particle number density in both the large and the small mass regimes. For the case in which the masses of the gravitationally produced fermions are small compared to the Hubble expansion rate at the end of inflation, we obtain a universal result for the number density that is nearly independent of the details of the inflationary model. The result is identical to the case of conformally coupled scalars up to an overall multiplicative factor of order unity for reasons other than just counting the fermionic degrees of freedom.

preprint2010arXiv

Gravity Waves as a Probe of Hubble Expansion Rate During An Electroweak Scale Phase Transition

Just as big bang nucleosynthesis allows us to probe the expansion rate when the temperature of the universe was around 1 MeV, the measurement of gravity waves from electroweak scale first order phase transitions may allow us to probe the expansion rate when the temperature of the universe was at the electroweak scale. We compute the simple transformation rule for the gravity wave spectrum under the scaling transformation of the Hubble expansion rate. We then apply this directly to the scenario of quintessence kination domination and show how gravity wave spectra would shift relative to LISA and BBO projected sensitivities.

Peng Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

40 published item(s)

Non-Prehensile Tool-Object Manipulation by Integrating LLM-Based Planning and Manoeuvrability-Driven Controls

UniBiDex: A Unified Teleoperation Framework for Robotic Bimanual Dexterous Manipulation

Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning

A Dual-Arm Collaborative Framework for Dexterous Manipulation in Unstructured Environments with Contrastive Planning

A Fully Memristive Spiking Neural Network with Unsupervised Learning

A Multi-Sensor Interface to Improve the Learning Experience in Arc Welding Training Tasks

Gradient-based Neuromorphic Learning on Dynamical RRAM Arrays

Logical and Physical Reversibility of Conservative Skyrmion Logic

Model Predictive Manipulation of Compliant Objects with Multi-Objective Optimizer and Adversarial Network for Occlusion Compensation

SPICEprop: Backpropagating Errors Through Memristive Spiking Neural Networks

WE model: A Machine Learning Model Based on Data-Driven Movie Derivatives Market Prediction

Deep Video Inpainting Detection

On the Coupling Effects between Elastic and Electromagnetic Fields from the Perspective of Conservation of Energy

DeepStrip: High Resolution Boundary Refinement

FastBERT: a Self-distilling BERT with Adaptive Inference Time

Inclusive GAN: Improving Data and Minority Coverage in Generative Models

Log D-modules and Index theorems

M2KD: Multi-model and Multi-level Knowledge Distillation for Incremental Learning

Occlusion-Adaptive Deep Network for Robust Facial Expression Recognition

Quasi-parallel X-ray microbeam obtained using a parabolic monocapillary X-ray lens with an embedded square-shaped lead occluder

Reservoir Computing with Planar Nanomagnet Arrays

Central Limit theorem for toric \kahler manifolds

Interface asymptotics of Partial Bergman kernels around a critical level

Interface asymptotics of partial Bergman kernels on $S^1$-symmetric Kaehler manifolds

Twisted Polytope Sheaves and Coherent-Constructible Correspondence for Toric Varieties

Arrayed van der Waals Vertical Heterostructures based on 2D GaSe Grown by Molecular Beam Epitaxy

Scaling of Harmonic Oscillator Eigenfunctions and Their Nodal Sets Around the Caustic

Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling

Wafer-scale arrayed p-n junctions based on few-layer epitaxial GaTe

Magnetotransport properties of Cd3As2 nanostructures

Analysis on Irreversible Processes using the Phase-Field Variational Approach with the Entropy or Energy Functional

Tunable charge-trap memory based on few-layer MoS2

Fermionic Isocurvature Perturbations

Nodal Sets of Random Eigenfunctions for the Isotropic Harmonic Oscillator

Quadratic Isocurvature Cross-Correlation, Ward Identity, and Dark Matter

STor: Social Network based Anonymous Communication in Tor

Energy Stable and Efficient Finite-Difference Nonlinear Multigrid Schemes for the Modified Phase Field Crystal Equation

Gravity waves and non-Gaussian features from particle production in a sector gravitationally coupled to the inflaton

Gravitational Fermion Production in Inflationary Cosmology

Gravity Waves as a Probe of Hubble Expansion Rate During An Electroweak Scale Phase Transition