Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Cooperative concurrence of 4f and 3d flat bands in kagome heavy-fermion metal YbCr6Ge6

Flat-band (FB) systems originating from special lattice geometry like in kagome metals as well as localized orbitals in the materials such as heavy-fermion (HF) compounds have induced intensive interest due to their band topology and strong electron correlation effects, leading to emergent quantum states of matter. However, the question of how these two distinct FBs coexist and interact remains unsettled. Here, we report that YbCr6Ge6 hosting both Cr-kagome lattice and Yb-4f electrons exhibits HF behaviors and a robust antiferromagnetic ground state with transition temperature TN = 3 K, significantly higher than other similar kagome metals with Yb ions. Angle-resolved photoemission spectroscopy measurements reveal the coexistence of FBs originating from both Cr-kagome lattice and localized Yb-4f electrons near Fermi energy level EF. More importantly, the clear spectroscopic signatures of a hybridization of Yb-4f FB with kagome-lattice-derived conduction bands and the high density of states of Cr-kagome FB near EF provide the underlying microscopic mechanisms of HF behaviors and enhanced antiferromagnetism in YbCr6Ge6. Our findings demonstrate that the novel kagome HF metals can not only host the cooperative coexistence of two different types of FBs, but also provide a paradigm material platform to explore the exotic correlated topological quantum phenomena.

preprint2023arXiv

AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model

Traffic accidents, being a significant contributor to both human casualties and property damage, have long been a focal point of research for many scholars in the field of traffic safety. However, previous studies, whether focusing on static environmental assessments or dynamic driving analyses, as well as pre-accident predictions or post-accident rule analyses, have typically been conducted in isolation. There has been a lack of an effective framework for developing a comprehensive understanding and application of traffic safety. To address this gap, this paper introduces AccidentGPT, a comprehensive accident analysis and prevention multi-modal large model. AccidentGPT establishes a multi-modal information interaction framework grounded in multi-sensor perception, thereby enabling a holistic approach to accident analysis and prevention in the field of traffic safety. Specifically, our capabilities can be categorized as follows: for autonomous driving vehicles, we provide comprehensive environmental perception and understanding to control the vehicle and avoid collisions. For human-driven vehicles, we offer proactive long-range safety warnings and blind-spot alerts while also providing safety driving recommendations and behavioral norms through human-machine dialogue and interaction. Additionally, for traffic police and management agencies, our framework supports intelligent and real-time analysis of traffic safety, encompassing pedestrian, vehicles, roads, and the environment through collaborative perception from multiple vehicles and road testing devices. The system is also capable of providing a thorough analysis of accident causes and liability after vehicle collisions. Our framework stands as the first large model to integrate comprehensive scene understanding into traffic safety studies. Project page: https://accidentgpt.github.io

preprint2023arXiv

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes expensive Lidar sensors, making it a feasible solution for economical autonomous driving. However, most existing BEV solutions either suffer from modest performance or require considerable resources to execute on-vehicle inference. This paper proposes a simple yet effective framework, termed Fast-BEV, which is capable of performing real-time BEV perception on the on-vehicle chips. Towards this goal, we first empirically find that the BEV representation can be sufficiently powerful without expensive view transformation or depth representation. Starting from M2BEV baseline, we further introduce (1) a strong data augmentation strategy for both image and BEV space to avoid over-fitting (2) a multi-frame feature fusion mechanism to leverage the temporal information (3) an optimized deployment-friendly view transformation to speed up the inference. Through experiments, we show Fast-BEV model family achieves considerable accuracy and efficiency on edge. In particular, our M1 model (R18@256x704) can run over 50FPS on the Tesla T4 platform, with 47.0% NDS on the nuScenes validation set. Our largest model (R101@900x1600) establishes a new state-of-the-art 53.5% NDS on the nuScenes validation set. The code is released at: https://github.com/Sense-GVT/Fast-BEV.

preprint2022arXiv

Charge Carrier Mediation and Ferromagnetism induced in MnBi6Te10 Magnetic Topological Insulators by antimony doping

A new kind of intrinsic magnetic topological insulators (MTI) MnBi2Te4 family have shed light on the observation of novel topological quantum effect such as quantum anomalous Hall effect (QAHE). However, the strong anti-ferromagnetic (AFM) coupling and high carrier concentration in the bulk hinder the practical applications. In closely related materials MnBi4Te7 and MnBi6Te10, the interlayer magnetic coupling is greatly suppressed by Bi2Te3 layer intercalation. However, AFM is still the ground state in these compounds. Here by magnetic and transport measurements, we demonstrate that Sb substitutional dopant plays a dual role in MnBi6Te10, which can not only adjust the charge carrier type and the concentration, but also induce the solid into a ferromagnetic (FM) ground state. AFM ground state region which is also close to the charge neutral point can be found in the phase diagram of Mn(SbxBi1-x)6Te10 when x ~ 0.25. An intrinsic FM-MTI candidate is thus demonstrated, and it may take a step further for the realization of high-quality and high-temperature QAHE and the related topological quantum effects in the future.

preprint2022arXiv

Clutter Edges Detection Algorithms for Structured Clutter Covariance Matrices

This letter deals with the problem of clutter edge detection and localization in training data. To this end, the problem is formulated as a binary hypothesis test assuming that the ranks of the clutter covariance matrix are known, and adaptive architectures are designed based on the generalized likelihood ratio test to decide whether the training data within a sliding window contains a homogeneous set or two heterogeneous subsets. In the design stage, we utilize four different covariance matrix structures (i.e., Hermitian, persymmetric, symmetric, and centrosymmetric) to exploit the a priori information. Then, for the case of unknown ranks, the architectures are extended by devising a preliminary estimation stage resorting to the model order selection rules. Numerical examples based on both synthetic and real data highlight that the proposed solutions possess superior detection and localization performance with respect to the competitors that do not use any a priori information.

preprint2022arXiv

EEG-based Emotion Recognition with Spatial and Functional Brain Mapping of CNS and PNS Signals

Emotion plays a significant role in our daily life. Recognition of emotion is wide-spread in the field of health care and human-computer interaction. Emotion is the result of the coordinated activities of cortical and subcortical neural processes, which correlate to specific physiological responses. However, the existing emotion recognition techniques failed to combine various physiological signals as one integrated feature representation. Meanwhile, many researchers ignored the problem of over-fitting model with high accuracy, which was actually false high accuracy caused by improper pre-processing. In this paper, sigmoid baseline filtering is conducted to solve the over-fitting problem from source. To construct a physiological-based algorithm, a 3D spatial and functional brain mapping is proposed based on human physiological mechanism and international electrode system, which combines the signals of the central and peripheral nervous system together. By combining the baseline filtering, 3D brain mapping, and simple 4D-CNN, a novel emotion recognition model is finally proposed. Experiment results demonstrate that the performance of the proposed model is comparable to the state of art algorithms.

preprint2022arXiv

Scale-Equivalent Distillation for Semi-Supervised Object Detection

Recent Semi-Supervised Object Detection (SS-OD) methods are mainly based on self-training, i.e., generating hard pseudo-labels by a teacher model on unlabeled data as supervisory signals. Although they achieved certain success, the limited labeled data in semi-supervised learning scales up the challenges of object detection. We analyze the challenges these methods meet with the empirical experiment results. We find that the massive False Negative samples and inferior localization precision lack consideration. Besides, the large variance of object sizes and class imbalance (i.e., the extreme ratio between background and object) hinder the performance of prior arts. Further, we overcome these challenges by introducing a novel approach, Scale-Equivalent Distillation (SED), which is a simple yet effective end-to-end knowledge distillation framework robust to large object size variance and class imbalance. SED has several appealing benefits compared to the previous works. (1) SED imposes a consistency regularization to handle the large scale variance problem. (2) SED alleviates the noise problem from the False Negative samples and inferior localization precision. (3) A re-weighting strategy can implicitly screen the potential foreground regions of the unlabeled data to reduce the effect of class imbalance. Extensive experiments show that SED consistently outperforms the recent state-of-the-art methods on different datasets with significant margins. For example, it surpasses the supervised counterpart by more than 10 mAP when using 5% and 10% labeled data on MS-COCO.

preprint2021arXiv

BaPipe: Exploration of Balanced Pipeline Parallelism for DNN Training

The size of deep neural networks (DNNs) grows rapidly as the complexity of the machine learning algorithm increases. To satisfy the requirement of computation and memory of DNN training, distributed deep learning based on model parallelism has been widely recognized. We propose a new pipeline parallelism training framework, BaPipe, which can automatically explore pipeline parallelism training methods and balanced partition strategies for DNN distributed training. In BaPipe, each accelerator calculates the forward propagation and backward propagation of different parts of networks to implement the intra-batch pipeline parallelism strategy. BaPipe uses a new load balancing automatic exploration strategy that considers the parameters of DNN models and the computation, memory, and communication resources of accelerator clusters. We have trained different DNNs such as VGG-16, ResNet-50, and GNMT on GPU clusters and simulated the performance of different FPGA clusters. Compared with state-of-the-art data parallelism and pipeline parallelism frameworks, BaPipe provides up to 3.2x speedup and 4x memory reduction in various platforms.

preprint2020arXiv

AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing

Deep learning systems have been successfully applied to Euclidean data such as images, video, and audio. In many applications, however, information and their relationships are better expressed with graphs. Graph Convolutional Networks (GCNs) appear to be a promising approach to efficiently learn from graph data structures, having shown advantages in many critical applications. As with other deep learning modalities, hardware acceleration is critical. The challenge is that real-world graphs are often extremely large and unbalanced; this poses significant performance demands and design challenges. In this paper, we propose Autotuning-Workload-Balancing GCN (AWB-GCN) to accelerate GCN inference. To address the issue of workload imbalance in processing real-world graphs, three hardware-based autotuning techniques are proposed: dynamic distribution smoothing, remote switching, and row remapping. In particular, AWB-GCN continuously monitors the sparse graph pattern, dynamically adjusts the workload distribution among a large number of processing elements (up to 4K PEs), and, after converging, reuses the ideal configuration. Evaluation is performed using an Intel D5005 FPGA with five commonly-used datasets. Results show that 4K-PE AWB-GCN can significantly elevate PE utilization by 7.7x on average and demonstrate considerable performance speedups over CPUs (3255x), GPUs (80.3x), and a prior GCN accelerator (5.1x).

preprint2020arXiv

Effects of paramagnetic pair-breaking and spin-orbital coupling on multi-band superconductivity

The BCS picture of superconductivity describes pairing between electrons originating from a single band. A generalization of this picture occurs in multi-band superconductors, where electrons from two or more bands contribute to superconductivity. The contributions of the different bands can result in an overall enhancement of the critical field and can lead to qualitative changes in the temperature dependence of the upper critical field when compared to the single-band case. While the role of orbital pair-breaking on the critical field of multi-band superconductors has been explored extensively, paramagnetic and spin-orbital scattering effects have received comparatively little attention. Here we investigate this problem using thin films of Nd-doped SrTiO$_3$. We furthermore propose a model for analyzing the temperature-dependence of the critical field in the presence of orbital, paramagnetic and spin-orbital effects, and find a very good agreement with our data. Interestingly, we also observe a dramatic enhancement in the out-of-plane critical field to values well in excess of the Chandrasekhar-Clogston (Pauli) paramagnetic limit, which can be understood as a consequence of multi-band effects in the presence of spin-orbital scattering.

preprint2020arXiv

FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters

Deep Neural Networks (DNNs) have revolutionized numerous applications, but the demand for ever more performance remains unabated. Scaling DNN computations to larger clusters is generally done by distributing tasks in batch mode using methods such as distributed synchronous SGD. Among the issues with this approach is that to make the distributed cluster work with high utilization, the workload distributed to each node must be large, which implies nontrivial growth in the SGD mini-batch size. In this paper, we propose a framework called FPDeep, which uses a hybrid of model and layer parallelism to configure distributed reconfigurable clusters to train DNNs. This approach has numerous benefits. First, the design does not suffer from batch size growth. Second, novel workload and weight partitioning leads to balanced loads of both among nodes. And third, the entire system is a fine-grained pipeline. This leads to high parallelism and utilization and also minimizes the time features need to be cached while waiting for back-propagation. As a result, storage demand is reduced to the point where only on-chip memory is used for the convolution layers. We evaluate FPDeep with the Alexnet, VGG-16, and VGG-19 benchmarks. Experimental results show that FPDeep has good scalability to a large number of FPGAs, with the limiting factor being the FPGA-to-FPGA bandwidth. With 6 transceivers per FPGA, FPDeep shows linearity up to 83 FPGAs. Energy efficiency is evaluated with respect to GOPs/J. FPDeep provides, on average, 6.36x higher energy efficiency than comparable GPU servers.

preprint2020arXiv

Precursor Selection in Hybrid Molecular Beam Epitaxy of Alkaline-Earth Stannates

One of the challenges of oxide molecular beam epitaxy (MBE) is the synthesis of oxides containing metals with high electronegativity (metals that are hard to oxidize). The use of reactive organometallic precursors can potentially address this issue. To investigate the formation of radicals in MBE, we explored three carefully chosen metal-organic precursors of tin for SnO2 and BaSnO3 growth: tetramethyltin (TMT), tetraethyltin (TET), and hexamethylditin (HMDT). All three precursors produced single-crystalline, atomically smooth, and epitaxial SnO2 (101) films on r-Al2O3 in the presence of an oxygen plasma. The study of growth kinetics revealed reaction-limited and flux-limited regimes except for TET, which also exhibited a decrease in deposition rate with increasing temperature above 800 C. Contrary to these similarities, the performance of these precursors was dramatically different for BaSnO3 growth. TMT and TET were ineffective in supplying adequate tin whereas HMDT yielded phase-pure, stoichiometric BaSnO3 films. Significantly, HMDT resulted in phase-pure and stoichiometric BaSnO3 films even without the use of an oxygen plasma (i.e., with molecular oxygen alone). These results are discussed using the ability of HMDT to form tin radicals and therefore, assisting with Sn to Sn4+ oxidation reaction. Structural and electronic transport properties of films grown using HMDT with and without oxygen plasma are compared. This study provides guideline for the choice of precursors that will enable synthesis of metal oxides containing hard-to-oxidize metals using reactive radicals in MBE.

preprint2020arXiv

Self-Assembled Periodic Nanostructures Using Martensitic Phase Transformations

We describe a novel approach for the rational design and synthesis of self-assembled periodic nanostructures using martensitic phase transformations. We demonstrate this approach in a thin film of perovskite SrSnO3 with reconfigurable periodic nanostructures consisting of regularly spaced regions of sharply contrasted dielectric properties. The films can be designed to have different periodicities and relative phase fractions via chemical doping or strain engineering. The dielectric contrast within a single film can be tuned using temperature and laser wavelength, effectively creating a variable photonic crystal. Our results show the realistic possibility of designing large-area self-assembled periodic structures using martensitic phase transformations with the potential of implementing "built-to-order" nanostructures for tailored optoelectronic functionalities.

preprint2019arXiv

Separating Electrons and Donors in BaSnO3 via Band Engineering

Through a combination of thin film growth, hard X-ray photoelectron spectroscopy (HAXPES), scanning transmission electron microscopy/electron energy loss spectroscopy (STEM/EELS), magneto-transport measurements, and transport modeling, we report on the demonstration of modulation-doping of BaSnO3 (BSO) using a wider bandgap La-doped SrSnO3 (LSSO) layer. Hard X-ray photoelectron spectroscopy (HAXPES) revealed a valence band offset of 0.71 +/- 0.02 eV between LSSO and BSO resulting in a favorable conduction band offset for remote doping of BSO using LSSO. Nonlinear Hall effect of LSSO/BSO heterostructure confirmed two-channel conduction owing to electron transfer from LSSO to BSO and remained in good agreement with the results of self-consistent solution to one-dimensional Poisson and Schrödinger equations. Angle-dependent HAXPES measurements revealed a spatial distribution of electrons over 2-3 unit cells in BSO. These results bring perovskite oxides a step closer to room-temperature oxide electronics by establishing modulation-doping approaches in non-SrTiO3-based oxide heterostructure.

preprint2019arXiv

Unraveling the Effect of Electron-Electron Interaction on Electronic Transport in High-Mobility Stannate Films

Contrary to the common belief that electron-electron interaction (EEI) should be negligible in s-orbital-based conductors, we demonstrated that the EEI effect could play a significant role on electronic transport leading to the misinterpretation of the Hall data. We show that the EEI effect is primarily responsible for an increase in the Hall coefficient in the La-doped SrSnO3 films below 50 K accompanied by an increase in the sheet resistance. The quantitative analysis of the magnetoresistance (MR) data yielded a large phase coherence length of electrons exceeding 450 nm at 1.8 K and revealed the electron-electron interaction being accountable for breaking of electron phase coherency in La-doped SrSnO3 films. These results while providing critical insights into the fundamental transport behavior in doped stannates also indicate the potential applications of stannates in quantum coherent electronic devices owing to their large phase coherence length.