Source author record

Xiaoli Ma

Xiaoli Ma appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci cond-mat.str-el eess.SP Machine Learning cond-mat.mes-hall cond-mat.supr-con Cryptography and Security eess.AS Information Theory math.IT Sound Artificial Intelligence Computation and Language Distributed, Parallel, and Cluster Computing Neurons and Cognition quant-ph

Catalog footprint

What is connected

13works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial speech signals. Specifically, we evaluate the model performance by interpretable speech recognition metrics and discuss the model performance by the augmented adversarial training. Our experiments show that our proposed U-Net$_{At}$ improves the perceptual evaluation of speech quality (PESQ) from 1.13 to 2.78, speech transmission index (STI) from 0.65 to 0.75, short-term objective intelligibility (STOI) from 0.83 to 0.96 on the task of speech enhancement with adversarial speech examples. We conduct experiments on the automatic speech recognition (ASR) task with adversarial audio attacks. We find that (i) temporal features learned by the attention network are capable of enhancing the robustness of DNN based ASR models; (ii) the generalization power of DNN based ASR model could be enhanced by applying adversarial training with an additive adversarial data augmentation. The ASR metric on word-error-rates (WERs) shows that there is an absolute 2.22 $\%$ decrease under gradient-based perturbation, and an absolute 2.03 $\%$ decrease, under evolutionary-optimized perturbation, which suggests that our enhancement models with adversarial training can further secure a resilient ASR system.

preprint2020arXiv

A General Difficulty Control Algorithm for Proof-of-Work Based Blockchains

Designing an efficient difficulty control algorithm is an essential problem in Proof-of-Work (PoW) based blockchains because the network hash rate is randomly changing. This paper proposes a general difficulty control algorithm and provides insights for difficulty adjustment rules for PoW based blockchains. The proposed algorithm consists a two-layer neural network. It has low memory cost, meanwhile satisfying the fast-updating and low volatility requirements for difficulty adjustment. Real data from Ethereum are used in the simulations to prove that the proposed algorithm has better performance for the control of the block difficulty.

preprint2020arXiv

Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

In this paper, we show that, in vector-to-vector regression utilizing deep neural networks (DNNs), a generalized loss of mean absolute error (MAE) between the predicted and expected feature vectors is upper bounded by the sum of an approximation error, an estimation error, and an optimization error. Leveraging upon error decomposition techniques in statistical learning theory and non-convex optimization theory, we derive upper bounds for each of the three aforementioned errors and impose necessary constraints on DNN models. Moreover, we assess our theoretical results through a set of image de-noising and speech enhancement experiments. Our proposed upper bounds of MAE for DNN based vector-to-vector regression are corroborated by the experimental results and the upper bounds are valid with and without the "over-parametrization" technique.

preprint2020arXiv

Crystalline Electric-Field Excitations in Quantum Spin Liquids Candidate $NaYbSe_{2}$

Very recently we revealed a large family of triangular lattice quantum spin liquid candidates named rare-earth chalcogenides, which features a high-symmetry structure without structural/charge disorders and spin impurities, and may serve as an ideal platform exploring spin liquid physics. The knowledge of crystalline electric-field (CEF) excitations is an essential step to explore the fundamental magnetism of rare-earth spin systems. Here we employed inelastic neutron scattering (INS) and Raman scattering (RS) to carry out a comprehensive CFE investigation on $NaYbSe_{2}$, a promising representative of the family. By comparison with its nonmagnetic compound $NaLuSe_{2}$, we are able to identify the CEF excitations at 15.8, 24.3 and 30.5 meV at 5K. The selected cuts of the INS spectra are well re-produced with a large anisotropy of $g$ factors ($g_{ab}:g_{c}\sim3:1$). Further, the CEF excitations are explained well by our calculations based on the point charge model. Interestingly, $NaYbSe_{2}$ exhibits an unusual CEF shift to higher energies with increasing temperatures, and the Raman mode close to the first CEF excitation shows an anomalously large softening with decreasing temperatures. The absence of the anomalies in $NaLuSe_{2}$ clearly demonstrates a CEF-phonon coupling not reported in the family. It can be understood in term of the weaker electronegativity of Se. The fact that the smallest first CEF excitation in the sub-family of $NaYbCh_{2}$ is $\sim$ 180K (Ch=O, S, Se), guarantees that the sub-family can be strictly described with an effective S=1/2 picture at sufficiently low temperatures. Interestingly the CEF-phonon coupling revealed here may present alternative possibilities to manipulate the spin systems.

preprint2020arXiv

Deep Learning Based FDD Non-Stationary Massive MIMO Downlink Channel Reconstruction

This paper proposes a model-driven deep learning-based downlink channel reconstruction scheme for frequency division duplexing (FDD) massive multi-input multi-output (MIMO) systems. The spatial non-stationarity, which is the key feature of the future extremely large aperture massive MIMO system, is considered. Instead of the channel matrix, the channel model parameters are learned by neural networks to save the overhead and improve the accuracy of channel reconstruction. By viewing the channel as an image, we introduce You Only Look Once (YOLO), a powerful neural network for object detection, to enable a rapid estimation process of the model parameters, including the detection of angles and delays of the paths and the identification of visibility regions of the scatterers. The deep learning-based scheme avoids the complicated iterative process introduced by the algorithm-based parameter extraction methods. A low-complexity algorithm-based refiner further refines the YOLO estimates toward high accuracy. Given the efficiency of model-driven deep learning and the combination of neural network and algorithm, the proposed scheme can rapidly and accurately reconstruct the non-stationary downlink channel. Moreover, the proposed scheme is also applicable to widely concerned stationary systems and achieves comparable reconstruction accuracy as an algorithm-based method with greatly reduced time consumption.

preprint2020arXiv

On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

In this paper, we exploit the properties of mean absolute error (MAE) as a loss function for the deep neural network (DNN) based vector-to-vector regression. The goal of this work is two-fold: (i) presenting performance bounds of MAE, and (ii) demonstrating new properties of MAE that make it more appropriate than mean squared error (MSE) as a loss function for DNN based vector-to-vector regression. First, we show that a generalized upper-bound for DNN-based vector- to-vector regression can be ensured by leveraging the known Lipschitz continuity property of MAE. Next, we derive a new generalized upper bound in the presence of additive noise. Finally, in contrast to conventional MSE commonly adopted to approximate Gaussian errors for regression, we show that MAE can be interpreted as an error modeled by Laplacian distribution. Speech enhancement experiments are conducted to corroborate our proposed theorems and validate the performance advantages of MAE over MSE for DNN based regression.

preprint2020arXiv

Pressure induced metallization and possible unconventional superconductivity in spin liquid $NaYbSe_{2}$

Beyond the conventional electron pairing mediated by phonons, high-temperature superconductivity in cuprates is believed to stem from quantum spin liquid (QSL). The unconventional superconductivity by doping a spin liquid/Mott insulator, is a long-sought goal but a principal challenge in condensed matter physics because of the lack of an ideal QSL platform. Here we report the pressure induced metallization and possible unconventional superconductivity in $NaYbSe_{2}$, which belongs to a large and ideal family of triangular lattice spin liquid we revealed recently and is evidenced to possess a QSL ground state. The charge gap of NaYbSe2 is gradually reduced by applying pressures, and at ~20 GPa the crystal jumps into a superconducting (SC) phase with Tc ~ 5.8 K even before the insulating gap is completely closed. The metallization is confirmed by further high-pressure experiments but the sign of superconductivity is not well repeated. No symmetry breaking accompanies the SC transition, as indicated by X-ray diffraction and low-temperature Raman experiments under high pressures. This intrinsically connects QSL and SC phases, and suggests an unconventional superconductivity developed from QSL. We further observed the magnetic-field-tuned superconductor-insulator transition which is analogous to that found in the underdoped cuprate superconductor $La_{2-x}Sr_{x}CuO_{4}$. The study is expected to inspire interest in exploring new types of superconductors and sheds light into the intriguing physics from a spin liquid/Mott insulator to a superconductor.

preprint2020arXiv

Variational Quantum Circuits for Deep Reinforcement Learning

The state-of-the-art machine learning approaches are based on classical von Neumann computing architectures and have been widely used in many industrial and academic domains. With the recent development of quantum computing, researchers and tech-giants have attempted new quantum circuits for machine learning tasks. However, the existing quantum computing platforms are hard to simulate classical deep learning models or problems because of the intractability of deep quantum circuits. Thus, it is necessary to design feasible quantum algorithms for quantum machine learning for noisy intermediate scale quantum (NISQ) devices. This work explores variational quantum circuits for deep reinforcement learning. Specifically, we reshape classical deep reinforcement learning algorithms like experience replay and target network into a representation of variational quantum circuits. Moreover, we use a quantum information encoding scheme to reduce the number of model parameters compared to classical neural networks. To the best of our knowledge, this work is the first proof-of-principle demonstration of variational quantum circuits to approximate the deep $Q$-value function for decision-making and policy-selection reinforcement learning with experience replay and target network. Besides, our variational quantum circuits can be deployed in many near-term NISQ machines.

preprint2016arXiv

Category specificity of N170 response recovery speeds for faces and Chinese characters

Neural selectivity of N170 responses is an important phenomenon in perceptual processing; however, the recovery times of neural selective responses remain unclear. In the present study, we used an adaptation paradigm to test the recovery speeds of N170 responses to faces and Chinese characters. The results showed that recovery of N170 responses elicited by faces occurred between 1400 and 1800 ms after stimuli onset, whereas recovery of N170 responses elicited by Chinese characters occurred between 600 and 800 ms after stimuli onset. These results demonstrate category-specific recovery speeds of N170 responses involved in the processing of faces and Chinese characters.

preprint2016arXiv

Raman scattering in transition metal dichalcogenides MTe2 (M = Mo, W)

We performed comparable polarized Raman scattering studies of MoTe2 and WTe2. By rotating crystals to tune the angle between the principal axis of the crystals and the polarization of the incident/scattered light, we obtained the angle dependence of the intensities for all the observed modes, which is perfectly consistent with careful symmetry analysis. Combining these results with first-principles calculations, we clearly identified the observed phonon modes in the different phases of both crystals. Fifteen Raman-active phonon modes (10Ag+5Bg) in the high-symmetry phase 1T'-MoTe2 (300 K) were well assigned, and all the symmetry-allowed Raman modes (11A1+6A2) in the low-symmetry phase Td-MoTe2 (10 K) and 12 Raman phonons (8A1+4A2) in Td-WTe2 were observed and identified. The present work provides basic information about the lattice dynamics in transition-metal dichalcogenides and may shed some light on the understanding of the extremely large magnetoresistance (MR) in this class of materials.

preprint2015arXiv

Raman scattering in superconducting NdO1-xFxBiS2 crystals

The recently discovered layered BiS2-based superconductors have attracted a great deal of interest due to their structural similarity to cuprate and iron-pnictide superconductors. We have performed Raman scattering measurements on two superconducting crystals NdO0.5F0.5BiS2 (Tc = 4.5 K) and NdO0.7F0.3BiS2 (Tc = 4.8 K). The observed Raman phonon modes are assigned with the aid of first-principles calculations. The asymmetrical phonon mode around 118 cm-1 reveals a small electron-phonon (e-ph) coupling constant 0.16, which is insufficient to generate superconductivity at ~ 4.5 K. In the Raman spectra there exists a clear temperature-dependent hump around 100 cm-1, which can be well understood in term of inter-band vertical transitions around Fermi surface. The transitions get boosted when the particular rectangular-like Fermi surface meets band splitting caused by spin-orbit coupling. It enables a unique and quantitative insight into the band splitting.

preprint2015arXiv

Ultralow-frequency collective compression mode and strong interlayer coupling in multilayer black phosphorus

The recent renaissance of black phosphorus (BP) as a two-dimensional 2D layered material has generated tremendous interest in its tunable electronic band gap and highly anisotropic transport properties that offer new opportunities for device applications. Many of these outstanding properties are attributed to its unique structural characters that still need elucidation. Here we show Raman measurements that reveal an ultralow-frequency collective compression mode (CCM), which is unprecedented among similar 2D layered materials. This novel CCM indicates an unusually strong interlayer coupling in BP, which is quantitatively supported by a phonon frequency analysis and first-principles calculations. Moreover, the CCM and another branch of low-frequency Raman modes shift sensitively with changing number of layers, allowing an accurate determination of the thickness up to tens of atomic layers, which is considerably higher than those previously achieved by using high-frequency Raman modes. These results offer fundamental insights and practical tools for exploring multilayer BP in new device applications.

preprint2014arXiv

UWB Signal Detection by Cyclic Features

Ultra-wideband (UWB) impulse radio (IR) systems are well known for low transmission power, low probability of detection, and overlaying with narrowband (NB) systems. These merits in fact make UWB signal detection challenging, since several high-power wireless communication systems coexist with UWB signals. In the literature, cyclic features are exploited for signal detection. However, the high computational complexity of conventional cyclic feature based detectors burdens the receivers. In this paper, we propose computationally efficient detectors using the specific cyclic features of UWB signals. The closed-form relationships between the cyclic features and the system parameters are revealed. Then, some constant false alarm rate detectors are proposed based on the estimated cyclic autocorrelation functions (CAFs). The proposed detectors have low complexities compared to the existing ones. Extensive simulation results indicate that the proposed detectors achieve a good balance between the detection performance and the computational complexity in various scenarios, such as multipath environments, colored noise, and NB interferences.

Xiaoli Ma

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

A General Difficulty Control Algorithm for Proof-of-Work Based Blockchains

Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

Crystalline Electric-Field Excitations in Quantum Spin Liquids Candidate $NaYbSe_{2}$

Deep Learning Based FDD Non-Stationary Massive MIMO Downlink Channel Reconstruction

On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

Pressure induced metallization and possible unconventional superconductivity in spin liquid $NaYbSe_{2}$

Variational Quantum Circuits for Deep Reinforcement Learning

Category specificity of N170 response recovery speeds for faces and Chinese characters

Raman scattering in transition metal dichalcogenides MTe2 (M = Mo, W)

Raman scattering in superconducting NdO1-xFxBiS2 crystals

Ultralow-frequency collective compression mode and strong interlayer coupling in multilayer black phosphorus

UWB Signal Detection by Cyclic Features