Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
27works
0followers
22topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

27 published item(s)

preprint2026arXiv

Online identification of nonlinear time-varying systems with uncertain information

Digital twins (DTs), serving as the core enablers for real-time monitoring and predictive maintenance of complex cyber-physical systems, impose critical requirements on their virtual models: high predictive accuracy, strong interpretability, and online adaptive capability. However, existing techniques struggle to meet these demands simultaneously: Bayesian methods excel in uncertainty quantification but lack model interpretability, while interpretable symbolic identification methods (e.g., SINDy) are constrained by their offline, batch-processing nature, which make real-time updates challenging. To bridge this semantic and computational gap, this paper proposes a novel Bayesian Regression-based Symbolic Learning (BRSL) framework. The framework formulates online symbolic discovery as a unified probabilistic state-space model. By incorporating sparse horseshoe priors, model selection is transformed into a Bayesian inference task, enabling simultaneous system identification and uncertainty quantification. Furthermore, we derive an online recursive algorithm with a forgetting factor and establish precise recursive conditions that guarantee the well-posedness of the posterior distribution. These conditions also function as real-time monitors for data utility, enhancing algorithmic robustness. Additionally, a rigorous convergence analysis is provided, demonstrating the convergence of parameter estimates under persistent excitation conditions. Case studies validate the effectiveness of the proposed framework in achieving interpretable, probabilistic prediction and online learning.

preprint2023arXiv

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

Conventional wisdom in pruning Transformer-based language models is that pruning reduces the model expressiveness and thus is more likely to underfit rather than overfit. However, under the trending pretrain-and-finetune paradigm, we postulate a counter-traditional hypothesis, that is: pruning increases the risk of overfitting when performed at the fine-tuning phase. In this paper, we aim to address the overfitting problem and improve pruning performance via progressive knowledge distillation with error-bound properties. We show for the first time that reducing the risk of overfitting can help the effectiveness of pruning under the pretrain-and-finetune paradigm. Ablation studies and experiments on the GLUE benchmark show that our method outperforms the leading competitors across different tasks.

preprint2022arXiv

A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining

Transformers are considered one of the most important deep learning models since 2018, in part because it establishes state-of-the-art (SOTA) records and could potentially replace existing Deep Neural Networks (DNNs). Despite the remarkable triumphs, the prolonged turnaround time of Transformer models is a widely recognized roadblock. The variety of sequence lengths imposes additional computing overhead where inputs need to be zero-padded to the maximum sentence length in the batch to accommodate the parallel computing platforms. This paper targets the field-programmable gate array (FPGA) and proposes a coherent sequence length adaptive algorithm-hardware co-design for Transformer acceleration. Particularly, we develop a hardware-friendly sparse attention operator and a length-aware hardware resource scheduling algorithm. The proposed sparse attention operator brings the complexity of attention-based models down to linear complexity and alleviates the off-chip memory traffic. The proposed length-aware resource hardware scheduling algorithm dynamically allocates the hardware resources to fill up the pipeline slots and eliminates bubbles for NLP tasks. Experiments show that our design has very small accuracy loss and has 80.2 $\times$ and 2.6 $\times$ speedup compared to CPU and GPU implementation, and 4 $\times$ higher energy efficiency than state-of-the-art GPU accelerator optimized via CUBLAS GEMM.

preprint2022arXiv

M-estimation in GARCH Models in the Absence of Higher-Order Moments

We consider a class of M-estimators of the parameters of a GARCH (p,q) model. These estimators involve score functions and, for adequate choices of the score functions, are asymptotically normal under milder moment assumptions than the usual quasi maximum likelihood, which makes them more reliable in the presence of heavy tails. We also consider weighted bootstrap approximations of the distributions of these M-estimators and establish their validity. Through extensive simulations, we demonstrate the robustness of these M-estimators under heavy tails and conduct a comparative study of the performance (bias and mean squared errors) of various score functions and the accuracy (confidence interval coverage rates) of their bootstrap approximations. In addition to the GARCH (1, 1) model, our simulations also involve higher-order models such as GARCH~(2, 1) and GARCH~(1,~\!2) which so far have received relatively little attention in the literature. We also consider the case of order-misspecified models. Finally, we use our M-estimators in the analysis of two real financial time series fitted with GARCH (1, 1) or GARCH (2, 1) models.

preprint2022arXiv

Motif-based Graph Representation Learning with Application to Chemical Molecules

This work considers the task of representation learning on the attributed relational graph (ARG). Both the nodes and edges in an ARG are associated with attributes/features allowing ARGs to encode rich structural information widely observed in real applications. Existing graph neural networks offer limited ability to capture complex interactions within local structural contexts, which hinders them from taking advantage of the expression power of ARGs. We propose Motif Convolution Module (MCM), a new motif-based graph representation learning technique to better utilize local structural information. The ability to handle continuous edge and node features is one of MCM's advantages over existing motif-based models. MCM builds a motif vocabulary in an unsupervised way and deploys a novel motif convolution operation to extract the local structural context of individual nodes, which is then used to learn higher-level node representations via multilayer perceptron and/or message passing in graph neural networks. When compared with other graph learning approaches to classifying synthetic graphs, our approach is substantially better in capturing structural context. We also demonstrate the performance and explainability advantages of our approach by applying it to several molecular benchmarks.

preprint2022arXiv

Negative Interatomic Spring Constant Manifested by Topological Phonon Flat Band

Phonons as bosons are different from electrons as fermions. Unlike interatomic electron hopping that can be either positive or negative and further tuned by spin-orbit coupling, interatomic spring constant is positive, or the structure of atomic lattices would be dynamically unstable. Surprisingly, we found that topological phonon flat bands (FBs) can manifest either a positive or negative interatomic spring constant that couples the FB-modes of opposite chirality, as exemplified by first-principles calculations of a 2D material of Kagome-BN. To reveal its physical origin, we first establish a fundamental correspondence between a collective lattice-coupling (CLC) variable of two quasi-particle states (e.g., electronic states or phonon modes) of opposite parity in a periodic lattice with band topology. Topological semimetals arise with zero CLC at special k-points protected by symmetry; while positive and negative CLC at these k-points gives rise to normal and topological insulators, respectively. Then, we show topological FB has a special form of CLC that vanishes at all k-points as characterized by its real-space wave function, and multi-atom FB phonon mode can manifest effectively a negative interatomic spring constant. Our findings shed new light on our fundamental understanding of topology and provide a practical design principle for creating artificial bosonic topological states.

preprint2022arXiv

Orbital Design of Flat Bands in Non-Line-Graph Lattices via Line-Graph Wavefunctions

Line-graph (LG) lattices are known for having flat bands (FBs) from the destructive interference of Bloch wavefunctions encoded in pure lattice symmetry. Here, we develop a generic atomic/molecular orbital design principle for FBs in non-LG lattices. Based on linear-combination-of-atomic-orbital (LCAO) theory, we demonstrate that the underlying wavefunction symmetry of FBs in a LG lattice can be transformed into the atomic/molecular orbital symmetry in a non-LG lattice. We illustrate such orbital-designed topological FBs in three 2D non-LG, square, trigonal, and hexagonal lattices, where the designed orbitals faithfully reproduce the corresponding lattice symmetries of checkerboard, Kagome, and diatomic-Kagome lattices, respectively. Interestingly, systematic design of FBs with a high Chern number is also achieved based on the same principle. Fundamentally our theory enriches the FB physics; practically it significantly expands the scope of FB materials, since most materials have multiple atomic/molecular orbitals at each lattice site, rather than a single s orbital mandated in graph theory and generic lattice models.

preprint2022arXiv

Relay-Assisted Cooperative Federated Learning

Federated learning (FL) has recently emerged as a promising technology to enable artificial intelligence (AI) at the network edge, where distributed mobile devices collaboratively train a shared AI model under the coordination of an edge server. To significantly improve the communication efficiency of FL, over-the-air computation allows a large number of mobile devices to concurrently upload their local models by exploiting the superposition property of wireless multi-access channels. Due to wireless channel fading, the model aggregation error at the edge server is dominated by the weakest channel among all devices, causing severe straggler issues. In this paper, we propose a relay-assisted cooperative FL scheme to effectively address the straggler issue. In particular, we deploy multiple half-duplex relays to cooperatively assist the devices in uploading the local model updates to the edge server. The nature of the over-the-air computation poses system objectives and constraints that are distinct from those in traditional relay communication systems. Moreover, the strong coupling between the design variables renders the optimization of such a system challenging. To tackle the issue, we propose an alternating-optimization-based algorithm to optimize the transceiver and relay operation with low complexity. Then, we analyze the model aggregation error in a single-relay case and show that our relay-assisted scheme achieves a smaller error than the one without relays provided that the relay transmit power and the relay channel gains are sufficiently large. The analysis provides critical insights on relay deployment in the implementation of cooperative FL. Extensive numerical results show that our design achieves faster convergence compared with state-of-the-art schemes.

preprint2022arXiv

SimNet: Accurate and High-Performance Computer Architecture Simulation using Deep Learning

While discrete-event simulators are essential tools for architecture research, design, and development, their practicality is limited by an extremely long time-to-solution for realistic applications under investigation. This work describes a concerted effort, where machine learning (ML) is used to accelerate discrete-event simulation. First, an ML-based instruction latency prediction framework that accounts for both static instruction properties and dynamic processor states is constructed. Then, a GPU-accelerated parallel simulator is implemented based on the proposed instruction latency predictor, and its simulation accuracy and throughput are validated and evaluated against a state-of-the-art simulator. Leveraging modern GPUs, the ML-based simulator outperforms traditional simulators significantly.

preprint2021arXiv

Conversational Query Rewriting with Self-supervised Learning

Context modeling plays a critical role in building multi-turn dialogue systems. Conversational Query Rewriting (CQR) aims to simplify the multi-turn dialogue modeling into a single-turn problem by explicitly rewriting the conversational query into a self-contained utterance. However, existing approaches rely on massive supervised training data, which is labor-intensive to annotate. And the detection of the omitted important information from context can be further improved. Besides, intent consistency constraint between contextual query and rewritten query is also ignored. To tackle these issues, we first propose to construct a large-scale CQR dataset automatically via self-supervised learning, which does not need human annotation. Then we introduce a novel CQR model Teresa based on Transformer, which is enhanced by self-attentive keywords detection and intent consistency constraint. Finally, we conduct extensive experiments on two public datasets. Experimental results demonstrate that our proposed model outperforms existing CQR baselines significantly, and also prove the effectiveness of self-supervised learning on improving the CQR performance.

preprint2021arXiv

Semi-Blind Cascaded Channel Estimation for Reconfigurable Intelligent Surface Aided Massive MIMO

Reconfigurable intelligent surface (RIS) is envisioned to be a promising green technology to reduce the energy consumption and improve the coverage and spectral efficiency of massive multiple-input multiple-output (MIMO) wireless networks. In a RIS-aided MIMO system, the acquisition of channel state information (CSI) is important for achieving passive beamforming gains of the RIS, but is also challenging due to the cascaded property of the transmitter-RIS-receiver channel and the lack of signal processing capability of the passive RIS elements. The state-of-the-art approach for CSI acquisition in such a system is a pure training-based strategy that depends on a long sequence of pilot symbols. In this paper, we investigate semi-blind cascaded channel estimation for RIS-aided massive MIMO systems, in which the receiver simultaneously estimates the channel coefficients and the partially unknown transmit signal with a small number of pilot sequences. Specifically, we formulate the semi-blind cascaded channel estimation as a trilinear matrix factorization task. Under the Bayesian inference framework, we develop a computationally efficient iterative algorithm using the approximate message passing principle to resolve the trilinear inference problem. Meanwhile, we present an analytical framework to characterize the theoretical performance bound of the proposed approach in the large-system limit via the replica method developed in statistical physics. Extensive simulation results demonstrate the effectiveness of the proposed semi-blind cascaded channel estimation algorithm.

preprint2021arXiv

SuperNeurons: FFT-based Gradient Sparsification in the Distributed Training of Deep Neural Networks

The performance and efficiency of distributed training of Deep Neural Networks highly depend on the performance of gradient averaging among all participating nodes, which is bounded by the communication between nodes. There are two major strategies to reduce communication overhead: one is to hide communication by overlapping it with computation, and the other is to reduce message sizes. The first solution works well for linear neural architectures, but latest networks such as ResNet and Inception offer limited opportunity for this overlapping. Therefore, researchers have paid more attention to minimizing communication. In this paper, we present a novel gradient compression framework derived from insights of real gradient distributions, and which strikes a balance between compression ratio, accuracy, and computational overhead. Our framework has two major novel components: sparsification of gradients in the frequency domain, and a range-based floating point representation to quantize and further compress gradients frequencies. Both components are dynamic, with tunable parameters that achieve different compression ratio based on the accuracy requirement and systems' platforms, and achieve very high throughput on GPUs. We prove that our techniques guarantee the convergence with a diminishing compression ratio. Our experiments show that the proposed compression framework effectively improves the scalability of most popular neural networks on a 32 GPU cluster to the baseline of no compression, without compromising the accuracy and convergence speed.

preprint2020arXiv

Acoustic black hole in Schwarzschild spacetime: quasi-normal modes, analogous Hawking radiation and shadows

Various properties of acoustic black holes constructed in Minkowski spacetime have been widely studied in the past decades. Recently the acoustic black holes in general spacetime were proposed . In this paper, we first investigate the basic characteristics of `curved' acoustic black hole in Schwarzschild spacetime, including the quasi-normal modes, grey-body factor and analogous Hawking radiation. We find that the signal of quasi-normal mode is weaker than that of Schwarzschild black hole. Moreover, as the tuning parameter increases, both the positive real part and negative imaginal part of the quasi-normal frequency approach to the horizonal axis, but they will not change sign. This means that all the perturbations could die off and the system is stable under those perturbations. Since the larger tuning parameter suppresses the effective potential barrier, so it enhances the grey-body factor. The energy emission rate of Hawking radiation does not monotonically increase of the tuning parameter because of the non-monotonicity of the Hawking temperature. Finally, as a first attempt, we study the acoustic black hole shadow. The radius of acoustic shadow becomes larger as the tuning parameter increases, because both the related acoustic horizon and the acoustic sphere become larger. Our studies could help us to further understand the near horizon geometrical features of the black hole. We also expect that our observations could be detected experimentally in the near future.

preprint2020arXiv

Center-Outward R-Estimation for Semiparametric VARMA Models

We propose a new class of R-estimators for semiparametric VARMA models in which the innovation density plays the role of the nuisance parameter. Our estimators are based on the novel concepts of multivariate center-outward ranks and signs. We show that these concepts, combined with Le Cam's asymptotic theory of statistical experiments, yield a class of semiparametric estimation procedures, which are efficient (at a given reference density), root-$n$ consistent, and asymptotically normal under a broad class of (possibly non elliptical) actual innovation densities. No kernel density estimation is required to implement our procedures. A Monte Carlo comparative study of our R-estimators and other routinely-applied competitors demonstrates the benefits of the novel methodology, in large and small sample. Proofs, computational aspects, and further numerical results are available in the supplementary material.

preprint2020arXiv

Echoes from phantom wormholes

We study the time evolution of the test scalar and electromagnetic fields perturbations in configurations of phantom wormholes surrounded by dark energy with state parameter $ω< -1$. We observe obvious signals of echoes reflecting wormholes properties and disclose the physical reasons behind such phenomena. In particular, we find that the dark energy equation of state has a clear imprint in echoes in wave perturbations. When $ω$ approaches the phantom divide $ω=-1$ from below, the delay time of echoes becomes longer. The echo of gravitational wave is likely to be detected in the near future, the signature of the dark energy equation of state in the echo spectrum can serve as a local measurement of the dark energy.

preprint2020arXiv

Efficient decoy-states for the reference-frame-independent measurement-device-independent quantum key distribution

Reference-frame-independent measurement-device-independent quantum key distribution (RFI-MDI-QKD) is a novel protocol which eliminates all possible attacks on detector side and necessity of reference-frame alignment in source sides. However, its performance may degrade notably due to statistical fluctuations, since more parameters, e.g. yields and error rates for mismatched-basis events, must be accumulated to monitor the security. In this work, we find that the original decoy-states method estimates these yields over pessimistically since it ignores the potential relations between different bases. Through processing parameters of different bases jointly, the performance of RFI-MDI-QKD is greatly improved in terms of secret key rate and achievable distance when statistical fluctuations are considered. Our results pave an avenue towards practical RFI-MDI-QKD.

preprint2020arXiv

EZLDA: Efficient and Scalable LDA on GPUs

LDA is a statistical approach for topic modeling with a wide range of applications. However, there exist very few attempts to accelerate LDA on GPUs which come with exceptional computing and memory throughput capabilities. To this end, we introduce EZLDA which achieves efficient and scalable LDA training on GPUs with the following three contributions: First, EZLDA introduces three-branch sampling method which takes advantage of the convergence heterogeneity of various tokens to reduce the redundant sampling task. Second, to enable sparsity-aware format for both D and W on GPUs with fast sampling and updating, we introduce hybrid format for W along with corresponding token partition to T and inverted index designs. Third, we design a hierarchical workload balancing solution to address the extremely skewed workload imbalance problem on GPU and scaleEZLDA across multiple GPUs. Taken together, EZLDA achieves superior performance over the state-of-the-art attempts with lower memory consumption.

preprint2020arXiv

FTRANS: Energy-Efficient Acceleration of Transformers using FPGA

In natural language processing (NLP), the &#34;Transformer&#34; architecture was proposed as the first transduction model replying entirely on self-attention mechanisms without using sequence-aligned recurrent neural networks (RNNs) or convolution, and it achieved significant improvements for sequence to sequence tasks. The introduced intensive computation and storage of these pre-trained language representations has impeded their popularity into computation and memory-constrained devices. The field-programmable gate array (FPGA) is widely used to accelerate deep learning algorithms for its high parallelism and low latency. However, the trained models are still too large to accommodate to an FPGA fabric. In this paper, we propose an efficient acceleration framework, Ftrans, for transformer-based large scale language representations. Our framework includes enhanced block-circulant matrix (BCM)-based weight representation to enable model compression on large-scale language representations at the algorithm level with few accuracy degradation, and an acceleration design at the architecture level. Experimental results show that our proposed framework significantly reduces the model size of NLP models by up to 16 times. Our FPGA design achieves 27.07x and 81x improvement in performance and energy efficiency compared to CPU, and up to 8.80x improvement in energy efficiency compared to GPU.

preprint2020arXiv

Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems

This paper studies the problem of information freshness-aware task offloading in an air-ground integrated multi-access edge computing system, which is deployed by an infrastructure provider (InP). A third-party real-time application service provider provides computing services to the subscribed mobile users (MUs) with the limited communication and computation resources from the InP based on a long-term business agreement. Due to the dynamic characteristics, the interactions among the MUs are modelled by a non-cooperative stochastic game, in which the control policies are coupled and each MU aims to selfishly maximize its own expected long-term payoff. To address the Nash equilibrium solutions, we propose that each MU behaves in accordance with the local system states and conjectures, based on which the stochastic game is transformed into a single-agent Markov decision process. Moreover, we derive a novel online deep reinforcement learning (RL) scheme that adopts two separate double deep Q-networks for each MU to approximate the Q-factor and the post-decision Q-factor. Using the proposed deep RL scheme, each MU in the system is able to make decisions without a priori statistical knowledge of dynamics. Numerical experiments examine the potentials of the proposed scheme in balancing the age of information and the energy consumption.

preprint2020arXiv

Matrix-Calibration-Based Cascaded Channel Estimation for Reconfigurable Intelligent Surface Assisted Multiuser MIMO

Reconfigurable intelligent surface (RIS) is envisioned to be an essential component of the paradigm for beyond 5G networks as it can potentially provide similar or higher array gains with much lower hardware cost and energy consumption compared with the massive multiple-input multiple-output (MIMO) technology. In this paper, we focus on one of the fundamental challenges, namely the channel acquisition, in an RIS-assisted multiuser MIMO system. The state-of-the-art channel acquisition approach in such a system with fully passive RIS elements estimates the cascaded transmitter-to-RIS and RIS-to-receiver channels by adopting excessively long training sequences. To estimate the cascaded channels with an affordable training overhead, we formulate the channel estimation problem in the RIS-assisted multiuser MIMO system as a matrix-calibration based matrix factorization task. By exploiting the information on the slow-varying channel components and the hidden channel sparsity, we propose a novel message-passing based algorithm to factorize the cascaded channels. Furthermore, we present an analytical framework to characterize the theoretical performance bound of the proposed estimator in the large-system limit. Finally, we conduct simulations to verify the high accuracy and efficiency of the proposed algorithm.

preprint2020arXiv

Quantum key distribution with dissipative Kerr soliton generated by on-chip microresonators

Quantum key distribution (QKD) can distribute symmetric key bits between remote legitimate users with the guarantee of quantum mechanics principles. For practical applications, the compact and robust photonic components for QKD are essential, and there are increasing attention to integrate the source, detector and modulators on a photonic chip. However, the massive and parallel QKD based on wavelength multiplexing are still challenge, due to the limited coherent light sources on the chip. Here, we introduce the Kerr dissipative soliton in a microresonator, which provides the locked coherent frequency comb with 49GHz frequency spacing, for QKD. We demonstrate the parallel QKD by demulplexing the coherent comb lines form the soliton, and showing the potential of Gbps secret key rate if the hundreds of channels covering C and L bands are fully exploited. The demonstrated soliton based QKD architecture are compatible with the efforts of quantum photonic integrated circuits, which are compact, robust and low-cost, and provides a competitive platform of practical QKD chip.

preprint2020arXiv

R-estimators in GARCH models; asymptotics, applications and bootstrapping

The quasi-maximum likelihood estimation is a commonly-used method for estimating GARCH parameters. However, such estimators are sensitive to outliers and their asymptotic normality is proved under the finite fourth moment assumption on the underlying error distribution. In this paper, we propose a novel class of estimators of the GARCH parameters based on ranks, called R-estimators, with the property that they are asymptotic normal under the existence of a more than second moment of the errors and are highly efficient. We also consider the weighted bootstrap approximation of the finite sample distributions of the R-estimators. We propose fast algorithms for computing the R-estimators and their bootstrap replicates. Both real data analysis and simulations show the superior performance of the proposed estimators under the normal and heavy-tailed distributions. Our extensive simulations also reveal excellent coverage rates of the weighted bootstrap approximations. In addition, we discuss empirical and simulation results of the R-estimators for the higher order GARCH models such as the GARCH~($2, 1$) and asymmetric models such as the GJR model.

preprint2020arXiv

Reconfigurable-Intelligent-Surface Empowered Wireless Communications: Challenges and Opportunities

Reconfigurable intelligent surfaces (RISs) are regarded as a promising emerging hardware technology to improve the spectrum and energy efficiency of wireless networks by artificially reconfiguring the propagation environment of electromagnetic waves. Due to the unique advantages in enhancing wireless channel capacity, RISs have recently become a hot research topic. In this article, we focus on three fundamental physical-layer challenges for the incorporation of RISs into wireless networks, namely, channel state information acquisition, passive information transfer, and low-complexity robust system design. We summarize the state-of-the-art solutions and explore potential research directions. Furthermore, we discuss other promising research directions of RISs, including edge intelligence and physical-layer security.

preprint2020arXiv

SAPAG: A Self-Adaptive Privacy Attack From Gradients

Distributed learning such as federated learning or collaborative learning enables model training on decentralized data from users and only collects local gradients, where data is processed close to its sources for data privacy. The nature of not centralizing the training data addresses the privacy issue of privacy-sensitive data. Recent studies show that a third party can reconstruct the true training data in the distributed machine learning system through the publicly-shared gradients. However, existing reconstruction attack frameworks lack generalizability on different Deep Neural Network (DNN) architectures and different weight distribution initialization, and can only succeed in the early training phase. To address these limitations, in this paper, we propose a more general privacy attack from gradient, SAPAG, which uses a Gaussian kernel based of gradient difference as a distance measure. Our experiments demonstrate that SAPAG can construct the training data on different DNNs with different weight initializations and on DNNs in any training phases.

preprint2020arXiv

Statistical Beamforming for FDD Downlink Massive MIMO via Spatial Information Extraction and Beam Selection

In this paper, we study the beamforming design problem in frequency-division duplexing (FDD) downlink massive MIMO systems, where instantaneous channel state information (CSI) is assumed to be unavailable at the base station (BS). We propose to extract the information of the angle-of-departures (AoDs) and the corresponding large-scale fading coefficients (a.k.a. spatial information) of the downlink channel from the uplink channel estimation procedure, based on which a novel downlink beamforming design is presented. By separating the subpaths for different users based on the spatial information and the hidden sparsity of the physical channel, we construct near-orthogonal virtual channels in the beamforming design. Furthermore, we derive a sum-rate expression and its approximations for the proposed system. Based on these closed-form rate expressions, we develop two low-complexity beam selection schemes and carry out asymptotic analysis to provide valuable insights on the system design. Numerical results demonstrate a significant performance improvement of our proposed algorithm over the state-of-the-art beamforming approach.

preprint2019arXiv

Cooperative evolution of intraband and interband excitations for high harmonic generation in strained MoS2

Modulating electronic structure of two-dimensional (2D) materials represents an exciting avenue for tailoring their optoelectronic properties. Here, we identify a strain-induced, cooperative effect of intraband and interband excitations contributing to high harmonic generation (HHG) in prototype dichalcogenide MoS2 monolayer. We find that besides the dominant intraband contributions, interband current is also indispensable in modulating HHG. The HHG yields increase linearly with the compressive strain since flatter band dispersion and Berry curvature enhance both interband and intraband dynamics. Band structure can be retrieved with high reliability by monitoring the strain-induced evolution of HHG spectra, suggesting that strain not only provides an additional knob to control HHG in solids, but also marks a way towards a complete understanding of underlying microscopic mechanisms.

preprint2019arXiv

Strong Cosmic Censorship in Charged de Sitter spacetime with Scalar Field Non-minimally Coupled to Curvature

We examine the stability and the strong cosmic censorship in the Reissner-Nordstrom-de Sitter (RN-dS) black hole by investigating the evolution of a scalar field non-minimally coupled to the curvature. We find that when the coupling parameter is negative, the RN-dS black hole experiences instability. The instability disappears when the coupling parameter becomes non-negative. With the increase of the coupling parameter, the violation of the strong cosmic censorship occurs at a larger critical charge ratio. But such an increase of the critical charge is suppressed by the increase of the cosmological constant. Different from the minimal coupling situation, it is possible to accommodate $β\ge1$ in the near extremal black hole when the scalar field is non-minimally coupled to curvature. The increase of the cosmological constant can allow $β\ge1$ to be satisfied for even smaller value of the coupling parameter. The existence of $β\ge1$ implies that the resulting curvature can continuously cross the Cauchy horizon.