Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
26works
0followers
24topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

26 published item(s)

preprint2022arXiv

$H$-factors in graphs with small independence number

Let $H$ be an $h$-vertex graph. The vertex arboricity $ar(H)$ of $H$ is the least integer $r$ such that $V(H)$ can be partitioned into $r$ parts and each part induces a forest in $H$. We show that for sufficiently large $n\in h\mathbb{N}$, every $n$-vertex graph $G$ with $δ(G)\geq \max\left\{\left(1-\frac{2}{f(H)}+o(1)\right)n, \left(\frac{1}{2}+o(1)\right)n\right\}$ and $α(G)=o(n)$ contains an $H$-factor, where $f(H)=2ar(H)$ or $2ar(H)-1$. The result can be viewed an analogue of the Alon--Yuster theorem \cite{MR1376050} in Ramsey--Turán theory, which generalises the results of Balogh--Molla--Sharifzadeh~\cite{MR3570984} and Knierm--Su~\cite{MR4193066} on clique factors. In particular the degree conditions are asymptotically sharp for infinitely many graphs $H$ which are not cliques.

preprint2022arXiv

A DNS Tunnel Sliding Window Differential Detection Method Based on Normal Distribution Reasonable Range Filtering

A covert attack method often used by APT organizations is the DNS tunnel, which is used to pass information by constructing C2 networks. And they often use the method of frequently changing domain names and server IP addresses to evade monitoring, which makes it extremely difficult to detect them. However, they carry DNS tunnel information traffic in normal DNS communication, which inevitably brings anomalies in some statistical characteristics of DNS traffic, so that it would provide security personnel with the opportunity to find them. Based on the above considerations, this paper studies the statistical discovery methodology of typical DNS tunnel high-frequency query behavior. Firstly, we analyze the distribution of the DNS domain name length and times and finds that the DNS domain name length and times follow the normal distribution law. Secondly, based on this distribution law, we propose a method for detecting and discovering high-frequency DNS query behaviors of non-single domain names based on the statistical rules of domain name length and frequency and we also give three theorems as theoretical support. Thirdly, we design a sliding window difference scheme based on the above method. Experimental results show that our method has a higher detection rate. At the same time, since our method does not need to construct a data set, it has better practicability in detecting unknown DNS tunnels. This also shows that our detection method based on mathematical models can effectively avoid the dilemma for machine learning methods that must have useful training data sets, and has strong practical significance.

preprint2022arXiv

A Novel 3D Non-Stationary Channel Model for 6G Indoor Visible Light Communication Systems

The visible light communication (VLC) technology has attracted much attention in the research of the sixth generation (6G) communication systems. In this paper, a novel three dimensional (3D) space-time-frequency non-stationary geometry-based stochastic model (GBSM) is proposed for indoor VLC channels. The proposed VLC GBSM can capture unique indoor VLC channel characteristics such as the space-time-frequency non-stationarity caused by large light-emitting diode (LED) arrays in indoor scenarios, long travelling paths, and large bandwidths of visible light waves, respectively. In addition, the proposed model can support special radiation patterns of LEDs, 3D translational and rotational motions of the optical receiver (Rx), and can be applied to angle diversity receivers (ADRs). Key channel properties are simulated and analyzed, including the space-time-frequency correlation function (STFCF), received power, root mean square (RMS) delay spread, and path loss (PL). Simulation results verify the space-time-frequency non-stationarity in indoor VLC channels. Finally, the accuracy and practicality of the proposed model are validated by comparing the simulation result of channel 3dB bandwidth with the existing measurement data. The proposed channel model will play a supporting role in the design of future 6G VLC systems.

preprint2022arXiv

AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 365 videos, including the LDV 2.0 dataset (335 videos) and 30 additional videos. In this challenge, there are 12 teams and 2 teams that submitted the final results to Track 1 and Track 2, respectively. The proposed methods and solutions gauge the state-of-the-art of super-resolution on compressed image and video. The proposed LDV 3.0 dataset is available at https://github.com/RenYang-home/LDV_dataset. The homepage of this challenge is at https://github.com/RenYang-home/AIM22_CompressSR.

preprint2022arXiv

Autonomous Electric Vehicle Battery Disassembly Based on NeuroSymbolic Computing

The booming of electric vehicles demands efficient battery disassembly for recycling to be environment-friendly. Due to the unstructured environment and high uncertainties, battery disassembly is still primarily done by humans, probably assisted by robots. It is highly desirable to design autonomous solutions to improve work efficiency and lower human risks in high voltage and toxic environments. This paper proposes a novel framework of the NeuroSymbolic task and motion planning method to disassemble batteries in an unstructured environment using robots automatically. It enables robots to independently locate and disassemble battery bolts, with or without obstacles. This study not only provides a solution for intelligently disassembling electric vehicle batteries but also verifies its feasibility through a set of test results with the robot accomplishing the disassembly tasks in a complex and dynamic environment.

preprint2022arXiv

Displacement calibration of optical tweezers with absolute gravitational acceleration

In recent years, levitated particles of optical traps in vacuum have shown enormous potential in precision sensor development and searching for new physics. The accuracy of the calibration relating the detected signal to absolute displacement of the trapped particle is a critical factor for absolute measurement performance. In this paper, we suggest and experimentally demonstrate a novel calibration method for optical tweezers based on free-falling particles in vacuum, where the gravitational acceleration is introduced as an absolute reference. Our work provides a calibration protocol with great certainty and traceability, which is significant in improving the accuracy of precision sensing based on optically levitated particles.

preprint2022arXiv

Field-wise Embedding Size Search via Structural Hard Auxiliary Mask Pruning for Click-Through Rate Prediction

Feature embeddings are one of the most essential steps when training deep learning based Click-Through Rate prediction models, which map high-dimensional sparse features to dense embedding vectors. Classic human-crafted embedding size selection methods are shown to be "sub-optimal" in terms of the trade-off between memory usage and model capacity. The trending methods in Neural Architecture Search (NAS) have demonstrated their efficiency to search for embedding sizes. However, most existing NAS-based works suffer from expensive computational costs, the curse of dimensionality of the search space, and the discrepancy between continuous search space and discrete candidate space. Other works that prune embeddings in an unstructured manner fail to reduce the computational costs explicitly. In this paper, to address those limitations, we propose a novel strategy that searches for the optimal mixed-dimension embedding scheme by structurally pruning a super-net via Hard Auxiliary Mask. Our method aims to directly search candidate models in the discrete space using a simple and efficient gradient-based method. Furthermore, we introduce orthogonal regularity on embedding tables to reduce correlations within embedding columns and enhance representation capacity. Extensive experiments demonstrate it can effectively remove redundant embedding dimensions without great performance loss.

preprint2022arXiv

MSE-Based Transceiver Designs for RIS-Aided Communications With Hardware Impairments

It is challenging to precisely configure the phase shifts of the reflecting elements at the reconfigurable intelligent surface (RIS) due to inherent hardware impairments (HIs). In this paper, the mean square error (MSE) performance is investigated in an RIS-aided single-user multiple-input multipleoutput (MIMO) communication system with transceiver HIs and RIS phase noise. We aim to jointly optimize the transmit precoder, linear received equalizer, and RIS reflecting matrices to minimize the MSE. To tackle this problem, an iterative algorithm is proposed, wherein the beamforming matrices are alternately optimized. Specifically, for the beamforming optimization subproblem, we derive the closed-form expression of the optimal precoder and equalizer matrices. Then, for the phase shift optimization subproblem, an efficient algorithm based on the majorization-minimization (MM) method is proposed. Simulation results show that the proposed MSE-based RIS-aided transceiver scheme dramatically outperforms the conventional system algorithms that do not consider HIs at both the transceiver and the RIS.

preprint2022arXiv

Online Time Series Anomaly Detection with State Space Gaussian Processes

We propose r-ssGPFA, an unsupervised online anomaly detection model for uni- and multivariate time series building on the efficient state space formulation of Gaussian processes. For high-dimensional time series, we propose an extension of Gaussian process factor analysis to identify the common latent processes of the time series, allowing us to detect anomalies efficiently in an interpretable manner. We gain explainability while speeding up computations by imposing an orthogonality constraint on the mapping from the latent to the observed. Our model's robustness is improved by using a simple heuristic to skip Kalman updates when encountering anomalous observations. We investigate the behaviour of our model on synthetic data and show on standard benchmark datasets that our method is competitive with state-of-the-art methods while being computationally cheaper.

preprint2021arXiv

Infant Cry Classification with Graph Convolutional Networks

We propose an approach of graph convolutional networks for robust infant cry classification. We construct non-fully connected graphs based on the similarities among the relevant nodes in both supervised and semi-supervised node classification with convolutional neural networks to consider the short-term and long-term effects of infant cry signals related to inner-class and inter-class messages. The approach captures the diversity of variations within infant cries, especially for limited training samples. The effectiveness of this approach is evaluated on Baby Chillanto Database and Baby2020 database. With as limited as 20% of labeled training data, our model outperforms that of CNN model with 80% labeled training data and the accuracy stably improves as the number of labeled training samples increases. The best results give significant improvements of 7.36% and 3.59% compared with the results of the CNN models on Baby Chillanto database and Baby2020 database respectively.

preprint2020arXiv

AutoEmb: Automated Embedding Dimensionality Search in Streaming Recommendations

Deep learning based recommender systems (DLRSs) often have embedding layers, which are utilized to lessen the dimensionality of categorical variables (e.g. user/item identifiers) and meaningfully transform them in the low-dimensional space. The majority of existing DLRSs empirically pre-define a fixed and unified dimension for all user/item embeddings. It is evident from recent researches that different embedding sizes are highly desired for different users/items according to their popularity. However, manually selecting embedding sizes in recommender systems can be very challenging due to the large number of users/items and the dynamic nature of their popularity. Thus, in this paper, we propose an AutoML based end-to-end framework (AutoEmb), which can enable various embedding dimensions according to the popularity in an automated and dynamic manner. To be specific, we first enhance a typical DLRS to allow various embedding dimensions; then we propose an end-to-end differentiable framework that can automatically select different embedding dimensions according to user/item popularity; finally we propose an AutoML based optimization algorithm in a streaming recommendation setting. The experimental results based on widely used benchmark datasets demonstrate the effectiveness of the AutoEmb framework.

preprint2020arXiv

Cooperative Rate-Splitting for Secrecy Sum-Rate Enhancement in Multi-antenna Broadcast Channels

In this paper, we employ Cooperative Rate-Splitting (CRS) technique to enhance the Secrecy Sum Rate (SSR) for the Multiple Input Single Output (MISO) Broadcast Channel (BC), consisting of two legitimate users and one eavesdropper, with perfect Channel State Information (CSI) available at all nodes. For CRS based on the three-node relay channel, the transmitter splits and encodes the messages of legitimate users into common and private streams based on Rate-Splitting (RS). With the goal of maximizing SSR, the proposed CRS strategy opportunistically asks the relaying legitimate user to forward its decoded common message. During the transmission, the eavesdropper keeps wiretapping silently. To ensure secure transmission, the common message is used for the dual purpose, serving both as a desired message and Artificial Noise (AN) without consuming extra transmit power comparing to the conventional AN design. Taking into account the total power constraint and the Physical Layer (PHY) security, the precoders and time-slot allocation are jointly optimized by solving the non-convex SSR maximization problem based on Sequential Convex Approximation (SCA) algorithm. Numerical results show that the proposed CRS secure transmission scheme outperforms existing Multi-User Linear Precoding (MU-LP) and Cooperative Non-Orthogonal Multiple Access (C-NOMA) strategies. Therefore, CRS is a promising strategy to enhance the PHY security in multi-antenna BC systems.

preprint2020arXiv

Dopant levels in large nanocrystals using stochastic optimally tuned range-separated hybrid density functional theory

We apply a stochastic version of an optimally tuned range-separated hybrid functional to provide insight on the electronic properties of P- and B- doped Si nanocrystals of experimentally relevant sizes. We show that we can use the range-separation parameter for undoped systems to calculate accurate results for dopant activation energies. We apply this strategy for tuning functionals to study doped nanocrystals up to 2.5 nm in diameter at the hybrid functional level. In this confinement regime, the P- and B- dopants have large activation energies and have strongly localized states that lie deep within the energy gaps. Structural relaxation plays a greater role for B-substituted dopants and contributes to the increase in activation energy when the B dopant is near the nanocrystal surface.

preprint2020arXiv

Enforced freedom: electric-field-induced declustering of ionic-liquid ions in the double layer

Whereas the majority of ions in the bulk of a solvent-free ionic liquid is bound into clusters, this is expected to change in the electrical double layer (EDL), in which the resulting electric field 'prefers' to interact with electrical monopoles-free, unclustered ions. The competition between the propensity of ions to stay in a clustered state and the reduction of the energy of ions in electric field in the free state determines the resulting portion of free ions in the EDL. We present a study of this effect, based on the simplest possible mean-field theory. 'Cracking' of ion clusters into individual ions in electric field is accompanied by the change of the dielectric response of ionic liquid which is different in clustered and unclustered states. The predictions of the theory are verified and further explored by specially performed molecular dynamics simulations. A particular finding of the theory is that the differential capacitance vs potential curve displays a bell shape despite low concentration of free charge carriers, because the dielectric response of bound ions reduces the threshold concentration of the bell- to camel-shape transition. Whereas qualitatively these findings make perfect sense, in reality the exact numbers and criteria might be different as the presented simple theory does not take into account overscreening and oscillating charge and electrostatic potential distributions near the electrode. This is why testing the theory with computer simulations is essential, but the latter basically reproduce the qualitative conclusions of the theory.

preprint2020arXiv

Improve black-box sequential anomaly detector relevancy with limited user feedback

Anomaly detectors are often designed to catch statistical anomalies. End-users typically do not have interest in all of the detected outliers, but only those relevant to their application. Given an existing black-box sequential anomaly detector, this paper proposes a method to improve its user relevancy using a small number of human feedback. As our first contribution, the method is agnostic to the detector: it only assumes access to its anomaly scores, without requirement on any additional information inside it. Inspired by a fact that anomalies are of different types, our approach identifies these types and utilizes user feedback to assign relevancy to types. This relevancy score, as our second contribution, is used to adjust the subsequent anomaly selection process. Empirical results on synthetic and real-world datasets show that our approach yields significant improvements on precision and recall over a range of anomaly detectors.

preprint2020arXiv

Nanoscale magnetic resonance imaging of proteins in a single cell

Magnetic resonance imaging (MRI) is a non-invasive and label-free technique widely used in medical diagnosis and life science research, and its success has benefited greatly from continuing efforts on enhancing contrast and resolution. Here we reported nanoscale MRI in a single cell using an atomic-size quantum sensor. With nitrogen-vacancy center in diamond, the intracellular protein ferritin has been imaged with a spatial resolution of ~ 10 nanometers, and ferritin-containing organelles were co-localized by correlative MRI and electron microscopy. Comparing to the current micrometer resolution in current state-of-art conventional MRI, our approach represents a 100-fold enhancement, and paves the way for MRI of intracellular proteins.

preprint2020arXiv

Range-Separated Stochastic Resolution of Identity: Formulation and Application to Second Order Green's Function Theory

We develop a range-separated stochastic resolution of identity approach for the $4$-index electron repulsion integrals, where the larger terms (above a predefined threshold) are treated using a deterministic resolution of identity and the remaining terms are treated using a stochastic resolution of identity. The approach is implemented within a second-order Greens function formalism with an improved $O(N^3)$ scaling with the size of the basis set, $N$. Moreover, the range-separated approach greatly reduces the statistical error compared to the full stochastic version ({\it J. Chem. Phys.} {\bf 151}, 044144 (2019)), resulting in computational speedups of ground and excited state energies of nearly two orders of magnitude, as demonstrated for hydrogen dimer chains.

preprint2020arXiv

Reflections in the Sky: Joint Trajectory and Passive Beamforming Design for Secure UAV Networks with Reconfigurable Intelligent Surface

This paper investigates the problem of secure energy efficiency maximization for a reconfigurable intelligent surface (RIS) assisted uplink wireless communication system, where an unmanned aerial vehicle (UAV) equipped with an RIS works as a mobile relay between the base station (BS) and a group of users. We focus on maximizing the secure energy efficiency of the system via jointly optimizing the UAV's trajectory, the RIS's phase shift, users' association and transmit power. To tackle this problem, we divide the original problem into three sub-problems, and propose an efficient iterative algorithm. In particular, the successive convex approximation method (SCA) is applied to solve the nonconvex UAV trajectory, the RIS's phase shift, and transmit power optimization sub-problems. We further provide two schemes to simplify the solution of phase and trajectory sub-problem. Simulation results demonstrate that the proposed algorithm converges fast, and the proposed design can enhance the secure energy efficiency by up to 38\% gains, as compared to the traditional schemes without any RIS.

preprint2020arXiv

Remarks on Replica Method and Sachdev-Ye-Kitaev Model

Through tracing back to EA/SK models, we disentangle the construction logic of SYK model. During the construction, we recall the technical essence of replica method. Then we point out the redundance of the flavor group and the slave indices as well as the color group via the generalization from 2-index SY model to its 4-index version and the simplification of the latter in terms of Majorana fermions. Spectacularly, such a simplification reserves the system's self-consistency condition. Getting rid of the redundance, the model itself and its effective action have the same $O(N_s)$ structural symmetry, while it realizes $AdS_2/CFT_1$ holography by the emergent conformal symmetry. We also clarify the model transmutation to matrix model and argue that the disorder-averaged SYK model naturally preserves a holographic nature. Based on the transmutation and the correspondence with vector model, SYK model is formally a hybrid of these two models. Its quantum and semiclassical properties are also discussed respectively.

preprint2020arXiv

SBAT: Video Captioning with Sparse Boundary-Aware Transformer

In this paper, we focus on the problem of applying the transformer structure to video captioning effectively. The vanilla transformer is proposed for uni-modal language generation task such as machine translation. However, video captioning is a multimodal learning problem, and the video features have much redundancy between different time steps. Based on these concerns, we propose a novel method called sparse boundary-aware transformer (SBAT) to reduce the redundancy in video representation. SBAT employs boundary-aware pooling operation for scores from multihead attention and selects diverse features from different scenarios. Also, SBAT includes a local correlation scheme to compensate for the local information loss brought by sparse operation. Based on SBAT, we further propose an aligned cross-modal encoding scheme to boost the multimodal interaction. Experimental results on two benchmark datasets show that SBAT outperforms the state-of-the-art methods under most of the metrics.

preprint2020arXiv

Simple and Deep Graph Convolutional Networks

Graph convolutional networks (GCNs) are a powerful deep learning approach for graph-structured data. Recently, GCNs and subsequent variants have shown superior performance in various application areas on real-world datasets. Despite their success, most of the current GCN models are shallow, due to the {\em over-smoothing} problem. In this paper, we study the problem of designing and analyzing deep graph convolutional networks. We propose the GCNII, an extension of the vanilla GCN model with two simple yet effective techniques: {\em Initial residual} and {\em Identity mapping}. We provide theoretical and empirical evidence that the two techniques effectively relieves the problem of over-smoothing. Our experiments show that the deep GCNII model outperforms the state-of-the-art methods on various semi- and full-supervised tasks. Code is available at https://github.com/chennnM/GCNII .

preprint2020arXiv

Single DNA Electron Spin Resonance Spectroscopy in Aqueous Solutions

Magnetic resonance spectroscopy of single biomolecules under near-physiological conditions may substantially advance understanding of biological function, yet remains very challenging. Here we use nitrogen-vacancy centers in diamonds to detect electron spin resonance spectra of individual, tethered DNA duplexes labeled with a nitroxide spin label in aqueous buffer solutions at ambient temperatures. This paves the way for magnetic resonance studies on single biomolecules and their inter-molecular interactions in a native-like environment.

preprint2019arXiv

Molecular understanding of charge storage and charging dynamics in supercapacitors with MOF electrodes and ionic liquid electrolytes

We present a computational microscopy analysis (targeted molecular dynamics simulations) of the structure and performance of conductive metal organic framework (MOF) electrodes in supercapacitors with room temperature ionic liquids. The molecular modeling predicts the characteristic shapes of the potential dependence of electrode capacitance, relying on the structure of MOF electrodes and particularly how ions transport and reside in MOFs under polarization. Transmission line model was adopted to characterize the charging dynamics process and build up a bridge to evaluate the capacitive performance of practical supercapacitor devices at macroscale from the simulation-obtained data at nanoscale. Such nanoscale-to-macroscale analysis demonstrates the potential of MOF supercapacitors for achieving unprecedentedly high volumetric energy and power densities. The investigation gives molecular insights into the preferred structures of MOF for achieving these results, which could provide a blueprint for future experimental characterization of these new systems.

preprint2019arXiv

Photonic hooks from Janus microcylinders

Recently, a type of curved light beams, photonic hooks (PHs), was theoretically predicted and experimentally observed. The production of photonic hook (PH) is due to the breaking of structural symmetry of a plane-wave illuminated microparticle. Herein, we presented and implemented a new approach, of utilizing the symmetry-broken of the microparticles in material composition, for the generation of PHs from Janus microcylinders. Finite element method based numerical simulation and energy flow diagram represented theoretical analysis were used to investigate the field distribution characteristics and formation mechanism of the PHs. The full width at half-maximum (FWHM) of the PH (~0.29$λ$) is smaller than the FWHM of the photonic nanojet (~0.35$λ$) formed from a circular microcylinder with the same geometric radius. By changing the refractive index contrasts between upper and lower half-cylinders, or rotating the Janus microcylinder relative to the central axis, the shape profiles of the PHs can be efficiently modulated. The tunability of the PHs through simple stretching or compression operations, for the Janus microcylinder constituted by one solid inorganic half-cylinder and the other flexible polymer half-cylinder, was studied and discussed as well.

preprint2019arXiv

Stochastic embedding DFT: theory and application to p-nitroaniline

Over this past decade, we combined the idea of stochastic resolution of identity with a variety of electronic structure methods. In our stochastic Kohn-Sham DFT method, the density is an average over multiple stochastic samples, with stochastic errors that decrease as the inverse square root of the number of sampling orbitals. Here we develop a stochastic embedding density functional theory method (se-DFT) that selectively reduces the stochastic error (specifically on the forces) for a selected sub-system(s). The motivation, similar to that of other quantum embedding methods, is that for many systems of practical interest the properties are often determined by only a small sub-system. In stochastic embedding DFT two sets of orbitals are used: a deterministic one associated with the embedded subspace, and the rest which is described by a stochastic set. The method is exact in the limit of large number of stochastic samples. We apply se-DFT to study a p-nitroaniline molecule in water, where the statistical errors in the forces on the system (the p-nitroaniline molecule) are reduced by an order of magnitude compared with non-embedding stochastic DFT.

preprint2018arXiv

Kinetics of Ion Transport in Ionic Liquids: Two Dynamical Diffusion States

Using classical molecular dynamics simulations, we investigate the mobility of ions in [Bmim][TFSI], a typical room temperature ionic liquid. Analyzing the trajectories of individual cations and anions, we estimate the time that ions spend in bound, clustered states, and when the ions move quasi-freely. Using this information, we evaluate the average portion of free ions that dominate conductivity. The amount of thus defined free ions comprises 15-25%, monotonically increasing with temperature in the range of 300-600 K, with the rest of the ions being temporarily bound, moving rather in local potentials. The conductivities as a function of temperature, calculated from electric current autocorrelation functions, reproduce reported experimental data well. Interestingly, for free ions the Nernst-Einstein relationship between the mobility and diffusion coefficient holds fairly well. In analogy with electronic semiconductors, one can speak about an ionic semiconductor model for ionic liquids with valence (or excitonic) and conduction band states for ions, separated by an energy gap. The obtained band gap for the ionic liquid is, however, very small, about 0.026 eV, allowing for easy interchanges between the two dynamic states.