Source author record

Qiang Xu

Qiang Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

41works

23topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MappingEvolve: LLM-Driven Code Evolution for Technology Mapping

Technology mapping is a critical yet challenging stage in logic synthesis. While Large Language Models (LLMs) have been applied to generate optimization scripts, their potential for core algorithm enhancement remains untapped. We introduce MappingEvolve, an open-source framework that pioneers the use of LLMs to directly evolve technology mapping code. Our method abstracts the mapping process into distinct optimization operators and employs a hierarchical agent-based architecture, comprising a Planner, Evolver, and Evaluator, to guide the evolutionary search. This structured approach enables strategic and effective code modifications. Experiments show our method significantly outperforms direct evolution and strong baselines, achieving 10.04\% area reduction versus ABC and 7.93\% versus mockturtle, with 46.6\%--96.0\% $S_{overall}$ improvement on EPFL benchmarks, while explicitly navigating the area--delay trade-off. Our code and data are available at https://github.com/Flians/MappingEvolve.

preprint2024arXiv

FITS: Modeling Time Series with $10k$ Parameters

In this paper, we introduce FITS, a lightweight yet powerful model for time series analysis. Unlike existing models that directly process raw time-domain data, FITS operates on the principle that time series can be manipulated through interpolation in the complex frequency domain. By discarding high-frequency components with negligible impact on time series data, FITS achieves performance comparable to state-of-the-art models for time series forecasting and anomaly detection tasks, while having a remarkably compact size of only approximately $10k$ parameters. Such a lightweight model can be easily trained and deployed in edge devices, creating opportunities for various applications. The code is available in: \url{https://github.com/VEWOXIC/FITS}

preprint2023arXiv

Agglomeration Drives the Reversed Fractionation of Aqueous Carbonate and Bicarbonate at the Air-water Interface

In the course of our investigations of the adsorption of ions to the air-water interface, we previously reported the surprising result that doubly-charged carbonate anions exhibit a stronger surface affinity than do singly-charged bicarbonate anions. In contrast to monovalent, weakly hydrated anions, which generally show enhanced concentrations in the interfacial region, multivalent (and strongly hydrated) anions are expected to show much weaker surface propensity. In the present work, we use resonantly enhanced deep-UV second harmonic generation spectroscopy to measure the Gibbs free energy of adsorption of both carbonate ($CO_3^{2-}$) and bicarbonate $(HCO_3^-)$ anions to the air-water interface. Contrasting the predictions of classical electrostatic theory, and in support of our previous findings from X-ray photoelectron spectroscopy, we find that carbonate anions do indeed exhibit much stronger surface affinity than do the bicarbonate anions. Molecular dynamics simulation reveals that strong ion pairing of $CO_3^{2-}$ with the $Na^+$ counter-cation in the interfacial region, resulting in formation of near-neutral agglomerates of $Na^+$ and $CO_3^{2-}$ clusters, is responsible for this counterintuitive behavior. These findings not only advance our fundamental understanding of ion adsorption chemistry, but will also impact important practical processes such as ocean acidification, sea-spray aerosol chemistry, and mammalian respiration physiology.

preprint2023arXiv

DeepSAT: An EDA-Driven Learning Framework for SAT

We present DeepSAT, a novel end-to-end learning framework for the Boolean satisfiability (SAT) problem. Unlike existing solutions trained on random SAT instances with relatively weak supervision, we propose applying the knowledge of the well-developed electronic design automation (EDA) field for SAT solving. Specifically, we first resort to logic synthesis algorithms to pre-process SAT instances into optimized and-inverter graphs (AIGs). By doing so, the distribution diversity among various SAT instances can be dramatically reduced, which facilitates improving the generalization capability of the learned model. Next, we regard the distribution of SAT solutions being a product of conditional Bernoulli distributions. Based on this observation, we approximate the SAT solving procedure with a conditional generative model, leveraging a novel directed acyclic graph neural network (DAGNN) with two polarity prototypes for conditional SAT modeling. To effectively train the generative model, with the help of logic simulation tools, we obtain the probabilities of nodes in the AIG being logic `1' as rich supervision. We conduct comprehensive experiments on various SAT problems. Our results show that, DeepSAT achieves significant accuracy improvements over state-of-the-art learning-based SAT solutions, especially when generalized to SAT instances that are relatively large or with diverse distributions.

preprint2022arXiv

An Empirical Study on the Impact of Deep Parameters on Mobile App Energy Usage

Improving software performance through configuration parameter tuning is a common activity during software maintenance. Beyond traditional performance metrics like latency, mobile app developers are interested in reducing app energy usage. Some mobile apps have centralized locations for parameter tuning, similar to databases and operating systems, but it is common for mobile apps to have hundreds of parameters scattered around the source code. The correlation between these "deep" parameters and app energy usage is unclear. Researchers have studied the energy effects of deep parameters in specific modules, but we lack a systematic understanding of the energy impact of mobile deep parameters. In this paper we empirically investigate this topic, combining a developer survey with systematic energy measurements. Our motivational survey of 25 Android developers suggests that developers do not understand, and largely ignore, the energy impact of deep parameters. To assess the potential implications of this practice, we propose a deep parameter energy profiling framework that can analyze the energy impact of deep parameters in an app. Our framework identifies deep parameters, mutates them based on our parameter value selection scheme, and performs reliable energy impact analysis. Applying the framework to 16 popular Android apps, we discovered that deep parameter-induced energy inefficiency is rare. We found only 2 out of 1644 deep parameters for which a different value would significantly improve its app's energy efficiency. A detailed analysis found that most deep parameters have either no energy impact, limited energy impact, or an energy impact only under extreme values. Our study suggests that it is generally safe for developers to ignore the energy impact when choosing deep parameter values in mobile apps.

preprint2022arXiv

Are Transformers Effective for Time Series Forecasting?

Recently, there has been a surge of Transformer-based solutions for the long-term time series forecasting (LTSF) task. Despite the growing performance over the past few years, we question the validity of this line of research in this work. Specifically, Transformers is arguably the most successful solution to extract the semantic correlations among the elements in a long sequence. However, in time series modeling, we are to extract the temporal relations in an ordered set of continuous points. While employing positional encoding and using tokens to embed sub-series in Transformers facilitate preserving some ordering information, the nature of the \emph{permutation-invariant} self-attention mechanism inevitably results in temporal information loss. To validate our claim, we introduce a set of embarrassingly simple one-layer linear models named LTSF-Linear for comparison. Experimental results on nine real-life datasets show that LTSF-Linear surprisingly outperforms existing sophisticated Transformer-based LTSF models in all cases, and often by a large margin. Moreover, we conduct comprehensive empirical studies to explore the impacts of various design elements of LTSF models on their temporal relation extraction capability. We hope this surprising finding opens up new research directions for the LTSF task. We also advocate revisiting the validity of Transformer-based solutions for other time series analysis tasks (e.g., anomaly detection) in the future. Code is available at: \url{https://github.com/cure-lab/LTSF-Linear}.

preprint2022arXiv

Be Your Own Neighborhood: Detecting Adversarial Example by the Neighborhood Relations Built on Self-Supervised Learning

Deep Neural Networks (DNNs) have achieved excellent performance in various fields. However, DNNs' vulnerability to Adversarial Examples (AE) hinders their deployments to safety-critical applications. This paper presents a novel AE detection framework, named BEYOND, for trustworthy predictions. BEYOND performs the detection by distinguishing the AE's abnormal relation with its augmented versions, i.e. neighbors, from two prospects: representation similarity and label consistency. An off-the-shelf Self-Supervised Learning (SSL) model is used to extract the representation and predict the label for its highly informative representation capacity compared to supervised learning models. For clean samples, their representations and predictions are closely consistent with their neighbors, whereas those of AEs differ greatly. Furthermore, we explain this observation and show that by leveraging this discrepancy BEYOND can effectively detect AEs. We develop a rigorous justification for the effectiveness of BEYOND. Furthermore, as a plug-and-play model, BEYOND can easily cooperate with the Adversarial Trained Classifier (ATC), achieving the state-of-the-art (SOTA) robustness accuracy. Experimental results show that BEYOND outperforms baselines by a large margin, especially under adaptive attacks. Empowered by the robust relation net built on SSL, we found that BEYOND outperforms baselines in terms of both detection ability and speed. Our code will be publicly available.

preprint2022arXiv

Bias in the representative volume element method: periodize the ensemble instead of its realizations

We study the Representative Volume Element (RVE) method, which is a method to approximately infer the effective behavior $a_{\text{hom}}$ of a stationary random medium. The latter is described by a coefficient field $a(x)$ generated from a given ensemble $\langle\cdot\rangle$ and the corresponding linear elliptic operator $-\nabla\cdot a\nabla$. In line with the theory of homogenization, the method proceeds by computing $d = 3$ correctors (d denoting the space dimension).To be numerically tractable, this computation has to be done on a finite domain: the so-called "representative" volume element, i. e. a large box with, say, periodic boundary conditions. The main message of this article is: Periodize the ensemble instead of its realizations. By this we mean that it is better to sample from a suitably periodized ensemble than to periodically extend the restriction of a realization $a(x)$ from the whole-space ensemble $\langle\cdot\rangle$. We make this point by investigating the bias (or systematic error), i. e. the difference between $a_{\text{hom}}$ and the expected value of the RVE method, in terms of its scaling w. r. t. the lateral size $L$ of the box. In case of periodizing $a(x)$, we heuristically argue that this error is generically $O(L^{-1})$. In case of a suitable periodization of $\langle\cdot\rangle$, we rigorously show that it is $O(L^{-d})$. In fact, we give a characterization of the leading-order error term for both strategies, and argue that even in the isotropic case it is generically non-degenerate. We carry out the rigorous analysis in the convenient setting of ensembles $\langle\cdot\rangle$ of Gaussian type with integrable covariance, which allow for a straightforward periodization and which make the Price theorem and the Malliavin calculus available for optimal stochastic estimates of correctors.

preprint2022arXiv

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can achieve 10 times efficiency improvement over existing works without any performance degradation, named DeciWatch. Unlike current solutions that estimate each frame in a video, DeciWatch introduces a simple yet effective sample-denoise-recover framework that only watches sparsely sampled frames, taking advantage of the continuity of human motions and the lightweight pose representation. Specifically, DeciWatch uniformly samples less than 10% video frames for detailed estimation, denoises the estimated 2D/3D poses with an efficient Transformer architecture, and then accurately recovers the rest of the frames using another Transformer-based network. Comprehensive experimental results on three video-based human pose estimation and body mesh recovery tasks with four datasets validate the efficiency and effectiveness of DeciWatch. Code is available at https://github.com/cure-lab/DeciWatch.

preprint2022arXiv

DeepGate: Learning Neural Representations of Logic Gates

Applying deep learning (DL) techniques in the electronic design automation (EDA) field has become a trending topic. Most solutions apply well-developed DL models to solve specific EDA problems. While demonstrating promising results, they require careful model tuning for every problem. The fundamental question on "How to obtain a general and effective neural representation of circuits?" has not been answered yet. In this work, we take the first step towards solving this problem. We propose DeepGate, a novel representation learning solution that effectively embeds both logic function and structural information of a circuit as vectors on each gate. Specifically, we propose transforming circuits into unified and-inverter graph format for learning and using signal probabilities as the supervision task in DeepGate. We then introduce a novel graph neural network that uses strong inductive biases in practical circuits as learning priors for signal probability prediction. Our experimental results show the efficacy and generalization capability of DeepGate.

preprint2022arXiv

DeepTPI: Test Point Insertion with Deep Reinforcement Learning

Test point insertion (TPI) is a widely used technique for testability enhancement, especially for logic built-in self-test (LBIST) due to its relatively low fault coverage. In this paper, we propose a novel TPI approach based on deep reinforcement learning (DRL), named DeepTPI. Unlike previous learning-based solutions that formulate the TPI task as a supervised-learning problem, we train a novel DRL agent, instantiated as the combination of a graph neural network (GNN) and a Deep Q-Learning network (DQN), to maximize the test coverage improvement. Specifically, we model circuits as directed graphs and design a graph-based value network to estimate the action values for inserting different test points. The policy of the DRL agent is defined as selecting the action with the maximum value. Moreover, we apply the general node embeddings from a pre-trained model to enhance node features, and propose a dedicated testability-aware attention mechanism for the value network. Experimental results on circuits with various scales show that DeepTPI significantly improves test coverage compared to the commercial DFT tool. The code of this work is available at https://github.com/cure-lab/DeepTPI.

preprint2022arXiv

Joint Learning of Deep Texture and High-Frequency Features for Computer-Generated Image Detection

Distinguishing between computer-generated (CG) and natural photographic (PG) images is of great importance to verify the authenticity and originality of digital images. However, the recent cutting-edge generation methods enable high qualities of synthesis in CG images, which makes this challenging task even trickier. To address this issue, a joint learning strategy with deep texture and high-frequency features for CG image detection is proposed. We first formulate and deeply analyze the different acquisition processes of CG and PG images. Based on the finding that multiple different modules in image acquisition will lead to different sensitivity inconsistencies to the convolutional neural network (CNN)-based rendering in images, we propose a deep texture rendering module for texture difference enhancement and discriminative texture representation. Specifically, the semantic segmentation map is generated to guide the affine transformation operation, which is used to recover the texture in different regions of the input image. Then, the combination of the original image and the high-frequency components of the original and rendered images are fed into a multi-branch neural network equipped with attention mechanisms, which refines intermediate features and facilitates trace exploration in spatial and channel dimensions respectively. Extensive experiments on two public datasets and a newly constructed dataset with more realistic and diverse images show that the proposed approach outperforms existing methods in the field by a clear margin. Besides, results also demonstrate the detection robustness and generalization ability of the proposed approach to postprocessing operations and generative adversarial network (GAN) generated images.

preprint2022arXiv

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Dual-encoder structure successfully utilizes two language-specific encoders (LSEs) for code-switching speech recognition. Because LSEs are initialized by two pre-trained language-specific models (LSMs), the dual-encoder structure can exploit sufficient monolingual data and capture the individual language attributes. However, most existing methods have no language constraints on LSEs and underutilize language-specific knowledge of LSMs. In this paper, we propose a language-specific characteristic assistance (LSCA) method to mitigate the above problems. Specifically, during training, we introduce two language-specific losses as language constraints and generate corresponding language-specific targets for them. During decoding, we take the decoding abilities of LSMs into account by combining the output probabilities of two LSMs and the mixture model to obtain the final predictions. Experiments show that either the training or decoding method of LSCA can improve the model's performance. Furthermore, the best result can obtain up to 15.4% relative error reduction on the code-switching test set by combining the training and decoding methods of LSCA. Moreover, the system can process code-switching speech recognition tasks well without extra shared parameters or even retraining based on two pre-trained LSMs by using our method.

preprint2022arXiv

MixDefense: A Defense-in-Depth Framework for Adversarial Example Detection Based on Statistical and Semantic Analysis

Machine learning with deep neural networks (DNNs) has become one of the foundation techniques in many safety-critical systems, such as autonomous vehicles and medical diagnosis systems. DNN-based systems, however, are known to be vulnerable to adversarial examples (AEs) that are maliciously perturbed variants of legitimate inputs. While there has been a vast body of research to defend against AE attacks in the literature, the performances of existing defense techniques are still far from satisfactory, especially for adaptive attacks, wherein attackers are knowledgeable about the defense mechanisms and craft AEs accordingly. In this work, we propose a multilayer defense-in-depth framework for AE detection, namely MixDefense. For the first layer, we focus on those AEs with large perturbations. We propose to leverage the `noise' features extracted from the inputs to discover the statistical difference between natural images and tampered ones for AE detection. For AEs with small perturbations, the inference result of such inputs would largely deviate from their semantic information. Consequently, we propose a novel learning-based solution to model such contradictions for AE detection. Both layers are resilient to adaptive attacks because there do not exist gradient propagation paths for AE generation. Experimental results with various AE attack methods on image classification datasets show that the proposed MixDefense solution outperforms the existing AE detection techniques by a considerable margin.

preprint2022arXiv

Nonlocal Pseudopotential Energy Density Functional for Orbital-Free Density Functional Theory

Orbital-free density functional theory (OF-DFT) runs at low computational cost that scales linearly with the number of simulated atoms, making it suitable for large-scale material simulations. It is generally considered that OF-DFT strictly requires the use of local pseudopotentials, rather than orbital-dependent nonlocal pseudopotentials, for the calculation of electron-ion interaction energies, as no orbitals are available. This is unfortunate situation since the nonlocal pseudopotentials are known to give much better transferability and calculation accuracy than local ones. We report here the development of a theoretical scheme that allows the direct use of nonlocal pseudopotentials in OF-DFT. In this scheme, a nonlocal pseudopotential energy density functional is derived by the projection of nonlocal pseudopotential onto the non-interacting density matrix (instead of 'orbitals') that can be approximated explicitly as a functional of electron density. Our development defies the belief that nonlocal pseudopotentials are not applicable to OF-DFT, leading to the creation of an alternate theoretical framework of OF-DFT that works superior to the traditional one.

preprint2022arXiv

Out-of-Distribution Detection with Semantic Mismatch under Masking

This paper proposes a novel out-of-distribution (OOD) detection framework named MoodCat for image classifiers. MoodCat masks a random portion of the input image and uses a generative model to synthesize the masked image to a new image conditioned on the classification result. It then calculates the semantic difference between the original image and the synthesized one for OOD detection. Compared to existing solutions, MoodCat naturally learns the semantic information of the in-distribution data with the proposed mask and conditional synthesis strategy, which is critical to identifying OODs. Experimental results demonstrate that MoodCat outperforms state-of-the-art OOD detection solutions by a large margin.

preprint2022arXiv

Relational Graph Neural Network Design via Progressive Neural Architecture Search

We propose a novel solution to addressing a long-standing dilemma in the representation learning of graph neural networks (GNNs): how to effectively capture and represent useful information embedded in long-distance nodes to improve the performance of nodes with low homophily without leading to performance degradation in nodes with high homophily. This dilemma limits the generalization capability of existing GNNs. Intuitively, interactions with distant nodes introduce more noise for a node than those with close neighbors. However, in most existing works, messages being passed among nodes are mingled together, which is inefficient from a communication perspective. Our solution is based on a novel, simple, yet effective aggregation scheme, resulting in a ladder-style GNN architecture, namely LADDER-GNN. Specifically, we separate messages from different hops, assign different dimensions for them, and then concatenate them to obtain node representations. Such disentangled representations facilitate improving the information-to-noise ratio of messages passed from different hops. To explore an effective hop-dimension relationship, we develop a conditionally progressive neural architecture search strategy. Based on the searching results, we further propose an efficient approximate hop-dimension relation function to facilitate the rapid configuration of the proposed LADDER-GNN. We verify the proposed LADDER-GNN on seven diverse semi-supervised node classification datasets. Experimental results show that our solution achieves better performance than most existing GNNs. We further analyze our aggregation scheme with two commonly used GNN architectures, and the results corroborate that our scheme outperforms existing schemes in classifying low homophily nodes by a large margin.

preprint2022arXiv

SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos

When analyzing human motion videos, the output jitters from existing pose estimators are highly-unbalanced with varied estimation errors across frames. Most frames in a video are relatively easy to estimate and only suffer from slight jitters. In contrast, for rarely seen or occluded actions, the estimated positions of multiple joints largely deviate from the ground truth values for a consecutive sequence of frames, rendering significant jitters on them. To tackle this problem, we propose to attach a dedicated temporal-only refinement network to existing pose estimators for jitter mitigation, named SmoothNet. Unlike existing learning-based solutions that employ spatio-temporal models to co-optimize per-frame precision and temporal smoothness at all the joints, SmoothNet models the natural smoothness characteristics in body movements by learning the long-range temporal relations of every joint without considering the noisy correlations among joints. With a simple yet effective motion-aware fully-connected network, SmoothNet improves the temporal smoothness of existing pose estimators significantly and enhances the estimation accuracy of those challenging frames as a side-effect. Moreover, as a temporal-only model, a unique advantage of SmoothNet is its strong transferability across various types of estimators and datasets. Comprehensive experiments on five datasets with eleven popular backbone networks across 2D and 3D pose estimation and body recovery tasks demonstrate the efficacy of the proposed solution. Code is available at https://github.com/cure-lab/SmoothNet.

preprint2022arXiv

What You See is Not What the Network Infers: Detecting Adversarial Examples Based on Semantic Contradiction

Adversarial examples (AEs) pose severe threats to the applications of deep neural networks (DNNs) to safety-critical domains, e.g., autonomous driving. While there has been a vast body of AE defense solutions, to the best of our knowledge, they all suffer from some weaknesses, e.g., defending against only a subset of AEs or causing a relatively high accuracy loss for legitimate inputs. Moreover, most existing solutions cannot defend against adaptive attacks, wherein attackers are knowledgeable about the defense mechanisms and craft AEs accordingly. In this paper, we propose a novel AE detection framework based on the very nature of AEs, i.e., their semantic information is inconsistent with the discriminative features extracted by the target DNN model. To be specific, the proposed solution, namely ContraNet, models such contradiction by first taking both the input and the inference result to a generator to obtain a synthetic output and then comparing it against the original input. For legitimate inputs that are correctly inferred, the synthetic output tries to reconstruct the input. On the contrary, for AEs, instead of reconstructing the input, the synthetic output would be created to conform to the wrong label whenever possible. Consequently, by measuring the distance between the input and the synthetic output with metric learning, we can differentiate AEs from legitimate inputs. We perform comprehensive evaluations under various AE attack scenarios, and experimental results show that ContraNet outperforms existing solutions by a large margin, especially under adaptive attacks. Moreover, our analysis shows that successful AEs that can bypass ContraNet tend to have much-weakened adversarial semantics. We have also shown that ContraNet can be easily combined with adversarial training techniques to achieve further improved AE defense capabilities.

preprint2020arXiv

High-Voltage Honeycomb Layered Oxide Positive Electrodes for Rechargeable Sodium Batteries

Natural abundance, impressive chemical characteristics and economic feasibility have rekindled the appeal for rechargeable sodium (Na) batteries as a practical solution for the growing energy demand, environmental sustainability and energy independence. However, the scarcity of viable positive electrode materials remains a huge impediment to the actualization of this technology. In this paper, we explore honeycomb layered oxides adopting the composition Na$_2$Ni$_{2-x}$Co$_x$TeO$_6$ ($x = 0, 0.25$ and $0.50$) as feasible positive electrode (cathode) materials for rechargeable sodium batteries at both room- and elevated temperatures using ionic liquids. Through standard galvanostatic assessments and analyses we demonstrate that substitution of nickel with cobalt in Na$_2$Ni$_2$TeO$_6$ leads to an increase in the discharge voltage to nearly $4$ V (versus Na$^+$ / Na) for the Na$_2$Ni$_{2-x}$Co$_x$TeO$_6$ family of honeycomb layered oxide materials, which surpasses the attained average voltages for most layered oxide positive electrode materials that facilitate Na-ion desertion. We also verify the increased kinetics within the Na$_2$Ni$_{2-x}$Co$_x$TeO$_6$ honeycomb layered oxides during operations at elevated temperatures which lead to an increase in reversible capacity of the rechargeable Na battery. This study underpins the doping of congener transition metal atoms to the honeycomb structure of Na$_2$Ni$_2$TeO$_6$ in addition to elevated-temperature operation as a judicious route to enhance the electrochemical performance of analogous layered oxides.

preprint2020arXiv

Nonlocal Kinetic Energy Density Functionals for Isolated Systems via Local Density Approximation Kernels

Despite a large number of nonlocal kinetic energy density functionals (KEDFs) available for large-scale calculations, most of those nonlocal KEDFs designed for the extended systems cannot be directly applied to isolated systems. In this manuscript, we proposed a generalized scheme to construct nonlocal KEDFs via the local density approximation kernels and construct a family of KEDFs for simulations of isolated systems within orbital-free density functional theory. The performance of KEDFs has been demonstrated by several clusters encompassing Mg, Si and GaAs. The results show that our constructed KEDFs can achieve high numerical accuracy and stability for random clusters, therefore, making orbital-free density functional theory accessible for practical simulations of isolated systems.

preprint2020arXiv

nuScenes: A multimodal dataset for autonomous driving

Robust detection and tracking of objects is crucial for the deployment of autonomous vehicle technology. Image based benchmark datasets have driven development in computer vision tasks such as object detection, tracking and segmentation of agents in the environment. Most autonomous vehicles, however, carry a combination of cameras and range sensors such as lidar and radar. As machine learning based methods for detection and tracking become more prevalent, there is a need to train and evaluate such methods on datasets containing range sensor data along with images. In this work we present nuTonomy scenes (nuScenes), the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view. nuScenes comprises 1000 scenes, each 20s long and fully annotated with 3D bounding boxes for 23 classes and 8 attributes. It has 7x as many annotations and 100x as many images as the pioneering KITTI dataset. We define novel 3D detection and tracking metrics. We also provide careful dataset analysis as well as baselines for lidar and image based detection and tracking. Data, development kit and more information are available online.

preprint2020arXiv

Quantitative estimates for homogenization of nonlinear elliptic operators in perforated domains

This paper was devoted to study the quantitative homogenization problems for nonlinear elliptic operators in perforated domains. We obtained a sharp error estimate $O(\varepsilon)$ when the problem was anchored in the reference domain $\varepsilonω$. If concerning a bounded perforated domain, one will see a bad influence from the boundary layers, which leads to the loss of the convergence rate by $O(\varepsilon^{1/2})$. Equipped with the error estimates, we developed both interior and boundary Lipschitz estimates at large-scales. As an application, we received the so-called quenched Calderón-Zygumund estimates by Shen's real arguments. To overcome some difficulties, we improved the extension theory from (\cite[Theorem 4.3]{OSY}) to $L^p$-versions with $\frac{2d}{d+1}-ε<p<\frac{2d}{d-1}+ε$ and $0<ε\ll1$. Appealing to this, we established Poincaré-Sobolev inequalities of local type on perforated domains. Some of results in the present literature are new even for related linear elliptic models.

preprint2020arXiv

Region-Wise Attack: On Efficient Generation of Robust Physical Adversarial Examples

Deep neural networks (DNNs) are shown to be susceptible to adversarial example attacks. Most existing works achieve this malicious objective by crafting subtle pixel-wise perturbations, and they are difficult to launch in the physical world due to inevitable transformations (e.g., different photographic distances and angles). Recently, there are a few research works on generating physical adversarial examples, but they generally require the details of the model a priori, which is often impractical. In this work, we propose a novel physical adversarial attack for arbitrary black-box DNN models, namely Region-Wise Attack. To be specific, we present how to efficiently search for regionwise perturbations to the inputs and determine their shapes, locations and colors via both top-down and bottom-up techniques. In addition, we introduce two fine-tuning techniques to further improve the robustness of our attack. Experimental results demonstrate the efficacy and robustness of the proposed Region-Wise Attack in real world.

preprint2020arXiv

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

Human poses that are rare or unseen in a training set are challenging for a network to predict. Similar to the long-tailed distribution problem in visual recognition, the small number of examples for such poses limits the ability of networks to model them. Interestingly, local pose distributions suffer less from the long-tail problem, i.e., local joint configurations within a rare pose may appear within other poses in the training set, making them less rare. We propose to take advantage of this fact for better generalization to rare and unseen poses. To be specific, our method splits the body into local regions and processes them in separate network branches, utilizing the property that a joint position depends mainly on the joints within its local body region. Global coherence is maintained by recombining the global context from the rest of the body into each branch as a low-dimensional vector. With the reduced dimensionality of less relevant body areas, the training set distribution within network branches more closely reflects the statistics of local poses instead of global body poses, without sacrificing information important for joint inference. The proposed split-and-recombine approach, called SRNet, can be easily adapted to both single-image and temporal models, and it leads to appreciable improvements in the prediction of rare and unseen poses.

preprint2020arXiv

suboptimal error estimates for homogenization of linear elasticity systems on perforated domains

In the present work, we established almost-sharp error estimates for linear elasticity systems in periodically perforated domains. The first result was $L^{\frac{2d}{d-1-τ}}$-error estimates $O\big(\varepsilon^{1-\fracτ{2}}\big)$ with $0<τ<1$ for a bounded smooth domain. It followed from weighted Hardy-Sobolev's inequalities and a suboptimal error estimate for the square function of the first-order approximating corrector (which was earliest investigated by C. Kenig, F. Lin, Z. Shen \cite{KLS} under additional regularity assumption on coefficients). The new approach relied on the weighted quenched Calderón-Zygmund estimate (initially appeared in A. Gloria, S. Neukamm, F. Otto's work \cite{Gloria_Neukamm_Otto_2015} for a quantitative stochastic homogenization theory). The second effort was $L^2$-error estimates $O\big(\varepsilon^{\frac{5}{6}}\ln^{\frac{2}{3}}(1/\varepsilon)\big)$ for a Lipschitz domain, followed from a new duality scheme coupled with interpolation inequalities. Also, we developed a new weighted extension theorem for perforated domains, and a real method imposed by Z. Shen \cite{S3} played a fundamental role in the whole project.

preprint2020arXiv

The Effect of the Multi-Layer Text Summarization Model on the Efficiency and Relevancy of the Vector Space-based Information Retrieval

The massive upload of text on the internet creates a huge inverted index in information retrieval systems, which hurts their efficiency. The purpose of this research is to measure the effect of the Multi-Layer Similarity model of the automatic text summarization on building an informative and condensed invert index in the IR systems. To achieve this purpose, we summarized a considerable number of documents using the Multi-Layer Similarity model, and we built the inverted index from the automatic summaries that were generated from this model. A series of experiments were held to test the performance in terms of efficiency and relevancy. The experiments include comparisons with three existing text summarization models; the Jaccard Coefficient Model, the Vector Space Model, and the Latent Semantic Analysis model. The experiments examined three groups of queries with manual and automatic relevancy assessment. The positive effect of the Multi-Layer Similarity in the efficiency of the IR system was clear without noticeable loss in the relevancy results. However, the evaluation showed that the traditional statistical models without semantic investigation failed to improve the information retrieval efficiency. Comparing with the previous publications that addressed the use of summaries as a source of the index, the relevancy assessment of our work was higher, and the Multi-Layer Similarity retrieval constructed an inverted index that was 58% smaller than the main corpus inverted index.

preprint2019arXiv

ICSTrace: A Malicious IP Traceback Model for Attacking Data of Industrial Control System

Considering the attacks against industrial control system are mostly organized and premeditated actions, IP traceback is significant for the security of industrial control system. Based on the infrastructure of the Internet, we have developed a novel malicious IP traceback model-ICSTrace, without deploying any new services. The model extracts the function codes and their parameters from the attack data according to the format of industrial control protocol, and employs a short sequence probability method to transform the function codes and their parameter into a vector, which characterizes the attack pattern of malicious IP addresses. Furthermore, a Partial Seeded K-Means algorithm is proposed for the pattern's clustering, which helps in tracing the attacks back to an organization. ICSTrace is evaluated basing on the attack data captured by the large-scale deployed honeypots for industrial control system, and the results demonstrate that ICSTrace is effective on malicious IP traceback in industrial control system.

preprint2016arXiv

A linear scaling method to evaluate the ion-electron potential of crystalline solids

We propose a simple linear scaling expression in reciprocal space for evaluating the ion--electron potential of crystalline solids. The expression replaces the long-range ion--electron potential with an equivalent localized charge distribution and corresponding boundary conditions on the unit cell. Given that no quadratic scaling structure factor is required---as used in traditional methods---the expression shows inherent linear behavior, and is well suited to simulating large-scale systems within orbital-free density functional theory. The scheme is implemented in the ATLAS software package and benchmarked by using a solid Mg bcc lattice containing tens of thousands of atoms in the unit cell. The test results show that the method can efficiently model large crystals with high computational accuracy.

preprint2016arXiv

Convergence rates and $W^{1,p}$ estimates in homogenization theory of Stokes systems in Lipschitz domains

Concerned with the Stokes systems with rapidly oscillating periodic coefficients, we mainly extend the recent works in \cite{SGZWS,G} to those in term of Lipschitz domains. The arguments employed here are quite different from theirs, and the basic idea comes from \cite{QX2}, originally motivated by \cite{SZW2,SZW12,TS}. We obtain an almost-sharp $O(\varepsilon\ln(r_0/\varepsilon))$ convergence rate in $L^2$ space, and a sharp $O(\varepsilon)$ error estimate in $L^{\frac{2d}{d-1}}$ space by a little stronger assumption. Under the dimensional condition $d=2$, we also establish the optimal $O(\varepsilon)$ convergence rate on pressure terms in $L^2$ space. Then utilizing the convergence rates we can derive the $W^{1,p}$ estimates uniformly down to microscopic scale $\varepsilon$ without any smoothness assumption on the coefficients, where $|\frac{1}{p}-\frac{1}{2}|<\frac{1}{2d}+ε$ and $ε$ is a positive constant independent of $\varepsilon$. Combining the local estimates, based upon $\text{VMO}$ coefficients, consequently leads to the uniform $W^{1,p}$ estimates. Here the proofs do not rely on the well known compactness methods.

preprint2016arXiv

Convergence rates for general elliptic homogenization problems in a bounded Lipschitz domain

The paper extends the results obtained by C. Kenig, F. Lin and Z. Shen in \cite{SZW2} to more general elliptic homogenization problems in two perspectives: lower order terms in the operator and no smoothness on the coefficients. We do not repeat their arguments. Instead we find the new weighted-type estimates for the smoothing operator at scale $\varepsilon$, and combining some techniques developed by Z. Shen in \cite{SZW12} leads to our main results. In addition, we also obtain sharp $O(\varepsilon)$ convergence rates in $L^{p}$ with $p=2d/(d-1)$, which were originally established by Z. Shen for elasticity systems in \cite{SZW12}. Also, this work may be regarded as the extension of \cite{TS,TS2} developed by T. Suslina concerned with the bounded Lipschitz domain.

preprint2016arXiv

Optimal Boundary Estimates for Stokes Systems in Homogenization Theory

The paper concerns the sharp boundary regularity estimates in homogenization of Dirichlet problem for Stokes systems. We obtain the Lipschitz estimates for velocity term and $L^\infty$ estimate for pressure term, under some reasonable smoothness assumption on rapidly oscillating periodic coefficients. The approach is based on convergence rates, originally investigated by S. Armstrong and Z. Shen in \cite{SZ,SZW12}, however the argument developed here does not rely on the Rellich estimates. In this sense, we find a new way to obtain the sharp uniform boundary estimates without imposing the symmetry assumption on coefficients. Additionally, we emphasize that $L^\infty$ estimate for the pressure term does require the $O(\varepsilon^{1/2})$ convergence rate, locally at least, compared to $O(\varepsilon^λ)$ for the velocity term, where $λ\in(0,1/2)$.

preprint2015arXiv

Device Fingerprinting in Wireless Networks: Challenges and Opportunities

Node forgery or impersonation, in which legitimate cryptographic credentials are captured by an adversary, constitutes one major security threat facing wireless networks. The fact that mobile devices are prone to be compromised and reverse engineered significantly increases the risk of such attacks in which adversaries can obtain secret keys on trusted nodes and impersonate the legitimate node. One promising approach toward thwarting these attacks is through the extraction of unique fingerprints that can provide a reliable and robust means for device identification. These fingerprints can be extracted from transmitted signal by analyzing information across the protocol stack. In this paper, the first unified and comprehensive tutorial in the area of wireless device fingerprinting for security applications is presented. In particular, we aim to provide a detailed treatment on developing novel wireless security solutions using device fingerprinting techniques. The objectives are three-fold: (i) to introduce a comprehensive taxonomy of wireless features that can be used in fingerprinting, (ii) to provide a systematic review on fingerprint algorithms including both white-list based and unsupervised learning approaches, and (iii) to identify key open research problems in the area of device fingerprinting and feature extraction, as applied to wireless security.

preprint2015arXiv

Uniform Regularity Estimates in Homogenization Theory of Elliptic System with Lower Order terms

In this paper, we extend the uniform regularity estimates obtained by M. Avellanda and F. Lin in the paper of Compactness methods in the theory of homogenization (Comm. Pure Appl. Math. 40(1987), no.6, 803-847) to the more general second order elliptic systems in divergences form, with rapidly oscillating periodic coefficients. We establish not only sharp $W^{1,p}$ estimates, Holder estimates, Lipschitz estimates and non-tangential maximal function estimates for the Dirichelt problem on a $C^{1,η}$ domain, but also a sharp $O(\varepsilon)$ convergence rate in $H_0^1(Ω)$ by virtue of the Dirichlet correctors. The well known compactness methods are not employed here, instead we construct the transformations to make full use of the corresponding results developed by M. Avellanda and F. Lin.

preprint2015arXiv

Uniform Regularity Estimates in Homogenization Theory of Elliptic Systems with Lower Order Terms on the Neumann Boundary Problem

In this paper, we mainly employed the idea of the previous paper to study the sharp uniform $W^{1,p}$ estimates with $1<p\leq \infty$ for more general elliptic systems with the Neumann boundary condition on a bounded $C^{1,η}$ domain, arising in homogenization theory. Based on the skills developed by Z. Shen and by T. Suslina for different purposes, we also established the $L^2$ convergence rates on a bounded $C^{1,1}$ domain and a Lipschitz domain, respectively. Here we found a "rough" version of the first order correctors (see Theorem 1.3), It allows us to skip the corresponding convergence results on $\mathbb{R}^d$ that are the preconditions in T. Suslina's papers.

preprint2014arXiv

Conditions for supersonic bent Marshak waves

Supersonic radiation diffusion approximation is a useful way to study the radiation transportation. Considering the bent Marshak wave theory in 2-dimensions, and an invariable source temperature, we get the supersonic radiation diffusion conditions which are about the Mach number $M>8(1+\sqrt{\ep})/3$, and the optical depth $τ>1$. A large Mach number requires a high temperature, while a large optical depth requires a low temperature. Only when the source temperature is in a proper region these conditions can be satisfied. Assuming the material opacity and the specific internal energy depend on the temperature and the density as a form of power law, for a given density, these conditions correspond to a region about source temperature and the length of the sample. This supersonic diffusion region involves both lower and upper limit of source temperature, while that in 1-dimension only gives a lower limit. Taking $\rm SiO_2$ and the Au for example, we show the supersonic region numerically.

preprint2012arXiv

A New Exponential Gravity

We propose a new exponential f(R) gravity model with f(R)=(R-λc)e^{λ(c/R)^n} and n>3, λ\geq 1, c>0 to explain late-time acceleration of the universe. At the high curvature region, the model behaves like the \LambdaCDM model. In the asymptotic future, it reaches a stable de-Sitter spacetime. It is a cosmologically viable model and can evade the local gravity constraints easily. This model share many features with other f(R) dark energy models like Hu-Sawicki model and Exponential gravity model. In it the dark energy equation of state is of an oscillating form and can cross phantom divide line ω_{de}=-1. In particular, in the parameter range 3< n\leq 4, λ\sim 1, the model is most distinguishable from other models. For instance, when n=4, λ=1, the dark energy equation of state will cross -1 in the earlier future and has a stronger oscillating form than the other models, the dark energy density in asymptotical future is smaller than the one in the high curvature region. This new model can evade the local gravity tests easily when n>3 and λ>1.

preprint2012arXiv

Singularity problem in f(R) model with non-minimal coupling

We consider the non-minimal coupling between matter and the geometry in the f(R) theory. In the new theory which we established, a new scalar $ψ$ has been defined and we give it a certain stability condition. We intend to take a closer look at the dark energy oscillating behavior in the de-Sitter universe and the matter era, from which we derive the oscillating frequency, and the oscillating condition. More importantly, we present the condition of coupling form that the singularity can be solved. We discuss several specific coupling forms, and find logarithmic coupling with an oscillating period $ΔT\simΔz$ in the matter era $z>4$, can improve singularity in the early universe. The result of numerical calculation verifies our theoretic calculation about the oscillating frequency. Considering two toy models, we find the cosmic evolution in the coupling model is nearly the same as that in the normal f(R) theory when $lna>4$. We also discuss the local tests of the non-minimal coupling f(R) model, and show the constraint on the coupling form.

preprint2011arXiv

Atomic-scale electron-beam sculpting of defect-free graphene nanostructures

In order to harvest the many promising properties of graphene in (electronic) applications, a technique is required to cut, shape or sculpt the material on a nanoscale without damage to its atomic structure, as this drastically influences the electronic properties of the nanostructure. Here, we reveal a temperature-dependent self-repair mechanism allowing damage-free atomic-scale sculpting of graphene using a focused electron beam. We demonstrate that by sculpting at temperatures above 600 °C, an intrinsic self-repair mechanism keeps the graphene single-crystalline during cutting, even thought the electron beam induces considerable damage. Self-repair is mediated by mobile carbon ad-atoms constantly repairing the defects caused by the electron beam. Our technique allows reproducible fabrication and simultaneous imaging of single-crystalline free-standing nanoribbons, nanotubes, nanopores, and single carbon chains.

preprint2011arXiv

Theoretical Study of Corundum as an Ideal Gate Dielectric Material for Graphene Transistors

Using physical insights and advanced first-principles calculations, we suggest that corundum is an ideal gate dielectric material for graphene transistors. Clean interface exists between graphene and Al-terminated (or hydroxylated) Al2O3 and the valence band offsets for these systems are large enough to create injection barrier. Remarkably, a band gap of » 180 meV can be induced in graphene layer adsorbed on Al-terminated surface, which could realize large ON/OFF ratio and high carrier mobility in graphene transistors without additional band gap engineering and significant reduction of transport properties. Moreover, the band gaps of graphene/Al2O3 system could be tuned by an external electric field for practical applications.

preprint2010arXiv

The Cosmological Constant as a Function of Extrinsic Curvature and Spatial Curvature

In this paper we suppose that the cosmological constant will change when the universe expends. For a general consideration, the cosmological constant is assumed to be a function of scale factor and Hubble constant. According to the ADM formulation, to the FRW metric, the extrinsic curvature $I$ equals $-6H^{2}$ and spatial curvature $R$ equals $6k/a^{2}$. Therefore we suppose cosmological constant is a function of extrinsic curvature and spatial curvature. We investigate the cosmological evolution of FRW universe in these models. At last we investigate two particular models which could fit the observation data about dark energy well. Actually a changeless cosmological constant is not essential. If when the universe expands, the cosmological constant changes slowly and gradually flows to a constant, the observation data about dark energy could also be fitted well by this kind of theory.

Qiang Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

41 published item(s)

MappingEvolve: LLM-Driven Code Evolution for Technology Mapping

FITS: Modeling Time Series with $10k$ Parameters

Agglomeration Drives the Reversed Fractionation of Aqueous Carbonate and Bicarbonate at the Air-water Interface

DeepSAT: An EDA-Driven Learning Framework for SAT

An Empirical Study on the Impact of Deep Parameters on Mobile App Energy Usage

Are Transformers Effective for Time Series Forecasting?

Be Your Own Neighborhood: Detecting Adversarial Example by the Neighborhood Relations Built on Self-Supervised Learning

Bias in the representative volume element method: periodize the ensemble instead of its realizations

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

DeepGate: Learning Neural Representations of Logic Gates

DeepTPI: Test Point Insertion with Deep Reinforcement Learning

Joint Learning of Deep Texture and High-Frequency Features for Computer-Generated Image Detection

Language-specific Characteristic Assistance for Code-switching Speech Recognition

MixDefense: A Defense-in-Depth Framework for Adversarial Example Detection Based on Statistical and Semantic Analysis

Nonlocal Pseudopotential Energy Density Functional for Orbital-Free Density Functional Theory

Out-of-Distribution Detection with Semantic Mismatch under Masking

Relational Graph Neural Network Design via Progressive Neural Architecture Search

SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos

What You See is Not What the Network Infers: Detecting Adversarial Examples Based on Semantic Contradiction

High-Voltage Honeycomb Layered Oxide Positive Electrodes for Rechargeable Sodium Batteries

Nonlocal Kinetic Energy Density Functionals for Isolated Systems via Local Density Approximation Kernels

nuScenes: A multimodal dataset for autonomous driving

Quantitative estimates for homogenization of nonlinear elliptic operators in perforated domains

Region-Wise Attack: On Efficient Generation of Robust Physical Adversarial Examples

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

suboptimal error estimates for homogenization of linear elasticity systems on perforated domains

The Effect of the Multi-Layer Text Summarization Model on the Efficiency and Relevancy of the Vector Space-based Information Retrieval

ICSTrace: A Malicious IP Traceback Model for Attacking Data of Industrial Control System

A linear scaling method to evaluate the ion-electron potential of crystalline solids

Convergence rates and $W^{1,p}$ estimates in homogenization theory of Stokes systems in Lipschitz domains

Convergence rates for general elliptic homogenization problems in a bounded Lipschitz domain

Optimal Boundary Estimates for Stokes Systems in Homogenization Theory

Device Fingerprinting in Wireless Networks: Challenges and Opportunities

Uniform Regularity Estimates in Homogenization Theory of Elliptic System with Lower Order terms

Uniform Regularity Estimates in Homogenization Theory of Elliptic Systems with Lower Order Terms on the Neumann Boundary Problem

Conditions for supersonic bent Marshak waves

A New Exponential Gravity

Singularity problem in f(R) model with non-minimal coupling

Atomic-scale electron-beam sculpting of defect-free graphene nanostructures

Theoretical Study of Corundum as an Ideal Gate Dielectric Material for Graphene Transistors

The Cosmological Constant as a Function of Extrinsic Curvature and Spatial Curvature