Source author record

Da Li

Da Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.supr-con Machine Learning cond-mat.mtrl-sci Artificial Intelligence Computation and Language cond-mat.mes-hall Cryptography and Security Distributed, Parallel, and Cluster Computing eess.SP math.NA Numerical Analysis physics.plasm-ph

Catalog footprint

What is connected

21works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

An Electromagnetic-Information-Theory Based Model for Efficient Characterization of MIMO Systems in Complex Space

It is the pursuit of a multiple-input-multiple-output (MIMO) system to approach and even break the limit of channel capacity. However, it is always a big challenge to efficiently characterize the MIMO systems in complex space and get better propagation performance than the conventional MIMO systems considering only free space, which is important for guiding the power and phase allocation of antenna units. In this manuscript, an Electromagnetic-Information-Theory (EMIT) based model is developed for efficient characterization of MIMO systems in complex space. The group-T-matrix-based multiple scattering fast algorithm, the mode-decomposition-based characterization method, and their joint theoretical framework in complex space are discussed. Firstly, key informatics parameters in free electromagnetic space based on a dyadic Green's function are derived. Next, a novel group-T-matrix-based multiple scattering fast algorithm is developed to describe a representative inhomogeneous electromagnetic space. All the analytical results are validated by simulations. In addition, the complete form of the EMIT-based model is proposed to derive the informatics parameters frequently used in electromagnetic propagation, through integrating the mode analysis method with the dyadic Green's function matrix. Finally, as a proof-or-concept, microwave anechoic chamber measurements of a cylindrical array is performed, demonstrating the effectiveness of the EMIT-based model. Meanwhile, a case of image transmission with limited power is presented to illustrate how to use this EMIT-based model to guide the power and phase allocation of antenna units for real MIMO applications.

preprint2022arXiv

A Simple Test-Time Method for Out-of-Distribution Detection

Neural networks are known to produce over-confident predictions on input images, even when these images are out-of-distribution (OOD) samples. This limits the applications of neural network models in real-world scenarios, where OOD samples exist. Many existing approaches identify the OOD instances via exploiting various cues, such as finding irregular patterns in the feature space, logits space, gradient space or the raw space of images. In contrast, this paper proposes a simple Test-time Linear Training (ETLT) method for OOD detection. Empirically, we find that the probabilities of input images being out-of-distribution are surprisingly linearly correlated to the features extracted by neural networks. To be specific, many state-of-the-art OOD algorithms, although designed to measure reliability in different ways, actually lead to OOD scores mostly linearly related to their image features. Thus, by simply learning a linear regression model trained from the paired image features and inferred OOD scores at test-time, we can make a more precise OOD prediction for the test instances. We further propose an online variant of the proposed method, which achieves promising performance and is more practical in real-world applications. Remarkably, we improve FPR95 from $51.37\%$ to $12.30\%$ on CIFAR-10 datasets with maximum softmax probability as the base OOD detector. Extensive experiments on several benchmark datasets show the efficacy of ETLT for OOD detection task.

preprint2022arXiv

AMS_ADRN at SemEval-2022 Task 5: A Suitable Image-text Multimodal Joint Modeling Method for Multi-task Misogyny Identification

Women are influential online, especially in image-based social media such as Twitter and Instagram. However, many in the network environment contain gender discrimination and aggressive information, which magnify gender stereotypes and gender inequality. Therefore, the filtering of illegal content such as gender discrimination is essential to maintain a healthy social network environment. In this paper, we describe the system developed by our team for SemEval-2022 Task 5: Multimedia Automatic Misogyny Identification. More specifically, we introduce two novel system to analyze these posts: a multimodal multi-task learning architecture that combines Bertweet for text encoding with ResNet-18 for image representation, and a single-flow transformer structure which combines text embeddings from BERT-Embeddings and image embeddings from several different modules such as EfficientNet and ResNet. In this manner, we show that the information behind them can be properly revealed. Our approach achieves good performance on each of the two subtasks of the current competition, ranking 15th for Subtask A (0.746 macro F1-score), 11th for Subtask B (0.706 macro F1-score) while exceeding the official baseline results by high margins.

preprint2022arXiv

Attacking Adversarial Defences by Smoothing the Loss Landscape

This paper investigates a family of methods for defending against adversarial attacks that owe part of their success to creating a noisy, discontinuous, or otherwise rugged loss landscape that adversaries find difficult to navigate. A common, but not universal, way to achieve this effect is via the use of stochastic neural networks. We show that this is a form of gradient obfuscation, and propose a general extension to gradient-based adversaries based on the Weierstrass transform, which smooths the surface of the loss function and provides more reliable gradient estimates. We further show that the same principle can strengthen gradient-free adversaries. We demonstrate the efficacy of our loss-smoothing method against both stochastic and non-stochastic adversarial defences that exhibit robustness due to this type of obfuscation. Furthermore, we provide analysis of how it interacts with Expectation over Transformation; a popular gradient-sampling method currently used to attack stochastic defences.

preprint2022arXiv

Dynamic Instance Domain Adaptation

Most existing studies on unsupervised domain adaptation (UDA) assume that each domain's training samples come with domain labels (e.g., painting, photo). Samples from each domain are assumed to follow the same distribution and the domain labels are exploited to learn domain-invariant features via feature alignment. However, such an assumption often does not hold true -- there often exist numerous finer-grained domains (e.g., dozens of modern painting styles have been developed, each differing dramatically from those of the classic styles). Therefore, forcing feature distribution alignment across each artificially-defined and coarse-grained domain can be ineffective. In this paper, we address both single-source and multi-source UDA from a completely different perspective, which is to view each instance as a fine domain. Feature alignment across domains is thus redundant. Instead, we propose to perform dynamic instance domain adaptation (DIDA). Concretely, a dynamic neural network with adaptive convolutional kernels is developed to generate instance-adaptive residuals to adapt domain-agnostic deep features to each individual instance. This enables a shared classifier to be applied to both source and target domain data without relying on any domain annotation. Further, instead of imposing intricate feature alignment losses, we adopt a simple semi-supervised learning paradigm using only a cross-entropy loss for both labeled source and pseudo labeled target data. Our model, dubbed DIDA-Net, achieves state-of-the-art performance on several commonly used single-source and multi-source UDA datasets including Digits, Office-Home, DomainNet, Digit-Five, and PACS.

preprint2022arXiv

Fisher SAM: Information Geometry and Sharpness Aware Minimisation

Recent sharpness-aware minimisation (SAM) is known to find flat minima which is beneficial for better generalisation with improved robustness. SAM essentially modifies the loss function by reporting the maximum loss value within the small neighborhood around the current iterate. However, it uses the Euclidean ball to define the neighborhood, which can be inaccurate since loss functions for neural networks are typically defined over probability distributions (e.g., class predictive probabilities), rendering the parameter space non Euclidean. In this paper we consider the information geometry of the model parameter space when defining the neighborhood, namely replacing SAM's Euclidean balls with ellipsoids induced by the Fisher information. Our approach, dubbed Fisher SAM, defines more accurate neighborhood structures that conform to the intrinsic metric of the underlying statistical manifold. For instance, SAM may probe the worst-case loss value at either a too nearby or inappropriately distant point due to the ignorance of the parameter space geometry, which is avoided by our Fisher SAM. Another recent Adaptive SAM approach stretches/shrinks the Euclidean ball in accordance with the scale of the parameter magnitudes. This might be dangerous, potentially destroying the neighborhood structure. We demonstrate improved performance of the proposed Fisher SAM on several benchmark datasets/tasks.

preprint2022arXiv

Multi-task Pre-training Language Model for Semantic Network Completion

Semantic networks, such as the knowledge graph, can represent the knowledge leveraging the graph structure. Although the knowledge graph shows promising values in natural language processing, it suffers from incompleteness. This paper focuses on knowledge graph completion by predicting linkage between entities, which is a fundamental yet critical task. Semantic matching is a potential solution as it can deal with unseen entities, which the translational distance based methods struggle with. However, to achieve competitive performance as translational distance based methods, semantic matching based methods require large-scale datasets for the training purpose, which are typically unavailable in practical settings. Therefore, we employ the language model and introduce a novel knowledge graph architecture named LP-BERT, which contains two main stages: multi-task pre-training and knowledge graph fine-tuning. In the pre-training phase, three tasks are taken to drive the model to learn the relationship from triples by predicting either entities or relations. While in the fine-tuning phase, inspired by contrastive learning, we design a triple-style negative sampling in a batch, which greatly increases the proportion of negative sampling while keeping the training time almost unchanged. Furthermore, we propose a new data augmentation method utilizing the inverse relationship of triples to improve the performance and robustness of the model. To demonstrate the effectiveness of our method, we conduct extensive experiments on three widely-used datasets, WN18RR, FB15k-237, and UMLS. The experimental results demonstrate the superiority of our methods, and our approach achieves state-of-the-art results on WN18RR and FB15k-237 datasets. Significantly, Hits@10 indicator is improved by 5% from previous state-of-the-art result on the WN18RR dataset while reaching 100% on the UMLS dataset.

preprint2022arXiv

Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference

Few-shot learning (FSL) is an important and topical problem in computer vision that has motivated extensive research into numerous methods spanning from sophisticated meta-learning methods to simple transfer learning baselines. We seek to push the limits of a simple-but-effective pipeline for more realistic and practical settings of few-shot image classification. To this end, we explore few-shot learning from the perspective of neural network architecture, as well as a three stage pipeline of network updates under different data supplies, where unsupervised external data is considered for pre-training, base categories are used to simulate few-shot tasks for meta-training, and the scarcely labelled data of an novel task is taken for fine-tuning. We investigate questions such as: (1) How pre-training on external data benefits FSL? (2) How state-of-the-art transformer architectures can be exploited? and (3) How fine-tuning mitigates domain shift? Ultimately, we show that a simple transformer-based pipeline yields surprisingly good performance on standard benchmarks such as Mini-ImageNet, CIFAR-FS, CDFSL and Meta-Dataset. Our code and demo are available at https://hushell.github.io/pmf.

preprint2022arXiv

Quasi-static magnetic compression of field-reversed configuration plasma: Amended scalings and limits from two-dimensional MHD equilibrium

In this work, several key scaling laws of the quasi-static magnetic compression of field reversed configuration (FRC) plasma [Spencer, Tuszewski, and Linford, 1983] are amended from a series of 2D FRC MHD equilibriums numerically obtained using the Grad-Shafranov equation solver NIMEQ. Based on the new scaling for the elongation and the magnetic fields at the separatrix and the wall, the empirically stable limits for the compression ratio, the fusion gain, and the neutron yield are evaluated, which may serve as a more accurate estimate for the upper ceiling of performance from the magnetic compression of FRC plasma as a potential fusion energy as well as neutron source devices.

preprint2020arXiv

Efficient and Stable Finite Difference Modelling of Acoustic Wave Propagation in Variable-density Media

In this paper, we consider the development and analysis of a new explicit compact high-order finite difference scheme for acoustic wave equation formulated in divergence form, which is widely used to describe seismic wave propagation through a heterogeneous media with variable media density and acoustic velocity. The new scheme is compact and of fourth-order accuracy in space and second-order accuracy in time. The compactness of the scheme is obtained by the so-called combined finite difference method, which utilizes the boundary values of the spatial derivatives and those boundary values are obtained by one-sided finite difference approximation. An empirical stability analysis has been conducted to obtain the Currant-Friedrichs-Lewy (CFL) condition, which confirmed the conditional stability of the new scheme. Four numerical examples have been conducted to validate the convergence and effectiveness of the new scheme. The application of the new scheme to a realistic wave propagation problem with Perfect Matched Layer boundary condition is also validated in this paper as well.

preprint2020arXiv

Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation

Domain adaptation (DA) is the topical problem of adapting models from labelled source datasets so that they perform well on target datasets where only unlabelled or partially labelled data is available. Many methods have been proposed to address this problem through different ways to minimise the domain shift between source and target datasets. In this paper we take an orthogonal perspective and propose a framework to further enhance performance by meta-learning the initial conditions of existing DA algorithms. This is challenging compared to the more widely considered setting of few-shot meta-learning, due to the length of the computation graph involved. Therefore we propose an online shortest-path meta-learning framework that is both computationally tractable and practically effective for improving DA performance. We present variants for both multi-source unsupervised domain adaptation (MSDA), and semi-supervised domain adaptation (SSDA). Importantly, our approach is agnostic to the base adaptation algorithm, and can be applied to improve many techniques. Experimentally, we demonstrate improvements on classic (DANN) and recent (MCD and MME) techniques for MSDA and SSDA, and ultimately achieve state of the art results on several DA benchmarks including the largest scale DomainNet.

preprint2020arXiv

Sequential Learning for Domain Generalization

In this paper we propose a sequential learning framework for Domain Generalization (DG), the problem of training a model that is robust to domain shift by design. Various DG approaches have been proposed with different motivating intuitions, but they typically optimize for a single step of domain generalization -- training on one set of domains and generalizing to one other. Our sequential learning is inspired by the idea lifelong learning, where accumulated experience means that learning the $n^{th}$ thing becomes easier than the $1^{st}$ thing. In DG this means encountering a sequence of domains and at each step training to maximise performance on the next domain. The performance at domain $n$ then depends on the previous $n-1$ learning problems. Thus backpropagating through the sequence means optimizing performance not just for the next domain, but all following domains. Training on all such sequences of domains provides dramatically more `practice' for a base DG learner compared to existing approaches, thus improving performance on a true testing domain. This strategy can be instantiated for different base DG algorithms, but we focus on its application to the recently proposed Meta-Learning Domain generalization (MLDG). We show that for MLDG it leads to a simple to implement and fast algorithm that provides consistent performance improvement on a variety of DG benchmarks.

preprint2019arXiv

Room temperature 2D ferromagnetism in few-layered 1$T$-CrTe$_{2}$

Spin-related electronics using two dimensional (2D) van der Waals (vdW) materials as a platform are believed to hold great promise for revolutionizing the next generation spintronics. Although many emerging new phenomena have been unravelled in 2D electronic systems with spin long-range orderings, the scarcely reported room temperature magnetic vdW material has thus far hindered the related applications. Here, we show that intrinsic ferromagnetically aligned spin polarization can hold up to 316 K in a metallic phase of 1$T$-CrTe$_{2}$ in the few-layer limit. This room temperature 2D long range spin interaction may be beneficial from an itinerant enhancement. Spin transport measurements indicate an in-plane room temperature negative anisotropic magnetoresistance (AMR) in few-layered CrTe$_{2}$, but a sign change in the AMR at lower temperature, with -0.6$\%$ at 300 K and +5$\%$ at 10 K, respectively. This behavior may originate from the specific spin polarized band structure of CrTe$_{2}$. Our findings provide insights into magnetism in few-layered CrTe$_{2}$, suggesting potential for future room temperature spintronic applications of such 2D vdW magnets.

preprint2016arXiv

A Large-scale Distributed Video Parsing and Evaluation Platform

Visual surveillance systems have become one of the largest data sources of Big Visual Data in real world. However, existing systems for video analysis still lack the ability to handle the problems of scalability, expansibility and error-prone, though great advances have been achieved in a number of visual recognition tasks and surveillance applications, e.g., pedestrian/vehicle detection, people/vehicle counting. Moreover, few algorithms explore the specific values/characteristics in large-scale surveillance videos. To address these problems in large-scale video analysis, we develop a scalable video parsing and evaluation platform through combining some advanced techniques for Big Data processing, including Spark Streaming, Kafka and Hadoop Distributed Filesystem (HDFS). Also, a Web User Interface is designed in the system, to collect users' degrees of satisfaction on the recognition tasks so as to evaluate the performance of the whole system. Furthermore, the highly extensible platform running on the long-term surveillance videos makes it possible to develop more intelligent incremental algorithms to enhance the performance of various visual recognition tasks.

preprint2016arXiv

Compiler-Assisted Workload Consolidation For Efficient Dynamic Parallelism on GPU

GPUs have been widely used to accelerate computations exhibiting simple patterns of parallelism - such as flat or two-level parallelism - and a degree of parallelism that can be statically determined based on the size of the input dataset. However, the effective use of GPUs for algorithms exhibiting complex patterns of parallelism, possibly known only at runtime, is still an open problem. Recently, Nvidia has introduced Dynamic Parallelism (DP) in its GPUs. By making it possible to launch kernels directly from GPU threads, this feature enables nested parallelism at runtime. However, the effective use of DP must still be understood: a naive use of this feature may suffer from significant runtime overhead and lead to GPU underutilization, resulting in poor performance. In this work, we target this problem. First, we demonstrate how a naive use of DP can result in poor performance. Second, we propose three workload consolidation schemes to improve performance and hardware utilization of DP-based codes, and we implement these code transformations in a directive-based compiler. Finally, we evaluate our framework on two categories of applications: algorithms including irregular loops and algorithms exhibiting parallel recursion. Our experiments show that our approach significantly reduces runtime overhead and improves GPU utilization, leading to speedup factors from 90x to 3300x over basic DP-based solutions and speedups from 2x to 6x over flat implementations.

preprint2015arXiv

Decomposition of solid hydrogen bromide at high pressure

The stability of different stoichiometric H$_n$Br ($n$=1-7) compounds under pressure are extensively studied using density functional theory calculations. Five new energetically stable stoichiometries of H$_2$Br, H$_3$Br, H$_4$Br, H$_5$Br, and H$_7$Br were uncovered at high pressure. The results show that HBr is stable below 64 GPa, then decomposes into new compound H$_2$Br and Br$_2$ molecular crystal. For H$_2$Br and H$_3$Br compounds, they were found to become stable above 30 GPa and 8 GPa, respectively. In addition, we accidentally discovered the triangular H$_3^+$ species in H$_5$Br compounds at 100 GPa. Further electron-phonon coupling calculations predicted that hydrogen-rich H$_2$Br and H$_4$Br compounds are superconductors with critical temperature of superconductivity $T_c$ of 12.1 K and 2.4 K at 240 GPa, respectively.

preprint2015arXiv

High-pressure structures and superconductivity of bismuth hydrides

We have systematically searched for the ground state structures of bismuth hydrides based on evolutionary algorithm method and particle swarm optimization algorithm method. Given only rich-hydrogen region, except for BiH$_{3}$, other hydrides (BiH, BiH$_{2}$, BiH$_{4}$, BiH$_{5}$, BiH$_{6}$) have been predicted to be stable with pressurization. With the increase of hydrogen content, hydrogen exists in bismuth hydrides with the different forms and presents the characteristics of ionicity. Under high pressure, the remarkable structural feature is the emergence of H$_{2}$ units in BiH$_{2}$, BiH$_{4}$ and BiH$_{6}$, and BiH$_{6}$ adopts a startling layered structure intercalated by H$_{2}$ and the linear H$_{3}$ units. Further calculations show these energetically stable hydrides are good metal and their metallic pressures are lower than that of pure solid hydrogen because of the doping impurities. The $T_{c}$ in the range of 20-119 K has been calculated by the Allen-Dynes modified McMillan equation, which indicates all these stable hydrides are potential high-temperature superconductors. Remarkably, it is the H-Bi-H and Bi atoms vibrations rather than the high-frequency H$_{2}$ or H$_{3}$ units that dominate the superconductivity. In addition, hydrogen content has a great influence on the superconducting transition temperature.

preprint2015arXiv

Phase diagram and superconductivity of polonium hydrides under high pressure

High pressure structures, phase diagram and superconductivity of polonium hydrides have been systematically investigated through the first-principles calculations based on the density functional theory. With the increasing pressure, several stoichiometries (PoH, $\textrm{PoH}_\textrm{2}$, $\textrm{PoH}_4$ and $\textrm{PoH}_6$) are predicted to stabilize in the excess hydrogen environment. All of the reported hydrides, exception of PoH, exhibit intriguing structural character with the appearing $\textrm{H}_2$ units. Moreover, our electronic band structure and the projected density of states (PDOS) demonstrate that these energetically stable phases are metallic. The application of the Allen-Dynes modified McMillan equation with the calculated electron-phonon coupling parameter reveals that $\textrm{PoH}_4$ is a superconductor with a critical temperature $T_c$ of 41.2-47.2 K at 300 GPa.

preprint2015arXiv

Pressure-induced decomposition of solid hydrogen sulfide

Solid hydrogen sulfide is well known as a typical molecular crystal but its stability under pressure is still under debate. Particularly, Eremets et al. found the high pressure superconductivity with $T_{c}\approx$ 190 K in a H$_{2}$S sample [arXiv: 1412.0460 (2014)] which is associates with the elemental decomposition into H$_{3}$S [Sci. Rep. 4, 6968 (2014)]. Therefore, on what pressure H$_{2}$S can decompose and which kind of the products of decomposition urgent need to be solved. In this paper, we have performed an extensive structural study on different stoichiometries H$_{n}$S with ${n> 1}$ under high pressure using $ab$ $initio$ calculations. Our results show that H$_{2}$S is stable below 50 GPa and decomposes into H$_3$S and sulfur at high pressure, while H$_{3}$S is stable at least up to 300 GPa. The other hydrogen-rich H$_{4}$S, H$_{5}$S, and H$_{6}$S are unstable in the pressure range from 20 to 300 GPa.

preprint2015arXiv

The unexpected binding and superconductivity in SbH4 at high pressure

The semimetal antimony (Sb) element doped into hydrogen has been performed theoretically to explored high-pressure crystal structure and superconductivity of antimony hydrides. The unexpected stoichiometry $\textrm{SbH}_\textrm{4}$ with $P6_3/mmc$ symmetry is found to have most negative enthalpy and embody the coexistence of covalent and ionic bonds. It is a metallic phase and stable in the pressure ranges of 127-300 GPa. Furthermore, a superconducting critical temperature ($T_c$) of 106 K is obtained at 150 GPa by employing the Allen-Dynes modified McMillan equation. In addition, an extrusive distinguishing feature is the presence of soft phonon modes, which is primary contribution to the strength of electron-phonon coupling.

preprint2012arXiv

Intermittent Josephson effect with feedback voltage and temperature oscillations in graphite-coated nanocapsules with superconducting TaC core

An intermittent Josephson effect in the form of voltage and temperature oscillations in the voltage - current curves near 2 K is observed in pellets consisting of superconducting TaC nanocapsules coated with graphite. This phenomenon is attributed to non-equilibrium conditions, when Cooper pairs across a junction, which stimulate the emission of photons and the feedback temperature change of the junction. It occurs in a three-dimensional granular framework composed of TaC/carbon/TaC tunneling junctions with a Mott metal-insulator transition, below the critical temperature Tc of non-ideal type-II superconductor TaC.

Da Li

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

An Electromagnetic-Information-Theory Based Model for Efficient Characterization of MIMO Systems in Complex Space

A Simple Test-Time Method for Out-of-Distribution Detection

AMS_ADRN at SemEval-2022 Task 5: A Suitable Image-text Multimodal Joint Modeling Method for Multi-task Misogyny Identification

Attacking Adversarial Defences by Smoothing the Loss Landscape

Dynamic Instance Domain Adaptation

Fisher SAM: Information Geometry and Sharpness Aware Minimisation

Multi-task Pre-training Language Model for Semantic Network Completion

Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference

Quasi-static magnetic compression of field-reversed configuration plasma: Amended scalings and limits from two-dimensional MHD equilibrium

Efficient and Stable Finite Difference Modelling of Acoustic Wave Propagation in Variable-density Media

Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation

Sequential Learning for Domain Generalization

Room temperature 2D ferromagnetism in few-layered 1$T$-CrTe$_{2}$

A Large-scale Distributed Video Parsing and Evaluation Platform

Compiler-Assisted Workload Consolidation For Efficient Dynamic Parallelism on GPU

Decomposition of solid hydrogen bromide at high pressure

High-pressure structures and superconductivity of bismuth hydrides

Phase diagram and superconductivity of polonium hydrides under high pressure

Pressure-induced decomposition of solid hydrogen sulfide

The unexpected binding and superconductivity in SbH4 at high pressure

Intermittent Josephson effect with feedback voltage and temperature oscillations in graphite-coated nanocapsules with superconducting TaC core