Source author record

Dongfang Li

Dongfang Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Information Theory math.IT math.NA Numerical Analysis physics.optics Artificial Intelligence Computer Vision math.DS Populations and Evolution

Catalog footprint

What is connected

14works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training

Feature attribution methods highlight the important input tokens as explanations to model predictions, which have been widely applied to deep neural networks towards trustworthy AI. However, recent works show that explanations provided by these methods face challenges of being faithful and robust. In this paper, we propose a method with Robustness improvement and Explanation Guided training towards more faithful EXplanations (REGEX) for text classification. First, we improve model robustness by input gradient regularization technique and virtual adversarial training. Secondly, we use salient ranking to mask noisy tokens and maximize the similarity between model attention and feature attribution, which can be seen as a self-training procedure without importing other external information. We conduct extensive experiments on six datasets with five attribution methods, and also evaluate the faithfulness in the out-of-domain setting. The results show that REGEX improves fidelity metrics of explanations in all settings and further achieves consistent gains based on two randomization tests. Moreover, we show that using highlight explanations produced by REGEX to train select-then-predict models results in comparable task performance to the end-to-end method.

preprint2022arXiv

A Survey on Table Question Answering: Recent Advances

Table Question Answering (Table QA) refers to providing precise answers from tables to answer a user's question. In recent years, there have been a lot of works on table QA, but there is a lack of comprehensive surveys on this research topic. Hence, we aim to provide an overview of available datasets and representative methods in table QA. We classify existing methods for table QA into five categories according to their techniques, which include semantic-parsing-based, generative, extractive, matching-based, and retriever-reader-based methods. Moreover, as table QA is still a challenging task for existing methods, we also identify and outline several key challenges and discuss the potential future directions of table QA.

preprint2022arXiv

Data-driven method to learn the most probable transition pathway and stochastic differential equations

Transition phenomena between metastable states play an important role in complex systems due to noisy fluctuations. In this paper, the physics informed neural networks (PINNs) are presented to compute the most probable transition pathway. It is shown that the expected loss is bounded by the empirical loss. And the convergence result for the empirical loss is obtained. Then, a sampling method of rare events is presented to simulate the transition path by the Markovian bridge process. And we investigate the inverse problem to extract the stochastic differential equation from the most probable transition pathway data and the Markovian bridge process data, respectively. Finally, several numerical experiments are presented to verify the effectiveness of our methods.

preprint2022arXiv

MSDF: A General Open-Domain Multi-Skill Dialog Framework

Dialog systems have achieved significant progress and have been widely used in various scenarios. The previous researches mainly focused on designing dialog generation models in a single scenario, while comprehensive abilities are required to handle tasks under various scenarios in the real world. In this paper, we propose a general Multi-Skill Dialog Framework, namely MSDF, which can be applied in different dialog tasks (e.g. knowledge grounded dialog and persona based dialog). Specifically, we propose a transferable response generator pre-trained on diverse large-scale dialog corpora as the backbone of MSDF, consisting of BERT-based encoders and a GPT-based decoder. To select the response consistent with dialog history, we propose a consistency selector trained through negative sampling. Moreover, the flexible copy mechanism of external knowledge is also employed to enhance the utilization of multiform knowledge in various scenarios. We conduct experiments on knowledge grounded dialog, recommendation dialog, and persona based dialog tasks. The experimental results indicate that our MSDF outperforms the baseline models with a large margin. In the Multi-skill Dialog of 2021 Language and Intelligence Challenge, our general MSDF won the 3rd prize, which proves our MSDF is effective and competitive.

preprint2021arXiv

Sharp pointwise-in-time error estimate of L1 scheme for nonlinear subdiffusion equations

An essential feature of the subdiffusion equations with the $α$-order time fractional derivative is the weak singularity at the initial time. The weak regularity of the solution is usually characterized by a regularity parameter $σ\in (0,1)\cup(1,2)$. Under this general regularity assumption, we here obtain the pointwise-in-time error estimate of the widely used L1 scheme for nonlinear subdiffusion equations. To the end, we present a refined discrete fractional-type Grönwall inequality and a rigorous analysis for the truncation errors. Numerical experiments are provided to demonstrate the effectiveness of our theoretical analysis.

preprint2021arXiv

Unconditionally optimal convergence of an energy-conserving and linearly implicit scheme for nonlinear wave equations

In this paper, we present and analyze an energy-conserving and linearly implicit scheme for solving the nonlinear wave equations. Optimal error estimates in time and superconvergent error estimates in space are established without time-step dependent on the spatial mesh size. The key is to estimate directly the solution bounds in the $H^2$-norm for both the nonlinear wave equation and the corresponding fully discrete scheme, while the previous investigations rely on the temporal-spatial error splitting approach. Numerical examples are presented to confirm energy-conserving properties, unconditional convergence, and optimal error estimates, respectively, of the proposed fully discrete schemes.

preprint2020arXiv

Semi-supervised Visual Feature Integration for Pre-trained Language Models

Integrating visual features has been proved useful for natural language understanding tasks. Nevertheless, in most existing multimodal language models, the alignment of visual and textual data is expensive. In this paper, we propose a novel semi-supervised visual integration framework for pre-trained language models. In the framework, the visual features are obtained through a visualization and fusion mechanism. The uniqueness includes: 1) the integration is conducted via a semi-supervised approach, which does not require aligned images for every sentences 2) the visual features are integrated as an external component and can be directly used by pre-trained language models. To verify the efficacy of the proposed framework, we conduct the experiments on both natural language inference and reading comprehension tasks. The results demonstrate that our mechanism brings improvement to two strong baseline models. Considering that our framework only requires an image database, and no not requires further alignments, it provides an efficient and feasible way for multimodal language learning.

preprint2016arXiv

A Novel Sufficient Condition for Generalized Orthogonal Matching Pursuit

Generalized orthogonal matching pursuit (gOMP), also called orthogonal multi-matching pursuit, is an extension of OMP in the sense that $N\geq1$ indices are identified per iteration. In this paper, we show that if the restricted isometry constant (RIC) $δ_{NK+1}$ of a sensing matrix $\A$ satisfies $δ_{NK+1} < 1/\sqrt {K/N+1}$, then under a condition on the signal-to-noise ratio, gOMP identifies at least one index in the support of any $K$-sparse signal $\x$ from $\y=\A\x+\v$ at each iteration, where $\v$ is a noise vector. Surprisingly, this condition does not require $N\leq K$ which is needed in Wang, \textit{et al} 2012 and Liu, \textit{et al} 2012. Thus, $N$ can have more choices. When $N=1$, it reduces to be a sufficient condition for OMP, which is less restrictive than that proposed in Wang 2015. Moreover, in the noise-free case, it is a sufficient condition for accurately recovering $\x$ in $K$ iterations which is less restrictive than the best known one. In particular, it reduces to the sharp condition proposed in Mo 2015 when $N=1$.

preprint2016arXiv

Strong Amplitude and Phase Modulation of Optical Spatial Coherence with Surface Plasmon Polaritons

The degree of optical spatial coherence -a fundamental property of light that describes the mutual correlations between fluctuating electromagnetic fields- has proven challenging to control at the micrometer scale. Here we employ surface plasmon polaritons -evanescent waves excited on both surfaces of a thin metal film- as a means to entangle the random fluctuations of the incident electromagnetic fields at the slit locations of a Young's double-slit interferometer. Strong tunability of the complex degree of spatial coherence of light is achieved by finely varying the separation distance between the two slits. Continuous modulation of the degree of spatial coherence with amplitudes ranging from 0% up to 80% allows us to transform totally incoherent incident light into highly coherent light, and vice versa. These findings pave the way for alternative methods to engineer flat optical elements with multi-functional capabilities beyond conventional refractive- and diffractive-based photonic metasurfaces.

preprint2014arXiv

Epidemic clones, oceanic gene pools and eco-LD in the free living marine pathogen Vibrio parahaemolyticus

We investigated global patterns of variation in 157 whole genome sequences of Vibrio parahaemolyticus, a free-living and seafood associated marine bacterium. Pandemic clones, responsible for recent outbreaks of gastroenteritis in humans have spread globally. However, there are oceanic gene pools, one located in the oceans surrounding Asia and another in the Mexican Gulf. Frequent recombination means that most isolates have acquired the genetic profile of their current location. We investigated the genetic structure in the Asian gene pool by calculating the effective population size in two different ways. Under standard neutral models, the two estimates should give similar answers but we found a thirty fold difference. We propose that this discrepancy is caused by the subdivision of the species into a hundred or more ecotypes which are maintained stably in the population. To investigate the genetic factors involved, we used 51 unrelated isolates to conduct a genome-wide scan for epistatically interacting loci. We found a single example of strong epistasis between distant genome regions. A majority of strains had a type VI secretion system associated with bacterial killing. The remaining strains had genes associated with biofilm formation and regulated by c-di-GMP signaling. All strains had one or other of the two systems and none of isolate had complete complements of both systems, although several strains had remnants. Further top-down analysis of patterns of linkage disequilibrium within frequently recombining species will allow a detailed understanding of how selection acts to structure the pattern of variation within natural bacterial populations.

preprint2014arXiv

Improved Bounds on the Restricted Isometry Constant for Orthogonal Matching Pursuit

In this letter, we first construct a counter example to show that for any given positive integer $K\geq 2$ and for any $\frac{1}{\sqrt{K+1}}\leq t<1$, there always exist a $K-$sparse $\x$ and a matrix $\A$ with the restricted isometry constant $δ_{K+1}=t$ such that the OMP algorithm fails in $K$ iterations. Secondly, we show that even when $δ_{K+1}=\frac{1}{\sqrt{K}+1}$, the OMP algorithm can also perfectly recover every $K-$sparse vector $\x$ from $\y=\A\x$ in $K$ iteration. This improves the best existing results which were independently given by Mo et al. and Wang et al.

preprint2014arXiv

Quantifying and controlling the magnetic dipole contribution to 1.5 $μ$m light emission in erbium-doped yttrium oxide

We experimentally quantify the contribution of magnetic dipole (MD) transitions to the near-infrared light emission from trivalent erbium-doped yttrium oxide (Er$^{3+}$:Y$_2$O$_3$). Using energy-momentum spectroscopy, we demonstrate that the $^4$I$_{13/2}{\to}^4$I$_{15/2}$ emission near 1.5 $μ$m originates from nearly equal contributions of electric dipole (ED) and MD transitions that exhibit distinct emission spectra. We then show how these distinct spectra, together with the differing local density of optical states (LDOS) for ED and MD transitions, can be leveraged to control Er$^{3+}$ emission in structured environments. We demonstrate that far-field emission spectra can be tuned to resemble almost pure emission from either ED or MD transitions, and show that the observed spectral modifications can be accurately predicted from the measured ED and MD intrinsic emission rates.

preprint2014arXiv

Stable Recovery of Sparse Signals via $l_p-$Minimization

In this paper, we show that, under the assumption that $\|\e\|_2\leq ε$, every $k-$sparse signal $\x\in \mathbb{R}^n$ can be stably ($ε\neq0$) or exactly recovered ($ε=0$) from $\y=\A\x+\e$ via $l_p-$mnimization with $p\in(0, \bar{p}]$, where \beqnn \bar{p}= \begin{cases} \frac{50}{31}(1-δ_{2k}), &δ_{2k}\in[\frac{\sqrt{2}}{2}, 0.7183)\cr 0.4541, &δ_{2k}\in[0.7183,0.7729)\cr 2(1-δ_{2k}), &δ_{2k}\in[0.7729,1) \end{cases}, \eeqnn even if the restricted isometry constant of $\A$ satisfies $δ_{2k}\in[\frac{\sqrt{2}}{2}, 1)$. Furthermore, under the assumption that $n\leq 4k$, we show that the range of $p$ can be further improved to $p\in(0,\frac{3+2\sqrt{2}}{2}(1-δ_{2k})]$. This not only extends some discussions of only the noiseless recovery (Lai et al. and Wu et al.) to the noise recovery, but also greatly improves the best existing results where $p\in(0,\min\{1, 1.0873(1-δ_{2k}) \})$ (Wu et al.).

preprint2014arXiv

Wide-angle energy-momentum spectroscopy

Light emission is defined by its distribution in energy, momentum, and polarization. Here, we demonstrate a method that resolves these distributions by means of wide-angle energy-momentum spectroscopy. Specifically, we image the back focal plane of a microscope objective through a Wollaston prism to obtain polarized Fourier-space momentum distributions, and disperse these two-dimensional radiation patterns through an imaging spectrograph without an entrance slit. The resulting measurements represent a convolution of individual radiation patterns at adjacent wavelengths, which can be readily deconvolved using any well-defined basis for light emission. As an illustrative example, we use this technique with the multipole basis to quantify the intrinsic emission rates for electric and magnetic dipole transitions in europium-doped yttrium oxide (Eu$^{3+}$:Y$_{2}$O$_{3}$) and chromium-doped magnesium oxide (Cr$^{3+}$:MgO). Once extracted, these rates allow us to reconstruct the full, polarized, two-dimensional radiation patterns at each wavelength.

Dongfang Li

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training

A Survey on Table Question Answering: Recent Advances

Data-driven method to learn the most probable transition pathway and stochastic differential equations

MSDF: A General Open-Domain Multi-Skill Dialog Framework

Sharp pointwise-in-time error estimate of L1 scheme for nonlinear subdiffusion equations

Unconditionally optimal convergence of an energy-conserving and linearly implicit scheme for nonlinear wave equations

Semi-supervised Visual Feature Integration for Pre-trained Language Models

A Novel Sufficient Condition for Generalized Orthogonal Matching Pursuit

Strong Amplitude and Phase Modulation of Optical Spatial Coherence with Surface Plasmon Polaritons

Epidemic clones, oceanic gene pools and eco-LD in the free living marine pathogen Vibrio parahaemolyticus

Improved Bounds on the Restricted Isometry Constant for Orthogonal Matching Pursuit

Quantifying and controlling the magnetic dipole contribution to 1.5 $μ$m light emission in erbium-doped yttrium oxide

Stable Recovery of Sparse Signals via $l_p-$Minimization

Wide-angle energy-momentum spectroscopy