Researcher profile

Nan Ding

Nan Ding contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

All You May Need for VQA are Image Captions

Visual Question Answering (VQA) has benefited from increasingly sophisticated models, but has not enjoyed the same level of engagement in terms of data creation. In this paper, we propose a method that automatically derives VQA examples at volume, by leveraging the abundance of existing image-caption annotations combined with neural models for textual question generation. We show that the resulting data is of high-quality. VQA models trained on our data improve state-of-the-art zero-shot accuracy by double digits and achieve a level of robustness that lacks in the same model trained on human-annotated VQA data.

preprint2022arXiv

PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks

With the increasing abundance of pretrained models in recent years, the problem of selecting the best pretrained checkpoint for a particular downstream classification task has been gaining increased attention. Although several methods have recently been proposed to tackle the selection problem (e.g. LEEP, H-score), these methods resort to applying heuristics that are not well motivated by learning theory. In this paper we present PACTran, a theoretically grounded family of metrics for pretrained model selection and transferability measurement. We first show how to derive PACTran metrics from the optimal PAC-Bayesian bound under the transfer learning setting. We then empirically evaluate three metric instantiations of PACTran on a number of vision tasks (VTAB) as well as a language-and-vision (OKVQA) task. An analysis of the results shows PACTran is a more consistent and effective transferability measure compared to existing selection methods.

preprint2021arXiv

Detection of a possible high-confidence radio quasi-periodic oscillation in the BL Lac PKS J2134-0153

We have searched quasi-periodic oscillations (QPOs) for BL Lac PKS J2134-0153 in the 15 GHz radio light curve announced by the Owens Valley Radio Observatory 40-m telescope during the period from 2008-01-05 to 2019-05-18, utilizing the Lomb-Scargle periodogram (LSP) and the weighted wavelet Z-transform (WWZ) techniques. This is the first time that to search for periodic radio signal in BL Lac PKS J2134-0153 by these two methods. These two methods consistently reveal a QPO of 4.69 $\pm$ 0.14 years (>5 $σ$ confidence level). We discuss possible causes for this QPO, and we expected that the binary black holes scenario, where the QPO is caused by the precession of the binary black holes, is the most likely explanation. BL Lac PKS J2134-0153 thus could be a good binary black hole candidate. In the binary black holes scenario, the distance between the primary black hole and the secondary black hole is 1.83$\times$10$^{16}$ cm.

preprint2020arXiv

A two-zone blazar radiation model for "orphan" neutrino flares

In this work, we investigate the 2014-2015 neutrino flare associated with the blazar TXS 0506+056 and a recently discovered muon neutrino event IceCube-200107A in spatial coincidence with the blazar 4FGL J0955.1+3551, under the framework of a two-zone radiation model of blazars where an inner/outer blob close to/far from the supermassive black hole are invoked. An interesting feature that the two sources share in common is that no evidence of GeV gamma-ray activity is found during the neutrino detection period, probably implying a large opacity for GeV gamma rays in the neutrino production region. In our model, continuous particle acceleration/injection takes place in the inner blob at the jet base, where the hot X-ray corona of the supermassive black hole provides target photon fields for efficient neutrino production and strong GeV gamma-ray absorption. We show that this model can self-consistently interpret the neutrino emission from both two blazars in a large parameter space. In the meantime, the dissipation processes in outer blob are responsible for the simultaneous multi-wavelength emission of both sources. In agreement with previous studies of TXS 0506+056 and, an intense MeV emission from the induced electromagnetic cascade in the inner blob is robustly expected to accompany the neutrino flare in our model could be used to test the model with the next-generation MeV gamma-ray detector in the future.

preprint2020arXiv

From the Fermi blazar sequence to the relation between Fermi blazars and gamma-ray Narrow-line Seyfert 1 Galaxies

We use the third catalog of blazars detected by Fermi/LAT (3LAC) and gamma-ray Narrow-line Seyfert 1 Galaxies (gamma-NLSy1s) to study the blazar sequence and relationship between them. Our results are as follows: (i) There is a weak anti-correlation between synchrotron peak frequency and peak luminosity for both Fermi blazars and gamma-NLSy1s, which supports the blazar sequence. However, after Doppler correction, the inverse correlation disappeared, which suggests that anti-correlation between synchrotron peak frequency and peak luminosity is affected by the beaming effect. (ii) There is a significant anti-correlation between jet kinetic power and synchrotron peak frequency for both Fermi blazars and gamma-NLSy1s, which suggests that the gamma-NLSy1s could fit well into the original blazar sequence. (iii) According to previous work, the relationship between synchrotron peak frequency and synchrotron curvature can be explained by statistical or stochastic acceleration mechanisms. There are significant correlations between synchrotron peak frequency and synchrotron curvature for whole sample, Fermi blazars and BL Lacs, respectively. The slopes of the correlation are consistent with statistical acceleration. For FSRQs, LBLs, IBLs, HBLs, and gamma-NLS1s, we also find a significant correlation, but in these cases the slopes can not be explained by previous theoretical models. (iv) The slope of relation between synchrotron peak frequency and synchrotron curvature in gamma-NLS1s is large than that of FSRQs and BL Lacs. This result may imply that the cooling dominates over the acceleration process for FSRQs and BL Lacs, while gamma-NLS1s is the opposite.

preprint2020arXiv

iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention

In this report, the method for the iqiyi submission to the task of ActivityNet 2019 Kinetics-700 challenge is described. Three models are involved in the model ensemble stage: TSN, HG-NL and StNet. We propose the hierarchical group-wise non-local (HG-NL) module for frame-level features aggregation for video classification. The standard non-local (NL) module is effective in aggregating frame-level features on the task of video classification but presents low parameters efficiency and high computational cost. The HG-NL method involves a hierarchical group-wise structure and generates multiple attention maps to enhance performance. Basing on this hierarchical group-wise structure, the proposed method has competitive accuracy, fewer parameters and smaller computational cost than the standard NL. For the task of ActivityNet 2019 Kinetics-700 challenge, after model ensemble, we finally obtain an averaged top-1 and top-5 error percentage 28.444% on the test set.

preprint2020arXiv

LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment

Pairwise sequence alignment is one of the most computationally intensive kernels in genomic data analysis, accounting for more than 90% of the runtime for key bioinformatics applications. This method is particularly expensive for third-generation sequences due to the high computational cost of analyzing sequences of length between 1Kb and 1Mb. Given the quadratic overhead of exact pairwise algorithms for long alignments, the community primarily relies on approximate algorithms that search only for high-quality alignments and stop early when one is not found. In this work, we present the first GPU optimization of the popular X-drop alignment algorithm, that we named LOGAN. Results show that our high-performance multi-GPU implementation achieves up to 181.6 GCUPS and speed-ups up to 6.6x and 30.7x using 1 and 6 NVIDIA Tesla V100, respectively, over the state-of-the-art software running on two IBM Power9 processors using 168 CPU threads, with equivalent accuracy. We also demonstrate a 2.3x LOGAN speed-up versus ksw2, a state-of-art vectorized algorithm for sequence alignment implemented in minimap2, a long-read mapping software. To highlight the impact of our work on a real-world application, we couple LOGAN with a many-to-many long-read alignment software called BELLA, and demonstrate that our implementation improves the overall BELLA runtime by up to 10.6x. Finally, we adapt the Roofline model for LOGAN and demonstrate that our implementation is near-optimal on the NVIDIA Tesla V100s.

preprint2020arXiv

Multi-wavelength Selected Compton-thick AGNs in Chandra Deep Field-South Survey

Even in deep X-ray surveys, Compton-thick active galactic nuclei (CT AGNs, ${\rm N_H} \geqslant 1.5~\times~10^{24}~{\rm cm}^{-2}$) are difficult to be identified due to X-ray flux suppression and their complex spectral shape. However, the study of CT AGNs is vital for understanding the rapid growth of black holes and the origin of cosmic X-ray background. In the local universe, the fraction of CT AGNs accounts for 30% of the whole AGN population. We may expect a higher fraction of CT AGNs in deep X-ray surveys, however, only 10% of AGNs have been identified as CT AGNs in the 7 Ms \textit{Chandra} Deep Field-South (CDFS) survey. In this work, we select 51 AGNs with abundant multi-wavelength data. Using the method of the mid-infrared (mid-IR) excess, we select hitherto unknown 8 CT AGN candidates in our sample. Seven of these candidates can confirm as CT AGN based on the multi-wavelength identification approach, and a new CT AGN (XID 133) is identified through the mid-IR diagnostics. We also discuss the X-ray origin of these eight CT AGNs and the reason why their column densities were underestimated in previous studies. We find that the multi-wavelength approaches of selecting CT AGNs are highly efficient, provided the high quality of observational data. We also find that CT AGNs have a higher Eddington ratio than non-CT AGNs, and that both CT AGNs and non-CT AGNs show similar properties of host galaxies.

preprint2020arXiv

Multicolor Optical Monitoring of the Blazar S5 0716+714 from 2017 to 2019

We continuously monitored the blazar S5 0716+714 in the optical $g$, $r$ and $i$ bands from Nov. 10, 2017 to Jun. 06, 2019. The total number of observations is 201 nights including 26973 data points. This is a very large quasi-simultaneous multicolor sample for the blazar. The average time spans and time resolutions are 3.4 hours and 2.9 minutes per night, respectively. During the period of observations, the target source in the $r$ band brightens from $14^{\rm m}.16$ to $12^{\rm m}.29$ together with five prominent sub-flares, and then first becomes fainter to $14^{\rm m}.76$ and again brightens to $12^{\rm m}.94$ with seven prominent sub-flares. For the long-term variations, we find a strong flatter when brighter (FWB) trend at a low flux state and then a weak FWB trend at a higher flux state. A weak FWB trend at a low flux state and then a strong FWB trend at a higher flux state are also reported. Most of sub-flares show the strong FWB trends, except for two flares with a weak FWB trend. The particle acceleration and cooling mechanisms together with the superposition of different FWB-slopes from sub-flares are likely to explain the optical color behaviours. A scenario of bent jet is discussed.

preprint2020arXiv

Talking-Heads Attention

We introduce "talking-heads attention" - a variation on multi-head attention which includes linearprojections across the attention-heads dimension, immediately before and after the softmax operation.While inserting only a small number of additional parameters and a moderate amount of additionalcomputation, talking-heads attention leads to better perplexities on masked language modeling tasks, aswell as better quality when transfer-learning to language comprehension and question answering tasks.