Source author record

Lin Cheng

Lin Cheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning cond-mat.mtrl-sci eess.IV math.OC Networking and Internet Architecture physics.optics

Catalog footprint

What is connected

7works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Machine Learning Interatomic Potential for Anisotropic Thermal Transport in Bulk Hexagonal Boron Nitride

The highly anisotropic thermal conductivity in layered materials is crucial for a broad range of applications such as thermal management of electronic devices, thermal insulation, and thermoelectrics. Understanding of anisotropic thermal transport in layered materials largely depends on atomistic simulations based on density functional theory (DFT) or empirical potentials, which however suffer either low computational efficiency or accuracy. Recently, machine learning interatomic potentials (MLIPs) are emerging as a powerful tool to bridge the gap. Despite the recent progress in developing MLIPs, little attention has been paid to constructing a potential that can accurately predict the thermal properties of layered materials, which is more challenging compared with the case of isotropic materials because of the highly anisotropic bonding and weak van der Waals interactions in layered materials. Here, we introduce a MLIP within the Gaussian approximation potential (GAP) framework for bulk hexagonal boron nitride (h-BN) with a typical layered structure. The GAP can well predict the highly anisotropic phonon transport properties and thermal conductivity of bulk h-BN with DFT-level accuracy at orders of magnitude reduced cost. Our work demonstrates the ability of GAP to reproduce the subtle features of anisotropic potential energy surfaces of bulk h-BN and potentially other layered materials. Atomistic simulations based on MLIPs are expected to be able to greatly promote the understanding of phonon transport and the prediction of thermophysical properties in layered materials.

preprint2022arXiv

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

The explanation for deep neural networks has drawn extensive attention in the deep learning community over the past few years. In this work, we study the visual saliency, a.k.a. visual explanation, to interpret convolutional neural networks. Compared to iteration based saliency methods, single backward pass based saliency methods benefit from faster speed, and they are widely used in downstream visual tasks. Thus, we focus on single backward pass based methods. However, existing methods in this category struggle to uccessfully produce fine-grained saliency maps concentrating on specific target classes. That said, producing faithful saliency maps satisfying both target-selectiveness and fine-grainedness using a single backward pass is a challenging problem in the field. To mitigate this problem, we revisit the gradient flow inside the network, and find that the entangled semantics and original weights may disturb the propagation of target-relevant saliency. Inspired by those observations, we propose a novel visual saliency method, termed Target-Selective Gradient Backprop (TSGB), which leverages rectification operations to effectively emphasize target classes and further efficiently propagate the saliency to the image space, thereby generating target-selective and fine-grained saliency maps. The proposed TSGB consists of two components, namely, TSGB-Conv and TSGB-FC, which rectify the gradients for convolutional layers and fully-connected layers, respectively. Extensive qualitative and quantitative experiments on the ImageNet and Pascal VOC datasets show that the proposed method achieves more accurate and reliable results than the other competitive methods. Code is available at https://github.com/123fxdx/CNNvisualizationTSGB.

preprint2020arXiv

Exploring the parameter reusability of CNN

In recent times, using small data to train networks has become a hot topic in the field of deep learning. Reusing pre-trained parameters is one of the most important strategies to address the issue of semi-supervised and transfer learning. However, the fundamental reason for the success of these methods is still unclear. In this paper, we propose a solution that can not only judge whether a given network is reusable or not based on the performance of reusing convolution kernels but also judge which layers' parameters of the given network can be reused, based on the performance of reusing corresponding parameters and, ultimately, judge whether those parameters are reusable or not in a target task based on the root mean square error (RMSE) of the corresponding convolution kernels. Specifically, we define that the success of a CNN's parameter reuse depends upon two conditions: first, the network is a reusable network; and second, the RMSE between the convolution kernels from the source domain and target domain is small enough. The experimental results demonstrate that the performance of reused parameters applied to target tasks, when these conditions are met, is significantly improved.

preprint2020arXiv

Learning Object Scale With Click Supervision for Object Detection

Weakly-supervised object detection has recently attracted increasing attention since it only requires image-levelannotations. However, the performance obtained by existingmethods is still far from being satisfactory compared with fully-supervised object detection methods. To achieve a good trade-off between annotation cost and object detection performance,we propose a simple yet effective method which incorporatesCNN visualization with click supervision to generate the pseudoground-truths (i.e., bounding boxes). These pseudo ground-truthscan be used to train a fully-supervised detector. To estimatethe object scale, we firstly adopt a proposal selection algorithmto preserve high-quality proposals, and then generate ClassActivation Maps (CAMs) for these preserved proposals by theproposed CNN visualization algorithm called Spatial AttentionCAM. Finally, we fuse these CAMs together to generate pseudoground-truths and train a fully-supervised object detector withthese ground-truths. Experimental results on the PASCAL VOC2007 and VOC 2012 datasets show that the proposed methodcan obtain much higher accuracy for estimating the object scale,compared with the state-of-the-art image-level based methodsand the center-click based method

preprint2019arXiv

Real-Time Optimal Control for Irregular Asteroid Landings Using Deep Neural Networks

Precise soft landings on asteroids are central to many deep space missions for surface exploration and resource exploitation. To improve the autonomy and intelligence of landing control, a real-time optimal control approach is proposed using deep neural networks (DNN) for asteroid landing problems wherein the developed DNN-based landing controller is capable of steering the lander to a preselected landing site with high robustness to initial conditions. First, to significantly reduce the time consumption of gravity calculation, DNNs are used to approximate the irregular gravitational field of the asteroid based on the samples from a polyhedral method. Then, an approximate indirect method is presented to solve the time-optimal landing problems with high computational efficiency by taking advantage of the designed gravity approximation method and a homotopy technique. Furthermore, five DNNs are developed to learn the functional relationship between the state and optimal actions obtained by the approximate indirect method, and the resulting DNNs can generate the optimal control instructions in real time because there is no longer need to solve the optimal landing problems onboard. Finally, a DNN-based landing controller composed of these five DNNs is devised to achieve the real-time optimal control for asteroid landings. Simulation results of the time-optimal landing for Eros are given to substantiate the effectiveness of these techniques and illustrate the real-time performance, control optimality, and robustness of the developed DNN-based optimal landing controller.

preprint2016arXiv

Near Optimal Adaptive Shortest Path Routing with Stochastic Links States under Adversarial Attack

We consider the shortest path routing (SPR) of a network with stochastically time varying link metrics under potential adversarial attacks. Due to potential denial of service attacks, the distributions of link states could be stochastic (benign) or adversarial at different temporal and spatial locations. Without any \emph{a priori}, designing an adaptive SPR protocol to cope with all possible situations in practice optimally is a very challenging issue. In this paper, we present the first solution by formulating it as a multi-armed bandit (MAB) problem. By introducing a novel control parameter into the exploration phase for each link, a martingale inequality is applied in the our combinatorial adversarial MAB framework. As such, our proposed algorithms could automatically detect features of the environment within a unified framework and find the optimal SPR strategies with almost optimal learning performance in all possible cases over time. Moreover, we study important issues related to the practical implementation, such as decoupling route selection with multi-path route probing, cooperative learning among multiple sources, the "cold-start" issue and delayed feedback of our algorithm. Nonetheless, the proposed SPR algorithms can be implemented with low complexity and they are proved to scale very well with the network size. Comparing to existing approaches in a typical network scenario under jamming attacks, our algorithm has a 65.3\% improvement of network delay given a learning period and a 81.5\% improvement of learning duration under a specified network delay.

preprint2012arXiv

Near-Infrared Super Resolution Imaging with Metallic Nanoshell Particle Chain Array

We propose a near-infrared super resolution imaging system without a lens or a mirror but with an array of metallic nanoshell particle chain. The imaging array can plasmonically transfer the near-field components of dipole sources in the incoherent and coherent manners and the super resolution images can be reconstructed in the output plane. By tunning the parameters of the metallic nanoshell particle, the plasmon resonance band of the isolate nanoshell particle red-shifts to the near-infrared region. The near-infrared super resolution images are obtained subsequently. We calculate the field intensity distribution at the different planes of imaging process using the finite element method and find that the array has super resolution imaging capability at near-infrared wavelengths. We also show that the image formation highly depends on the coherence of the dipole sources and the image-array distance.