Researcher profile

Lin Cheng

Lin Cheng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Machine Learning Interatomic Potential for Anisotropic Thermal Transport in Bulk Hexagonal Boron Nitride

The highly anisotropic thermal conductivity in layered materials is crucial for a broad range of applications such as thermal management of electronic devices, thermal insulation, and thermoelectrics. Understanding of anisotropic thermal transport in layered materials largely depends on atomistic simulations based on density functional theory (DFT) or empirical potentials, which however suffer either low computational efficiency or accuracy. Recently, machine learning interatomic potentials (MLIPs) are emerging as a powerful tool to bridge the gap. Despite the recent progress in developing MLIPs, little attention has been paid to constructing a potential that can accurately predict the thermal properties of layered materials, which is more challenging compared with the case of isotropic materials because of the highly anisotropic bonding and weak van der Waals interactions in layered materials. Here, we introduce a MLIP within the Gaussian approximation potential (GAP) framework for bulk hexagonal boron nitride (h-BN) with a typical layered structure. The GAP can well predict the highly anisotropic phonon transport properties and thermal conductivity of bulk h-BN with DFT-level accuracy at orders of magnitude reduced cost. Our work demonstrates the ability of GAP to reproduce the subtle features of anisotropic potential energy surfaces of bulk h-BN and potentially other layered materials. Atomistic simulations based on MLIPs are expected to be able to greatly promote the understanding of phonon transport and the prediction of thermophysical properties in layered materials.

preprint2022arXiv

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

The explanation for deep neural networks has drawn extensive attention in the deep learning community over the past few years. In this work, we study the visual saliency, a.k.a. visual explanation, to interpret convolutional neural networks. Compared to iteration based saliency methods, single backward pass based saliency methods benefit from faster speed, and they are widely used in downstream visual tasks. Thus, we focus on single backward pass based methods. However, existing methods in this category struggle to uccessfully produce fine-grained saliency maps concentrating on specific target classes. That said, producing faithful saliency maps satisfying both target-selectiveness and fine-grainedness using a single backward pass is a challenging problem in the field. To mitigate this problem, we revisit the gradient flow inside the network, and find that the entangled semantics and original weights may disturb the propagation of target-relevant saliency. Inspired by those observations, we propose a novel visual saliency method, termed Target-Selective Gradient Backprop (TSGB), which leverages rectification operations to effectively emphasize target classes and further efficiently propagate the saliency to the image space, thereby generating target-selective and fine-grained saliency maps. The proposed TSGB consists of two components, namely, TSGB-Conv and TSGB-FC, which rectify the gradients for convolutional layers and fully-connected layers, respectively. Extensive qualitative and quantitative experiments on the ImageNet and Pascal VOC datasets show that the proposed method achieves more accurate and reliable results than the other competitive methods. Code is available at https://github.com/123fxdx/CNNvisualizationTSGB.

preprint2020arXiv

Exploring the parameter reusability of CNN

In recent times, using small data to train networks has become a hot topic in the field of deep learning. Reusing pre-trained parameters is one of the most important strategies to address the issue of semi-supervised and transfer learning. However, the fundamental reason for the success of these methods is still unclear. In this paper, we propose a solution that can not only judge whether a given network is reusable or not based on the performance of reusing convolution kernels but also judge which layers' parameters of the given network can be reused, based on the performance of reusing corresponding parameters and, ultimately, judge whether those parameters are reusable or not in a target task based on the root mean square error (RMSE) of the corresponding convolution kernels. Specifically, we define that the success of a CNN's parameter reuse depends upon two conditions: first, the network is a reusable network; and second, the RMSE between the convolution kernels from the source domain and target domain is small enough. The experimental results demonstrate that the performance of reused parameters applied to target tasks, when these conditions are met, is significantly improved.

preprint2020arXiv

Learning Object Scale With Click Supervision for Object Detection

Weakly-supervised object detection has recently attracted increasing attention since it only requires image-levelannotations. However, the performance obtained by existingmethods is still far from being satisfactory compared with fully-supervised object detection methods. To achieve a good trade-off between annotation cost and object detection performance,we propose a simple yet effective method which incorporatesCNN visualization with click supervision to generate the pseudoground-truths (i.e., bounding boxes). These pseudo ground-truthscan be used to train a fully-supervised detector. To estimatethe object scale, we firstly adopt a proposal selection algorithmto preserve high-quality proposals, and then generate ClassActivation Maps (CAMs) for these preserved proposals by theproposed CNN visualization algorithm called Spatial AttentionCAM. Finally, we fuse these CAMs together to generate pseudoground-truths and train a fully-supervised object detector withthese ground-truths. Experimental results on the PASCAL VOC2007 and VOC 2012 datasets show that the proposed methodcan obtain much higher accuracy for estimating the object scale,compared with the state-of-the-art image-level based methodsand the center-click based method

preprint2019arXiv

Real-Time Optimal Control for Irregular Asteroid Landings Using Deep Neural Networks

Precise soft landings on asteroids are central to many deep space missions for surface exploration and resource exploitation. To improve the autonomy and intelligence of landing control, a real-time optimal control approach is proposed using deep neural networks (DNN) for asteroid landing problems wherein the developed DNN-based landing controller is capable of steering the lander to a preselected landing site with high robustness to initial conditions. First, to significantly reduce the time consumption of gravity calculation, DNNs are used to approximate the irregular gravitational field of the asteroid based on the samples from a polyhedral method. Then, an approximate indirect method is presented to solve the time-optimal landing problems with high computational efficiency by taking advantage of the designed gravity approximation method and a homotopy technique. Furthermore, five DNNs are developed to learn the functional relationship between the state and optimal actions obtained by the approximate indirect method, and the resulting DNNs can generate the optimal control instructions in real time because there is no longer need to solve the optimal landing problems onboard. Finally, a DNN-based landing controller composed of these five DNNs is devised to achieve the real-time optimal control for asteroid landings. Simulation results of the time-optimal landing for Eros are given to substantiate the effectiveness of these techniques and illustrate the real-time performance, control optimality, and robustness of the developed DNN-based optimal landing controller.