Researcher profile

Xinlin Li

Xinlin Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2022arXiv

Achievements and Lessons Learned from Successful Small Satellite Missions for Space Weather-Oriented Research

When the first CubeSats were launched nearly two decades ago, few people believed that the miniature satellites would likely prove to be a useful scientific tool. Skeptics abounded. However, the last decade has seen the highly successful implementation of space missions that make creative and innovative use of fast-advancing CubeSat and small satellite technology to carry out important science experiments and missions. Several projects now have used CubeSats to obtain first-of-their-kind observations and findings that have formed the basis for high-profile engineering and science publications, thereby establishing without doubt the scientific value and broad utility of CubeSats. In this paper, we describe recent achievements and lessons learned from a representative selection of successful CubeSat missions with a space weather focus. We conclude that these missions were successful in part because their limited resources promoted not only mission focus but also appropriate risk-taking for comparatively high science return. Quantitative analysis of refereed publications from these CubeSat missions and several larger missions reveals that mission outcome metrics compare favorably when publication number is normalized by mission cost or if expressed as a weighted net scientific impact of all mission publications.

preprint2022arXiv

Deep Neural Networks pruning via the Structured Perspective Regularization

In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes training, storage and inference expensive. This motivated a stream of research about compressing the original networks into smaller ones without excessively sacrificing performances. Among the many proposed compression approaches, one of the most popular is \emph{pruning}, whereby entire elements of the ANN (links, nodes, channels, \ldots) and the corresponding weights are deleted. Since the nature of the problem is inherently combinatorial (what elements to prune and what not), we propose a new pruning method based on Operational Research tools. We start from a natural Mixed-Integer-Programming model for the problem, and we use the Perspective Reformulation technique to strengthen its continuous relaxation. Projecting away the indicator variables from this reformulation yields a new regularization term, which we call the Structured Perspective Regularization, that leads to structured pruning of the initial architecture. We test our method on some ResNet architectures applied to CIFAR-10, CIFAR-100 and ImageNet datasets, obtaining competitive performances w.r.t.~the state of the art for structured pruning.

preprint2022arXiv

Low-bit Shift Network for End-to-End Spoken Language Understanding

Deep neural networks (DNN) have achieved impressive success in multiple domains. Over the years, the accuracy of these models has increased with the proliferation of deeper and more complex architectures. Thus, state-of-the-art solutions are often computationally expensive, which makes them unfit to be deployed on edge computing platforms. In order to mitigate the high computation, memory, and power requirements of inferring convolutional neural networks (CNNs), we propose the use of power-of-two quantization, which quantizes continuous parameters into low-bit power-of-two values. This reduces computational complexity by removing expensive multiplication operations and with the use of low-bit weights. ResNet is adopted as the building block of our solution and the proposed model is evaluated on a spoken language understanding (SLU) task. Experimental results show improved performance for shift neural network architectures, with our low-bit quantization achieving 98.76 \% on the test set which is comparable performance to its full-precision counterpart and state-of-the-art solutions.

preprint2021arXiv

Relativistic electron flux model in the outer radiation belt using a neural network approach

We present a machine-learning-based model of relativistic electron fluxes >1.8 MeV using a neural network approach in the Earth's outer radiation belt. The Outer RadIation belt Electron Neural net model for Relativistic electrons (ORIENT-R) uses only solar wind conditions and geomagnetic indices as input. For the first time, we show that the state of the outer radiation belt can be determined using only solar wind conditions and geomagnetic indices, without any initial and boundary conditions. The most important features for determining outer radiation belt dynamics are found to be AL, solar wind flow speed and density, and SYM-H indices. ORIENT-R reproduces out-of-sample relativistic electron fluxes with a correlation coefficient of 0.95 and an uncertainty factor of ~2. ORIENT-R reproduces radiation belt dynamics during an out-of-sample geomagnetic storm with good agreement to the observations. In addition, ORIENT-R was run for a completely out-of-sample period between March 2018 and October 2019 when the AL index ended and was replaced with the predicted AL index (lasp.colorado.edu/~lix). It reproduces electron fluxes with a correlation coefficient of 0.92 and an out-of-sample uncertainty factor of ~3. Furthermore, ORIENT-R captured the trend in the electron fluxes from low-earth-orbit (LEO) SAMPEX, which is a completely out-of-sample dataset both temporally and spatially. In sum, the ORIENT-R model can reproduce transport, acceleration, decay, and dropouts of the outer radiation belt anywhere from short timescales (i.e., geomagnetic storms) and very long timescales (i.e., solar cycle) variations.

preprint2020arXiv

Importance of Data Loading Pipeline in Training Deep Neural Networks

Training large-scale deep neural networks is a long, time-consuming operation, often requiring many GPUs to accelerate. In large models, the time spent loading data takes a significant portion of model training time. As GPU servers are typically expensive, tricks that can save training time are valuable.Slow training is observed especially on real-world applications where exhaustive data augmentation operations are required. Data augmentation techniques include: padding, rotation, adding noise, down sampling, up sampling, etc. These additional operations increase the need to build an efficient data loading pipeline, and to explore existing tools to speed up training time. We focus on the comparison of two main tools designed for this task, namely binary data format to accelerate data reading, and NVIDIA DALI to accelerate data augmentation. Our study shows improvement on the order of 20% to 40% if such dedicated tools are used.

preprint2020arXiv

Random Bias Initialization Improves Quantized Training

Binary neural networks improve computationally efficiency of deep models with a large margin. However, there is still a performance gap between a successful full-precision training and binary training. We bring some insights about why this accuracy drop exists and call for a better understanding of binary network geometry. We start with analyzing full-precision neural networks with ReLU activation and compare it with its binarized version. This comparison suggests to initialize networks with random bias, a counter-intuitive remedy.

preprint2020arXiv

Tensor train decompositions on recurrent networks

Recurrent neural networks (RNN) such as long-short-term memory (LSTM) networks are essential in a multitude of daily live tasks such as speech, language, video, and multimodal learning. The shift from cloud to edge computation intensifies the need to contain the growth of RNN parameters. Current research on RNN shows that despite the performance obtained on convolutional neural networks (CNN), keeping a good performance in compressed RNNs is still a challenge. Most of the literature on compression focuses on CNNs using matrix product (MPO) operator tensor trains. However, matrix product state (MPS) tensor trains have more attractive features than MPOs, in terms of storage reduction and computing time at inference. We show that MPS tensor trains should be at the forefront of LSTM network compression through a theoretical analysis and practical experiments on NLP task.