Researcher profile

Yu Tang

Yu Tang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2022arXiv

A Novel Four-DOF Lagrangian Approach to Attitude Tracking for Rigid Spacecraft

This paper presents a novel Lagrangian approach to attitude tracking for rigid spacecraft using unit quaternions, where the motion equations of a spacecraft are described by a four degrees of freedom Lagrangian dynamics subject to a holonomic constraint imposed by the norm of a unit quaternion. The basic energy-conservation property as well as some additional useful properties of the Lagrangian dynamics are explored, enabling to develop quaternion-based attitude tracking controllers by taking full advantage of a broad class of tracking control designs for mechanical systems based on energy-shaping methodology. Global tracking of a desired attitude on the unit sphere is achieved by designing control laws that render the tracking error on the four-dimensional Euclidean space to converge to the origin. The topological constraints for globally exponentially tracking by a quaternion-based continuous controller and singularities in controller designs based on any three-parameter representation of the attitude are then avoided. Using this approach, a full-state feedback controller is first developed, and then several important issues, such as robustness to noise in quaternion measurements, unknown on-orbit torque disturbances, uncertainty in the inertial matrix, and lack of angular-velocity measurements are addressed progressively, by designing a hybrid state-feedback controller, an adaptive hybrid state-feedback controller, and an adaptive hybrid attitude-feedback controller. Global asymptotic stability is established for each controller. Simulations are included to illustrate the theoretical results.

preprint2022arXiv

Attitude Tracking for Rigid Bodies Using Vector and Biased Gyro Measurements

The rigid-body attitude tracking using vector and biased gyro measurements with unknown inertia matrix is studied in this note. First, a gyro-bias observer with global exponential stability is designed. Then, an attitude tracking controller based on this observer is devised, ensuring almost global asymptotic stability and almost semiglobal exponential stability. A separation property of the combined observer-controller is proved. Lastly, an adaptive attitude tracking controller relying on a modified gyro-bias observer and with no over-parametrization is developed to deal with the unknown inertia matrix. The proposed control schemes require neither an explicit attitude representation nor any attitude estimation, but only the measurement of at least two non-collinear known inertial reference vectors and biased gyro rate, which can be obtained by common low-cost IMU sensors. Simulations are included to illustrate the proposed adaptive controller under noisy measurements.

preprint2022arXiv

DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation

The further development of deep neural networks is hampered by the limited GPU memory resource. Therefore, the optimization of GPU memory resources is highly demanded. Swapping and recomputation are commonly applied to make better use of GPU memory in deep learning. However, as an emerging domain, several challenges remain:1)The efficiency of recomputation is limited for both static and dynamic methods. 2)Swapping requires offloading parameters manually, which incurs a great time cost. 3) There is no such dynamic and fine-grained method that involves tensor swapping together with tensor recomputation nowadays. To remedy the above issues, we propose a novel scheduler manager named DELTA(Dynamic tEnsor offLoad and recompuTAtion). To the best of our knowledge, we are the first to make a reasonable dynamic runtime scheduler on the combination of tensor swapping and tensor recomputation without user oversight. In DELTA, we propose a filter algorithm to select the optimal tensors to be released out of GPU memory and present a director algorithm to select a proper action for each of these tensors. Furthermore, prefetching and overlapping are deliberately considered to overcome the time cost caused by swapping and recomputing tensors. Experimental results show that DELTA not only saves 40%-70% of GPU memory, surpassing the state-of-the-art method to a great extent but also gets comparable convergence results as the baseline with acceptable time delay. Also, DELTA gains 2.04$\times$ maximum batchsize when training ResNet-50 and 2.25$\times$ when training ResNet-101 compared with the baseline. Besides, comparisons between the swapping cost and recomputation cost in our experiments demonstrate the importance of making a reasonable dynamic scheduler on tensor swapping and tensor recomputation, which refutes the arguments in some related work that swapping should be the first and best choice.

preprint2022arXiv

P-ADMMiRNN: Training RNN with Stable Convergence via An Efficient and Paralleled ADMM Approach

It is hard to train Recurrent Neural Network (RNN) with stable convergence and avoid gradient vanishing and exploding problems, as the weights in the recurrent unit are repeated from iteration to iteration. Moreover, RNN is sensitive to the initialization of weights and bias, which brings difficulties in training. The Alternating Direction Method of Multipliers (ADMM) has become a promising algorithm to train neural networks beyond traditional stochastic gradient algorithms with the gradient-free features and immunity to unsatisfactory conditions. However, ADMM could not be applied to train RNN directly since the state in the recurrent unit is repetitively updated over timesteps. Therefore, this work builds a new framework named ADMMiRNN upon the unfolded form of RNN to address the above challenges simultaneously. We also provide novel update rules and theoretical convergence analysis. We explicitly specify essential update rules in the iterations of ADMMiRNN with constructed approximation techniques and solutions to each sub-problem instead of vanilla ADMM. Numerical experiments are conducted on MNIST, IMDb, and text classification tasks. ADMMiRNN achieves convergent results and outperforms the compared baselines. Furthermore, ADMMiRNN trains RNN more stably without gradient vanishing or exploding than stochastic gradient algorithms. We also provide a distributed paralleled algorithm regarding ADMMiRNN, named P-ADMMiRNN, including Synchronous Parallel ADMMiRNN (SP-ADMMiRNN) and Asynchronous Parallel ADMMiRNN (AP-ADMMiRNN), which is the first to train RNN with ADMM in an asynchronous parallel manner. The source code is publicly available.

preprint2022arXiv

Resilient Ramp Control for Highways Facing Stochastic Perturbations

Highway capacity is often subject to stochastic perturbations due to the combined effects of weather, traffic mixture, driver behavior, etc. This paper is motivated by the need of a systematic approach to traffic control with performance guarantees in the face of such perturbations. We develop a novel control-theoretic method for designing perturbation-resilient ramp metering. We consider a cell-transmission model with 1) Markovian cell capacities and 2) buffers representing on-ramps and upstream mainline. Using this model, we analyze the stability of on-ramp queues by constructing piecewise Lyapunov functions that consider the nature of nonlinear traffic dynamics. Then, we design ramp controllers that guarantee bounds for throughput and queue sizes. We also formulate the problem of coordinated ramp metering as a bi-level optimization with non-convex inner sub-problems. To address the computational issue in solving this problem, we also consider localized and partially coordinated reformulations. A case study of a 18.1-km highway in Los Angeles, USA indicates a 8.3\% (resp. 9.9\%) reduction of vehicle-hours-traveled obtained by the localized (resp. partially coordinated) control, both outperforming the classical ALINEA and METALINE controllers.

preprint2022arXiv

SAD: A Large-scale Dataset towards Airport Detection in Synthetic Aperture Radar Images

Airports have an important role in both military and civilian domains. The synthetic aperture radar (SAR) based airport detection has received increasing attention in recent years. However, due to the high cost of SAR imaging and annotation process, there is no publicly available SAR dataset for airport detection. As a result, deep learning methods have not been fully used in airport detection tasks. To provide a benchmark for airport detection research in SAR images, this paper introduces a large-scale SAR Airport Dataset (SAD). In order to adequately reflect the demands of real world applications, it contains 624 SAR images from Sentinel 1B and covers 104 airfield instances with different scales, orientations and shapes. The experiments of multiple deep learning approach on this dataset proves its effectiveness. It developing state-of-the-art airport area detection algorithms or other relevant tasks.

preprint2021arXiv

Thermal stress around a smooth cavity in a plate subjected to uniform heat flux

The two-dimensional thermoelastic problem of an adiabatic cavity in an infinite isotropic homogeneous medium subjected to uniform heat flux is studied, where the shape of the cavity is characterized by the Laurent polynomial. By virtue of a novel tactics, the obtained K-M potentials can be explicitly worked out to satisfy the boundary conditions precisely, and the possible translation of the cavity is also available. The new and explicit analytical solutions are compared with the those reported in literature and some serious problems are found and corrected. Finally, some discussions on the thermal stress concentration around the tips of three typical cavities are provided.

preprint2020arXiv

Median regression with differential privacy

Median regression analysis has robustness properties which make it attractive compared with regression based on the mean, while differential privacy can protect individual privacy during statistical analysis of certain datasets. In this paper, three privacy preserving methods are proposed for median regression. The first algorithm is based on a finite smoothing method, the second provides an iterative way and the last one further employs the greedy coordinate descent approach. Privacy preserving properties of these three methods are all proved. Accuracy bound or convergence properties of these algorithms are also provided. Numerical calculation shows that the first method has better accuracy than the others when the sample size is small. When the sample size becomes larger, the first method needs more time while the second method needs less time with well-matched accuracy. For the third method, it costs less time in both cases, while it highly depends on step size.