Source author record

Tianyi Liu

Tianyi Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision physics.app-ph physics.optics Applications Artificial Intelligence cond-mat.soft cond-mat.str-el cond-mat.supr-con Cryptography and Security Distributed, Parallel, and Cluster Computing Graphics math.OC Neural and Evolutionary Computing q-fin.RM

Catalog footprint

What is connected

15works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

DAOS: A Multimodal In-cabin Behavior Monitoring with Driver Action-Object Synergy Dataset

In driver activity monitoring, movements are mostly limited to the upper body, which makes many actions look similar. To tell these actions apart, human often rely on the objects the driver is using, such as holding a phone compared with gripping the steering wheel. However, most existing driver-monitoring datasets lack accurate object-location annotations or do not link objects to their associated actions, leaving a critical gap for reliable action recognition. To address this, we introduce the Driver Action with Object Synergy (DAOS) dataset, comprising 9,787 video clips annotated with 36 fine-grained driver actions and 15 object classes, totaling more than 2.5 million corresponding object instances. DAOS offers multi-modal, multi-view data (RGB, IR, and depth) from front, face, left, and right perspectives. Although DAOS captures a wide range of cabin objects, only a few are directly relevant to each action for prediction, so focusing on task-specific human-object relations is essential. To tackle this challenge, we propose the Action-Object-Relation Network (AOR-Net). AOR-Net comprehends complex driver actions through multi-level reasoning and a chain-of-action prompting mechanism that models the logical relationships among actions, objects, and their relations. Additionally, the Mixture of Thoughts module is introduced to dynamically select essential knowledge at each stage, enhancing robustness in object-rich and object-scarce conditions. Extensive experiments demonstrate that our model outperforms other state-of-the-art methods on various datasets.

preprint2026arXiv

Enhanced superconducting correlations in the Emery model and its connections to strange metallic transport and normal state coherence

Numerical evidence for superconductivity in the single-band Hubbard model is elusive or ambiguous despite extensive study, raising the question of whether the single-band Hubbard model is a faithful low energy effective model for cuprates, and whether explicitly including the oxygen ions will recover the properties necessary for a superconducting transition. Here we show, by using numerically exact determinant quantum Monte Carlo (DQMC) simulations of the doped Emery model, that while the single-band model exhibits strikingly T-linear resistivity, the three-band model crosses the resistivity of the single-band model from above, indicating a crossover to a more metallic transport regime. The enhanced conductivity is mainly contributed by a steep increase in the diffusivity of the three-band model at the crossover, suggesting that three-band transport is more coherent than single-band transport at lower temperatures. Below the same crossover temperature in the three-band model, the pair-field susceptibility increases more steeply than at higher temperatures or when compared to the single-band model. This suggests a possible connection between superconductivity and coherent transport, and further implies that coherent transport might be necessary for a model to capture the high-temperature superconductivity observed in hole-doped cuprates.

preprint2022arXiv

Differentially Private Multi-Party Data Release for Linear Regression

Differentially Private (DP) data release is a promising technique to disseminate data without compromising the privacy of data subjects. However the majority of prior work has focused on scenarios where a single party owns all the data. In this paper we focus on the multi-party setting, where different stakeholders own disjoint sets of attributes belonging to the same group of data subjects. Within the context of linear regression that allow all parties to train models on the complete data without the ability to infer private attributes or identities of individuals, we start with directly applying Gaussian mechanism and show it has the small eigenvalue problem. We further propose our novel method and prove it asymptotically converges to the optimal (non-private) solutions with increasing dataset size. We substantiate the theoretical results through experiments on both artificial and real-world datasets.

preprint2022arXiv

Noise Regularizes Over-parameterized Rank One Matrix Recovery, Provably

We investigate the role of noise in optimization algorithms for learning over-parameterized models. Specifically, we consider the recovery of a rank one matrix $Y^*\in R^{d\times d}$ from a noisy observation $Y$ using an over-parameterization model. We parameterize the rank one matrix $Y^*$ by $XX^\top$, where $X\in R^{d\times d}$. We then show that under mild conditions, the estimator, obtained by the randomly perturbed gradient descent algorithm using the square loss function, attains a mean square error of $O(σ^2/d)$, where $σ^2$ is the variance of the observational noise. In contrast, the estimator obtained by gradient descent without random perturbation only attains a mean square error of $O(σ^2)$. Our result partially justifies the implicit regularization effect of noise when learning over-parameterized models, and provides new understanding of training over-parameterized neural networks.

preprint2022arXiv

Tag-assisted Multimodal Sentiment Analysis under Uncertain Missing Modalities

Multimodal sentiment analysis has been studied under the assumption that all modalities are available. However, such a strong assumption does not always hold in practice, and most of multimodal fusion models may fail when partial modalities are missing. Several works have addressed the missing modality problem; but most of them only considered the single modality missing case, and ignored the practically more general cases of multiple modalities missing. To this end, in this paper, we propose a Tag-Assisted Transformer Encoder (TATE) network to handle the problem of missing uncertain modalities. Specifically, we design a tag encoding module to cover both the single modality and multiple modalities missing cases, so as to guide the network's attention to those missing modalities. Besides, we adopt a new space projection pattern to align common vectors. Then, a Transformer encoder-decoder network is utilized to learn the missing modality features. At last, the outputs of the Transformer encoder are used for the final sentiment classification. Extensive experiments are conducted on CMU-MOSI and IEMOCAP datasets, showing that our method can achieve significant improvements compared with several baselines.

preprint2021arXiv

A Diffusion Approximation Theory of Momentum SGD in Nonconvex Optimization

Momentum Stochastic Gradient Descent (MSGD) algorithm has been widely applied to many nonconvex optimization problems in machine learning, e.g., training deep neural networks, variational Bayesian inference, and etc. Despite its empirical success, there is still a lack of theoretical understanding of convergence properties of MSGD. To fill this gap, we propose to analyze the algorithmic behavior of MSGD by diffusion approximations for nonconvex optimization problems with strict saddle points and isolated local optima. Our study shows that the momentum helps escape from saddle points, but hurts the convergence within the neighborhood of optima (if without the step size annealing or momentum annealing). Our theoretical discovery partially corroborates the empirical success of MSGD in training deep neural networks.

preprint2021arXiv

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

Numerous empirical evidences have corroborated the importance of noise in nonconvex optimization problems. The theory behind such empirical observations, however, is still largely unknown. This paper studies this fundamental problem through investigating the nonconvex rectangular matrix factorization problem, which has infinitely many global minima due to rotation and scaling invariance. Hence, gradient descent (GD) can converge to any optimum, depending on the initialization. In contrast, we show that a perturbed form of GD with an arbitrary initialization converges to a global optimum that is uniquely determined by the injected noise. Our result implies that the noise imposes implicit bias towards certain optima. Numerical experiments are provided to support our theory.

preprint2021arXiv

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization

Asynchronous momentum stochastic gradient descent algorithms (Async-MSGD) is one of the most popular algorithms in distributed machine learning. However, its convergence properties for these complicated nonconvex problems is still largely unknown, because of the current technical limit. Therefore, in this paper, we propose to analyze the algorithm through a simpler but nontrivial nonconvex problem - streaming PCA, which helps us to understand Aync-MSGD better even for more general problems. Specifically, we establish the asymptotic rate of convergence of Async-MSGD for streaming PCA by diffusion approximation. Our results indicate a fundamental tradeoff between asynchrony and momentum: To ensure convergence and acceleration through asynchrony, we have to reduce the momentum (compared with Sync-MSGD). To the best of our knowledge, this is the first theoretical attempt on understanding Async-MSGD for distributed nonconvex stochastic optimization. Numerical experiments on both streaming PCA and training deep neural networks are provided to support our findings for Async-MSGD.

preprint2020arXiv

A Benchmarking Framework for Interactive 3D Applications in the Cloud

With the growing popularity of cloud gaming and cloud virtual reality (VR), interactive 3D applications have become a major type of workloads for the cloud. However, despite their growing importance, there is limited public research on how to design cloud systems to efficiently support these applications, due to the lack of an open and reliable research infrastructure, including benchmarks and performance analysis tools. The challenges of generating human-like inputs under various system/application randomness and dissecting the performance of complex graphics systems make it very difficult to design such an infrastructure. In this paper, we present the design of a novel cloud graphics rendering research infrastructure, Pictor. Pictor employs AI to mimic human interactions with complex 3D applications. It can also provide in-depth performance measurements for the complex software and hardware stack used for cloud 3D graphics rendering. With Pictor, we designed a benchmark suite with six interactive 3D applications. Performance analyses were conducted with these benchmarks to characterize 3D applications in the cloud and reveal new performance bottlenecks. To demonstrate the effectiveness of Pictor, we also implemented two optimizations to address two performance bottlenecks discovered in a state-of-the-art cloud 3D-graphics rendering system, which improved the frame rate by 57.7% on average.

preprint2020arXiv

Monolithic piezoelectric control of soliton microcombs

High-speed laser frequency actuation is critical in all applications employing lasers and frequency combs, and is prerequisite for phase locking, frequency stabilization and stability transfer among multiple optical carriers. Soliton microcombs have emerged as chip-scale, broadband and low-power-consumption frequency comb sources.Yet, integrated microcombs relying on thermal heaters for on-chip actuation all exhibit only kilohertz actuation bandwidth. Consequently, high-speed actuation and locking of microcombs have been attained only with off-chip bulk modulators. Here, we present high-speed microcomb actuation using integrated components. By monolithically integrating piezoelectric AlN actuators on ultralow-loss Si3N4 photonic circuits, we demonstrate voltage-controlled soliton tuning, modulation and stabilization. The integrated AlN actuators feature bi-directional tuning with high linearity and low hysteresis, operate with 300 nW power and exhibit flat actuation response up to megahertz frequency, significantly exceeding bulk piezo tuning bandwidth. We use this novel capability to demonstrate a microcomb engine for parallel FMCW LiDAR, via synchronously tuning the laser and microresonator. By applying a triangular sweep at the modulation rate matching the frequency spacing of HBAR modes, we exploit the resonant build-up of bulk acoustic energy to significantly lower the required driving to a CMOS voltage of only 7 Volts. Our approach endows soliton microcombs with integrated, ultralow-power-consumption, and fast actuation, significantly expanding the repertoire of technological applications.

preprint2020arXiv

On Computation and Generalization of Generative Adversarial Imitation Learning

Generative Adversarial Imitation Learning (GAIL) is a powerful and practical approach for learning sequential decision-making policies. Different from Reinforcement Learning (RL), GAIL takes advantage of demonstration data by experts (e.g., human), and learns both the policy and reward function of the unknown environment. Despite the significant empirical progresses, the theory behind GAIL is still largely unknown. The major difficulty comes from the underlying temporal dependency of the demonstration data and the minimax computational formulation of GAIL without convex-concave structure. To bridge such a gap between theory and practice, this paper investigates the theoretical properties of GAIL. Specifically, we show: (1) For GAIL with general reward parameterization, the generalization can be guaranteed as long as the class of the reward functions is properly controlled; (2) For GAIL, where the reward is parameterized as a reproducing kernel function, GAIL can be efficiently solved by stochastic first order optimization algorithms, which attain sublinear convergence to a stationary solution. To the best of our knowledge, these are the first results on statistical and computational guarantees of imitation learning with reward/policy function approximation. Numerical experiments are provided to support our analysis.

preprint2020arXiv

Online Quantification of Input Model Uncertainty by Two-Layer Importance Sampling

Stochastic simulation has been widely used to analyze the performance of complex stochastic systems and facilitate decision making in those systems. Stochastic simulation is driven by the input model, which is a collection of probability distributions that model the stochasticity in the system. The input model is usually estimated using a finite amount of data, which introduces the so-called input model uncertainty to the simulation output. How to quantify input uncertainty has been studied extensively, and many methods have been proposed for the batch data setting, i.e., when all the data are available at once. However, methods for "streaming data" arriving sequentially in time are still in demand, despite that streaming data have become increasingly prevalent in modern applications. To fill this gap, we propose a two-layer importance sampling framework that incorporates streaming data for online input uncertainty quantification. Under this framework, we develop two algorithms that suit different application scenarios: the first scenario is when data come at a fast speed and there is no time for any new simulation in between updates; the second is when data come at a moderate speed and a few but limited simulations are allowed at each time stage. We prove the consistency and asymptotic convergence rate results, which theoretically show the efficiency of our proposed approach. We further demonstrate the proposed algorithms on a numerical example of the news vendor problem.

preprint2019arXiv

Integrated turnkey soliton microcombs operated at CMOS frequencies

While soliton microcombs offer the potential for integration of powerful frequency metrology and precision spectroscopy systems, their operation requires complex startup and feedback protocols that necessitate difficult-to-integrate optical and electrical components. Moreover, CMOS-rate microcombs, required in nearly all comb systems, have resisted integration because of their power requirements. Here, a regime for turnkey operation of soliton microcombs co-integrated with a pump laser is demonstrated and theoretically explained. Significantly, a new operating point is shown to appear from which solitons are generated through binary turn-on and turn-off of the pump laser, thereby eliminating all photonic/electronic control circuitry. These features are combined with high-Q $Si_3N_4$ resonators to fully integrate into a butterfly package microcombs with CMOS frequencies as low as 15 GHz, offering compelling advantages for high-volume production.

preprint2016arXiv

Length Scale of Correlated Dynamics in Ultra-thin Molecular Glasses

Physical vapor deposition (PVD) is widely used in manufacturing ultra-thin layers of amorphous organic solids. Here, we demonstrate that these films exhibit a sharp transition from glassy solid to liquid-like behavior with thickness below 30 nm. This liquid-like behavior persists even at temperatures well below the glass transition temperature, T$_{\mathrm{g}}$. The enhanced dynamics in these films can produce large scale morphological features during PVD and lead to a dewetting instability in films held at temperatures as low as T$_{\mathrm{g}}$-35 K. We measure the effective viscosity of organic glass films by monitoring the dewetting kinetics. These measurements combined with cooling rate-dependent T$_{\mathrm{g}}$ measurements show that the apparent activation barrier for rearrangement decreases sharply in films thinner than 30 nm. These observations suggest long-range facilitation of dynamics induced by the free surface, with dramatic effects on the properties of nano-scale amorphous materials.

preprint2015arXiv

Implementation of Training Convolutional Neural Networks

Deep learning refers to the shining branch of machine learning that is based on learning levels of representations. Convolutional Neural Networks (CNN) is one kind of deep neural network. It can study concurrently. In this article, we gave a detailed analysis of the process of CNN algorithm both the forward process and back propagation. Then we applied the particular convolutional neural network to implement the typical face recognition problem by java. Then, a parallel strategy was proposed in section4. In addition, by measuring the actual time of forward and backward computing, we analysed the maximal speed up and parallel efficiency theoretically.

Tianyi Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

DAOS: A Multimodal In-cabin Behavior Monitoring with Driver Action-Object Synergy Dataset

Enhanced superconducting correlations in the Emery model and its connections to strange metallic transport and normal state coherence

Differentially Private Multi-Party Data Release for Linear Regression

Noise Regularizes Over-parameterized Rank One Matrix Recovery, Provably

Tag-assisted Multimodal Sentiment Analysis under Uncertain Missing Modalities

A Diffusion Approximation Theory of Momentum SGD in Nonconvex Optimization

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization

A Benchmarking Framework for Interactive 3D Applications in the Cloud

Monolithic piezoelectric control of soliton microcombs

On Computation and Generalization of Generative Adversarial Imitation Learning

Online Quantification of Input Model Uncertainty by Two-Layer Importance Sampling

Integrated turnkey soliton microcombs operated at CMOS frequencies

Length Scale of Correlated Dynamics in Ultra-thin Molecular Glasses

Implementation of Training Convolutional Neural Networks