Source author record

Miao Liu

Miao Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning Artificial Intelligence cond-mat.mtrl-sci physics.comp-ph Systems and Control eess.SY cond-mat.mes-hall cond-mat.other Multiagent Systems Networking and Internet Architecture eess.SP Robotics

Catalog footprint

What is connected

24works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Multimodal Pre-trained Network for Integrated EEG-Video Seizure Detection

Reliable seizure detection in mouse models is essential for preclinical epilepsy research, yet manual review of synchronized video-EEG recordings is labor-intensive and single-modality systems fail for complementary reasons: video-based methods are easily confounded by benign behaviors, whereas EEG-based methods are vulnerable to ictal motion artifacts. We present EEGVFusion, a multimodal framework that combines self-supervised EEG representation learning, spatio-temporal video encoding, optimal-transport alignment, and bidirectional cross-attention to integrate neural and behavioral evidence. We also curate an expert-annotated dataset of synchronized EEG and video recordings comprising 93 sessions from 15 mice for training and evaluation. In the random-session split, EEGVFusion achieved a Balanced Accuracy of 0.9957 with perfect event sensitivity and an Event FAR of 0.6250 FP/h, indicating strong seizure detection performance with a low false-alarm burden. In a single held-out-subject evaluation with Subject 110 reserved for testing, EEGVFusion achieved a Balanced Accuracy of 0.9718 and reduced Event FAR from 2.7250 FP/h for the EEG-only counterpart to 0.4833 FP/h while preserving perfect event sensitivity. Targeted ablations further showed that EEG pre-training and OT alignment help reduce false alarms while preserving event sensitivity.

preprint2022arXiv

An efficient distributed scheduling algorithm for relay-assisted mmWave backhaul networks

In this paper, a novel distributed scheduling algorithm is proposed, which aims to efficiently schedule both the uplink and downlink backhaul traffic in the relay-assisted mmWave backhaul network with a tree topology. The handshaking of control messages, calculation of local schedules, and the determination of final valid schedule are all discussed. Simulation results show that the performance of the distributed algorithm can reach very close to the maximum traffic demand of the backhaul network, and it can also adapt to the dynamic traffic with sharp traffic demand change of small-cell BSs quickly and accurately.

preprint2022arXiv

Context-Specific Representation Abstraction for Deep Option Learning

Hierarchical reinforcement learning has focused on discovering temporally extended actions, such as options, that can provide benefits in problems requiring extensive exploration. One promising approach that learns these options end-to-end is the option-critic (OC) framework. We examine and show in this paper that OC does not decompose a problem into simpler sub-problems, but instead increases the size of the search over policy space with each option considering the entire state space during learning. This issue can result in practical limitations of this method, including sample inefficient learning. To address this problem, we introduce Context-Specific Representation Abstraction for Deep Option Learning (CRADOL), a new framework that considers both temporal abstraction and context-specific representation abstraction to effectively reduce the size of the search over policy space. Specifically, our method learns a factored belief state representation that enables each option to learn a policy over only a subsection of the state space. We test our method against hierarchical, non-hierarchical, and modular recurrent neural network baselines, demonstrating significant sample efficiency improvements in challenging partially observable environments.

preprint2022arXiv

Ego4D: Around the World in 3,000 Hours of Egocentric Video

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant. Ego4D dramatically expands the volume of diverse egocentric video footage publicly available to the research community. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. Furthermore, we present a host of new benchmark challenges centered around understanding the first-person visual experience in the past (querying an episodic memory), present (analyzing hand-object manipulation, audio-visual conversation, and social interactions), and future (forecasting activities). By publicly sharing this massive annotated dataset and benchmark suite, we aim to push the frontier of first-person perception. Project page: https://ego4d-data.org/

preprint2022arXiv

Egocentric Activity Recognition and Localization on a 3D Map

Given a video captured from a first person perspective and the environment context of where the video is recorded, can we recognize what the person is doing and identify where the action occurs in the 3D space? We address this challenging problem of jointly recognizing and localizing actions of a mobile user on a known 3D map from egocentric videos. To this end, we propose a novel deep probabilistic model. Our model takes the inputs of a Hierarchical Volumetric Representation (HVR) of the 3D environment and an egocentric video, infers the 3D action location as a latent variable, and recognizes the action based on the video and contextual cues surrounding its potential locations. To evaluate our model, we conduct extensive experiments on the subset of Ego4D dataset, in which both human naturalistic actions and photo-realistic 3D environment reconstructions are captured. Our method demonstrates strong results on both action recognition and 3D action localization across seen and unseen environments. We believe our work points to an exciting research direction in the intersection of egocentric vision, and 3D scene understanding.

preprint2022arXiv

Generative Adversarial Network for Future Hand Segmentation from Egocentric Video

We introduce the novel problem of anticipating a time series of future hand masks from egocentric video. A key challenge is to model the stochasticity of future head motions, which globally impact the head-worn camera video analysis. To this end, we propose a novel deep generative model -- EgoGAN, which uses a 3D Fully Convolutional Network to learn a spatio-temporal video representation for pixel-wise visual anticipation, generates future head motion using Generative Adversarial Network (GAN), and then predicts the future hand masks based on the video representation and the generated future head motion. We evaluate our method on both the EPIC-Kitchens and the EGTEA Gaze+ datasets. We conduct detailed ablation studies to validate the design choices of our approach. Furthermore, we compare our method with previous state-of-the-art methods on future image segmentation and show that our method can more accurately predict future hand masks.

preprint2022arXiv

Screening promising CsV3Sb5-like kagome materials from systematic first-principles evaluation

CsV3Sb5 kagome lattice holds the promise for manifesting electron correlation, topology and superconducting. However, by far only three CsV3Sb5-like kagome materials have been experimentally spotted. In this work, we enlarge this family of materials to 1386 compounds via element species substitution, and the further screening process suggests that 28 promising candidates have superior thermodynamic stability, hence they are highly likely to be synthesized. Moreover, these compounds possess several identical electronic structures, and can be categorized into five non-magnetic and three magnetic groups accordingly. It is our hope that this work can greatly expand the viable phase space of the CsV3Sb5-like materials for investigating or tuning the novel quantum phenomena in kagome lattice.

preprint2022arXiv

Towards the Maximum Traffic Demand and Throughput Supported by Relay-Assisted mmWave Backhaul Networks

This paper investigates the throughput performance issue of the relay-assisted mmWave backhaul network. The maximum traffic demand of small-cell base stations (BSs) and the maximum throughput at the macro-cell BS have been found in a tree-style backhaul network through linear programming under different network settings, which concern both the number of radio chains available on BSs and the interference relationship between logical links in the backhaul network. A novel interference model for the relay-assisted mmWave backhaul network in the dense urban environment is proposed, which demonstrates the limited interference footprint of mmWave directional communications. Moreover, a scheduling algorithm is developed to find the optimal scheduling for tree-style mmWave backhaul networks. Extensive numerical analysis and simulations are conducted to show and validate the network throughput performance and the scheduling algorithm.

preprint2022arXiv

Unconventional Materials: the mismatch between electronic charge centers and atomic positions

The complete band representations (BRs) have been constructed in the work of topological quantum chemistry. Each BR is expressed by either a localized orbital at a Wyckoff site in real space, or by a set of irreducible representations in momentum space. In this work, we define unconventional materials with a common feature of the mismatch between average electronic centers and atomic positions. They can be effectively diagnosed as whose occupied bands can be expressed as a sum of elementary BRs (eBRs), but not a sum of atomic-orbital-induced BRs (aBRs). The existence of an essential BR at an empty site is described by nonzero real-space invariants (RSIs). The "valence" states can be derived by the aBR decomposition, and unconventional materials are supposed to have an uncompensated total "valence" state. The high-throughput screening for unconventional materials has been performed through the first-principles calculations. We have discovered 423 unconventional compounds, including thermoelectronic materials, higher-order topological insulators, electrides, hydrogen storage materials, hydrogen evolution reaction electrocatalysts, electrodes, and superconductors. The diversity of these interesting properties and applications would be widely studied in the future.

preprint2020arXiv

Attention Distillation for Learning Video Representations

We address the challenging problem of learning motion representations using deep models for video recognition. To this end, we make use of attention modules that learn to highlight regions in the video and aggregate features for recognition. Specifically, we propose to leverage output attention maps as a vehicle to transfer the learned representation from a motion (flow) network to an RGB network. We systematically study the design of attention modules, and develop a novel method for attention distillation. Our method is evaluated on major action benchmarks, and consistently improves the performance of the baseline RGB network by a significant margin. Moreover, we demonstrate that our attention maps can leverage motion cues in learning to identify the location of actions in video frames. We believe our method provides a step towards learning motion-aware representations in deep models. Our project page is available at https://aptx4869lm.github.io/AttentionDistillation/

preprint2020arXiv

Characterization and Thermal Management of a DC Motor-Driven Resonant Actuator for Miniature Mobile Robots with Oscillating Limbs

In this paper, we characterize the performance of and develop thermal management solutions for a DC motor-driven resonant actuator developed for flapping wing micro air vehicles. The actuator, a DC micro-gearmotor connected in parallel with a torsional spring, drives reciprocal wing motion. Compared to the gearmotor alone, this design increased torque and power density by 161.1% and 666.8%, respectively, while decreasing the drawn current by 25.8%. Characterization of the actuator, isolated from nonlinear aerodynamic loading, results in standard metrics directly comparable to other actuators. The micro-motor, selected for low weight considerations, operates at high power for limited duration due to thermal effects. To predict system performance, a lumped parameter thermal circuit model was developed. Critical model parameters for this micro-motor, two orders of magnitude smaller than those previously characterized, were identified experimentally. This included the effects of variable winding resistance, bushing friction, speed-dependent forced convection, and the addition of a heatsink. The model was then used to determine a safe operation envelope for the vehicle and to design a weight-optimal heatsink. This actuator design and thermal modeling approach could be applied more generally to improve the performance of any miniature mobile robot or device with motor-driven oscillating limbs or loads.

preprint2020arXiv

Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video

We address the challenging task of anticipating human-object interaction in first person videos. Most existing methods ignore how the camera wearer interacts with the objects, or simply consider body motion as a separate modality. In contrast, we observe that the international hand movement reveals critical information about the future activity. Motivated by this, we adopt intentional hand movement as a future representation and propose a novel deep network that jointly models and predicts the egocentric hand motion, interaction hotspots and future action. Specifically, we consider the future hand motion as the motor attention, and model this attention using latent variables in our deep model. The predicted motor attention is further used to characterise the discriminative spatial-temporal visual features for predicting actions and interaction hotspots. We present extensive experiments demonstrating the benefit of the proposed joint model. Importantly, our model produces new state-of-the-art results for action anticipation on both EGTEA Gaze+ and the EPIC-Kitchens datasets. Our project page is available at https://aptx4869lm.github.io/ForecastingHOI/

preprint2020arXiv

Learning Hierarchical Teaching Policies for Cooperative Agents

Collective learning can be greatly enhanced when agents effectively exchange knowledge with their peers. In particular, recent work studying agents that learn to teach other teammates has demonstrated that action advising accelerates team-wide learning. However, the prior work has simplified the learning of advising policies by using simple function approximations and only considered advising with primitive (low-level) actions, limiting the scalability of learning and teaching to complex domains. This paper introduces a novel learning-to-teach framework, called hierarchical multiagent teaching (HMAT), that improves scalability to complex environments by using the deep representation for student policies and by advising with more expressive extended action sequences over multiple levels of temporal abstraction. Our empirical evaluations demonstrate that HMAT improves team-wide learning progress in large, complex domains where previous approaches fail. HMAT also learns teaching policies that can effectively transfer knowledge to different teammates with knowledge of different tasks, even when the teammates have heterogeneous action spaces.

preprint2020arXiv

On the Role of Weight Sharing During Deep Option Learning

The options framework is a popular approach for building temporally extended actions in reinforcement learning. In particular, the option-critic architecture provides general purpose policy gradient theorems for learning actions from scratch that are extended in time. However, past work makes the key assumption that each of the components of option-critic has independent parameters. In this work we note that while this key assumption of the policy gradient theorems of option-critic holds in the tabular case, it is always violated in practice for the deep function approximation setting. We thus reconsider this assumption and consider more general extensions of option-critic and hierarchical option-critic training that optimize for the full architecture with each update. It turns out that not assuming parameter independence challenges a belief in prior work that training the policy over options can be disentangled from the dynamics of the underlying options. In fact, learning can be sped up by focusing the policy over options on states where options are actually likely to terminate. We put our new algorithms to the test in application to sample efficient learning of Atari games, and demonstrate significantly improved stability and faster convergence when learning long options.

preprint2019arXiv

Learning Abstract Options

Building systems that autonomously create temporal abstractions from data is a key challenge in scaling learning and planning in reinforcement learning. One popular approach for addressing this challenge is the options framework (Sutton et al., 1999). However, only recently in (Bacon et al., 2017) was a policy gradient theorem derived for online learning of general purpose options in an end to end fashion. In this work, we extend previous work on this topic that only focuses on learning a two-level hierarchy including options and primitive actions to enable learning simultaneously at multiple resolutions in time. We achieve this by considering an arbitrarily deep hierarchy of options where high level temporally extended options are composed of lower level options with finer resolutions in time. We extend results from (Bacon et al., 2017) and derive policy gradient theorems for a deep hierarchy of options. Our proposed hierarchical option-critic architecture is capable of learning internal policies, termination conditions, and hierarchical compositions over options without the need for any intrinsic rewards or subgoals. Our empirical results in both discrete and continuous environments demonstrate the efficiency of our framework.

preprint2016arXiv

Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning

Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion. Simulation results show more than 26 percent improvement in paths quality (i.e., time to reach the goal) when compared with optimal reciprocal collision avoidance (ORCA), a state-of-the-art collision avoidance strategy.

preprint2015arXiv

First-principles evaluation of Multi-valent cation insertion into Orthorhombic V$_2$O$_5$

A systematic first principles evaluation of the insertion behavior of multi-valent cations in orthorhombic V$_2$O$_5$ is performed. Layer spacing, voltage, phase stability, and ion mobility are computed for Li$^+$, Mg$^{2+}$, Zn$^{2+}$, Ca$^{2+}$, and Al$^{3+}$ intercalation in the $α$ and $δ$ polymorphs.

preprint2015arXiv

Stick-Breaking Policy Learning in Dec-POMDPs

Expectation maximization (EM) has recently been shown to be an efficient algorithm for learning finite-state controllers (FSCs) in large decentralized POMDPs (Dec-POMDPs). However, current methods use fixed-size FSCs and often converge to maxima that are far from optimal. This paper considers a variable-size FSC to represent the local policy of each agent. These variable-size FSCs are constructed using a stick-breaking prior, leading to a new framework called \emph{decentralized stick-breaking policy representation} (Dec-SBPR). This approach learns the controller parameters with a variational Bayesian algorithm without having to assume that the Dec-POMDP model is available. The performance of Dec-SBPR is demonstrated on several benchmark problems, showing that the algorithm scales to large problems while outperforming other state-of-the-art methods.

preprint2013arXiv

Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture

This paper presents a novel algorithm, based upon the dependent Dirichlet process mixture model (DDPMM), for clustering batch-sequential data containing an unknown number of evolving clusters. The algorithm is derived via a low-variance asymptotic analysis of the Gibbs sampling algorithm for the DDPMM, and provides a hard clustering with convergence guarantees similar to those of the k-means algorithm. Empirical results from a synthetic test with moving Gaussian clusters and a test with real ADS-B aircraft trajectory data demonstrate that the algorithm requires orders of magnitude less computational time than contemporary probabilistic and hard clustering algorithms, while providing higher accuracy on the examined datasets.

preprint2012arXiv

Interplay between Quantum Size Effect and Strain Effect on Growth of Nanoscale Metal Thin Film

We develop a theoretical framework to investigate the interplay between quantum size effect (QSE) and strain effect on the stability of metal nanofilms. The QSE and strain effect are shown to be coupled through the concept of "quantum electronic stress. First-principles calculations reveal large quantum oscillations in the surface stress of metal nanofilms as a function of film thickness. This adds extrinsically additional strain-coupled quantum oscillations to surface energy of strained metal nanofilms. Our theory enables a quantitative estimation of the amount of strain in experimental samples, and suggests strain be an important factor contributing to the discrepancies between the existing theories and experiments.

preprint2012arXiv

Quantum Manifestation of Elastic Constants in Nanostructures

Generally, there are two distinct effects in modifying the properties of low-dimensional nanostructures: surface effect (SS) due to increased surface-volume ratio and quantum size effect (QSE) due to quantum confinement in reduced dimension. The SS has been widely shown to affect the elastic constants and mechanical properties of nanostructures. Here, using Pb nanofilm and graphene nanoribbon as model systems, we demonstrate the QSE on the elastic constants of nanostructures by first-principles calculations. We show that generally QSE is dominant in affecting the elastic constants of metallic nanostructures while SS is more pronounced in semiconductor and insulator nanostructures. Our findings have broad implications in quantum aspects of nanomechanics.

preprint2011arXiv

Quantum Stress: Density Functional Theory Formulation and Physical Manifestation

The concept of "quantum stress (QS)" is introduced and formulated within density functional theory (DFT), to elucidate extrinsic electronic effects on the stress state of solids and thin films in the absence of lattice strain. A formal expression of QS (σ^Q) is derived in relation to deformation potential of electronic states (Ξ) and variation of electron density (Δn), σ^Q = ΞΔn, as a quantum analog of classical Hook's law. Two distinct QS manifestations are demonstrated quantitatively by DFT calculations: (1) in the form of bulk stress induced by charge carriers; and (2) in the form of surface stress induced by quantum confinement. Implications of QS in some physical phenomena are discussed to underlie its importance.

preprint2010arXiv

Quantum Manifestations of Graphene Edge Stress and Edge Instability: A First-Principles Study

We have performed first-principles calculations of graphene edge stresses, which display two interesting quantum manifestations absent from the classical interpretation: the armchair edge stress oscillates with a nanoribbon width, and the zigzag edge stress is noticeably reduced by spin polarization. Such quantum stress effects in turn manifest in mechanical edge twisting and warping instability, showing features not captured by empirical potentials or continuum theory. Edge adsorption of H and Stone-Wales reconstruction are shown to provide alternative mechanisms in relieving the edge compression and hence to stabilize the planar edge structure.

preprint2008arXiv

Magnetic Graphene Nanohole Superlattices

We investigate the magnetic properties of nano-holes (NHs) patterned in graphene using first principles calculations. We show that superlattices consisting of a periodic array of NHs form a new family of 2D crystalline "bulk" magnets whose collective magnetic behavior is governed by inter-NH spin-spin interaction. They exhibit long-range magnetic order well above room temperature. Furthermore, magnetic semiconductors can be made by doping magnetic NHs into semiconducting NH superlattices. Our findings offer a new material system for fundamental studies of spin-spin interaction and magnetic ordering in low dimensions, and open up the exciting opportunities of making engineered magnetic materials for storage media and spintronics applications.

Miao Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

A Multimodal Pre-trained Network for Integrated EEG-Video Seizure Detection

An efficient distributed scheduling algorithm for relay-assisted mmWave backhaul networks

Context-Specific Representation Abstraction for Deep Option Learning

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Egocentric Activity Recognition and Localization on a 3D Map

Generative Adversarial Network for Future Hand Segmentation from Egocentric Video

Screening promising CsV3Sb5-like kagome materials from systematic first-principles evaluation

Towards the Maximum Traffic Demand and Throughput Supported by Relay-Assisted mmWave Backhaul Networks

Unconventional Materials: the mismatch between electronic charge centers and atomic positions

Attention Distillation for Learning Video Representations

Characterization and Thermal Management of a DC Motor-Driven Resonant Actuator for Miniature Mobile Robots with Oscillating Limbs

Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video

Learning Hierarchical Teaching Policies for Cooperative Agents

On the Role of Weight Sharing During Deep Option Learning

Learning Abstract Options

Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning

First-principles evaluation of Multi-valent cation insertion into Orthorhombic V$_2$O$_5$

Stick-Breaking Policy Learning in Dec-POMDPs

Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture

Interplay between Quantum Size Effect and Strain Effect on Growth of Nanoscale Metal Thin Film

Quantum Manifestation of Elastic Constants in Nanostructures

Quantum Stress: Density Functional Theory Formulation and Physical Manifestation

Quantum Manifestations of Graphene Edge Stress and Edge Instability: A First-Principles Study

Magnetic Graphene Nanohole Superlattices