Researcher profile

Qianli Ma

Qianli Ma contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

A Unified Shape-Aware Foundation Model for Time Series Classification

Foundation models pre-trained on large-scale source datasets are reshaping the traditional training paradigm for time series classification. However, existing time series foundation models primarily focus on forecasting tasks and often overlook classification-specific challenges, such as modeling interpretable shapelets that capture class-discriminative temporal features. To bridge this gap, we propose UniShape, a unified shape-aware foundation model designed for time series classification. UniShape incorporates a shape-aware adapter that adaptively aggregates multiscale discriminative subsequences (shapes) into class tokens, effectively selecting the most relevant subsequence scales to enhance model interpretability. Meanwhile, a prototype-based pretraining module is introduced to jointly learn instance- and shape-level representations, enabling the capture of transferable shape patterns. Pre-trained on a large-scale multi-domain time series dataset comprising 1.89 million samples, UniShape exhibits superior generalization across diverse target domains. Experiments on 128 UCR datasets and 30 additional time series datasets demonstrate that UniShape achieves state-of-the-art classification performance, with interpretability and ablation analyses further validating its effectiveness.

preprint2026arXiv

Lifelong Learning of Large Language Model based Agents: A Roadmap

Lifelong learning, also known as continual or incremental learning, is a crucial component for advancing Artificial General Intelligence (AGI) by enabling systems to continuously adapt in dynamic environments. While large language models (LLMs) have demonstrated impressive capabilities in natural language processing, existing LLM agents are typically designed for static systems and lack the ability to adapt over time in response to new challenges. This survey is the first to systematically summarize the potential techniques for incorporating lifelong learning into LLM-based agents. We categorize the core components of these agents into three modules: the perception module for multimodal input integration, the memory module for storing and retrieving evolving knowledge, and the action module for grounded interactions with the dynamic environment. We highlight how these pillars collectively enable continuous adaptation, mitigate catastrophic forgetting, and improve long-term performance. This survey provides a roadmap for researchers and practitioners working to develop lifelong learning capabilities in LLM agents, offering insights into emerging trends, evaluation metrics, and application scenarios. Relevant literature and resources are available at \href{this url}{https://github.com/qianlima-lab/awesome-lifelong-llm-agent}.

preprint2026arXiv

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Autonomous systems are increasingly deployed in open and dynamic environments -- from city streets to aerial and indoor spaces -- where perception models must remain reliable under sensor noise, environmental variation, and platform shifts. However, even state-of-the-art methods often degrade under unseen conditions, highlighting the need for robust and generalizable robot sensing. The RoboSense 2025 Challenge is designed to advance robustness and adaptability in robot perception across diverse sensing scenarios. It unifies five complementary research tracks spanning language-grounded decision making, socially compliant navigation, sensor configuration generalization, cross-view and cross-modal correspondence, and cross-platform 3D perception. Together, these tasks form a comprehensive benchmark for evaluating real-world sensing reliability under domain shifts, sensor failures, and platform discrepancies. RoboSense 2025 provides standardized datasets, baseline models, and unified evaluation protocols, enabling large-scale and reproducible comparison of robust perception methods. The challenge attracted 143 teams from 85 institutions across 16 countries, reflecting broad community engagement. By consolidating insights from 23 winning solutions, this report highlights emerging methodological trends, shared design principles, and open challenges across all tracks, marking a step toward building robots that can sense reliably, act robustly, and adapt across platforms in real-world environments.

preprint2022arXiv

Dynamic Parallel Spin Stripes from the 1/8 anomaly to the End of Superconductivity in La$_{1.6-x}$Nd$_{0.4}$Sr$_x$CuO$_4$

We have carried out new neutron spectroscopic measurements on single crystals of La$_{1.6-x}$Nd$_{0.4}$Sr$_x$CuO$_4$ from 0.12 to 0.26 using time-of-flight techniques. These measurements allow us to follow the evolution of parallel spin stripe fluctuations with energies less than 33 meV, from x=0.12 to 0.26. Samples at these hole-doping levels are known to display static (on the neutron scattering time scale) parallel spin stripes at low temperature, with onset temperatures and intensities which decrease rapidly with increasing x. Nonetheless, we report remarkably similar dynamic spectral weight for the corresponding dynamic parallel spin stripes, between 5 meV to 33 meV, from the 1/8 anomaly near x=0.12, to optimal doping near x=0.19 to the quantum critical point for the pseudogap phase near x=0.24, and finally to the approximate end of superconductivity near x=0.26. This observed dynamic magnetic spectral weight is structured in energy with a peak near 17 meV at all dopings studied. Earlier neutron and resonant x-ray scattering measurements on related cuprate superconductors have reported both a disappearance with increasing doping of magnetic fluctuations at ($π$, $π$) wavevectors characterizing parallel spin stripe structures, and persistant paramagnon scattering away from this wavevector, respectively. Our new results on La$_{1.6-x}$Nd$_{0.4}$Sr$_x$CuO$_4$ from 0.12 < x <0.26 clearly show persistent parallel spin stripe fluctuations at and around at ($π$, $π$), and across the full range of doping studied. These results are also compared to recent theory. Together with a rapidly declining x-dependence to the static parallel spin stripe order, the persistent parallel spin stripe fluctuations show a remarkable similarity to the expectations of a quantum spin glass, random t-J model, recently introduced to describe strong local correlations in cuprates.

preprint2022arXiv

EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

Understanding social interactions from egocentric views is crucial for many applications, ranging from assistive robotics to AR/VR. Key to reasoning about interactions is to understand the body pose and motion of the interaction partner from the egocentric view. However, research in this area is severely hindered by the lack of datasets. Existing datasets are limited in terms of either size, capture/annotation modalities, ground-truth quality, or interaction diversity. We fill this gap by proposing EgoBody, a novel large-scale dataset for human pose, shape and motion estimation from egocentric views, during interactions in complex 3D scenes. We employ Microsoft HoloLens2 headsets to record rich egocentric data streams (including RGB, depth, eye gaze, head and hand tracking). To obtain accurate 3D ground truth, we calibrate the headset with a multi-Kinect rig and fit expressive SMPL-X body meshes to multi-view RGB-D frames, reconstructing 3D human shapes and poses relative to the scene, over time. We collect 125 sequences, spanning diverse interaction scenarios, and propose the first benchmark for 3D full-body pose and shape estimation of the social partner from egocentric views. We extensively evaluate state-of-the-art methods, highlight their limitations in the egocentric scenario, and address such limitations leveraging our high-quality annotations. Data and code are available at https://sanweiliti.github.io/egobody/egobody.html.

preprint2022arXiv

Magnetic Field Tuning of Parallel Spin Stripe Order and Fluctuations near the Pseudogap Quantum Critical Point in La$_{1.36}$Nd$_{0.4}$Sr$_{0.24}$CuO$_4$

A quantum critical point in the single layer, hole-doped cuprate system La$_{1.6-x}$Nd$_{0.4}$Sr$_x$CuO$_4$ (Nd-LSCO), near $x$ = 0.23 has been proposed as an organizing principle for understanding high temperature superconductivity. Our earlier neutron diffraction work on Nd-LSCO at optimal and high doping revealed static parallel spin stripes to exist out to the QCP and slightly beyond, at $x$ = 0.24 and 0.26. We examine more closely the parallel spin stripe order parameter in Nd-LSCO in both zero magnetic field and fields up to 8 T for H // c in these single crystals. In contrast to earlier studies at lower doping, we observe that H //c in excess of $\sim$ 2.5 T eliminates the incommensurate quasi-Bragg peaks associated with parallel spin stripes. But this elastic scattering is not destroyed by the field; rather it is transferred to commensurate {\textbf{Q} = 0} Bragg positions, implying that the spins participating in the spin stripes have been polarized. Inelastic neutron scattering measurements at high fields show an increase in the low energy, parallel spin stripe fluctuations and evidence for a spin gap, $Δ_{spin}$= 3 $\pm$ 0.5 meV for Nd-LSCO with $x$ = 0.24. This is shown to be consistent with spin gap measurements as a function of superconducting T$_C$ over five different families of cuprate superconductors, which follow the approximate linear relation, $Δ_{spin}$ = 3.5 k$_B$T$_C$.

preprint2022arXiv

MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images

In this paper, we aim to create generalizable and controllable neural signed distance fields (SDFs) that represent clothed humans from monocular depth observations. Recent advances in deep learning, especially neural implicit representations, have enabled human shape reconstruction and controllable avatar generation from different sensor inputs. However, to generate realistic cloth deformations from novel input poses, watertight meshes or dense full-body scans are usually needed as inputs. Furthermore, due to the difficulty of effectively modeling pose-dependent cloth deformations for diverse body shapes and cloth types, existing approaches resort to per-subject/cloth-type optimization from scratch, which is computationally expensive. In contrast, we propose an approach that can quickly generate realistic clothed human avatars, represented as controllable neural SDFs, given only monocular depth images. We achieve this by using meta-learning to learn an initialization of a hypernetwork that predicts the parameters of neural SDFs. The hypernetwork is conditioned on human poses and represents a clothed neural avatar that deforms non-rigidly according to the input poses. Meanwhile, it is meta-learned to effectively incorporate priors of diverse body shapes and cloth types and thus can be much faster to fine-tune, compared to models trained from scratch. We qualitatively and quantitatively show that our approach outperforms state-of-the-art approaches that require complete meshes as inputs while our approach requires only depth frames as inputs and runs orders of magnitudes faster. Furthermore, we demonstrate that our meta-learned hypernetwork is very robust, being the first to generate avatars with realistic dynamic cloth deformations given as few as 8 monocular depth frames.

preprint2022arXiv

Neural Point-based Shape Modeling of Humans in Challenging Clothing

Parametric 3D body models like SMPL only represent minimally-clothed people and are hard to extend to clothing because they have a fixed mesh topology and resolution. To address these limitations, recent work uses implicit surfaces or point clouds to model clothed bodies. While not limited by topology, such methods still struggle to model clothing that deviates significantly from the body, such as skirts and dresses. This is because they rely on the body to canonicalize the clothed surface by reposing it to a reference shape. Unfortunately, this process is poorly defined when clothing is far from the body. Additionally, they use linear blend skinning to pose the body and the skinning weights are tied to the underlying body parts. In contrast, we model the clothing deformation in a local coordinate space without canonicalization. We also relax the skinning weights to let multiple body parts influence the surface. Specifically, we extend point-based methods with a coarse stage, that replaces canonicalization with a learned pose-independent &#34;coarse shape&#34; that can capture the rough surface geometry of clothing like skirts. We then refine this using a network that infers the linear blend skinning weights and pose dependent displacements from the coarse representation. The approach works well for garments that both conform to, and deviate from, the body. We demonstrate the usefulness of our approach by learning person-specific avatars from examples and then show how they can be animated in new poses and motions. We also show that the method can learn directly from raw scans with missing data, greatly simplifying the process of creating realistic avatars. Code is available for research purposes at {\small\url{https://qianlim.github.io/SkiRT}}.

preprint2021arXiv

Relativistic electron flux model in the outer radiation belt using a neural network approach

We present a machine-learning-based model of relativistic electron fluxes >1.8 MeV using a neural network approach in the Earth&#39;s outer radiation belt. The Outer RadIation belt Electron Neural net model for Relativistic electrons (ORIENT-R) uses only solar wind conditions and geomagnetic indices as input. For the first time, we show that the state of the outer radiation belt can be determined using only solar wind conditions and geomagnetic indices, without any initial and boundary conditions. The most important features for determining outer radiation belt dynamics are found to be AL, solar wind flow speed and density, and SYM-H indices. ORIENT-R reproduces out-of-sample relativistic electron fluxes with a correlation coefficient of 0.95 and an uncertainty factor of ~2. ORIENT-R reproduces radiation belt dynamics during an out-of-sample geomagnetic storm with good agreement to the observations. In addition, ORIENT-R was run for a completely out-of-sample period between March 2018 and October 2019 when the AL index ended and was replaced with the predicted AL index (lasp.colorado.edu/~lix). It reproduces electron fluxes with a correlation coefficient of 0.92 and an out-of-sample uncertainty factor of ~3. Furthermore, ORIENT-R captured the trend in the electron fluxes from low-earth-orbit (LEO) SAMPEX, which is a completely out-of-sample dataset both temporally and spatially. In sum, the ORIENT-R model can reproduce transport, acceleration, decay, and dropouts of the outer radiation belt anywhere from short timescales (i.e., geomagnetic storms) and very long timescales (i.e., solar cycle) variations.

preprint2020arXiv

Frontal Low-rank Random Tensors for Fine-grained Action Segmentation

Fine-grained action segmentation in long untrimmed videos is an important task for many applications such as surveillance, robotics, and human-computer interaction. To understand subtle and precise actions within a long time period, second-order information (e.g. feature covariance) or higher is reported to be effective in the literature. However, extracting such high-order information is considerably non-trivial. In particular, the dimensionality increases exponentially with the information order, and hence gaining more representation power also increases the computational cost and the risk of overfitting. In this paper, we propose an approach to representing high-order information for temporal action segmentation via a simple yet effective bilinear form. Specifically, our contributions are: (1) From the multilinear perspective, we derive a bilinear form of low complexity, assuming that the three-way tensor has low-rank frontal slices. (2) Rather than learning the tensor entries from data, we sample the entries from different underlying distributions, and prove that the underlying distribution influences the information order. (3) We employed our bilinear form as an intermediate layer in state-of-the-art deep neural networks, enabling to represent high-order information in complex deep models effectively and efficiently. Our experimental results demonstrate that the proposed bilinear form outperforms the previous state-of-the-art methods on the challenging temporal action segmentation task. One can see our project page for data, model and code: \url{https://vlg.inf.ethz.ch/projects/BilinearTCN/}.

preprint2020arXiv

Learning to Dress 3D People in Generative Clothing

Three-dimensional human body models are widely used in the analysis of human pose and motion. Existing models, however, are learned from minimally-clothed 3D scans and thus do not generalize to the complexity of dressed people in common images and videos. Additionally, current models lack the expressive power needed to represent the complex non-linear geometry of pose-dependent clothing shapes. To address this, we learn a generative 3D mesh model of clothed people from 3D scans with varying pose and clothing. Specifically, we train a conditional Mesh-VAE-GAN to learn the clothing deformation from the SMPL body model, making clothing an additional term in SMPL. Our model is conditioned on both pose and clothing type, giving the ability to draw samples of clothing to dress different body shapes in a variety of styles and poses. To preserve wrinkle detail, our Mesh-VAE-GAN extends patchwise discriminators to 3D meshes. Our model, named CAPE, represents global shape and fine local structure, effectively extending the SMPL body model to clothing. To our knowledge, this is the first generative model that directly dresses 3D human body meshes and generalizes to different poses. The model, code and data are available for research purposes at https://cape.is.tue.mpg.de.