Source author record

Tian Lan

Tian Lan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

36works

29topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

General-sum multi-agent learning is often governed by a stacked update field in which each agent's policy update changes the optimization landscape faced by the others. This coupling can entangle an integrable component of collective improvement with cyclic interaction dynamics, leading to slow or unstable multi-agent learning. Existing approaches, such as regularization, credit assignment, and consensus methods, stabilize MARL through local or algorithmic modifications; HPML complements them by projecting the joint update field onto a metric-gradient component. We introduce \textbf{HPML} (\textbf{H}odge-\textbf{P}rojected \textbf{M}ulti-agent \textbf{L}earning), which views the joint update field of a multi-agent system as an element of an $L^2$ space of vector fields and computes a Hodge-type projection onto the closest metric-gradient potential flow. HPML follows the projected component as the update direction, yielding the closest metric-gradient field under the chosen metric and sampling measure. The projection is defined variationally, characterized by a Poisson-type equation, and implemented through graph-based and amortized neural realizations that recover projected directions from samples. We show that the projected dynamics admit a Lyapunov potential and yield equilibrium-gap bounds with an explicit additive non-potentiality term. Controlled experiments validate the geometric mechanism, and CTDE benchmarks show improved stability and normalized return when HPML is used as a plug-in projection layer in MARL pipelines.

preprint2026arXiv

NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search

Monte Carlo Tree Search (MCTS) scales poorly in cooperative multi-agent domains because expansion must consider an exponentially large set of joint actions, severely limiting exploration under realistic search budgets. We propose NonZero, which keeps multi-agent MCTS tractable by running surrogate-guided selection over a low-dimensional nonlinear representation using an interaction-guided proposal rule, instead of directly exploring the full joint-action space. Our exploration uses an interaction score: single-agent deviations are ranked by predicted gain, while two-agent deviations are scored by a mixed-difference measure that reveals coordination benefits even when no single agent can improve alone. We formalize candidate proposal as a bandit problem over local deviations and derive a proposal rule, NonZero, with a sublinear local-regret guarantee for reaching approximate graph-local optima without enumerating the joint-action space. Empirically, NonZero improves sample efficiency and final performance on MatGame, SMAC, and SMACv2 relative to strong model-based and model-free baselines under matched search budgets.

preprint2025arXiv

Ultrafast Exciton-Polariton Transport and Relaxation in Halide Perovskite

Halide perovskites offer a great platform for room-temperature exciton-polaritons (EPs) due to their strong oscillator strength and large exciton binding energy, promising applications in next-generation photonic and polaritonic devices. Efficient manipulation of EP transport and relaxation is critical for device performance, yet their spatiotemporal dynamics across different in-plane momenta (k//) remain poorly understood due to limitations in experimental access. In this work, we employ energy-resolved transient reflectance microscopy (TRM) combined with the dispersion relation of EPs to achieve high-resolution imaging of EP transport at specific k//. This approach directly reveals the quasi-ballistic transport and ultrafast relaxation of EPs in different k// regions, showcasing diffusion as fast as ~490 cm2/s and a relaxation time of ~95.1 fs. Furthermore, by tuning the detuning parameter, we manipulate the ballistic transport group velocity and relaxation time of EPs across varying k//. Our results reveal key insights into the dynamics of EP transport and relaxation, providing valuable guidance for the design and optimization of polaritonic devices.

preprint2023arXiv

TWR-MCAE: A Data Augmentation Method for Through-the-Wall Radar Human Motion Recognition

To solve the problems of reduced accuracy and prolonging convergence time of through-the-wall radar (TWR) human motion due to wall attenuation, multipath effect, and system interference, we propose a multilink auto-encoding neural network (TWR-MCAE) data augmentation method. Specifically, the TWR-MCAE algorithm is jointly constructed by a singular value decomposition (SVD)-based data preprocessing module, an improved coordinate attention module, a compressed sensing learnable iterative shrinkage threshold reconstruction algorithm (LISTA) module, and an adaptive weight module. The data preprocessing module achieves wall clutter, human motion features, and noise subspaces separation. The improved coordinate attention module achieves clutter and noise suppression. The LISTA module achieves human motion feature enhancement. The adaptive weight module learns the weights and fuses the three subspaces. The TWR-MCAE can suppress the low-rank characteristics of wall clutter and enhance the sparsity characteristics in human motion at the same time. It can be linked before the classification step to improve the feature extraction capability without adding other prior knowledge or recollecting more data. Experiments show that the proposed algorithm gets a better peak signal-to-noise ratio (PSNR), which increases the recognition accuracy and speeds up the training process of the back-end classifiers.

preprint2022arXiv

A Framework for Server Authentication using Communication Protocol Dialects

In today's world, computer networks have become vulnerable to numerous attacks. In both wireless and wired networks, one of the most common attacks is man-in-the-middle attacks, within which session hijacking, context confusion attacks have been the most attempted. A potential attacker may have enough time to launch an attack targeting these vulnerabilities (such as rerouting the target request to a malicious server or hijacking the traffic). A viable strategy to solve this problem is, by dynamically changing the system properties, configurations and create unique fingerprints to identify the source. However, the existing work of fingerprinting mainly focuses on lower-level properties (e.g IP address), and only these types of properties are restricted for mutation. We develop a novel system, called Verify-Pro, to provide server authentication using communication protocol dialects, that uses a client-server architecture based on network protocols for customizing the communication transactions. For each session, a particular sequence of handshakes will be used as dialects. So, given the context, with the establishment of a one-time username and password, we use the dialects as an authentication mechanism for each request (e.g get filename in FTP) throughout the session, which enforces continuous authentication. Specifically, we leverage a machine learning approach on both client and server machines to trigger a specific dialect that dynamically changes for each request. We implement a prototype of Verify-Pro and evaluate its practicality on standard communication protocols FTP, HTTP & internet of things protocol MQTT. Our experimental results show that by sending misleading information through message packets from an attacker at the application layer, it is possible for the recipient to identify if the sender is genuine or a spoofed one, with a negligible overhead of 0.536%.

preprint2022arXiv

Cross-Lingual Phrase Retrieval

Cross-lingual retrieval aims to retrieve relevant text across languages. Current methods typically achieve cross-lingual retrieval by learning language-agnostic text representations in word or sentence level. However, how to learn phrase representations for cross-lingual phrase retrieval is still an open problem. In this paper, we propose XPR, a cross-lingual phrase retriever that extracts phrase representations from unlabeled example sentences. Moreover, we create a large-scale cross-lingual phrase retrieval dataset, which contains 65K bilingual phrase pairs and 4.2M example sentences in 8 English-centric language pairs. Experimental results show that XPR outperforms state-of-the-art baselines which utilize word-level or sentence-level representations. XPR also shows impressive zero-shot transferability that enables the model to perform retrieval in an unseen language pair during training. Our dataset, code, and trained models are publicly available at www.github.com/cwszz/XPR/.

preprint2022arXiv

Efficient Video Instance Segmentation via Tracklet Query and Proposal

Video Instance Segmentation (VIS) aims to simultaneously classify, segment, and track multiple object instances in videos. Recent clip-level VIS takes a short video clip as input each time showing stronger performance than frame-level VIS (tracking-by-segmentation), as more temporal context from multiple frames is utilized. Yet, most clip-level methods are neither end-to-end learnable nor real-time. These limitations are addressed by the recent VIS transformer (VisTR) which performs VIS end-to-end within a clip. However, VisTR suffers from long training time due to its frame-wise dense attention. In addition, VisTR is not fully end-to-end learnable in multiple video clips as it requires a hand-crafted data association to link instance tracklets between successive clips. This paper proposes EfficientVIS, a fully end-to-end framework with efficient training and inference. At the core are tracklet query and tracklet proposal that associate and segment regions-of-interest (RoIs) across space and time by an iterative query-video interaction. We further propose a correspondence learning that makes tracklets linking between clips end-to-end learnable. Compared to VisTR, EfficientVIS requires 15x fewer training epochs while achieving state-of-the-art accuracy on the YouTube-VIS benchmark. Meanwhile, our method enables whole video instance segmentation in a single end-to-end pass without data association at all.

preprint2022arXiv

Exploring Dense Retrieval for Dialogue Response Selection

Recent progress in deep learning has continuously improved the accuracy of dialogue response selection. In particular, sophisticated neural network architectures are leveraged to capture the rich interactions between dialogue context and response candidates. While remarkably effective, these models also bring in a steep increase in computational cost. Consequently, such models can only be used as a re-rank module in practice. In this study, we present a solution to directly select proper responses from a large corpus or even a nonparallel corpus that only consists of unpaired sentences, using a dense retrieval model. To push the limits of dense retrieval, we design an interaction layer upon the dense retrieval models and apply a set of tailor-designed learning strategies. Our model shows superiority over strong baselines on the conventional re-rank evaluation setting, which is remarkable given its efficiency. To verify the effectiveness of our approach in realistic scenarios, we also conduct full-rank evaluation, where the target is to select proper responses from a full candidate pool that may contain millions of candidates and evaluate them fairly through human annotations. Our proposed model notably outperforms pipeline baselines that integrate fast recall and expressive re-rank modules. Human evaluation results show that enlarging the candidate pool with nonparallel corpora improves response quality further.

preprint2022arXiv

Language Models Can See: Plugging Visual Controls in Text Generation

Generative language models (LMs) such as GPT-2/3 can be prompted to generate text with remarkable quality. While they are designed for text-prompted generation, it remains an open question how the generation process could be guided by modalities beyond text such as images. In this work, we propose a training-free framework, called MAGIC (iMAge-Guided text generatIon with CLIP), for plugging in visual controls in the generation process and enabling LMs to perform multimodal tasks (e.g., image captioning) in a zero-shot manner. MAGIC is a simple yet efficient plug-and-play framework, which directly combines an off-the-shelf LM (i.e., GPT-2) and an image-text matching model (i.e., CLIP) for image-grounded text generation. During decoding, MAGIC influences the generation of the LM by introducing a CLIP-induced score, called magic score, which regularizes the generated result to be semantically related to a given image while being coherent to the previously generated context. Notably, the proposed decoding scheme does not involve any gradient update operation, therefore being computationally efficient. On the challenging task of zero-shot image captioning, MAGIC outperforms the state-of-the-art method by notable margins with a nearly 27 times decoding speedup. MAGIC is a flexible framework and is theoretically compatible with any text generation tasks that incorporate image grounding. In the experiments, we showcase that it is also capable of performing visually grounded story generation given both an image and a text prompt.

preprint2022arXiv

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning

One of the biggest challenges in Federated Learning (FL) is that client devices often have drastically different computation and communication resources for local updates. To this end, recent research efforts have focused on training heterogeneous local models obtained by pruning a shared global model. Despite empirical success, theoretical guarantees on convergence remain an open question. In this paper, we present a unifying framework for heterogeneous FL algorithms with {\em arbitrary} adaptive online model pruning and provide a general convergence analysis. In particular, we prove that under certain sufficient conditions and on both IID and non-IID data, these algorithms converges to a stationary point of standard FL for general smooth cost functions, with a convergence rate of $O(\frac{1}{\sqrt{Q}})$. Moreover, we illuminate two key factors impacting convergence: pruning-induced noise and minimum coverage index, advocating a joint design of local pruning masks for efficient training.

preprint2022arXiv

SAFARI: Sparsity enabled Federated Learning with Limited and Unreliable Communications

Federated learning (FL) enables edge devices to collaboratively learn a model in a distributed fashion. Many existing researches have focused on improving communication efficiency of high-dimensional models and addressing bias caused by local updates. However, most of FL algorithms are either based on reliable communications or assume fixed and known unreliability characteristics. In practice, networks could suffer from dynamic channel conditions and non-deterministic disruptions, with time-varying and unknown characteristics. To this end, in this paper we propose a sparsity enabled FL framework with both communication efficiency and bias reduction, termed as SAFARI. It makes novel use of a similarity among client models to rectify and compensate for bias that is resulted from unreliable communications. More precisely, sparse learning is implemented on local clients to mitigate communication overhead, while to cope with unreliable communications, a similarity-based compensation method is proposed to provide surrogates for missing model updates. We analyze SAFARI under bounded dissimilarity and with respect to sparse models. It is demonstrated that SAFARI under unreliable communications is guaranteed to converge at the same rate as the standard FedAvg with perfect communications. Implementations and evaluations on CIFAR-10 dataset validate the effectiveness of SAFARI by showing that it can achieve the same convergence speed and accuracy as FedAvg with perfect communications, with up to 80% of the model weights being pruned and a high percentage of client updates missing in each round.

preprint2022arXiv

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Masked language models (MLMs) such as BERT and RoBERTa have revolutionized the field of Natural Language Understanding in the past few years. However, existing pre-trained MLMs often output an anisotropic distribution of token representations that occupies a narrow subset of the entire representation space. Such token representations are not ideal, especially for tasks that demand discriminative semantic meanings of distinct tokens. In this work, we propose TaCL (Token-aware Contrastive Learning), a novel continual pre-training approach that encourages BERT to learn an isotropic and discriminative distribution of token representations. TaCL is fully unsupervised and requires no additional data. We extensively test our approach on a wide range of English and Chinese benchmarks. The results show that TaCL brings consistent and notable improvements over the original BERT model. Furthermore, we conduct detailed analysis to reveal the merits and inner-workings of our approach.

preprint2021arXiv

Sobolev Orthogonal Polynomials on the Sierpinski Gasket

We develop a theory of Sobolev orthogonal polynomials on the Sierpiński gasket ($SG$). These orthogonal polynomials arise through the Gram-Schmidt orthogonalisation process applied on the set of monomials on $SG$ using several notions of a Sobolev inner products. After establishing some recurrence relations for these orthogonal polynomials, we give estimates for their $L^2$, $L^\infty$ and Sobolev norms, and study their asymptotic behaviour. Finally, we study the properties of zero sets of polynomials and develop fast computational tools to explore applications to quadrature and interpolation.

preprint2020arXiv

Classification of topological phases with finite internal symmetries in all dimensions

We develop a mathematical theory of symmetry protected trivial (SPT) orders and anomaly-free symmetry enriched topological (SET) orders in all dimensions via two different approaches with an emphasis on the second approach. The first approach is to gauge the symmetry in the same dimension by adding topological excitations as it was done in the 2d case, in which the gauging process is mathematically described by the minimal modular extensions of unitary braided fusion 1-categories. This 2d result immediately generalizes to all dimensions except in 1d, which is treated with special care. The second approach is to use the 1-dimensional higher bulk of the SPT/SET order and the boundary-bulk relation. This approach also leads us to a precise mathematical description and a classification of SPT/SET orders in all dimensions. The equivalence of these two approaches, together with known physical results, provides us with many precise mathematical predictions.

preprint2020arXiv

Modeling and Optimization of Latency in Erasure-coded Storage Systems

As consumers are increasingly engaged in social networking and E-commerce activities, businesses grow to rely on Big Data analytics for intelligence, and traditional IT infrastructures continue to migrate to the cloud and edge, these trends cause distributed data storage demand to rise at an unprecedented speed. Erasure coding has seen itself quickly emerged as a promising technique to reduce storage cost while providing similar reliability as replicated systems, widely adopted by companies like Facebook, Microsoft and Google. However, it also brings new challenges in characterizing and optimizing the access latency when erasure codes are used in distributed storage. The aim of this monograph is to provide a review of recent progress (both theoretical and practical) on systems that employ erasure codes for distributed storage. In this monograph, we will first identify the key challenges and taxonomy of the research problems and then give an overview of different approaches that have been developed to quantify and model latency of erasure-coded storage. This includes recent work leveraging MDS-Reservation, Fork-Join, Probabilistic, and Delayed-Relaunch scheduling policies, as well as their applications to characterize access latency (e.g., mean, tail, asymptotic latency) of erasure-coded distributed storage systems. We will also extend the problem to the case when users are streaming videos from erasure-coded distributed storage systems. Next, we bridge the gap between theory and practice, and discuss lessons learned from prototype implementation. In particular, we will discuss exemplary implementations of erasure-coded storage, illuminate key design degrees of freedom and tradeoffs, and summarize remaining challenges in real-world storage systems such as in content delivery and caching. Open problems for future research are discussed at the end of each chapter.

preprint2020arXiv

Multi-task Learning for Low-resource Second Language Acquisition Modeling

Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned. It is a fundamental building block of the personalized learning system and has attracted more and more attention recently. However, as far as we know, almost all existing methods cannot work well in low-resource scenarios due to lacking of training data. Fortunately, there are some latent common patterns among different language-learning tasks, which gives us an opportunity to solve the low-resource SLA modeling problem. Inspired by this idea, in this paper, we propose a novel SLA modeling method, which learns the latent common patterns among different language-learning datasets by multi-task learning and are further applied to improving the prediction performance in low-resource scenarios. Extensive experiments show that the proposed method performs much better than the state-of-the-art baselines in the low-resource scenario. Meanwhile, it also obtains improvement slightly in the non-low-resource scenario.

preprint2020arXiv

PONE: A Novel Automatic Evaluation Metric for Open-Domain Generative Dialogue Systems

Open-domain generative dialogue systems have attracted considerable attention over the past few years. Currently, how to automatically evaluate them, is still a big challenge problem. As far as we know, there are three kinds of automatic methods to evaluate the open-domain generative dialogue systems: (1) Word-overlap-based metrics; (2) Embedding-based metrics; (3) Learning-based metrics. Due to the lack of systematic comparison, it is not clear which kind of metrics are more effective. In this paper, we will first measure systematically all kinds of automatic evaluation metrics over the same experimental setting to check which kind is best. Through extensive experiments, the learning-based metrics are demonstrated that they are the most effective evaluation metrics for open-domain generative dialogue systems. Moreover, we observe that nearly all learning-based metrics depend on the negative sampling mechanism, which obtains an extremely imbalanced and low-quality dataset to train a score model. In order to address this issue, we propose a novel and feasible learning-based metric that can significantly improve the correlation with human judgments by using augmented POsitive samples and valuable NEgative samples, called PONE. Extensive experiments demonstrate that our proposed evaluation method significantly outperforms the state-of-the-art learning-based evaluation methods, with an average correlation improvement of 13.18%. In addition, we have publicly released the codes of our proposed method and state-of-the-art baselines.

preprint2020arXiv

Real Entropy Can Also Predict Daily Voice Traffic for Wireless Network Users

Voice traffic prediction is significant for network deployment optimization thus to improve the network efficiency. The real entropy based theorectical bound and corresponding prediction models have demonstrated their success in mobility prediction. In this paper, the real entropy based predictability analysis and prediction models are introduced into voice traffic prediction. For this adoption, the traffic quantification methods is proposed and discussed. Based on the real world voice traffic data, the prediction accuracy of N-order Markov models, diffusion based model and MF model are presented, among which, 25-order Markov models performs best and approach close to the maximum predictability. This work demonstrates that, the real entropy can also predict voice traffic well which broaden the understanding on the real entropy based prediction theory.

preprint2020arXiv

TNT: Target-driveN Trajectory Prediction

Predicting the future behavior of moving agents is essential for real world applications. It is challenging as the intent of the agent and the corresponding behavior is unknown and intrinsically multimodal. Our key insight is that for prediction within a moderate time horizon, the future modes can be effectively captured by a set of target states. This leads to our target-driven trajectory prediction (TNT) framework. TNT has three stages which are trained end-to-end. It first predicts an agent's potential target states $T$ steps into the future, by encoding its interactions with the environment and the other agents. TNT then generates trajectory state sequences conditioned on targets. A final stage estimates trajectory likelihoods and a final compact set of trajectory predictions is selected. This is in contrast to previous work which models agent intents as latent variables, and relies on test-time sampling to generate diverse trajectories. We benchmark TNT on trajectory prediction of vehicles and pedestrians, where we outperform state-of-the-art on Argoverse Forecasting, INTERACTION, Stanford Drone and an in-house Pedestrian-at-Intersection dataset.

preprint2020arXiv

Twin-Finder: Integrated Reasoning Engine for Pointer-related Code Clone Detection

Detecting code clones is crucial in various software engineering tasks. In particular, code clone detection can have significant uses in the context of analyzing and fixing bugs in large scale applications. However, prior works, such as machine learning-based clone detection, may cause a considerable amount of false positives. In this paper, we propose Twin-Finder, a novel, closed-loop approach for pointer-related code clone detection that integrates machine learning and symbolic execution techniques to achieve precision. Twin-Finder introduces a clone verification mechanism to formally verify if two clone samples are indeed clones and a feedback loop to automatically generated formal rules to tune machine learning algorithm and further reduce the false positives. Our experimental results show that Twin-Finder can swiftly identify up 9X more code clones comparing to a tree-based clone detector, Deckard and remove an average 91.69% false positives.

preprint2020arXiv

Which Kind Is Better in Open-domain Multi-turn Dialog,Hierarchical or Non-hierarchical Models? An Empirical Study

Currently, open-domain generative dialog systems have attracted considerable attention in academia and industry. Despite the success of single-turn dialog generation, multi-turn dialog generation is still a big challenge. So far, there are two kinds of models for open-domain multi-turn dialog generation: hierarchical and non-hierarchical models. Recently, some works have shown that the hierarchical models are better than non-hierarchical models under their experimental settings; meanwhile, some works also demonstrate the opposite conclusion. Due to the lack of adequate comparisons, it's not clear which kind of models are better in open-domain multi-turn dialog generation. Thus, in this paper, we will measure systematically nearly all representative hierarchical and non-hierarchical models over the same experimental settings to check which kind is better. Through extensive experiments, we have the following three important conclusions: (1) Nearly all hierarchical models are worse than non-hierarchical models in open-domain multi-turn dialog generation, except for the HRAN model. Through further analysis, the excellent performance of HRAN mainly depends on its word-level attention mechanism; (2) The performance of other hierarchical models will also obtain a great improvement if integrating the word-level attention mechanism into these models. The modified hierarchical models even significantly outperform the non-hierarchical models; (3) The reason why the word-level attention mechanism is so powerful for hierarchical models is because it can leverage context information more effectively, especially the fine-grained information. Besides, we have implemented all of the models and already released the codes.

preprint2019arXiv

Gapped domain walls between 2+1D topologically ordered states

The 2+1D topological order can be characterized by the mapping-class-group representations for Riemann surfaces of genus-1, genus-2, etc. In this paper, we use those representations to determine the possible gapped boundaries of a 2+1D topological order, as well as the domain walls between two topological orders. We find that mapping-class-group representations for both genus-1 and genus-2 surfaces are needed to determine the gapped domain walls and boundaries. Our systematic theory is based on the fixed-point partition functions for the walls (or the boundaries), which completely characterize the gapped domain walls (or the boundaries). The mapping-class-group representations give rise to conditions that must be satisfied by the fixed-point partition functions, which leads to a systematic theory. Such conditions can be viewed as bulk topological order determining the (non-invertible) gravitational anomaly at the domain wall, and our theory can be viewed as finding all types of the gapped domain wall given a (non-invertible) gravitational anomaly. We also developed a systematic theory of gapped domain walls (boundaries) based on the structure coefficients of condensable algebras.

preprint2018arXiv

Fermion decoration construction of symmetry protected trivial orders for fermion systems with any symmetries $G_f$ and in any dimensions

We use higher dimensional bosonization and fermion decoration to construct exactly soluble interacting fermion models to realize fermionic symmetry protected trivial (SPT) orders (which are also known as symmetry protected topological orders) in any dimensions and for generic fermion symmetries $G_f$, which can be a non-trivial $Z_2^f$ extension (where $Z_2^f$ is the fermion-number-parity symmetry). This generalizes the previous results from group superconhomology of Gu and Wen (arXiv:1201.2648), where $G_f$ is assumed to be a trivial $Z_2^f$ extension. We find that the SPT phases from fermion decoration construction can be described in a compact way using higher groups.

preprint2016arXiv

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR). The proposed STANNCR is based on action component and incorporates the spatial-temporal information. We first introduce a spatial-temporal distribution vector (STDV) to model the distributions of local feature locations in a compact and discriminative manner. Then we employ non-negative matrix factorization (NMF) to learn the action components and encode the video samples. The action component considers the correlations of visual words, which effectively bridge the sematic gap in action recognition. To incorporate the spatial-temporal cues for final representation, the STDV is used as the part of graph regularization for NMF. The fusion of spatial-temporal information makes the STANNCR more discriminative, and our fusion manner is more compact than traditional method of concatenating vectors. The proposed approach is extensively evaluated on three public datasets. The experimental results demonstrate the effectiveness of STANNCR for action recognition.

preprint2015arXiv

A theory of 2+1D fermionic topological orders and fermionic/bosonic topological orders with symmetries

We propose that, up to invertible topological orders, 2+1D fermionic topological orders without symmetry and 2+1D fermionic/bosonic topological orders with symmetry $G$ are classified by non-degenerate unitary braided fusion categories (UBFC) over a symmetric fusion category (SFC); the SFC describes a fermionic product state without symmetry or a fermionic/bosonic product state with symmetry $G$, and the UBFC has a modular extension. We developed a simplified theory of non-degenerate UBFC over a SFC based on the fusion coefficients $N^{ij}_k$ and spins $s_i$. This allows us to obtain a list that contains all 2+1D fermionic topological orders (without symmetry). We find explicit realizations for all the fermionic topological orders in the table. For example, we find that, up to invertible $p+\hspace{1pt}\mathrm{i}\hspace{1pt} p$ fermionic topological orders, there are only four fermionic topological orders with one non-trivial topological excitation: (1) the $K={\scriptsize \begin{pmatrix} -1&0\\0&2\end{pmatrix}}$ fractional quantum Hall state, (2) a Fibonacci bosonic topological order $2^B_{14/5}$ stacking with a fermionic product state, (3) the time-reversal conjugate of the previous one, (4) a primitive fermionic topological order that has a chiral central charge $c=\frac14$, whose only topological excitation has a non-abelian statistics with a spin $s=\frac14$ and a quantum dimension $d=1+\sqrt{2}$. We also proposed a categorical way to classify 2+1D invertible fermionic topological orders using modular extensions.

preprint2015arXiv

Action Recognition by Hierarchical Mid-level Action Elements

Realistic videos of human actions exhibit rich spatiotemporal structures at multiple levels of granularity: an action can always be decomposed into multiple finer-grained elements in both space and time. To capture this intuition, we propose to represent videos by a hierarchy of mid-level action elements (MAEs), where each MAE corresponds to an action-related spatiotemporal segment in the video. We introduce an unsupervised method to generate this representation from videos. Our method is capable of distinguishing action-related segments from background segments and representing actions at multiple spatiotemporal resolutions. Given a set of spatiotemporal segments generated from the training data, we introduce a discriminative clustering algorithm that automatically discovers MAEs at multiple levels of granularity. We develop structured models that capture a rich set of spatial, temporal and hierarchical relations among the segments, where the action label and multiple levels of MAE labels are jointly inferred. The proposed model achieves state-of-the-art performance in multiple action recognition benchmarks. Moreover, we demonstrate the effectiveness of our model in real-world applications such as action recognition in large-scale untrimmed videos and action parsing.

preprint2015arXiv

Advanced Post-Processing Techniques of Molecular Dynamics Simulations in Studying Strong Anharmonic Thermodynamics of Solids

While the vibrational thermodynamics of materials with small anharmonicity at low temperatures has been understood well based on the harmonic phonons approximation; at high temperatures, this understanding must accommodate how phonons interact with other phonons or with other excitations. We shall see that the phonon-phonon interactions give rise to interesting coupling problems, and essentially modify the equilibrium and non-equilibrium properties of materials, e.g., thermal expansion, thermodynamic stability, heat capacity, optical properties, thermal transport and other nonlinear properties of materials. To date the anharmonic lattice dynamics is poorly understood despite its great importance, and most studies on lattice dynamics still rely on the harmonic or quasiharmonic models. With recent developement of computational models, the anharmonic information can be extracted from the atomic trajectories of molecular dynamics simulations. For example, the vibrational energy spectra, the effective potential energy surface and the phonon-phonon interaction channels can be derived from these trajectories which appear stochastic. These inter-dependent methods are adopted to successfully uncover the strong anharmonic phenomena while the traditional harmonic models fail dramatically, e.g., the negative thermal expansion of cuprite and the high temperature thermal stability of rutile.

preprint2015arXiv

Phonon quarticity induced by changes in phonon-tracked hybridization during lattice expansion and its stabilization of rutile TiO$_2$

Although the rutile structure of TiO$_2$ is stable at high temperatures, the conventional quasiharmonic approximation predicts that several acoustic phonons decrease anomalously to zero frequency with thermal expansion, incorrectly predicting a structural collapse at temperatures well below 1000\,K. Inelastic neutron scattering was used to measure the temperature dependence of the phonon density of states (DOS) of rutile TiO$_2$ from 300 to 1373\,K. Surprisingly, these anomalous acoustic phonons were found to increase in frequency with temperature. First-principles calculations showed that with lattice expansion, the potentials for the anomalous acoustic phonons transform from quadratic to quartic, stabilizing the rutile phase at high temperatures. In these modes, the vibrational displacements of adjacent Ti and O atoms cause variations in hybridization of $3d$ electrons of Ti and $2p$ electrons of O atoms. With thermal expansion, the energy variation in this "phonon-tracked hybridization" flattens the bottom of the interatomic potential well between Ti and O atoms, and induces a quarticity in the phonon potential.

preprint2014arXiv

Anharmonic lattice dynamics of Ag$_2$O studied by inelastic neutron scattering and first principles molecular dynamics simulations

Inelastic neutron scattering measurements on silver oxide (Ag$_2$O) with the cuprite structure were performed at temperatures from 40 to 400\,K, and Fourier transform far-infrared spectra were measured from 100 to 300\,K. The measured phonon densities of states and the infrared spectra showed unusually large energy shifts with temperature, and large linewidth broadenings. First principles molecular dynamics (MD) calculations were performed at various temperatures, successfully accounting for the negative thermal expansion (NTE) and local dynamics. Using the Fourier-transformed velocity autocorrelation method, the MD calculations reproduced the large anharmonic effects of Ag$_2$O, and were in excellent agreement with the neutron scattering data. The quasiharmonic approximation (QHA) was less successful in accounting for much of the phonon behavior. The QHA could account for some of the NTE below 250 K, although not at higher temperatures. Strong anharmonic effects were found for both phonons and for the NTE. The lifetime broadenings of Ag$_2$O were explained by anharmonic perturbation theory, which showed rich interactions between the Ag-dominated modes and the O-dominated modes in both up- and down-conversion processes.

preprint2014arXiv

Gapped Domain Walls, Gapped Boundaries and Topological Degeneracy

Gapped domain walls, as topological line defects between 2+1D topologically ordered states, are examined. We provide simple criteria to determine the existence of gapped domain walls, which apply to both Abelian and non-Abelian topological orders. Our criteria also determine which 2+1D topological orders must have gapless edge modes, namely which 1+1D global gravitational anomalies ensure gaplessness. Furthermore, we introduce a new mathematical object, the tunneling matrix $\mathcal W$, whose entries are the fusion-space dimensions $\mathcal W_{ia}$, to label different types of gapped domain walls. By studying many examples, we find evidence that the tunneling matrices are powerful quantities to classify different types of gapped domain walls. Since a gapped boundary is a gapped domain wall between a bulk topological order and the vacuum, regarded as the trivial topological order, our theory of gapped domain walls inclusively contains the theory of gapped boundaries. In addition, we derive a topological ground state degeneracy formula, applied to arbitrary orientable spatial 2-manifolds with gapped domain walls, including closed 2-manifolds and open 2-manifolds with gapped boundaries.

preprint2014arXiv

Topological quasiparticles and the holographic bulk-edge relation in 2+1D string-net models

String-net models allow us to systematically construct and classify 2+1D topologically ordered states which can have gapped boundaries. We can use a simple ideal string-net wavefunction, which is described by a set of F-matrices [or more precisely, a unitary fusion category (UFC)], to study all the universal properties of such a topological order. In this paper, we describe a finite computational method -- Q-algebra approach, that allows us to compute the non-Abelian statistics of the topological excitations [or more precisely, the unitary modular tensor category (UMTC)], from the string-net wavefunction (or the UFC). We discuss several examples, including the topological phases described by twisted gauge theory (i.e., twisted quantum double $D^α(G)$). Our result can also be viewed from an angle of holographic bulk-boundary relation. The 1+1D anomalous topological orders, that can appear as edges of 2+1D topological states, are classified by UFCs which describe the fusion of quasiparticles in 1+1D. The 1+1D anomalous edge topological order uniquely determines the 2+1D bulk topological order (which are classified by UMTC). Our method allows us to compute this bulk topological order (i.e., the UMTC) from the anomalous edge topological order (i.e., the UFC).

preprint2012arXiv

Cosmic Duality and Statefinder Diagnosis of Spinor Quintom

In this paper, we study the possible connections among different Spinor Quintom Dark Energy (DE) models by the aid of duality. Then we apply the statefinder diagnostic to these models. By this diagnostic pair {$\{r,s\}$}, we differentiate one Quintom DE model from the others in a model independent manner. A class of evolutionary trajectories of these Spinor Quintom models are presented in the statefinder parameter planes. We also obtain the current locations of the parameters $r$ and $s$, and these locations correspond to different models in statefinder parameter planes theoretically.

preprint2011arXiv

Constraints on the Dark Side of the Universe and Observational Hubble Parameter Data

This paper is a review on the observational Hubble parameter data that have gained increasing attention in recent years for their illuminating power on the dark side of the universe --- the dark matter, dark energy, and the dark age. Currently, there are two major methods of independent observational H(z) measurement, which we summarize as the "differential age method" and the "radial BAO size method". Starting with fundamental cosmological notions such as the spacetime coordinates in an expanding universe, we present the basic principles behind the two methods. We further review the two methods in greater detail, including the source of errors. We show how the observational H(z) data presents itself as a useful tool in the study of cosmological models and parameter constraint, and we also discuss several issues associated with their applications. Finally, we point the reader to a future prospect of upcoming observation programs that will lead to some major improvements in the quality of observational H(z) data.

preprint2010arXiv

Constraints on smoothness parameter and dark energy using observational $H(z)$ data

The universe, with large-scale homogeneity, is locally inhomogeneous, clustering into stars, galaxies and larger structures. Such property is described by the smoothness parameter $α$ which is defined as the proportion of matter in the form of intergalactic medium. If we take consideration of the inhomogeneities in small scale, there should be modifications of the cosmological distances compared to a homogenous model. Dyer and Roeder developed a second-order ordinary differential equation (D-R equation) that describes the angular diameter distance-redshift relation for inhomogeneous cosmological models. Furthermore, we may obtain the D-R equation for observational $H(z)$ data (OHD). The density-parameter $Ω_{\rm M}$, the state of dark energy $ω$, and the smoothness-parameter $α$ are constrained by a set of OHD in a spatially flat $Λ$CDM universe as well as a spatially flat XCDM universe. By using of $χ^2$ minimization method we get $α=0.81^{+0.19}_{-0.20}$ and $Ω_{\rm M}=0.32^{+0.12}_{-0.06}$ at $1σ$ confidence level. If we assume a Gaussian prior of $Ω_{\rm M}=0.26\pm0.1$, we get $α=0.93^{+0.07}_{-0.19}$ and $Ω_{\rm M}=0.31^{+0.06}_{-0.05}$. For XCDM model, $α$ is constrained to $α\geq0.80$ but $ω$ is weakly constrained around -1, where $ω$ describes the equation of the state of the dark energy ($p_{\rm X}=ωρ_{\rm X}$). We conclude that OHD constrains the smoothness parameter more effectively than the data of SNe Ia and compact radio sources.

preprint2010arXiv

Cosmological Constraints on the Undulant Universe

We use the redshift Hubble parameter $H(z)$ data derived from relative galaxy ages, distant type Ia supernovae (SNe Ia), the Baryonic Acoustic Oscillation (BAO) peak, and the Cosmic Microwave Background (CMB) shift parameter data, to constrain cosmological parameters in the Undulant Universe. We marginalize the likelihood functions over $h$ by integrating the probability density $P\propto e^{-χ^2/2}$. By using the Markov Chain Monte Carlo (MCMC) technique, we obtain the best fitting results and give the confidence regions on the $b-Ω_{\rm m0}$ plane. Then we compare their constraints. Our results show that the $H(z)$ data play a similar role with the SNe Ia data in cosmological study. By presenting the independent and joint constraints, we find that the BAO and CMB data play very important roles in breaking the degeneracy compared with the $H(z)$ and SNe Ia data alone. Combined with the BAO or CMB data, one can improve the constraints remarkably. The SNe Ia data sets constrain $Ω_{\rm m0}$ much tighter than the $H(z)$ data sets, but the $H(z)$ data sets constrain $b$ much tighter than the SNe Ia data sets. All these results show that the Undulant Universe approaches the $Λ\rm$CDM model. We expect more $H(z)$ data to constrain cosmological parameters in future.

preprint2010arXiv

Thermodynamical properties of the Undulant Universe

Recent observations show that our universe is accelerating by dark energy, so it is important to investigate the thermodynamical properties of it. The Undulant Universe is a model with equation of state $ω(a)=-\cos(b\ln a)$ for dark energy, where we show that there neither the event horizon nor the particle horizon exists. However, as a boundary of keeping thermodynamical properties, the apparent horizon is a good holographic screen. The Universe has a thermal equilibrium inside the apparent horizon, so the Unified First Law and the Generalized Second Law of thermodynamics are satisfied. As a thermodynamical whole, the evolution of the Undulant Universe behaves very well in the current phase. However, when considering the unification theory, the failure of conversation law at the epoch of the matter dominated or near singularity need some more consideration for the form of the Undulant Universe.

Tian Lan

What is connected

Connect this record

See the researcher in context

Building this map preview

36 published item(s)

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search

Ultrafast Exciton-Polariton Transport and Relaxation in Halide Perovskite

TWR-MCAE: A Data Augmentation Method for Through-the-Wall Radar Human Motion Recognition

A Framework for Server Authentication using Communication Protocol Dialects

Cross-Lingual Phrase Retrieval

Efficient Video Instance Segmentation via Tracklet Query and Proposal

Exploring Dense Retrieval for Dialogue Response Selection

Language Models Can See: Plugging Visual Controls in Text Generation

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning

SAFARI: Sparsity enabled Federated Learning with Limited and Unreliable Communications

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Sobolev Orthogonal Polynomials on the Sierpinski Gasket

Classification of topological phases with finite internal symmetries in all dimensions

Modeling and Optimization of Latency in Erasure-coded Storage Systems

Multi-task Learning for Low-resource Second Language Acquisition Modeling

PONE: A Novel Automatic Evaluation Metric for Open-Domain Generative Dialogue Systems

Real Entropy Can Also Predict Daily Voice Traffic for Wireless Network Users

TNT: Target-driveN Trajectory Prediction

Twin-Finder: Integrated Reasoning Engine for Pointer-related Code Clone Detection

Which Kind Is Better in Open-domain Multi-turn Dialog,Hierarchical or Non-hierarchical Models? An Empirical Study

Gapped domain walls between 2+1D topologically ordered states

Fermion decoration construction of symmetry protected trivial orders for fermion systems with any symmetries $G_f$ and in any dimensions

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

A theory of 2+1D fermionic topological orders and fermionic/bosonic topological orders with symmetries

Action Recognition by Hierarchical Mid-level Action Elements

Advanced Post-Processing Techniques of Molecular Dynamics Simulations in Studying Strong Anharmonic Thermodynamics of Solids

Phonon quarticity induced by changes in phonon-tracked hybridization during lattice expansion and its stabilization of rutile TiO$_2$

Anharmonic lattice dynamics of Ag$_2$O studied by inelastic neutron scattering and first principles molecular dynamics simulations

Gapped Domain Walls, Gapped Boundaries and Topological Degeneracy

Topological quasiparticles and the holographic bulk-edge relation in 2+1D string-net models

Cosmic Duality and Statefinder Diagnosis of Spinor Quintom

Constraints on the Dark Side of the Universe and Observational Hubble Parameter Data

Constraints on smoothness parameter and dark energy using observational $H(z)$ data

Cosmological Constraints on the Undulant Universe

Thermodynamical properties of the Undulant Universe