Source author record

Li Tao

Li Tao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci cond-mat.str-el Machine Learning Multimedia physics.app-ph physics.chem-ph physics.optics

Catalog footprint

What is connected

8works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Transmission-matrix Quantitative Phase Profilometry for Accurate and Fast Thickness Mapping of 2D Materials

The physical properties of two-dimensional (2D) materials may drastically vary with their thickness profiles. Current thickness profiling methods for 2D material (e.g., atomic force microscopy and ellipsometry) are limited in measurement throughput and accuracy. Here we present a novel high-speed and high-precision thickness profiling method, termed Transmission-Matrix Quantitative Phase Profilometry (TM-QPP). In TM-QPP, picometer-level optical pathlength sensitivity is enabled by extending the photon shot-noise limit of a high sensitivity common-path interferometric microscopy technique, while accurate thickness determination is realized by developing a transmission-matrix model that accounts for multiple refractions and reflections of light at sample interfaces. Using TM-QPP, the exact thickness profiles of monolayer and few-layered 2D materials (e.g., MoS2, MoSe2 and WSe2) are mapped over a wide field of view within seconds in a contact-free manner. Notably, TM-QPP is also capable of spatially resolving the number of layers of few-layered 2D materials.

preprint2020arXiv

Motion Representation Using Residual Frames with 3D CNN

Recently, 3D convolutional networks (3D ConvNets) yield good performance in action recognition. However, optical flow stream is still needed to ensure better performance, the cost of which is very high. In this paper, we propose a fast but effective way to extract motion features from videos utilizing residual frames as the input data in 3D ConvNets. By replacing traditional stacked RGB frames with residual ones, 35.6% and 26.6% points improvements over top-1 accuracy can be obtained on the UCF101 and HMDB51 datasets when ResNet-18 models are trained from scratch. And we achieved the state-of-the-art results in this training mode. Analysis shows that better motion features can be extracted using residual frames compared to RGB counterpart. By combining with a simple appearance path, our proposal can be even better than some methods using optical flow streams.

preprint2020arXiv

Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition

Recently, 3D convolutional networks yield good performance in action recognition. However, optical flow stream is still needed to ensure better performance, the cost of which is very high. In this paper, we propose a fast but effective way to extract motion features from videos utilizing residual frames as the input data in 3D ConvNets. By replacing traditional stacked RGB frames with residual ones, 20.5% and 12.5% points improvements over top-1 accuracy can be achieved on the UCF101 and HMDB51 datasets when trained from scratch. Because residual frames contain little information of object appearance, we further use a 2D convolutional network to extract appearance features and combine them with the results from residual frames to form a two-path solution. In three benchmark datasets, our two-path solution achieved better or comparable performances than those using additional optical flow methods, especially outperformed the state-of-the-art models on Mini-kinetics dataset. Further analysis indicates that better motion features can be extracted using residual frames with 3D ConvNets, and our residual-frame-input path is a good supplement for existing RGB-frame-input models.

preprint2020arXiv

Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework

We propose a self-supervised method to learn feature representations from videos. A standard approach in traditional self-supervised methods uses positive-negative data pairs to train with contrastive learning strategy. In such a case, different modalities of the same video are treated as positives and video clips from a different video are treated as negatives. Because the spatio-temporal information is important for video representation, we extend the negative samples by introducing intra-negative samples, which are transformed from the same anchor video by breaking temporal relations in video clips. With the proposed Inter-Intra Contrastive (IIC) framework, we can train spatio-temporal convolutional networks to learn video representations. There are many flexible options in our IIC framework and we conduct experiments by using several different configurations. Evaluations are conducted on video retrieval and video recognition tasks using the learned video representation. Our proposed IIC outperforms current state-of-the-art results by a large margin, such as 16.7% and 9.5% points improvements in top-1 accuracy on UCF101 and HMDB51 datasets for video retrieval, respectively. For video recognition, improvements can also be obtained on these two benchmark datasets. Code is available at https://github.com/BestJuly/Inter-intra-video-contrastive-learning.

preprint2020arXiv

Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning

Conventional video summarization approaches based on reinforcement learning have the problem that the reward can only be received after the whole summary is generated. Such kind of reward is sparse and it makes reinforcement learning hard to converge. Another problem is that labelling each frame is tedious and costly, which usually prohibits the construction of large-scale datasets. To solve these problems, we propose a weakly supervised hierarchical reinforcement learning framework, which decomposes the whole task into several subtasks to enhance the summarization quality. This framework consists of a manager network and a worker network. For each subtask, the manager is trained to set a subgoal only by a task-level binary label, which requires much fewer labels than conventional approaches. With the guide of the subgoal, the worker predicts the importance scores for video frames in the subtask by policy gradient according to both global reward and innovative defined sub-rewards to overcome the sparse problem. Experiments on two benchmark datasets show that our proposal has achieved the best performance, even better than supervised approaches.

preprint2015arXiv

Mott-Kondo Insulator Behavior in the Iron Oxychalcogenides

We perform a combined experimental-theoretical study of the Fe-oxychalcogenides (FeO$\emph{Ch}$) series La$_{2}$O$_{2}$Fe$_{2}$O\emph{M}$_{2}$ (\emph{M}=S, Se), which is the latest among the Fe-based materials with the potential \ to show unconventional high-T$_{c}$ superconductivity (HTSC). A combination of incoherent Hubbard features in X-ray absorption (XAS) and resonant inelastic X-ray scattering (RIXS) spectra, as well as resitivity data, reveal that the parent FeO$\emph{Ch}$ are correlation-driven insulators. To uncover microscopics underlying these findings, we perform local density approximation-plus-dynamical mean field theory (LDA+DMFT) calculations that unravel a Mott-Kondo insulating state. Based upon good agreement between theory and a range of data, we propose that FeO$\emph{Ch}$ may constitute a new, ideal testing ground to explore HTSC arising from a strange metal proximate to a novel selective-Mott quantum criticality.

preprint2014arXiv

Toward Air-Stable Multilayer Phosphorene Thin-Films and Transistors

Few-layer black phosphorus (BP), also known as phosphorene, is poised to be the most attractive graphene analogue owing to its high mobility approaching that of graphene, and its thickness- tunable band gap that can be as large as that of molybdenum disulfide. In essence, phosphorene represents the much sought after high-mobility, large direct band gap two-dimensional layered crystal that is ideal for optoelectronics and flexible devices. However, its instability in air is of paramount concern for practical applications. Here, we demonstrate air-stable BP devices with dielectric and hydrophobic encapsulation. Microscopy, spectroscopy, and transport techniques were employed to elucidate the aging mechanism, which can initiate from the BP surface for bare samples, or edges for samples with thin dielectric coating highlighting the ineffectiveness of conventional scaled dielectrics. Our pioneering months-long studies indicate that a double layer of Al2O3 and hydrophobic fluoropolymer affords BP devices and transistors with indefinite air-stability for the first time, overcoming a critical material challenge for applied research and development.

preprint2012arXiv

Uniform wafer-scale synthesis of graphene on evaporated Cu (111) film with quality comparable to exfoliated monolayer

Monolayer graphene has been grown on crystallized Cu (111) films on standard oxidized Si 100 mm wafers. The monolayer graphene demonstrates high uniformity (>97% coverage), with immeasurable defects (>95% defect-negligible) across the entire wafer. Key to these results is the phase transition of evaporated copper films from amorphous to crystalline at the growth temperature as corroborated by X-ray diffraction and electron backscatter diffraction. Noticeably, phase transition of copper film is observed on technologically ubiquitous oxidized Si wafer where the oxide is a standard amorphous thermal oxide. Ion mass spectroscopy indicates that the copper films can be purposely hydrogen-enriched during a hydrogen anneal which subsequently affords graphene growth with a sole carbonaceous precursor for low defect densities. Owing to the strong hexagonal lattice match, the graphene domains align to the Cu (111) domains, suggesting a pathway for increasing the graphene grains by maximizing the copper grain sizes. Fabricated graphene transistors on a flexible polyimide film yield a peak carrier mobility ~4,930 cm2/Vs.

Li Tao

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Transmission-matrix Quantitative Phase Profilometry for Accurate and Fast Thickness Mapping of 2D Materials

Motion Representation Using Residual Frames with 3D CNN

Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition

Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework

Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning

Mott-Kondo Insulator Behavior in the Iron Oxychalcogenides

Toward Air-Stable Multilayer Phosphorene Thin-Films and Transistors

Uniform wafer-scale synthesis of graphene on evaporated Cu (111) film with quality comparable to exfoliated monolayer