Researcher profile

Long Qian

Long Qian contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

Microwave vortex beam lasing via photonic time crystals

Microwave lasing carrying orbital angular momentum (OAM) holds significant potential for advanced applications in fields such as high-capacity communications, precision sensing, and radar imaging. However, conventional approaches to masers fail to produce emission with embedded OAM. The recent emergence of photonic time crystals (PTCs)-artificially structured media with periodically varying electromagnetic properties in time-offers a paradigm shift toward resonance-free lasing without the need for gain media. Yet, pioneering PTC designs have been based on three-dimensional bulk structures, which lack a surface-emitting configuration, and do not possess the capability to modulate OAM, thus hindering the realization of surface-emitted PTC masing that carries OAM. Here, we report the first experimental demonstration of non-resonant, gain medium-free, and surface-emitted microwave vortex beam lasing OAM using ring-shaped PTCs. By developing a multiplier-driven time-varying metamaterial that achieves over 100% equivalent permittivity modulation depth, we establish momentum bandgaps (k gaps) with sufficient bandwidth to overcome intrinsic losses and enable self-sustained coherent microwave amplification. Furthermore, space-time modulation induces non-reciprocity between clockwise and counterclockwise k gap modes within the circularly symmetric PTC structure, facilitating the selective generation of microwave lasing carrying OAM-a capability beyond the reach of conventional maser technologies. Our work bridges PTC physics with coherent OAM-carrying microwave emission, establishing a transformative platform for next-generation wireless communications, advanced sensing systems, and OAM-based technologies.

preprint2022arXiv

Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning

Temporal grounding in videos aims to localize one target video segment that semantically corresponds to a given query sentence. Thanks to the semantic diversity of natural language descriptions, temporal grounding allows activity grounding beyond pre-defined classes and has received increasing attention in recent years. The semantic diversity is rooted in the principle of compositionality in linguistics, where novel semantics can be systematically described by combining known words in novel ways (compositional generalization). However, current temporal grounding datasets do not specifically test for the compositional generalizability. To systematically measure the compositional generalizability of temporal grounding models, we introduce a new Compositional Temporal Grounding task and construct two new dataset splits, i.e., Charades-CG and ActivityNet-CG. Evaluating the state-of-the-art methods on our new dataset splits, we empirically find that they fail to generalize to queries with novel combinations of seen words. To tackle this challenge, we propose a variational cross-graph reasoning framework that explicitly decomposes video and language into multiple structured hierarchies and learns fine-grained semantic correspondence among them. Experiments illustrate the superior compositional generalizability of our approach. The repository of this work is at https://github.com/YYJMJC/ Compositional-Temporal-Grounding.

preprint2022arXiv

Descriptive Combinatorics, Computable Combinatorics, and ASI Algorithms

We introduce new types of local algorithms, which we call "ASI Algorithms", and use them to demonstrate a link between descriptive and computable combinatorics. This allows us to unify arguments from the two fields, and also sometimes to port arguments from one field to the other. As an example, we generalize a computable combinatorics result of Kierstead and use it to get within one color of the Baire measurable analogue of Vizing's Theorem. We also improve Kierstead's result for multigraphs along the way.

preprint2022arXiv

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos

Understanding human emotions is a crucial ability for intelligent robots to provide better human-robot interactions. The existing works are limited to trimmed video-level emotion classification, failing to locate the temporal window corresponding to the emotion. In this paper, we introduce a new task, named Temporal Emotion Localization in videos~(TEL), which aims to detect human emotions and localize their corresponding temporal boundaries in untrimmed videos with aligned subtitles. TEL presents three unique challenges compared to temporal action localization: 1) The emotions have extremely varied temporal dynamics; 2) The emotion cues are embedded in both appearances and complex plots; 3) The fine-grained temporal annotations are complicated and labor-intensive. To address the first two challenges, we propose a novel dilated context integrated network with a coarse-fine two-stream architecture. The coarse stream captures varied temporal dynamics by modeling multi-granularity temporal contexts. The fine stream achieves complex plots understanding by reasoning the dependency between the multi-granularity temporal contexts from the coarse stream and adaptively integrates them into fine-grained video segment features. To address the third challenge, we introduce a cross-modal consensus learning paradigm, which leverages the inherent semantic consensus between the aligned video and subtitle to achieve weakly-supervised learning. We contribute a new testing set with 3,000 manually-annotated temporal boundaries so that future research on the TEL problem can be quantitatively evaluated. Extensive experiments show the effectiveness of our approach on temporal emotion localization. The repository of this work is at https://github.com/YYJMJC/Temporal-Emotion-Localization-in-Videos.