Researcher profile

Jiaojiao Zhao

Jiaojiao Zhao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2026arXiv

Fine-Grained Generalization via Structuralizing Concept and Feature Space into Commonality, Specificity and Confounding

Fine-Grained Domain Generalization (FGDG) presents greater challenges than conventional domain generalization due to the subtle inter-class differences and relatively pronounced intra-class variations inherent in fine-grained recognition tasks. Under domain shifts, the model becomes overly sensitive to fine-grained cues, leading to the suppression of critical features and a significant drop in performance. Cognitive studies suggest that humans classify objects by leveraging both common and specific attributes, enabling accurate differentiation between fine-grained categories. However, current deep learning models have yet to incorporate this mechanism effectively. Inspired by this mechanism, we propose Concept-Feature Structuralized Generalization (CFSG). This model explicitly disentangles both the concept and feature spaces into three structured components: common, specific, and confounding segments. To mitigate the adverse effects of varying degrees of distribution shift, we introduce an adaptive mechanism that dynamically adjusts the proportions of common, specific, and confounding components. In the final prediction, explicit weights are assigned to each pair of components. Extensive experiments on three single-source benchmark datasets demonstrate that CFSG achieves an average performance improvement of 9.87% over baseline models and outperforms existing state-of-the-art methods by an average of 3.08%. Additionally, explainability analysis validates that CFSG effectively integrates multi-granularity structured knowledge and confirms that feature structuralization facilitates the emergence of concept structuralization.

preprint2022arXiv

TubeR: Tubelet Transformer for Video Action Detection

We propose TubeR: a simple solution for spatio-temporal video action detection. Different from existing methods that depend on either an off-line actor detector or hand-designed actor-positional hypotheses like proposals or anchors, we propose to directly detect an action tubelet in a video by simultaneously performing action localization and recognition from a single representation. TubeR learns a set of tubelet-queries and utilizes a tubelet-attention module to model the dynamic spatio-temporal nature of a video clip, which effectively reinforces the model capacity compared to using actor-positional hypotheses in the spatio-temporal space. For videos containing transitional states or scene changes, we propose a context aware classification head to utilize short-term and long-term context to strengthen action classification, and an action switch regression head for detecting the precise temporal action extent. TubeR directly produces action tubelets with variable lengths and even maintains good results for long video clips. TubeR outperforms the previous state-of-the-art on commonly used action detection datasets AVA, UCF101-24 and JHMDB51-21.

preprint2021arXiv

Spatially indirect intervalley excitons in bilayer WSe2

Spatially indirect excitons with displaced wavefunctions of electrons and holes play a pivotal role in a large portfolio of fascinating physical phenomena and emerging optoelectronic applications, such as valleytronics, exciton spin Hall effect, excitonic integrated circuit and high-temperature superfluidity. Here, we uncover three types of spatially indirect excitons (including their phonon replicas) and their quantum-confined Stark effects in hexagonal boron nitride encapsulated bilayer WSe2, by performing electric field-tunable photoluminescence measurements. Because of different out-of-plane electric dipole moments, the energy order between the three types of spatially indirect excitons can be switched by a vertical electric field. Remarkably, we demonstrate, assisted by first-principles calculations, that the observed spatially indirect excitons in bilayer WSe2 are also momentum-indirect, involving electrons and holes from Q and K/Γ valleys in the Brillouin zone, respectively. This is in contrast to the previously reported spatially indirect excitons with electrons and holes localized in the same valley. Furthermore, we find that the spatially indirect intervalley excitons in bilayer WSe2 can exhibit considerable, doping-sensitive circular polarization. The spatially indirect excitons with momentum-dark nature and highly tunable circular polarization open new avenues for exotic valley physics and technological innovations in photonics and optoelectronics.