Source author record

Tao Gong

Tao Gong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision physics.optics Artificial Intelligence Computer Science and Game Theory cond-mat.stat-mech eess.SP Machine Learning Multiagent Systems physics.app-ph physics.soc-ph Populations and Evolution quant-ph

Catalog footprint

What is connected

8works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SAPL: Semantic-Agnostic Prompt Learning in CLIP for Weakly Supervised Image Manipulation Localization

Malicious image manipulation threatens public safety and requires efficient localization methods. Existing approaches depend on costly pixel-level annotations which make training expensive. Existing weakly supervised methods rely only on image-level binary labels and focus on global classification, often overlooking local edge cues that are critical for precise localization. We observe that feature variations at manipulated boundaries are substantially larger than in interior regions. To address this gap, we propose Semantic-Agnostic Prompt Learning (SAPL) in CLIP, which learns text prompts that intentionally encode non-semantic, boundary-centric cues so that CLIPs multimodal similarity highlights manipulation edges rather than high-level object semantics. SAPL combines two complementary modules Edge-aware Contextual Prompt Learning (ECPL) and Hierarchical Edge Contrastive Learning (HECL) to exploit edge information in both textual and visual spaces. The proposed ECPL leverages edge-enhanced image features to generate learnable textual prompts via an attention mechanism, embedding semantic-irrelevant information into text features, to guide CLIP focusing on manipulation edges. The proposed HECL extract genuine and manipulated edge patches, and utilize contrastive learning to boost the discrimination between genuine edge patches and manipulated edge patches. Finally, we predict the manipulated regions from the similarity map after processing. Extensive experiments on multiple public benchmarks demonstrate that SAPL significantly outperforms existing approaches, achieving state-of-the-art localization performance.

preprint2022arXiv

Radiative energy bandgap of nanostructures coupled with quantum emitters around the epsilon-near-zero (ENZ) frequency

Epsilon-near-zero (ENZ) materials have been demonstrated to exhibit unique electromagnetic properties. Here we propose the concept of radiative energy bandgap for an ENZ nanoparticle coupled with a quantum emitter (QE). The radiative emission of the coupled QE-nanoparticle can be significantly suppressed around the ENZ frequency and substantially enhanced otherwise, yielding an effective energy bandgap for radiation. This suppression is effectively invariant with respect to the particle size and is therefore an intrinsic property of the ENZ material. Our concept also heralds an alternative pathway to quench the emission from a QE, which may find potential application in quantum information storage.

preprint2021arXiv

Engineering Casimir interactions with epsilon-near-zero materials

In this paper we theoretically demonstrate the tunability of the Casimir force both in sign and magnitude between parallel plates coated with dispersive materials. We show that this force, existing between uncharged plates, can be tuned by carefully choosing the value of the plasma frequency (i.e., the epsilon-near-zero frequency) of the coating in the neighborhood of the resonance frequency of the cavity. The coating layer enables a continuous variation of the force between four limiting values when a coating is placed on each plate. We explore the consequences of such variation when pairs of electric and magnetic conductors (i.e. low and high impedance surfaces) are used as substrates on either side, showing that this continuous variation results in changes in the sign of the force, leading to both stable and unstable conditions, which could find interesting potential applications in nanomechanics including nanoparticle tweezing.

preprint2020arXiv

A Large Scale Urban Surveillance Video Dataset for Multiple-Object Tracking and Behavior Analysis

Multiple-object tracking and behavior analysis have been the essential parts of surveillance video analysis for public security and urban management. With billions of surveillance video captured all over the world, multiple-object tracking and behavior analysis by manual labor are cumbersome and cost expensive. Due to the rapid development of deep learning algorithms in recent years, automatic object tracking and behavior analysis put forward an urgent demand on a large scale well-annotated surveillance video dataset that can reflect the diverse, congested, and complicated scenarios in real applications. This paper introduces an urban surveillance video dataset (USVD) which is by far the largest and most comprehensive. The dataset consists of 16 scenes captured in 7 typical outdoor scenarios: street, crossroads, hospital entrance, school gate, park, pedestrian mall, and public square. Over 200k video frames are annotated carefully, resulting in more than 3:7 million object bounding boxes and about 7:1 thousand trajectories. We further use this dataset to evaluate the performance of typical algorithms for multiple-object tracking and anomaly behavior analysis and explore the robustness of these methods in urban congested scenarios.

preprint2020arXiv

A study of resting-state EEG biomarkers for depression recognition

Background: Depression has become a major health burden worldwide, and effective detection depression is a great public-health challenge. This Electroencephalography (EEG)-based research is to explore the effective biomarkers for depression recognition. Methods: Resting state EEG data was collected from 24 major depressive patients (MDD) and 29 normal controls using 128 channel HydroCel Geodesic Sensor Net (HCGSN). To better identify depression, we extracted different types of EEG features including linear features, nonlinear features and functional connectivity features phase lagging index (PLI) to comprehensively analyze the EEG signals in patients with MDD. And using different feature selection methods and classifiers to evaluate the optimal feature sets. Results: Functional connectivity feature PLI is superior to the linear features and nonlinear features. And when combining all the types of features to classify MDD patients, we can obtain the highest classification accuracy 82.31% using ReliefF feature selection method and logistic regression (LR) classifier. Analyzing the distribution of optimal feature set, it was found that intrahemispheric connection edges of PLI were much more than the interhemispheric connection edges, and the intrahemispheric connection edges had a significant differences between two groups. Conclusion: Functional connectivity feature PLI plays an important role in depression recognition. Especially, intrahemispheric connection edges of PLI might be an effective biomarker to identify depression. And statistic results suggested that MDD patients might exist functional dysfunction in left hemisphere.

preprint2020arXiv

Side-Aware Boundary Localization for More Precise Object Detection

Current object detection frameworks mainly rely on bounding box regression to localize objects. Despite the remarkable progress in recent years, the precision of bounding box regression remains unsatisfactory, hence limiting performance in object detection. We observe that precise localization requires careful placement of each side of the bounding box. However, the mainstream approach, which focuses on predicting centers and sizes, is not the most effective way to accomplish this task, especially when there exists displacements with large variance between the anchors and the targets. In this paper, we propose an alternative approach, named as Side-Aware Boundary Localization (SABL), where each side of the bounding box is respectively localized with a dedicated network branch. To tackle the difficulty of precise localization in the presence of displacements with large variance, we further propose a two-step localization scheme, which first predicts a range of movement through bucket prediction and then pinpoints the precise position within the predicted bucket. We test the proposed method on both two-stage and single-stage detection frameworks. Replacing the standard bounding box regression branch with the proposed design leads to significant improvements on Faster R-CNN, RetinaNet, and Cascade R-CNN, by 3.0%, 1.7%, and 0.9%, respectively. Code is available at https://github.com/open-mmlab/mmdetection.

preprint2014arXiv

The Shockley-Queisser limit for nanostructured solar cells

The Shockley-Queisser limit describes the maximum solar energy conversion efficiency achievable for a particular material and is the standard by which new photovoltaic technologies are compared. This limit is based on the principle of detailed balance, which equates the photon flux into a device to the particle flux (photons or electrons) out of that device. Nanostructured solar cells represent a new class of photovoltaic devices, and questions have been raised about whether or not they can exceed the Shockley-Queisser limit. Here we show that single-junction nanostructured solar cells have a theoretical maximum efficiency of 42% under AM 1.5 solar illumination. While this exceeds the efficiency of a non- concentrating planar device, it does not exceed the Shockley-Queisser limit for a planar device with optical concentration. We conclude that nanostructured solar cells offer an important route towards higher efficiency photovoltaic devices through a built-in optical concentration.

preprint2010arXiv

Modeling the emergence of universality in color naming patterns

The empirical evidence that human color categorization exhibits some universal patterns beyond superficial discrepancies across different cultures is a major breakthrough in cognitive science. As observed in the World Color Survey (WCS), indeed, any two groups of individuals develop quite different categorization patterns, but some universal properties can be identified by a statistical analysis over a large number of populations. Here, we reproduce the WCS in a numerical model in which different populations develop independently their own categorization systems by playing elementary language games. We find that a simple perceptual constraint shared by all humans, namely the human Just Noticeable Difference (JND), is sufficient to trigger the emergence of universal patterns that unconstrained cultural interaction fails to produce. We test the results of our experiment against real data by performing the same statistical analysis proposed to quantify the universal tendencies shown in the WCS [Kay P and Regier T. (2003) Proc. Natl. Acad. Sci. USA 100: 9085-9089], and obtain an excellent quantitative agreement. This work confirms that synthetic modeling has nowadays reached the maturity to contribute significantly to the ongoing debate in cognitive science.

Tao Gong

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

SAPL: Semantic-Agnostic Prompt Learning in CLIP for Weakly Supervised Image Manipulation Localization

Radiative energy bandgap of nanostructures coupled with quantum emitters around the epsilon-near-zero (ENZ) frequency

Engineering Casimir interactions with epsilon-near-zero materials

A Large Scale Urban Surveillance Video Dataset for Multiple-Object Tracking and Behavior Analysis

A study of resting-state EEG biomarkers for depression recognition

Side-Aware Boundary Localization for More Precise Object Detection

The Shockley-Queisser limit for nanostructured solar cells

Modeling the emergence of universality in color naming patterns