Researcher profile

Bingqing Zhang

Bingqing Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

ReCoVR: Closing the Loop in Interactive Composed Video Retrieval

Composed video retrieval (CoVR) searches for target videos using a reference video and a modification text, but existing methods are restricted to a single interaction round and cannot support the progressive nature of real-world visual search. To bridge this gap, we first formalize interactive composed video retrieval, a multi-turn extension of CoVR, where users progressively refine their search intent through natural-language feedback across turns. Adapting existing interactive retrieval methods to this setting reveals two structural weaknesses: reliance on a single retrieval channel and an open-loop retrieval design that consumes user feedback but does not diagnose whether its own retrieval trajectory is drifting or stagnating. To address these limitations, we propose ReCoVR (Reflexive Composed Video Retrieval), a dual-pathway architecture built on reflexive perception, where the system treats its retrieval history as diagnostic evidence alongside user feedback. Specifically, an Intent Pathway routes heterogeneous feedback to complementary retrieval channels, while a Reflection Pathway performs trajectory-level reflection to monitor result evolution and correct retrieval errors across turns. Experiments on multiple benchmarks show that ReCoVR consistently outperforms interactive baselines, notably achieving 74.30% R@1 after just one interactive round on the WebVid-CoVR-Test dataset.

preprint2022arXiv

InvisibiliTee: Angle-agnostic Cloaking from Person-Tracking Systems with a Tee

After a survey for person-tracking system-induced privacy concerns, we propose a black-box adversarial attack method on state-of-the-art human detection models called InvisibiliTee. The method learns printable adversarial patterns for T-shirts that cloak wearers in the physical world in front of person-tracking systems. We design an angle-agnostic learning scheme which utilizes segmentation of the fashion dataset and a geometric warping process so the adversarial patterns generated are effective in fooling person detectors from all camera angles and for unseen black-box detection models. Empirical results in both digital and physical environments show that with the InvisibiliTee on, person-tracking systems' ability to detect the wearer drops significantly.

preprint2022arXiv

Optical Observations of the Nearby Type Ia Supernova 2021hpr

We present the optical photometric and spectroscopic observations of the nearby Type Ia supernova (SN) 2021hpr. The observations covered the phase of $-$14.37 to +63.68 days relative to its maximum luminosity in the $B$ band. The evolution of multiband light/color curves of SN 2021hpr is similar to that of normal Type Ia supernovae (SNe Ia) with the exception of some phases, especially a plateau phase that appeared in the $V-R$ color curve before peak luminosity, which resembles that of SN 2017cbv. The first spectrum we observed at t $\sim -$14.4 days shows a higher velocity for the Si II $λ$6355 feature ($\sim$ 21,000 km s$^{-1}$) than that of other normal Velocity (NV) SNe Ia at the same phase. Based on the Si II $λ$6355 velocity of $\sim$ 12,420 km s$^{-1}$ around the maximum light, we deduce that SN 2021hpr is a transitional object between high velocity (HV) and NV SNe Ia. Meanwhile, the Si II $λ$6355 feature shows a high velocity gradient (HVG) of about 800 km s$^{-1}$ day$^{-1}$ from roughly $-$14.37 to $-$4.31 days relative to the $B$-band maximum, which indicates that SN 2021hpr can also be classified as an HVG SN Ia. The evolution of SN 2021hpr is similar to that of SN 2011fe. Including SN 2021hpr, there have been six supernovae observed in the host galaxy NGC 3147, and the supernovae explosion rate in the last 50 yr is slightly higher for SNe Ia, while lower for SNe Ibc and SNe II it is lower than expected rate from the radio data. Inspecting the spectra, we find that SN 2021hpr has a metal-rich (12 + log(O/H) $\approx$ 8.648) circumstellar environment, where HV SNe tend to reside. Based on the decline rate of SN 2021hpr in the $B$ band, we determine the distance modulus of the host galaxy NGC 3147 using the Phillips relation to be 33.46 $\pm$ 0.21 mag, which is close to that found by previous works.

preprint2022arXiv

STAR-GNN: Spatial-Temporal Video Representation for Content-based Retrieval

We propose a video feature representation learning framework called STAR-GNN, which applies a pluggable graph neural network component on a multi-scale lattice feature graph. The essence of STAR-GNN is to exploit both the temporal dynamics and spatial contents as well as visual connections between regions at different scales in the frames. It models a video with a lattice feature graph in which the nodes represent regions of different granularity, with weighted edges that represent the spatial and temporal links. The contextual nodes are aggregated simultaneously by graph neural networks with parameters trained with retrieval triplet loss. In the experiments, we show that STAR-GNN effectively implements a dynamic attention mechanism on video frame sequences, resulting in the emphasis for dynamic and semantically rich content in the video, and is robust to noise and redundancies. Empirical results show that STAR-GNN achieves state-of-the-art performance for Content-Based Video Retrieval.