Researcher profile

Qi Feng

Qi Feng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2026arXiv

An Analytic Solution to the Optimal Spherical Dubins Path Problem with Geodesic Curvature Constraints

Computing shortest paths for curvature-constrained Dubins vehicles on the unit sphere is fundamental to many engineering applications, including long-range flight planning, persistent surveillance patterns, and global routing problems where great circles are natural routes. Numerical optimization methods on $\SO(3)$ suffer from sensitivity to initialization, may converge to local minima, and often miss feasible solution branches. This paper proposes a unified analytic computational approach for spherical Dubins CGC and CCC paths that overcomes these limitations. By exploiting the axis-fixing property of rotations and developing a closed-form back-substitution method using geometric projection, the three-dimensional boundary value problem is reduced to solving a quadratic polynomial equation. The proposed analytic solver achieves machine precision accuracy with errors on the order of $10^{-16}$, is approximately $717$ times faster than numerical methods under the same computational environment, and systematically enumerates all feasible solution branches without requiring exhaustive multi-start initialization. The method provides closed-form solutions for optimal path computation in the regime where turning radius $\Rturn \in (0, 1/2]$, corresponding to $U_{\max} \geq \sqrt{3}$.

preprint2026arXiv

Noise estimation of SDE from a single data trajectory

In this paper, we propose a data-driven framework for model discovery of stochastic differential equations (SDEs) from a single trajectory, without requiring the ergodicity or stationary assumption on the underlying continuous process. By combining (stochastic) Taylor expansions with Girsanov transformations, and using the drift function's initial value as input, we construct drift estimators while simultaneously recovering the model noise. This allows us to recover the underlying $\mathbb P$ Brownian motion increments. Building on these estimators, we introduce the first stochastic Sparse Identification of Stochastic Differential Equation (SSISDE) algorithm, capable of identifying the governing SDE dynamics from a single observed trajectory without requiring ergodicity or stationarity. To validate the proposed approach, we conduct numerical experiments with both linear and quadratic drift-diffusion functions. Among these, the Black-Scholes SDE is included as a representative case of a system that does not satisfy ergodicity or stationarity.

preprint2024arXiv

Reflected Schrödinger Bridge for Constrained Generative Modeling

Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process governed by reflected Brownian motion. However, reflected diffusion models may not easily adapt to diverse domains without the derivation of proper diffeomorphic mappings and do not guarantee optimal transport properties. To overcome these limitations, we introduce the Reflected Schrodinger Bridge algorithm: an entropy-regularized optimal transport approach tailored for generating data within diverse bounded domains. We derive elegant reflected forward-backward stochastic differential equations with Neumann and Robin boundary conditions, extend divergence-based likelihood training to bounded domains, and explore natural connections to entropic optimal transport for the study of approximate linear convergence - a valuable insight for practical training. Our algorithm yields robust generative modeling in diverse domains, and its scalability is demonstrated in real-world constrained generative modeling through standard image benchmarks.

preprint2023arXiv

Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training

Pose estimation of the human body and hands is a fundamental problem in computer vision, and learning-based solutions require a large amount of annotated data. In this work, we improve the efficiency of the data annotation process for 3D pose estimation problems with Active Learning (AL) in a multi-view setting. AL selects examples with the highest value to annotate under limited annotation budgets (time and cost), but choosing the selection strategy is often nontrivial. We present a framework to efficiently extend existing single-view AL strategies. We then propose two novel AL strategies that make full use of multi-view geometry. Moreover, we demonstrate additional performance gains by incorporating pseudo-labels computed during the AL process, which is a form of self-training. Our system significantly outperforms simulated annotation baselines in 3D body and hand pose estimation on two large-scale benchmarks: CMU Panoptic Studio and InterHand2.6M. Notably, on CMU Panoptic Studio, we are able to reduce the turn-around time by 60% and annotation cost by 80% when compared to the conventional annotation process.

preprint2022arXiv

360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network

Single-view depth estimation from omnidirectional images has gained popularity with its wide range of applications such as autonomous driving and scene reconstruction. Although data-driven learning-based methods demonstrate significant potential in this field, scarce training data and ineffective 360 estimation algorithms are still two key limitations hindering accurate estimation across diverse domains. In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem. This is achieved by exploring the use of a plenteous source of data, 360 videos from the internet, using a test-time training method that leverages unique information in each omnidirectional sequence. With novel geometric and temporal constraints, our method generates consistent and convincing depth samples to facilitate single-view estimation. We then propose an end-to-end two-branch multi-task learning network, SegFuse, that mimics the human eye to effectively learn from the dataset and estimate high-quality depth maps from diverse monocular RGB images. With a peripheral branch that uses equirectangular projection for depth estimation and a foveal branch that uses cubemap projection for semantic segmentation, our method predicts consistent global depth while maintaining sharp details at local regions. Experimental results show favorable performance against the state-of-the-art methods.

preprint2022arXiv

Deep Signature FBSDE Algorithm

We propose a deep signature/log-signature FBSDE algorithm to solve forward-backward stochastic differential equations (FBSDEs) with state and path dependent features. By incorporating the deep signature/log-signature transformation into the recurrent neural network (RNN) model, our algorithm shortens the training time, improves the accuracy, and extends the time horizon comparing to methods in the existing literature. Moreover, our algorithms can be applied to a wide range of applications such as state and path dependent option pricing involving high-frequency data, model ambiguity, and stochastic games, which are linked to parabolic partial differential equations (PDEs), and path-dependent PDEs (PPDEs). Lastly, we also derive the convergence analysis of the deep signature/log-signature FBSDE algorithm.

preprint2022arXiv

Design and Performance of the Prototype Schwarzschild-Couder Telescope Camera

The prototype Schwarzschild-Couder Telescope (pSCT) is a candidate for a medium-sized telescope in the Cherenkov Telescope Array. The pSCT is based on a novel dual mirror optics design which reduces the plate scale and allows for the use of silicon photomultipliers as photodetectors. The prototype pSCT camera currently has only the central sector instrumented with 25 camera modules (1600 pixels), providing a 2.68$^{\circ}$ field of view (FoV). The camera electronics are based on custom TARGET (TeV array readout with GSa/s sampling and event trigger) application specific integrated circuits. Field programmable gate arrays sample incoming signals at a gigasample per second. A single backplane provides camera-wide triggers. An upgrade of the pSCT camera is in progress, which will fully populate the focal plane. This will increase the number of pixels to 11,328, the number of backplanes to 9, and the FoV to 8.04$^{\circ}$. Here we give a detailed description of the pSCT camera, including the basic concept, mechanical design, detectors, electronics, current status and first light.

preprint2022arXiv

Efficient CNN Architecture Design Guided by Visualization

Modern efficient Convolutional Neural Networks(CNNs) always use Depthwise Separable Convolutions(DSCs) and Neural Architecture Search(NAS) to reduce the number of parameters and the computational complexity. But some inherent characteristics of networks are overlooked. Inspired by visualizing feature maps and N$\times$N(N$>$1) convolution kernels, several guidelines are introduced in this paper to further improve parameter efficiency and inference speed. Based on these guidelines, our parameter-efficient CNN architecture, called \textit{VGNetG}, achieves better accuracy and lower latency than previous networks with about 30%$\thicksim$50% parameters reduction. Our VGNetG-1.0MP achieves 67.7% top-1 accuracy with 0.99M parameters and 69.2% top-1 accuracy with 1.14M parameters on ImageNet classification dataset. Furthermore, we demonstrate that edge detectors can replace learnable depthwise convolution layers to mix features by replacing the N$\times$N kernels with fixed edge detection kernels. And our VGNetF-1.5MP archives 64.4%(-3.2%) top-1 accuracy and 66.2%(-1.4%) top-1 accuracy with additional Gaussian kernels.

preprint2022arXiv

Giant second-order nonlinearity in twisted bilayer graphene

In the second-order response regime, the Hall voltage can be nonzero without breaking the time-reversal symmetry, as long as the system is noncentrosymmetric. There are multiple mechanisms with different scaling rules that contribute to the nonlinear Hall effect (NLHE). The intrinsic contribution is closely related to the Berry curvature dipole and has been extensively investigated recently. The study of the extrinsic contribution, however, is scarce, although it can enter the NLHE even in the leading order. Here, we report a giant nonlinear transport response in TBG, in which the intrinsic mechanism is forbidden. The magnitude and direction of the second-order nonlinearity can be effectively tuned by the gate voltage. The peak value of the second-order Hall conductivity close to the full filling of the moiré band reaches 8.76 $μmSV^{-1}$, four-order larger than those detected in $WTe_2$. The observed giant second-order nonlinearity can be understood from the collaboration of the asymmetric scattering of electrons off the static (Coulomb impurities) and dynamic disorders (phonons) in noncentrosymmetric crystals. It is mainly determined by the skew-scattering contribution from impurities at 1.7 K. The skew-scattering from phonons has a much larger coupling coefficient as suggested by the scaling results, and becomes as important as the impurity contribution as the temperature rises. Our observations demonstrate the potential of TBG in studying nonlinear response and possible rectification applications.

preprint2022arXiv

Industrial Scene Text Detection with Refined Feature-attentive Network

Detecting the marking characters of industrial metal parts remains challenging due to low visual contrast, uneven illumination, corroded character structures, and cluttered background of metal part images. Affected by these factors, bounding boxes generated by most existing methods locate low-contrast text areas inaccurately. In this paper, we propose a refined feature-attentive network (RFN) to solve the inaccurate localization problem. Specifically, we design a parallel feature integration mechanism to construct an adaptive feature representation from multi-resolution features, which enhances the perception of multi-scale texts at each scale-specific level to generate a high-quality attention map. Then, an attentive refinement network is developed by the attention map to rectify the location deviation of candidate boxes. In addition, a re-scoring mechanism is designed to select text boxes with the best rectified location. Moreover, we construct two industrial scene text datasets, including a total of 102156 images and 1948809 text instances with various character structures and metal parts. Extensive experiments on our dataset and four public datasets demonstrate that our proposed method achieves the state-of-the-art performance.

preprint2022arXiv

The 6th AI City Challenge

The 6th edition of the AI City Challenge specifically focuses on problems in two domains where there is tremendous unlocked potential at the intersection of computer vision and artificial intelligence: Intelligent Traffic Systems (ITS), and brick and mortar retail businesses. The four challenge tracks of the 2022 AI City Challenge received participation requests from 254 teams across 27 countries. Track 1 addressed city-scale multi-target multi-camera (MTMC) vehicle tracking. Track 2 addressed natural-language-based vehicle track retrieval. Track 3 was a brand new track for naturalistic driving analysis, where the data were captured by several cameras mounted inside the vehicle focusing on driver safety, and the task was to classify driver actions. Track 4 was another new track aiming to achieve retail store automated checkout using only a single view camera. We released two leader boards for submissions based on different methods, including a public leader board for the contest, where no use of external data is allowed, and a general leader board for all submitted results. The top performance of participating teams established strong baselines and even outperformed the state-of-the-art in the proposed challenge tracks.

preprint2022arXiv

Tuning the competition between superconductivity and charge order in kagome superconductor Cs(V1-xNbx)3Sb5

The recently discovered coexistence of superconductivity and charge density wave order in the kagome systems AV3Sb5 (A = K, Rb, Cs) has stimulated enormous interest. According to theory, a vanadium-based kagome system may host a flat band, nontrivial linear dispersive Dirac surface states and electronic correlation. Despite intensive investigations, it remains controversial about the origin of the charge density wave (CDW) order, how does the superconductivity relate to the CDW, and whether the anomalous Hall effect (AHE) arises primarily from the kagome lattice or the CDW order. We report an extensive investigation on Cs(V1-xNbx)3Sb5 samples with systematic Nb doping. Our results show that the Nb doping induces apparent suppression of CDW order and promotes superconductivity; meanwhile, the AHE and magnetoresistance (MR) will be significantly weakened together with the CDW order. Combining with our density functional calculations, we interpret these effects by an antiphase shift of the Fermi energy with respect to the saddle points near M and the Fermi surface centered around Γ. It is found that the former depletes the filled states for the CDW instability and worsens the nesting condition for CDW order; while the latter lifts the Fermi level upward and enlarges the Fermi surface surrounding the Γ point, and thus promotes superconductivity. Our results uncover a delicate but unusual competition between the CDW order and superconductivity.

preprint2021arXiv

Dynamic Curves for Decentralized Autonomous Cryptocurrency Exchanges

One of the exciting recent developments in decentralized finance (DeFi) has been the development of decentralized cryptocurrency exchanges that can autonomously handle conversion between different cryptocurrencies. Decentralized exchange protocols such as Uniswap, Curve and other types of Automated Market Makers (AMMs) maintain a liquidity pool (LP) of two or more assets constrained to maintain at all times a mathematical relation to each other, defined by a given function or curve. Examples of such functions are the constant-sum and constant-product AMMs. Existing systems however suffer from several challenges. They require external arbitrageurs to restore the price of tokens in the pool to match the market price. Such activities can potentially drain resources from the liquidity pool. In particular, dramatic market price changes can result in low liquidity with respect to one or more of the assets and reduce the total value of the LP. We propose in this work a new approach to constructing the AMM by proposing the idea of dynamic curves. It utilizes input from a market price oracle to modify the mathematical relationship between the assets so that the pool price continuously and automatically adjusts to be identical to the market price. This approach eliminates arbitrage opportunities and, as we show through simulations, maintains liquidity in the LP for all assets and the total value of the LP over a wide range of market prices.

preprint2020arXiv

Investigating a Deep Learning Method to Analyze Images from Multiple Gamma-ray Telescopes

Imaging atmospheric Cherenkov telescope (IACT) arrays record images from air showers initiated by gamma rays entering the atmosphere, allowing astrophysical sources to be observed at very high energies. To maximize IACT sensitivity, gamma-ray showers must be efficiently distinguished from the dominant background of cosmic-ray showers using images from multiple telescopes. A combination of convolutional neural networks (CNNs) with a recurrent neural network (RNN) has been proposed to perform this task. Using CTLearn, an open source Python package using deep learning to analyze data from IACTs, with simulated data from the upcoming Cherenkov Telescope Array (CTA), we implement a CNN-RNN network and find no evidence that sorting telescope images by total amplitude improves background rejection performance.

preprint2020arXiv

Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

While visual object detection with deep learning has received much attention in the past decade, cases when heavy intra-class occlusions occur have not been studied thoroughly. In this work, we propose a Non-Maximum-Suppression (NMS) algorithm that dramatically improves the detection recall while maintaining high precision in scenes with heavy occlusions. Our NMS algorithm is derived from a novel embedding mechanism, in which the semantic and geometric features of the detected boxes are jointly exploited. The embedding makes it possible to determine whether two heavily-overlapping boxes belong to the same object in the physical world. Our approach is particularly useful for car detection and pedestrian detection in urban scenes where occlusions often happen. We show the effectiveness of our approach by creating a model called SG-Det (short for Semantics and Geometry Detection) and testing SG-Det on two widely-adopted datasets, KITTI and CityPersons for which it achieves state-of-the-art performance.

preprint2020arXiv

Multi-wavelength observations of 2HWC J1928+177: dark accelerator or new TeV gamma-ray binary?

2HWC J1928+177 is a Galactic TeV gamma-ray source detected by the High Altitude Water Cherenkov (HAWC) Observatory up to ~ 56 TeV. The HAWC source, later confirmed by H.E.S.S., still remains unidentified as a dark accelerator since there is no apparent supernova remnant or pulsar wind nebula detected in the lower energy bands. The radio pulsar PSR J1928+1746, coinciding with the HAWC source position, has no X-ray counterpart. Our SED modeling shows that inverse Compton scattering in the putative pulsar wind nebula can account for the TeV emission only if the unseen nebula is extended beyond r ~ 4 [arcmin]. Alternatively, TeV gamma rays may be produced by hadronic interactions between relativistic protons from an undetected supernova remnant associated with the radio pulsar and a nearby molecular cloud G52.9+0.1. NuSTAR and Chandra observations detected a variable X-ray point source within the HAWC error circle, potentially associated with a bright IR source. The X-ray spectra can be fitted with an absorbed power-law model with $N_{\rm H} = (9\pm3)\times10^{22}$ cm$^{-2}$ and $Γ_X = 1.6\pm0.3$ and exhibit long-term X-ray flux variability over the last decade. If the X-ray source, possibly associated with the IR source (likely an O star), is the counterpart of the HAWC source, it may be a new TeV gamma-ray binary powered by collisions between the pulsar wind and stellar wind. Follow-up X-ray observations are warranted to search for diffuse X-ray emission and determine the nature of the HAWC source.

preprint2020arXiv

Taylor expansions and Castell estimates for solutions of stochastic differential equations driven by rough paths

We study the Taylor expansion for the solutions of differential equations driven by $p$-rough paths with $p>2$. We prove a general theorem concerning the convergence of the Taylor expansion on a nonempty interval provided that the vector fields are analytic on a ball centered at the initial point. We also derive criteria that enable us to study the rate of convergence of the Taylor expansion. Finally and this is also the main and the most original part of this paper, we prove Castell expansions and tail estimates with exponential decays for the remainder terms of the solutions of the stochastic differential equations driven by continuous centered Gaussian process with finite $2D~ρ-$variation and fractional Brownian motion with Hurst parameter $H>1/4$.