Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
16topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

SVII-3D: Advancing Roadside Infrastructure Inventory with Decimeter-level 3D Localization and Comprehension from Sparse Street Imagery

The automated creation of digital twins and precise asset inventories is a critical task in smart city construction and facility lifecycle management. However, utilizing cost-effective sparse imagery remains challenging due to limited robustness, inaccurate localization, and a lack of fine-grained state understanding. To address these limitations, SVII-3D, a unified framework for holistic asset digitization, is proposed. First, LoRA fine-tuned open-set detection is fused with a spatial-attention matching network to robustly associate observations across sparse views. Second, a geometry-guided refinement mechanism is introduced to resolve structural errors, achieving precise decimeter-level 3D localization. Third, transcending static geometric mapping, a Vision-Language Model agent leveraging multi-modal prompting is incorporated to automatically diagnose fine-grained operational states. Experiments demonstrate that SVII-3D significantly improves identification accuracy and minimizes localization errors. Consequently, this framework offers a scalable, cost-effective solution for high-fidelity infrastructure digitization, effectively bridging the gap between sparse perception and automated intelligent maintenance.

preprint2026arXiv

Unleashing the Capabilities of Large Vision-Language Models for Intelligent Perception of Roadside Infrastructure

Automated perception of urban roadside infrastructure is crucial for smart city management, yet general-purpose models often struggle to capture the necessary fine-grained attributes and domain rules. While Large Vision Language Models (VLMs) excel at open-world recognition, they often struggle to accurately interpret complex facility states in compliance with engineering standards, leading to unreliable performance in real-world applications. To address this, we propose a domain-adapted framework that transforms VLMs into specialized agents for intelligent infrastructure analysis. Our approach integrates a data-efficient fine-tuning strategy with a knowledge-grounded reasoning mechanism. Specifically, we leverage open-vocabulary fine-tuning on Grounding DINO to robustly localize diverse assets with minimal supervision, followed by LoRA-based adaptation on Qwen-VL for deep semantic attribute reasoning. To mitigate hallucinations and enforce professional compliance, we introduce a dual-modality Retrieval-Augmented Generation (RAG) module that dynamically retrieves authoritative industry standards and visual exemplars during inference. Evaluated on a comprehensive new dataset of urban roadside scenes, our framework achieves a detection performance of 58.9 mAP and an attribute recognition accuracy of 95.5%, demonstrating a robust solution for intelligent infrastructure monitoring.

preprint2023arXiv

A Càdlàg Rough Path Foundation for Robust Finance

Using rough path theory, we provide a pathwise foundation for stochastic Itô integration, which covers most commonly applied trading strategies and mathematical models of financial markets, including those under Knightian uncertainty. To this end, we introduce the so-called Property (RIE) for càdlàg paths, which is shown to imply the existence of a càdlàg rough path and of quadratic variation in the sense of Föllmer. We prove that the corresponding rough integrals exist as limits of left-point Riemann sums along a suitable sequence of partitions. This allows one to treat integrands of non-gradient type, and gives access to the powerful stability estimates of rough path theory. Additionally, we verify that (path-dependent) functionally generated trading strategies and Cover's universal portfolio are admissible integrands, and that Property (RIE) is satisfied by both (Young) semimartingales and typical price paths.

preprint2023arXiv

Summative Student Course Review Tool Based on Machine Learning Sentiment Analysis to Enhance Life Science Feedback Efficacy

Machine learning enables the development of new, supplemental, and empowering tools that can either expand existing technologies or invent new ones. In education, space exists for a tool that supports generic student course review formats to organize and recapitulate students' views on the pedagogical practices to which they are exposed. Often, student opinions are gathered with a general comment section that solicits their feelings towards their courses without polling specifics about course contents. Herein, we show a novel approach to summarizing and organizing students' opinions via analyzing their sentiment towards a course as a function of the language/vocabulary used to convey their opinions about a class and its contents. This analysis is derived from their responses to a general comment section encountered at the end of post-course review surveys. This analysis, accomplished with Python, LaTeX, and Google's Natural Language API, allows for the conversion of unstructured text data into both general and topic-specific sub-reports that convey students' views in a unique, novel way.

preprint2022arXiv

Doubly Robust Crowdsourcing

Large-scale labeled dataset is the indispensable fuel that ignites the AI revolution as we see today. Most such datasets are constructed using crowdsourcing services such as Amazon Mechanical Turk which provides noisy labels from non-experts at a fair price. The sheer size of such datasets mandates that it is only feasible to collect a few labels per data point. We formulate the problem of test-time label aggregation as a statistical estimation problem of inferring the expected voting score. By imitating workers with supervised learners and using them in a doubly robust estimation framework, we prove that the variance of estimation can be substantially reduced, even if the learner is a poor approximation. Synthetic and real-world experiments show that by combining the doubly robust approach with adaptive worker/item selection rules, we often need much lower label cost to achieve nearly the same accuracy as in the ideal world where all workers label all data points.

preprint2022arXiv

Graph Convolution for Re-ranking in Person Re-identification

Nowadays, deep learning is widely applied to extract features for similarity computation in person re-identification (re-ID) and have achieved great success. However, due to the non-overlapping between training and testing IDs, the difference between the data used for model training and the testing data makes the performance of learned feature degraded during testing. Hence, re-ranking is proposed to mitigate this issue and various algorithms have been developed. However, most of existing re-ranking methods focus on replacing the Euclidean distance with sophisticated distance metrics, which are not friendly to downstream tasks and hard to be used for fast retrieval of massive data in real applications. In this work, we propose a graph-based re-ranking method to improve learned features while still keeping Euclidean distance as the similarity metric. Inspired by graph convolution networks, we develop an operator to propagate features over an appropriate graph. Since graph is the essential key for the propagation, two important criteria are considered for designing the graph, and three different graphs are explored accordingly. Furthermore, a simple yet effective method is proposed to generate a profile vector for each tracklet in videos, which helps extend our method to video re-ID. Extensive experiments on three benchmark data sets, e.g., Market-1501, Duke, and MARS, demonstrate the effectiveness of our proposed approach.

preprint2022arXiv

Modulation instability and non-degenerate Akhmediev breathers of Manakov equations

We reveal a new class of \textit{non-degenerate} Akhmediev breather (AB) solutions of Manakov equations that only exist in the focusing case. Based on exact solutions, we present the existence diagram of such excitations on the frequency-wavenumber plane. Conventional single-frequency modulation instability leads to simultaneous excitation of three ABs with two of them being non-degenerate.

preprint2022arXiv

Non-degenerate Kuznetsov-Ma solitons of Manakov equations and their physical spectra

We study the dynamics of Kuznetsov-Ma solitons (KMS) in the framework of vector nonlinear Schrödinger (Manakov) equations. Exact multi-parameter family of solutions for such KMSs is derived. This family of solutions includes the known results as well as the previously unknown solutions in the form of the non-degenerate KMSs. We present the existence diagram of such KMSs that follows from the exact solutions. These non-degenerate KMSs are formed by nonlinear superposition of two fundamental KMSs that have the same propagation period but different eigenvalues. We present the amplitude profiles of new solutions, their exact physical spectra, their link to ordinary vector solitons and offer easy ways of their excitation using numerical simulations.

preprint2022arXiv

Non-degenerate multi-rogue waves and easy ways of their excitation

In multi-component systems, several rogue waves can be simultaneously excited using simple initial conditions in the form of a plane wave with a small amplitude single-peak perturbation. This is in drastic contrast with the case of multi-rogue waves of a single nonlinear Schrödinger equation (or other evolution equations) that require highly specific initial conditions to be used. This possibility arises due to the higher variety of rogue waves in multi-components systems each with individual eigenvalue of the inverse scattering technique. In theory, we expand the limited class of Peregrine-type solutions to a much larger family of non-degenerate rogue waves. The results of our work may explain the increased chances of appearance of rogue waves in crossing sea states (wind generated ocean gravity waves that form nonparallel wave systems along the water surface) as well as provide new possibilities of rogue wave observation in a wide range of multi-component physical systems such as multi-component Bose-Einstein condensates, multi-component plasmas and in birefringent optical fibres.

preprint2022arXiv

Realistic simulation of reflection high-energy electron diffraction patterns for two-dimensional lattices using Ewald construction

Reflection high-energy electron diffraction (RHEED) is a powerful tool for characterizing crystal surface structures. However, the setup geometry leads to distorted and complicated patterns, which are not straightforward to link to the real-space structures. A program with a graphical user interface is provided here to simulate the RHEED patterns. Following the Ewald construction in the kinematic theory, we find out the exact geometric transformation in this model that determines the positions of diffraction spots. The program can deal with many forms of surface structures, including surface reconstructions or domains. The simulations exhibit great agreement with the experimental results in various cases. This program will benefit the structure analysis in thin film growth and surface science studies.

preprint2022arXiv

Revisiting Model-Agnostic Private Learning: Faster Rates and Active Learning

The Private Aggregation of Teacher Ensembles (PATE) framework is one of the most promising recent approaches in differentially private learning. Existing theoretical analysis shows that PATE consistently learns any VC-classes in the realizable setting, but falls short in explaining its success in more general cases where the error rate of the optimal classifier is bounded away from zero. We fill in this gap by introducing the Tsybakov Noise Condition (TNC) and establish stronger and more interpretable learning bounds. These bounds provide new insights into when PATE works and improve over existing results even in the narrower realizable setting. We also investigate the compelling idea of using active learning for saving privacy budget, and empirical studies show the effectiveness of this new idea. The novel components in the proofs include a more refined analysis of the majority voting classifier - which could be of independent interest - and an observation that the synthetic "student" learning problem is nearly realizable by construction under the Tsybakov noise condition.

preprint2022arXiv

UFNRec: Utilizing False Negative Samples for Sequential Recommendation

Sequential recommendation models are primarily optimized to distinguish positive samples from negative ones during training in which negative sampling serves as an essential component in learning the evolving user preferences through historical records. Except for randomly sampling negative samples from a uniformly distributed subset, many delicate methods have been proposed to mine negative samples with high quality. However, due to the inherent randomness of negative sampling, false negative samples are inevitably collected in model training. Current strategies mainly focus on removing such false negative samples, which leads to overlooking potential user interests, lack of recommendation diversity, less model robustness, and suffering from exposure bias. To this end, we propose a novel method that can Utilize False Negative samples for sequential Recommendation (UFNRec) to improve model performance. We first devise a simple strategy to extract false negative samples and then transfer these samples to positive samples in the following training process. Furthermore, we construct a teacher model to provide soft labels for false negative samples and design a consistency loss to regularize the predictions of these samples from the student model and the teacher model. To the best of our knowledge, this is the first work to utilize false negative samples instead of simply removing them for the sequential recommendation. Experiments on three benchmark public datasets are conducted using three widely applied SOTA models. The experiment results demonstrate that our proposed UFNRec can effectively draw information from false negative samples and further improve the performance of SOTA models. The code is available at https://github.com/UFNRec-code/UFNRec.

preprint2021arXiv

The topological phase of bright solitons

We study the topological phase of bright soliton with arbitrary velocity under the self-steepening effect. Such topological phase can be described by the topological vector potential and effective magnetic field. We find that the point-like magnetic fields corresponds to the density peak of such bright solitons, where each elementary magnetic flux is π. Remarkably, we show that two bright solitons can generate an additional topological field due to the phase jump between them. Our research provided the possibility to use bright solitons to explore topological properties.

preprint2020arXiv

Controlling the electrical and magnetic ground states by doping in the complete phase diagram of titanate Eu1-xLaxTiO3 thin films

EuTiO3, a band insulator, and LaTiO3, a Mott insulator, are both antiferromagnetic with transition temperatures ~ 5.5 K and ~ 160 K, respectively. Here, we report the synthesis of Eu1-xLaxTiO3 thin films with x = 0 to 1 by oxide molecular beam epitaxy. The films in the full range have high crystalline quality and show no phase segregation, allowing us carry out transport measurements to study their electrical and magnetic properties. From x = 0.03 to 0.95, Eu1-xLaxTiO3 films show conduction by electrons as charge carriers, with differences in carrier densities and mobilities, contrary to the insulating nature of pure EuTiO3 and LaTiO3. Following a rich phase diagram, the magnetic ground states of the films vary with increasing La-doping level, changing Eu1-xLaxTiO3 from an antiferromagnetic insulator to an antiferromagnetic metal, a ferromagnetic metal, a paramagnetic metal, and back to an antiferromagnetic insulator. These emergent properties reflect the evolutions of the band structure, mainly at the Ti t2g bands near the Fermi level, when Eu2+ are gradually replaced by La3+. This work sheds light on this method for designing the electrical and magnetic properties in strongly-correlated oxides and completes the phase diagram of the titanate Eu1-xLaxTiO3.

preprint2020arXiv

Reconstruction Regularized Deep Metric Learning for Multi-label Image Classification

In this paper, we present a novel deep metric learning method to tackle the multi-label image classification problem. In order to better learn the correlations among images features, as well as labels, we attempt to explore a latent space, where images and labels are embedded via two unique deep neural networks, respectively. To capture the relationships between image features and labels, we aim to learn a \emph{two-way} deep distance metric over the embedding space from two different views, i.e., the distance between one image and its labels is not only smaller than those distances between the image and its labels' nearest neighbors, but also smaller than the distances between the labels and other images corresponding to the labels' nearest neighbors. Moreover, a reconstruction module for recovering correct labels is incorporated into the whole framework as a regularization term, such that the label embedding space is more representative. Our model can be trained in an end-to-end manner. Experimental results on publicly available image datasets corroborate the efficacy of our method compared with the state-of-the-arts.

preprint2020arXiv

The three-level coupled Maxwell-Bloch equations: rogue waves, semirational rogue waves and W-shaped solitons

In this paper the coupled Maxwell-Bloch equations which describe the propagation of two optical pulses in an optical medium with coherent three-level atoms are studied by Darboux transformation. The general nth-order rogue wave solution involving two different choices of multiple roots for the spectral characteristic equation and the multiparametric nth-order semirational solution are both obtained in terms of Schur polynomials. The explicit rogue wave solutions and semirational solutions from first to second order are provided. In contrast to the known Peregrine soliton, dark and four-petaled structures, some unusual patterns such as triple-hole, twisted-pair, composite four-petaled and composite dark rogue waves are put forward. Moreover, the interaction between dark-bright soliton and dark rogue wave and interaction between breather and dark rogue wave are shown. Further, the higher-order nonlinear superposition modes which feature triple and quadruple temporal-spatial distributions are presented. Finally, the state transition between rogue wave and W-shaped soliton is found where the modulation instability growth rate tends to zero under the low perturbation frequency. Particularly, the dark and double-peak W-shaped solitons are examined.

preprint2020arXiv

Tuning stoichiometry and its impact on superconductivity of monolayer and multilayer FeSe on SrTiO3

Synthesis of monolayer FeSe on SrTiO3, with greatly enhanced superconductivity compared to bulk FeSe, remains difficult. Lengthy annealing within a certain temperature window is always required to achieve superconducting samples as reported by different groups around the world, but the mechanism of annealing in inducing superconductivity has not been elucidated. We grow FeSe films on SrTiO3 by molecular beam epitaxy and adjust the stoichiometry by depositing additional small amounts of Fe atoms. The monolayer films become superconducting after the Fe deposition without any annealing, and show similar superconducting transition temperatures as those of the annealed films in transport measurements. We also demonstrate on the 5-unit-cell films that the FeSe multilayer can be reversibly tuned between the non-superconducting $\sqrt{5} \times \sqrt{5}$ phase with Fe-vacancies and superconducting $1 \times 1$ phase. Our results reveal that the traditional anneal process in essence removes Fe vacancies and the additional Fe deposition serves as a more efficient way to achieve superconductivity. This work highlights the significance of stoichiometry in the superconductivity of FeSe thin films and provides an easy path for superconducting samples.

preprint2020arXiv

Type-II Ising superconductivity and anomalous metallic state in macro-size ambient-stable ultrathin crystalline films

Recent emergence of two-dimensional (2D) crystalline superconductors has provided a promising platform to investigate novel quantum physics and potential applications. To reveal essential quantum phenomena therein, ultralow temperature transport investigation on high quality ultrathin superconducting films is critically required, although it has been quite challenging experimentally. Here we report a systematic transport study on the ultrathin crystalline PdTe2 films grown by molecular beam epitaxy (MBE). Interestingly, a new type of Ising superconductivity in 2D centrosymmetric materials is revealed by the detection of large in-plane critical field more than 7 times Pauli limit. Remarkably, in perpendicular magnetic field, we provide solid evidence of anomalous metallic state characterized by the resistance saturation at low temperatures with high quality filters. The robust superconductivity with intriguing quantum phenomena in the macro-size ambient-stable ultrathin PdTe2 films remains almost the same for 20 months, showing great potentials in electronic and spintronic applications.

preprint2020arXiv

Unity Style Transfer for Person Re-Identification

Style variation has been a major challenge for person re-identification, which aims to match the same pedestrians across different cameras. Existing works attempted to address this problem with camera-invariant descriptor subspace learning. However, there will be more image artifacts when the difference between the images taken by different cameras is larger. To solve this problem, we propose a UnityStyle adaption method, which can smooth the style disparities within the same camera and across different cameras. Specifically, we firstly create UnityGAN to learn the style changes between cameras, producing shape-stable style-unity images for each camera, which is called UnityStyle images. Meanwhile, we use UnityStyle images to eliminate style differences between different images, which makes a better match between query and gallery. Then, we apply the proposed method to Re-ID models, expecting to obtain more style-robust depth features for querying. We conduct extensive experiments on widely used benchmark datasets to evaluate the performance of the proposed framework, the results of which confirm the superiority of the proposed model.