Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection

Roadside litter poses environmental, safety and economic challenges, yet current monitoring relies on labour-intensive surveys and public reporting, providing limited spatial coverage. Existing vision datasets for litter detection focus on street-level still images, aerial scenes or aquatic environments, and do not reflect the unique characteristics of dashcam footage, where litter appears extremely small, sparse and embedded in cluttered road-verge backgrounds. We introduce RoLID-11K, the first large-scale dataset for roadside litter detection from dashcams, comprising over 11k annotated images spanning diverse UK driving conditions and exhibiting pronounced long-tail and small-object distributions. We benchmark a broad spectrum of modern detectors, from accuracy-oriented transformer architectures to real-time YOLO models, and analyse their strengths and limitations on this challenging task. Our results show that while CO-DETR and related transformers achieve the best localisation accuracy, real-time models remain constrained by coarse feature hierarchies. RoLID-11K establishes a challenging benchmark for extreme small-object detection in dynamic driving scenes and aims to support the development of scalable, low-cost systems for roadside-litter monitoring. The dataset is available at https://github.com/xq141839/RoLID-11K.

preprint2022arXiv

A new class of bilayer kagome lattice compounds with Dirac nodal lines and pressure-induced superconductivity

Kagome lattice composed of transition-metal ions provides a great opportunity to explore the intertwining between geometry, electronic orders and band topology. The discovery of multiple competing orders that connect intimately with the underlying topological band structure in nonmagnetic kagome metals $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) further pushes this topic to the quantum frontier. Here we report the discovery and characterization of a new class of vanadium-based compounds with kagome bilayers, namely $A$V$_6$Sb$_6$ ($A$ = K, Rb, Cs) and V$_6$Sb$_4$, which, together with $A$V$_3$Sb$_5$, compose a series of kagome compounds with a generic chemical formula ($A_{m-1}$Sb$_{2m}$)(V$_3$Sb)$_n$ (m = 1, 2; n = 1, 2). Theoretical calculations combined with angle-resolved photoemission measurements reveal that these compounds feature Dirac nodal lines in close vicinity to the Fermi level. Pressure-induced superconductivity in $A$V$_6$Sb$_6$ further suggests promising emergent phenomena in these materials. The establishment of a new family of layered kagome materials paves the way for designer of fascinating kagome systems with diverse topological nontrivialities and collective ground states.

preprint2022arXiv

EaaS: A Service-Oriented Edge Computing Framework Towards Distributed Intelligence

Edge computing has become a popular paradigm where services and applications are deployed at the network edge closer to the data sources. It provides applications with outstanding benefits, including reduced response latency and enhanced privacy protection. For emerging advanced applications, such as autonomous vehicles, industrial IoT, and metaverse, further research is needed. This is because such applications demand ultra-low latency, hyper-connectivity, and dynamic and reliable service provision, while existing approaches are inadequate to address the new challenges. Hence, we envision that the future edge computing is moving towards distributed intelligence, where heterogeneous edge nodes collaborate to provide services in large-scale and geo-distributed edge infrastructure. We thereby propose Edge-as-a-Service (EaaS) to enable distributed intelligence. EaaS jointly manages large-scale cross-node edge resources and facilitates edge autonomy, edge-to-edge collaboration, and resource elasticity. These features enable flexible deployment of services and ubiquitous computation and intelligence. We first give an overview of existing edge computing studies and discuss their limitations to articulate the motivation for proposing EaaS. Then, we describe the details of EaaS, including the physical architecture, proposed software framework, and benefits of EaaS. Various application scenarios, such as real-time video surveillance, smart building, and metaverse, are presented to illustrate the significance and potential of EaaS. Finally, we discuss several challenging issues of EaaS to inspire more research towards this new edge computing framework.

preprint2022arXiv

Generative Graph Neural Networks for Link Prediction

Inferring missing links or detecting spurious ones based on observed graphs, known as link prediction, is a long-standing challenge in graph data analysis. With the recent advances in deep learning, graph neural networks have been used for link prediction and have achieved state-of-the-art performance. Nevertheless, existing methods developed for this purpose are typically discriminative, computing features of local subgraphs around two neighboring nodes and predicting potential links between them from the perspective of subgraph classification. In this formalism, the selection of enclosing subgraphs and heuristic structural features for subgraph classification significantly affects the performance of the methods. To overcome this limitation, this paper proposes a novel and radically different link prediction algorithm based on the network reconstruction theory, called GraphLP. Instead of sampling positive and negative links and heuristically computing the features of their enclosing subgraphs, GraphLP utilizes the feature learning ability of deep-learning models to automatically extract the structural patterns of graphs for link prediction under the assumption that real-world graphs are not locally isolated. Moreover, GraphLP explores high-order connectivity patterns to utilize the hierarchical organizational structures of graphs for link prediction. Our experimental results on all common benchmark datasets from different applications demonstrate that the proposed method consistently outperforms other state-of-the-art methods. Unlike the discriminative neural network models used for link prediction, GraphLP is generative, which provides a new paradigm for neural-network-based link prediction.

preprint2022arXiv

Interactive Image Synthesis with Panoptic Layout Generation

Interactive image synthesis from user-guided input is a challenging task when users wish to control the scene structure of a generated image with ease.Although remarkable progress has been made on layout-based image synthesis approaches, in order to get realistic fake image in interactive scene, existing methods require high-precision inputs, which probably need adjustment several times and are unfriendly to novice users. When placement of bounding boxes is subject to perturbation, layout-based models suffer from "missing regions" in the constructed semantic layouts and hence undesirable artifacts in the generated images. In this work, we propose Panoptic Layout Generative Adversarial Networks (PLGAN) to address this challenge. The PLGAN employs panoptic theory which distinguishes object categories between "stuff" with amorphous boundaries and "things" with well-defined shapes, such that stuff and instance layouts are constructed through separate branches and later fused into panoptic layouts. In particular, the stuff layouts can take amorphous shapes and fill up the missing regions left out by the instance layouts. We experimentally compare our PLGAN with state-of-the-art layout-based models on the COCO-Stuff, Visual Genome, and Landscape datasets. The advantages of PLGAN are not only visually demonstrated but quantitatively verified in terms of inception score, Fréchet inception distance, classification accuracy score, and coverage.

preprint2022arXiv

Photoacoustic Imaging Based on AlN MF-PMUT with Broadened Bandwidth

This paper reports an aluminum nitride (AlN) multi-frequency piezoelectric micromachined ultrasound transducers (MF-PMUT) array for photoacoustic (PA) imaging, where the broadened bandwidth is beneficial to improve imaging resolution. Specifically, PMUT based on micro-electromechanical systems (MEMS) technology is suitable for PA endoscopic imaging of blood vessels and bronchi due to its miniature size. More importantly, AlN is a non-toxic material, which makes it harmless for biomedical applications. In this work, a MF-PMUT array are designed and fabricated for PAI. The device's vibration mode impedance and bandwidth are analyzed. The MF-PMUT sensor provides a wider bandwidth (65%) signal detection, which increases the resolution of PAI compared with traditional PMUT. We conduct an experiment on agar sample to present sensor's performance in images' axial resolution.

preprint2022arXiv

Pressure-induced dimensional crossover in a kagome superconductor

The recently discovered kagome superconductors AV3Sb5 exhibit tantalizing high-pressure phase diagrams, in which a new dome-like superconducting phase emerges under moderate pressure. However, its origin is as yet unknown. Here, we carried out the high-pressure electrical measurements up to 150 GPa, together with the high-pressure X-ray diffraction measurements and first-principles calculations on CsV3Sb5. We find the new superconducting phase to be rather robust and inherently linked to the interlayer Sb2-Sb2 interactions. The formation of Sb2-Sb2 bonds at high pressure tunes the system from two-dimensional to three-dimensional and pushes the Pz orbital of Sb2 upward across the Fermi level, resulting in enhanced density of states and increase of TC. Our work demonstrates that the dimensional crossover at high pressure can induce a topological phase transition and is related to the abnormal high-pressure TC evolution. Our findings should apply for other layered materials.

preprint2021arXiv

A 20-Second Cadence View of Solar-Type Stars and Their Planets with TESS: Asteroseismology of Solar Analogs and a Re-characterization of pi Men c

We present an analysis of the first 20-second cadence light curves obtained by the TESS space telescope during its extended mission. We find a precision improvement of 20-second data compared to 2-minute data for bright stars when binned to the same cadence (~10-25% better for T<~8 mag, reaching equal precision at T~13 mag), consistent with pre-flight expectations based on differences in cosmic ray mitigation algorithms. We present two results enabled by this improvement. First, we use 20-second data to detect oscillations in three solar analogs (gamma Pav, zeta Tuc and pi Men) and use asteroseismology to measure their radii, masses, densities and ages to ~1%, ~3%, ~1% and ~20% respectively, including systematic errors. Combining our asteroseismic ages with chromospheric activity measurements we find evidence that the spread in the activity-age relation is linked to stellar mass and thus convection-zone depth. Second, we combine 20-second data and published radial velocities to re-characterize pi Men c, which is now the closest transiting exoplanet for which detailed asteroseismology of the host star is possible. We show that pi Men c is located at the upper edge of the planet radius valley for its orbital period, confirming that it has likely retained a volatile atmosphere and that the &#34;asteroseismic radius valley&#34; remains devoid of planets. Our analysis favors a low eccentricity for pi Men c (<0.1 at 68% confidence), suggesting efficient tidal dissipation (Q/k <~ 2400) if it formed via high-eccentricity migration. Combined, these early results demonstrate the strong potential of TESS 20-second cadence data for stellar astrophysics and exoplanet science.

preprint2020arXiv

Detection and characterisation of oscillating red giants: first results from the TESS satellite

Since the onset of the `space revolution&#39; of high-precision high-cadence photometry, asteroseismology has been demonstrated as a powerful tool for informing Galactic archaeology investigations. The launch of the NASA TESS mission has enabled seismic-based inferences to go full sky -- providing a clear advantage for large ensemble studies of the different Milky Way components. Here we demonstrate its potential for investigating the Galaxy by carrying out the first asteroseismic ensemble study of red giant stars observed by TESS. We use a sample of 25 stars for which we measure their global asteroseimic observables and estimate their fundamental stellar properties, such as radius, mass, and age. Significant improvements are seen in the uncertainties of our estimates when combining seismic observables from TESS with astrometric measurements from the Gaia mission compared to when the seismology and astrometry are applied separately. Specifically, when combined we show that stellar radii can be determined to a precision of a few percent, masses to 5-10% and ages to the 20% level. This is comparable to the precision typically obtained using end-of-mission Kepler data

preprint2020arXiv

Drosophila-Inspired 3D Moving Object Detection Based on Point Clouds

3D moving object detection is one of the most critical tasks in dynamic scene analysis. In this paper, we propose a novel Drosophila-inspired 3D moving object detection method using Lidar sensors. According to the theory of elementary motion detector, we have developed a motion detector based on the shallow visual neural pathway of Drosophila. This detector is sensitive to the movement of objects and can well suppress background noise. Designing neural circuits with different connection modes, the approach searches for motion areas in a coarse-to-fine fashion and extracts point clouds of each motion area to form moving object proposals. An improved 3D object detection network is then used to estimate the point clouds of each proposal and efficiently generates the 3D bounding boxes and the object categories. We evaluate the proposed approach on the widely-used KITTI benchmark, and state-of-the-art performance was obtained by using the proposed approach on the task of motion detection.

preprint2020arXiv

Microsoft Recommenders: Tools to Accelerate Developing Recommender Systems

The purpose of this work is to highlight the content of the Microsoft Recommenders repository and show how it can be used to reduce the time involved in developing recommender systems. The open source repository provides python utilities to simplify common recommender-related data science work as well as example Jupyter notebooks that demonstrate use of the algorithms and tools under various environments.

preprint2020arXiv

Modeling Information Need of Users in Search Sessions

Users issue queries to Search Engines, and try to find the desired information in the results produced. They repeat this process if their information need is not met at the first place. It is crucial to identify the important words in a query that depict the actual information need of the user and will determine the course of a search session. To this end, we propose a sequence-to-sequence based neural architecture that leverages the set of past queries issued by users, and results that were explored by them. Firstly, we employ our model for predicting the words in the current query that are important and would be retained in the next query. Additionally, as a downstream application of our model, we evaluate it on the widely popular task of next query suggestion. We show that our intuitive strategy of capturing information need can yield superior performance at these tasks on two large real-world search log datasets.

preprint2020arXiv

Optimization of Graph Total Variation via Active-Set-based Combinatorial Reconditioning

Structured convex optimization on weighted graphs finds numerous applications in machine learning and computer vision. In this work, we propose a novel adaptive preconditioning strategy for proximal algorithms on this problem class. Our preconditioner is driven by a sharp analysis of the local linear convergence rate depending on the &#34;active set&#34; at the current iterate. We show that nested-forest decomposition of the inactive edges yields a guaranteed local linear convergence rate. Further, we propose a practical greedy heuristic which realizes such nested decompositions and show in several numerical experiments that our reconditioning strategy, when applied to proximal gradient or primal-dual hybrid gradient algorithm, achieves competitive performances. Our results suggest that local convergence analysis can serve as a guideline for selecting variable metrics in proximal algorithms.

preprint2020arXiv

TESS Asteroseismic Analysis of the Known Exoplanet Host Star HD 222076

The Transiting Exoplanet Survey Satellite (TESS) is an all-sky survey mission aiming to search for exoplanets that transit bright stars. The high-quality photometric data of TESS are excellent for the asteroseismic study of solar-like stars. In this work, we present an asteroseismic analysis of the red-giant star HD~222076 hosting a long-period (2.4 yr) giant planet discovered through radial velocities. Solar-like oscillations of HD~222076 are detected around $203 \, μ$Hz by TESS for the first time. Asteroseismic modeling, using global asteroseismic parameters as input, yields a determination of the stellar mass ($M_\star = 1.12 \pm 0.12\, M_\odot$), radius ($R_\star = 4.34 \pm 0.21\,R_\odot$), and age ($7.4 \pm 2.7\,$Gyr), with precisions greatly improved from previous studies. The period spacing of the dipolar mixed modes extracted from the observed power spectrum reveals that the star is on the red-giant branch burning hydrogen in a shell surrounding the core. We find that the planet will not escape the tidal pull of the star and be engulfed into it within about $800\,$Myr, before the tip of the red-giant branch is reached.

preprint2020arXiv

Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval

Many recent advances in neural information retrieval models, which predict top-K items given a query, learn directly from a large training set of (query, item) pairs. However, they are often insufficient when there are many previously unseen (query, item) combinations, often referred to as the cold start problem. Furthermore, the search system can be biased towards items that are frequently shown to a query previously, also known as the &#39;rich get richer&#39; (a.k.a. feedback loop) problem. In light of these problems, we observed that most online content platforms have both a search and a recommender system that, while having heterogeneous input spaces, can be connected through their common output item space and a shared semantic representation. In this paper, we propose a new Zero-Shot Heterogeneous Transfer Learning framework that transfers learned knowledge from the recommender system component to improve the search component of a content platform. First, it learns representations of items and their natural-language features by predicting (item, item) correlation graphs derived from the recommender system as an auxiliary task. Then, the learned representations are transferred to solve the target search retrieval task, performing query-to-item prediction without having seen any (query, item) pairs in training. We conduct online and offline experiments on one of the world&#39;s largest search and recommender systems from Google, and present the results and lessons learned. We demonstrate that the proposed approach can achieve high performance on offline search retrieval tasks, and more importantly, achieved significant improvements on relevance and user interactions over the highly-optimized production system in online experiments.

preprint2014arXiv

Limiting aspects of non-convex ${TV}^ϕ$ models

Recently, non-convex regularisation models have been introduced in order to provide a better prior for gradient distributions in real images. They are based on using concave energies $ϕ$ in the total variation type functional ${TV}^ϕ(u) := \int ϕ(|\nabla u(x)|) d x$. In this paper, it is demonstrated that for typical choices of $ϕ$, functionals of this type pose several difficulties when extended to the entire space of functions of bounded variation, ${BV}(Ω)$. In particular, if $ϕ(t)=t^q$ for $q \in (0, 1)$ and ${TV}^ϕ$ is defined directly for piecewise constant functions and extended via weak* lower semicontinuous envelopes to ${BV}(Ω)$, then still ${TV}^ϕ(u)=\infty$ for $u$ not piecewise constant. If, on the other hand, ${TV}^ϕ$ is defined analogously via continuously differentiable functions, then ${TV}^ϕ\equiv 0$, (!). We study a way to remedy the models through additional multiscale regularisation and area strict convergence, provided that the energy $ϕ(t)=t^q$ is linearised for high values. The fact, that this kind of energies actually better matches reality and improves reconstructions, is demonstrated by statistics and numerical experiments.