Source author record

Lin Zhu

Lin Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision physics.class-ph Artificial Intelligence math.RT Networking and Internet Architecture Neural and Evolutionary Computing physics.geo-ph physics.optics cond-mat.mtrl-sci eess.SP eess.SY hep-ex hep-ph Machine Learning physics.ao-ph Systems and Control

Catalog footprint

What is connected

21works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Dynamic Pondering Sparsity-aware Mixture-of-Experts Transformer for Event Stream based Visual Object Tracking

Despite significant progress, RGB-based trackers remain vulnerable to challenging imaging conditions, such as low illumination and fast motion. Event cameras offer a promising alternative by asynchronously capturing pixel-wise brightness changes, providing high dynamic range and high temporal resolution. However, existing event-based trackers often neglect the intrinsic spatial sparsity and temporal density of event data, while relying on a single fixed temporal-window sampling strategy that is suboptimal under varying motion dynamics. In this paper, we propose an event sparsity-aware tracking framework that explicitly models event-density variations across multiple temporal scales. Specifically, the proposed framework progressively injects sparse, medium-density, and dense event search regions into a three-stage Vision Transformer backbone, enabling hierarchical multi-density feature learning. Furthermore, we introduce a sparsity-aware Mixture-of-Experts module to encourage expert specialization under different sparsity patterns, and design a dynamic pondering strategy to adaptively adjust the inference depth according to tracking difficulty. Extensive experiments on FE240hz, COESOT, and EventVOT demonstrate that the proposed approach achieves a favorable trade-off between tracking accuracy and computational efficiency. The source code will be released on https://github.com/Event-AHU/OpenEvTracking.

preprint2024arXiv

CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras

Existing datasets for RGB-DVS tracking are collected with DVS346 camera and their resolution ($346 \times 260$) is low for practical applications. Actually, only visible cameras are deployed in many practical systems, and the newly designed neuromorphic cameras may have different resolutions. The latest neuromorphic sensors can output high-definition event streams, but it is very difficult to achieve strict alignment between events and frames on both spatial and temporal views. Therefore, how to achieve accurate tracking with unaligned neuromorphic and visible sensors is a valuable but unresearched problem. In this work, we formally propose the task of object tracking using unaligned neuromorphic and visible cameras. We build the first unaligned frame-event dataset CRSOT collected with a specially built data acquisition system, which contains 1,030 high-definition RGB-Event video pairs, 304,974 video frames. In addition, we propose a novel unaligned object tracking framework that can realize robust tracking even using the loosely aligned RGB-Event data. Specifically, we extract the template and search regions of RGB and Event data and feed them into a unified ViT backbone for feature embedding. Then, we propose uncertainty perception modules to encode the RGB and Event features, respectively, then, we propose a modality uncertainty fusion module to aggregate the two modalities. These three branches are jointly optimized in the training phase. Extensive experiments demonstrate that our tracker can collaborate the dual modalities for high-performance tracking even without strictly temporal and spatial alignment. The source code, dataset, and pre-trained models will be released at https://github.com/Event-AHU/Cross_Resolution_SOT.

preprint2024arXiv

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric

Combining the Color and Event cameras (also called Dynamic Vision Sensors, DVS) for robust object tracking is a newly emerging research topic in recent years. Existing color-event tracking framework usually contains multiple scattered modules which may lead to low efficiency and high computational complexity, including feature extraction, fusion, matching, interactive learning, etc. In this paper, we propose a single-stage backbone network for Color-Event Unified Tracking (CEUTrack), which achieves the above functions simultaneously. Given the event points and RGB frames, we first transform the points into voxels and crop the template and search regions for both modalities, respectively. Then, these regions are projected into tokens and parallelly fed into the unified Transformer backbone network. The output features will be fed into a tracking head for target object localization. Our proposed CEUTrack is simple, effective, and efficient, which achieves over 75 FPS and new SOTA performance. To better validate the effectiveness of our model and address the data deficiency of this task, we also propose a generic and large-scale benchmark dataset for color-event tracking, termed COESOT, which contains 90 categories and 1354 video sequences. Additionally, a new evaluation metric named BOC is proposed in our evaluation toolkit to evaluate the prominence with respect to the baseline methods. We hope the newly proposed method, dataset, and evaluation metric provide a better platform for color-event-based tracking. The dataset, toolkit, and source code will be released on: \url{https://github.com/Event-AHU/COESOT}.

preprint2022arXiv

Data-Driven Fast Frequency Control using Inverter-Based Resources

We develop and test a data-driven and area-based fast frequency control scheme, which rapidly redispatches inverter-based resources to compensate for local power imbalances within the bulk power system. The approach requires no explicit system model information, relying only on historical measurement sequences for the computation of control actions. Our technical approach fuses developments in low-gain estimator design and data-driven control to provide a model-free and practical solution for fast frequency control. Theoretical results and extensive simulation scenarios on a three area system are provided to support the approach.

preprint2022arXiv

Event-based Video Reconstruction via Potential-assisted Spiking Neural Network

Neuromorphic vision sensor is a new bio-inspired imaging paradigm that reports asynchronous, continuously per-pixel brightness changes called `events' with high temporal resolution and high dynamic range. So far, the event-based image reconstruction methods are based on artificial neural networks (ANN) or hand-crafted spatiotemporal smoothing techniques. In this paper, we first implement the image reconstruction work via fully spiking neural network (SNN) architecture. As the bio-inspired neural networks, SNNs operating with asynchronous binary spikes distributed over time, can potentially lead to greater computational efficiency on event-driven hardware. We propose a novel Event-based Video reconstruction framework based on a fully Spiking Neural Network (EVSNN), which utilizes Leaky-Integrate-and-Fire (LIF) neuron and Membrane Potential (MP) neuron. We find that the spiking neurons have the potential to store useful temporal information (memory) to complete such time-dependent tasks. Furthermore, to better utilize the temporal information, we propose a hybrid potential-assisted framework (PA-EVSNN) using the membrane potential of spiking neuron. The proposed neuron is referred as Adaptive Membrane Potential (AMP) neuron, which adaptively updates the membrane potential according to the input spikes. The experimental results demonstrate that our models achieve comparable performance to ANN-based models on IJRR, MVSEC, and HQF datasets. The energy consumptions of EVSNN and PA-EVSNN are 19.36$\times$ and 7.75$\times$ more computationally efficient than their ANN architectures, respectively.

preprint2022arXiv

GenAD: General Representations of Multivariate Time Seriesfor Anomaly Detection

The reliability of wireless base stations in China Mobile is of vital importance, because the cell phone users are connected to the stations and the behaviors of the stations are directly related to user experience. Although the monitoring of the station behaviors can be realized by anomaly detection on multivariate time series, due to complex correlations and various temporal patterns of multivariate series in large-scale stations, building a general unsupervised anomaly detection model with a higher F1-score remains a challenging task. In this paper, we propose a General representation of multivariate time series for Anomaly Detection(GenAD). First, we pre-train a general model on large-scale wireless base stations with self-supervision, which can be easily transferred to a specific station anomaly detection with a small amount of training data. Second, we employ Multi-Correlation Attention and Time-Series Attention to represent the correlations and temporal patterns of the stations. With the above innovations, GenAD increases F1-score by total 9% on real-world datasets in China Mobile, while the performance does not significantly degrade on public datasets with only 10% of the training data.

preprint2022arXiv

Mirror Complementary Transformer Network for RGB-thermal Salient Object Detection

RGB-thermal salient object detection (RGB-T SOD) aims to locate the common prominent objects of an aligned visible and thermal infrared image pair and accurately segment all the pixels belonging to those objects. It is promising in challenging scenes such as nighttime and complex backgrounds due to the insensitivity to lighting conditions of thermal images. Thus, the key problem of RGB-T SOD is to make the features from the two modalities complement and adjust each other flexibly, since it is inevitable that any modalities of RGB-T image pairs failure due to challenging scenes such as extreme light conditions and thermal crossover. In this paper, we propose a novel mirror complementary Transformer network (MCNet) for RGB-T SOD. Specifically, we introduce a Transformer-based feature extraction module to effective extract hierarchical features of RGB and thermal images. Then, through the attention-based feature interaction and serial multiscale dilated convolution (SDC) based feature fusion modules, the proposed model achieves the complementary interaction of low-level features and the semantic fusion of deep features. Finally, based on the mirror complementary structure, the salient regions of the two modalities can be accurately extracted even one modality is invalid. To demonstrate the robustness of the proposed model under challenging scenes in real world, we build a novel RGB-T SOD dataset VT723 based on a large public semantic segmentation RGB-T dataset used in the autonomous driving domain. Expensive experiments on benchmark and VT723 datasets show that the proposed method outperforms state-of-the-art approaches, including CNN-based and Transformer-based methods. The code and dataset will be released later at https://github.com/jxr326/SwinMCNet.

preprint2022arXiv

Noise and Edge Based Dual Branch Image Manipulation Detection

Unlike ordinary computer vision tasks that focus more on the semantic content of images, the image manipulation detection task pays more attention to the subtle information of image manipulation. In this paper, the noise image extracted by the improved constrained convolution is used as the input of the model instead of the original image to obtain more subtle traces of manipulation. Meanwhile, the dual-branch network, consisting of a high-resolution branch and a context branch, is used to capture the traces of artifacts as much as possible. In general, most manipulation leaves manipulation artifacts on the manipulation edge. A specially designed manipulation edge detection module is constructed based on the dual-branch network to identify these artifacts better. The correlation between pixels in an image is closely related to their distance. The farther the two pixels are, the weaker the correlation. We add a distance factor to the self-attention module to better describe the correlation between pixels. Experimental results on four publicly available image manipulation datasets demonstrate the effectiveness of our model.

preprint2022arXiv

Proposal for the search for exotic spin-spin interactions at the micrometer scale using functionalized cantilever force sensors

Spin-dependent exotic interactions can be generated by exchanging hypothetical bosons, which were introduced to solve some puzzles in physics. Many precision experiments have been performed to search for such interactions, but no confirmed observation has been made. Here, we propose new experiments to search for the exotic spin-spin interactions that can be mediated by axions or Z$^\prime$ bosons. A sensitive functionalized cantilever is utilized as a force sensor to measure the interactions between the spin-polarized electrons in a periodic magnetic source structure and a closed-loop magnetic structure integrated on the cantilever. The source is set to oscillate during data acquisition to modulate the exotic force signal to high harmonics of the oscillating frequency. This helps to suppress the spurious signals at the signal frequency. Different magnetic source structures are designed for different interaction detections. A magnetic stripe structure is designed for Z$^\prime$-mediated interaction, which is insensitive to the detection of axion-mediated interaction. This allows us to measure the coupling constant of both if we assume both exist. With the force sensitivity achievable at low temperature, the proposed experiments are expected to search for the parameter spaces with much smaller coupling constant than the current stringent constraints from micrometer to millimeter range. Specifically, the lower bound of the parameter space will be seven orders of magnitude lower than the stringent constraints for Z$^\prime$-mediated interaction, and an order of magnitude lower for axion-mediated interaction, at the interaction range of $10\, μ$m.

preprint2022arXiv

Temporal Up-Sampling for Asynchronous Events

The event camera is a novel bio-inspired vision sensor. When the brightness change exceeds the preset threshold, the sensor generates events asynchronously. The number of valid events directly affects the performance of event-based tasks, such as reconstruction, detection, and recognition. However, when in low-brightness or slow-moving scenes, events are often sparse and accompanied by noise, which poses challenges for event-based tasks. To solve these challenges, we propose an event temporal up-sampling algorithm1 to generate more effective and reliable events. The main idea of our algorithm is to generate up-sampling events on the event motion trajectory. First, we estimate the event motion trajectory by contrast maximization algorithm and then up-sampling the events by temporal point processes. Experimental results show that up-sampling events can provide more effective information and improve the performance of downstream tasks, such as improving the quality of reconstructed images and increasing the accuracy of object detection.

preprint2020arXiv

Smart Prediction of the Complaint Hotspot Problem in Mobile Network

In mobile network, a complaint hotspot problem often affects even thousands of users' service and leads to significant economic losses and bulk complaints. In this paper, we propose an approach to predict a customer complaint based on real-time user signalling data. Through analyzing the network and user sevice procedure, 30 key data fields related to user experience have been extracted in XDR data collected from the S1 interface. Furthermore, we augment these basic features with derived features for user experience evaluation, such as one-hot features, statistical features and differential features. Considering the problems of unbalanced data, we use LightGBM as our prediction model. LightGBM has strong generalization ability and was designed to handle unbalanced data. Experiments we conducted prove the effectiveness and efficiency of this proposal. This approach has been deployed for daily routine to locate the hot complaint problem scope as well as to report affected users and area.

preprint2016arXiv

A Graph-Based Semi-Supervised k Nearest-Neighbor Method for Nonlinear Manifold Distributed Data Classification

$k$ Nearest Neighbors ($k$NN) is one of the most widely used supervised learning algorithms to classify Gaussian distributed data, but it does not achieve good results when it is applied to nonlinear manifold distributed data, especially when a very limited amount of labeled samples are available. In this paper, we propose a new graph-based $k$NN algorithm which can effectively handle both Gaussian distributed data and nonlinear manifold distributed data. To achieve this goal, we first propose a constrained Tired Random Walk (TRW) by constructing an $R$-level nearest-neighbor strengthened tree over the graph, and then compute a TRW matrix for similarity measurement purposes. After this, the nearest neighbors are identified according to the TRW matrix and the class label of a query point is determined by the sum of all the TRW weights of its nearest neighbors. To deal with online situations, we also propose a new algorithm to handle sequential samples based a local neighborhood reconstruction. Comparison experiments are conducted on both synthetic data sets and real-world data sets to demonstrate the validity of the proposed new $k$NN algorithm and its improvements to other version of $k$NN algorithms. Given the widespread appearance of manifold structures in real-world problems and the popularity of the traditional $k$NN algorithm, the proposed manifold version $k$NN shows promising potential for classifying manifold-distributed data.

preprint2016arXiv

An analytical study of mismatched complementary media

Complementary media (CM) interacting with arbitrarily situated obstacles are usually less discussed. In this paper, an analytical framework based on multiple scattering theory is established for analyzing such a mismatched case. As examples, CM-based devices, i.e., a superlens and superscatterer, are discussed. From an analysis, the cancellation mechanism of the mismatched CM is studied. In addition, numerical results are provided for illustration. Moreover, further study shows that such cancellation effects might rely on specific conditions. Actually, the conclusions are not restricted to any specific frequencies; they could be extended to many other areas including applications to active cloaking, antennas, and wireless power transfer.

preprint2016arXiv

Unbounded ladders induced by Gorenstein algebras

The derived category of a Gorenstein triangular matrix algebra $A$ admits an unbounded ladder, which is of period $3$ if $A = T_2(B)$. Also, a left recollement of triangulated categories with Serre functors sits in a ladder of period $1$; as an application, the singularity category of $A$ admits a ladder of period $1$.

preprint2016arXiv

Unbounded ladders induced by Gorenstein algebras

The derived category $D({\rm Mod}A)$ of a Gorenstein triangular matrix algebra $A$ admits an unbounded ladder; and this ladder restricts to $D^-({\rm Mod})$ {\rm(}resp. $D^b({\rm Mod})$, $D^b({\rm mod})$, $K^b({\rm proj})${\rm)}. A left recollement of triangulated categories with Serre functors sits in a ladder of period $1$; as an application, the singularity category of $A$ admits a ladder of period $1$.

preprint2015arXiv

Extend the explanation of transformation optics in metamaterial-modified wireless power transfer system

Based on rigorous scattering theory we establish a systematic methodology for research of metamaterial-modified current-carrying conductors, from which we mathematically demonstrate the explanation of transformation optics could be extended in metamaterial-modified wireless power transfer system, and based on that we could establish a equivalent model. More important, our demonstration reveals that the equivalent model will still be applicable even when TO could not give a direct explanation, as the requirements of complementary media is not satisfied. And numerical results from our methodology as well as COMSOL verified our findings. The demonstration is not under specific frequency, the conclusion could be extended to a broad range of wavelength, and expected to be applicable for active cloak etc.

preprint2015arXiv

Statistic inversion of multi-zone transition probability models for aquifer characterization in alluvial fans

Understanding the heterogeneity arising from the complex architecture of sedimentary sequences in alluvial fans is challenging. This paper develops a statistical inverse framework in a multi-zone transition probability approach for characterizing the heterogeneity in alluvial fans. An analytical solution of the transition probability matrix is used to define the statistical relationships among different hydrofacies and their mean lengths, integral scales, and volumetric proportions. A statistical inversion is conducted to identify the multi-zone transition probability models and estimate the optimal statistical parameters using the modified Gauss-Newton-Levenberg-Marquardt method. The Jacobian matrix is computed by the sensitivity equation method, which results in an accurate inverse solution with quantification of parameter uncertainty. We use the Chaobai River alluvial fan in the Beijing Plain, China, as an example for elucidating the methodology of alluvial fan characterization. The alluvial fan is divided into three sediment zones. In each zone, the explicit mathematical formulations of the transition probability models are constructed with optimized different integral scales and volumetric proportions. The hydrofacies distributions in the three zones are simulated sequentially by the multi-zone transition probability-based indicator simulations. The result of this study provides the heterogeneous structure of the alluvial fan for further study of flow and transport simulations.

preprint2014arXiv

An integrated assessment of the impact of precipitation and groundwater on vegetation growth in arid and semiarid areas

Increased demand for water resources together with the influence of climate change has degraded water conditions which support vegetation in many parts of the world, especially in arid and semiarid areas. This study develops an integrated framework to assess the impact of precipitation and groundwater on vegetation growth in the Xiliao River Plain of northern China. The integrated framework systematically combines remote sensing technology with water flow modeling in the vadose zone and field data analysis. The vegetation growth is quantitatively evaluated with the remote sensing data by the Normalized Difference Vegetation Index (NDVI) and the simulated plant water uptake rates. The correlations among precipitation, groundwater depth and NDVI are investigated by using Pearson correlation equations. The results provide insights for understanding interactions between precipitation and groundwater and their contributions to vegetation growth. Strong correlations between groundwater depth, plant water uptake and NDVI are found in parts of the study area during a ten-year drought period. The numerical modeling results indicate that there is an increased correlation between the groundwater depth and vegetation growth and that groundwater significantly contributes to sustaining effective soil moisture for vegetation growth during the long drought period. Therefore, a decreasing groundwater table might pose a great threat to the survival of vegetation during a long drought period.

preprint2014arXiv

Robust and Efficient Subspace Segmentation via Least Squares Regression

This paper studies the subspace segmentation problem which aims to segment data drawn from a union of multiple linear subspaces. Recent works by using sparse representation, low rank representation and their extensions attract much attention. If the subspaces from which the data drawn are independent or orthogonal, they are able to obtain a block diagonal affinity matrix, which usually leads to a correct segmentation. The main differences among them are their objective functions. We theoretically show that if the objective function satisfies some conditions, and the data are sufficiently drawn from independent subspaces, the obtained affinity matrix is always block diagonal. Furthermore, the data sampling can be insufficient if the subspaces are orthogonal. Some existing methods are all special cases. Then we present the Least Squares Regression (LSR) method for subspace segmentation. It takes advantage of data correlation, which is common in real data. LSR encourages a grouping effect which tends to group highly correlated data together. Experimental results on the Hopkins 155 database and Extended Yale Database B show that our method significantly outperforms state-of-the-art methods. Beyond segmentation accuracy, all experiments demonstrate that LSR is much more efficient.

preprint2013arXiv

Optomechanical Transductions in Single and Coupled Wheel Resonators

In this report, the optomechanical transductions in both single and two side-coupled wheel resonators are investigated. In the single resonator, the optomechanical transduction sensitivity is determined by the optical and mechanical quality factors of the resonator. In the coupled resonators, the optomechanical transduction is related to the energy distribution in the two resonators, which is strongly dependent on the input detuning. Compared to a single resonator, the coupled resonators can still provide very sensitive optomechanical transduction even if the optical and mechanical quality factors of one resonator are degraded.

preprint2010arXiv

An invisibility cloak using silver nanowires

In this paper, we use the parameter retrieval method together with an analytical effective medium approach to design a well-performed invisible cloak, which is based on an empirical revised version of the reduced cloak. The designed cloak can be implemented by silver nanowires with elliptical cross-sections embedded in a polymethyl methacrylate host. This cloak is numerically proved to be robust for both the inner hidden object as well as incoming detecting waves, and is much simpler thus easier to manufacture when compared with the earlier proposed one [Nat. Photon. 1, 224 (2007)].

Lin Zhu

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

Dynamic Pondering Sparsity-aware Mixture-of-Experts Transformer for Event Stream based Visual Object Tracking

CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric

Data-Driven Fast Frequency Control using Inverter-Based Resources

Event-based Video Reconstruction via Potential-assisted Spiking Neural Network

GenAD: General Representations of Multivariate Time Seriesfor Anomaly Detection

Mirror Complementary Transformer Network for RGB-thermal Salient Object Detection

Noise and Edge Based Dual Branch Image Manipulation Detection

Proposal for the search for exotic spin-spin interactions at the micrometer scale using functionalized cantilever force sensors

Temporal Up-Sampling for Asynchronous Events

Smart Prediction of the Complaint Hotspot Problem in Mobile Network

A Graph-Based Semi-Supervised k Nearest-Neighbor Method for Nonlinear Manifold Distributed Data Classification

An analytical study of mismatched complementary media

Unbounded ladders induced by Gorenstein algebras

Unbounded ladders induced by Gorenstein algebras

Extend the explanation of transformation optics in metamaterial-modified wireless power transfer system

Statistic inversion of multi-zone transition probability models for aquifer characterization in alluvial fans

An integrated assessment of the impact of precipitation and groundwater on vegetation growth in arid and semiarid areas

Robust and Efficient Subspace Segmentation via Least Squares Regression

Optomechanical Transductions in Single and Coupled Wheel Resonators

An invisibility cloak using silver nanowires