Source author record

Lijun Wang

Lijun Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Networking and Internet Architecture physics.ins-det Machine Learning Artificial Intelligence eess.IV eess.SP Information Theory math.IT Methodology Multimedia physics.atom-ph physics.chem-ph physics.optics

Catalog footprint

What is connected

16works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AR-MOT: Autoregressive Multi-object Tracking

As multi-object tracking (MOT) tasks continue to evolve toward more general and multi-modal scenarios, the rigid and task-specific architectures of existing MOT methods increasingly hinder their applicability across diverse tasks and limit flexibility in adapting to new tracking formulations. Most approaches rely on fixed output heads and bespoke tracking pipelines, making them difficult to extend to more complex or instruction-driven tasks. To address these limitations, we propose AR-MOT, a novel autoregressive paradigm that formulates MOT as a sequence generation task within a large language model (LLM) framework. This design enables the model to output structured results through flexible sequence construction, without requiring any task-specific heads. To enhance region-level visual perception, we introduce an Object Tokenizer based on a pretrained detector. To mitigate the misalignment between global and regional features, we propose a Region-Aware Alignment (RAA) module, and to support long-term tracking, we design a Temporal Memory Fusion (TMF) module that caches historical object tokens. AR-MOT offers strong potential for extensibility, as new modalities or instructions can be integrated by simply modifying the output sequence format without altering the model architecture. Extensive experiments on MOT17 and DanceTrack validate the feasibility of our approach, achieving performance comparable to state-of-the-art methods while laying the foundation for more general and flexible MOT systems.

preprint2026arXiv

Improve Power of Knockoffs with Annotation Information of Covariates

Genome-wide association studies (GWAS) often find association signals between many genetic variants and traits of interest in a genomic region. Functional annotations of these variants provide valuable prior information that helps prioritize biologically relevant variants and enhances the power to detect causal variants. However, due to substantial correlations among these variants, a critical question is how to rigorously control the false discovery rate while effectively leveraging prior knowledge. We introduce annotation-informed knockoffs (AnnoKn), a knockoff-based method that performs annotation-informed variable selection with strict control of the false discovery rate. AnnoKn integrates the knockoff procedure with adaptive Lasso regression to evaluate the importance of multiple covariates while incorporating functional annotation information within a unified Bayesian framework. To facilitate real-world applications where individual-level data are not accessible, we further extend AnnoKn to operate on summary statistics. Through simulations and real-world applications to GTEx and GWAS datasets, we show that AnnoKn achieves superior power in detecting causal genetic variants compared with existing annotation-informed variable selection methods, while maintaining valid control over false discoveries.

preprint2024arXiv

Sparse Full Configuration Interaction

We propose an efficient deterministic method to calculate the full configuration interaction (FCI) ground state energy. This method leverages the sparseness of the Lanczos basis vectors that span the Krylov subspace associated with the Hamiltonian to rapidly calculate the lowest eigenvalue of the effective Hamiltonian formed in this basis. By iteratively performing the spanning and diagonalization steps, this method is capable of rapidly reaching chemical accuracy for a variety of strongly correlated molecules, including stretched N$_2$ and C$_2$, in merely several tens of iterations. To accelerate our algorithm, we implement parallelized sparse matrix-sparse vector multiplication, which features a novel hashing method that establishes a one-to-one correspondence between the set of full configuration Slater determinants and a set of consecutive integers. A near-linear acceleration was observed.

preprint2022arXiv

Time shifting deviation method enhanced laser interferometry: ultrahigh precision localizing of traffic vibration using urban fiber link

Using fiber network as a huge sensing system will enrich monitoring methods of public infrastructures and geological disasters. With traditional cross-correlation method, laser interferometer has been used to detect and localize the vibration event. However, the random error induced by cross-correlation method limits the localization accuracy, and makes it not suitable for ultrahigh precision localizing applications. We propose a novel time shifting deviation (TSDEV) method, which has advantages over cross-correlation method in practicability and localization accuracy. Three experiments are carried out to demonstrate the novelty of the TSDEV method. In lab test, vibration localization accuracy of ~2.5 m is realized. In field tests, TSDEV method enhanced interferometry is applied to monitor the urban fiber link. Traffic vibration events on the campus road and Beijing ring road have been precisely localized and analyzed, respectively. The proposed technique will extend the function of existing urban fiber network, and better serve the future smart city.

preprint2021arXiv

Comparison between Time Shifting Deviation and Cross-correlation Methods

Time delay estimation (TDE) is an important step to identify and locate vibration source. The TDE result can be obtained by cross-correlation method through seeking the maximum correlation peak of two signals. However, the cross-correlation method will induce random error when dealing with the nonstationary signal. We propose a novel time shifting deviation (TSDEV) method to solve this problem, which has been proved to achieve ultrahigh precision localization result in the fiber vibration monitoring system. This paper compares TSDEV method with cross-correlation in detail by simulating TDE process in different conditions, such as signals with arbitrary intercepted length, nonstationary drift and correlated noise. Besides, experimental demonstration has been carried out on 60 km fiber to localize a wide band vibration signal. The typical localization error is 2 m with standard deviation of 21.4 m using TSDEV method. It stands in clear contrast to the result of cross-correlation method, whose localization error is 70 m and the standard deviation is 208.4 m. Compared with cross-correlation method, TSDEV has the same resistance to white noise, but has fewer boundary conditions and better suppression on linear drift or common noise, which leads to more precise TDE results.

preprint2020arXiv

A Single Frame and Multi-Frame Joint Network for 360-degree Panorama Video Super-Resolution

Spherical videos, also known as \ang{360} (panorama) videos, can be viewed with various virtual reality devices such as computers and head-mounted displays. They attract large amount of interest since awesome immersion can be experienced when watching spherical videos. However, capturing, storing and transmitting high-resolution spherical videos are extremely expensive. In this paper, we propose a novel single frame and multi-frame joint network (SMFN) for recovering high-resolution spherical videos from low-resolution inputs. To take advantage of pixel-level inter-frame consistency, deformable convolutions are used to eliminate the motion difference between feature maps of the target frame and its neighboring frames. A mixed attention mechanism is devised to enhance the feature representation capability. The dual learning strategy is exerted to constrain the space of solution so that a better solution can be found. A novel loss function based on the weighted mean square error is proposed to emphasize on the super-resolution of the equatorial regions. This is the first attempt to settle the super-resolution of spherical videos, and we collect a novel dataset from the Internet, MiG Panorama Video, which includes 204 videos. Experimental results on 4 representative video clips demonstrate the efficacy of the proposed method. The dataset and code are available at https://github.com/lovepiano/SMFN_For_360VSR.

preprint2020arXiv

When Relation Networks meet GANs: Relation GANs with Triplet Loss

Though recent research has achieved remarkable progress in generating realistic images with generative adversarial networks (GANs), the lack of training stability is still a lingering concern of most GANs, especially on high-resolution inputs and complex datasets. Since the randomly generated distribution can hardly overlap with the real distribution, training GANs often suffers from the gradient vanishing problem. A number of approaches have been proposed to address this issue by constraining the discriminator's capabilities using empirical techniques, like weight clipping, gradient penalty, spectral normalization etc. In this paper, we provide a more principled approach as an alternative solution to this issue. Instead of training the discriminator to distinguish real and fake input samples, we investigate the relationship between paired samples by training the discriminator to separate paired samples from the same distribution and those from different distributions. To this end, we explore a relation network architecture for the discriminator and design a triplet loss which performs better generalization and stability. Extensive experiments on benchmark datasets show that the proposed relation discriminator and new loss can provide significant improvement on variable vision tasks including unconditional and conditional image generation and image translation.

preprint2019arXiv

Theoretical Investigation of the Black-body Zeeman Shift for Microwave Atomic Clocks

With the development of microwave atomic clocks, the Zeeman shifts for the spectral lines of black-body radiation need to be investigated carefully. In this Letter, the frequency shifts of hyperfine splittings of atomic ground states due to the magnetic field of black-body radiation are reported. The relative frequency shifts of different alkali atoms and alkali-like ions, which could be candidates of microwave atomic clocks, were calculated. The results vary from $-0.977\times10^{-17}[T(K)/300]^{2}$ to $-1.947\times10^{-17}[T(K)/300]^{2}$ for different atoms considered. These results are consistent with previous work but with greater precision, detailed derivations, and a clear physical picture.

preprint2017arXiv

Interference Minimization in 5G Heterogeneous Networks

In this paper, we focus on one of the representative 5G network scenarios, namely multi-tier heterogeneous cellular networks. User association is investigated in order to reduce the down-link co-channel interference. Firstly, in order to analyze the multi-tier heterogeneous cellular networks where the base stations in different tiers usually adopt different transmission powers, we propose a Transmission Power Normalization Model (TPNM), which is able to convert a multi-tier cellular network into a single-tier network, such that all base stations have the same normalized transmission power. Then using TPNM, the signal and interference received at any point in the complex multi-tier environment can be analyzed by considering the same point in the equivalent single-tier cellular network model, thus significantly simplifying the analysis. On this basis, we propose a new user association scheme in heterogeneous cellular networks, where the base station that leads to the smallest interference to other co-channel mobile stations is chosen from a set of candidate base stations that satisfy the quality-of-service (QoS) constraint for an intended mobile station. Numerical results show that the proposed user association scheme is able to significantly reduce the down-link interference compared with existing schemes while maintaining a reasonably good QoS.

preprint2016arXiv

End-to-End Image Super-Resolution via Deep and Shallow Convolutional Networks

One impressive advantage of convolutional neural networks (CNNs) is their ability to automatically learn feature representation from raw pixels, eliminating the need for hand-designed procedures. However, recent methods for single image super-resolution (SR) fail to maintain this advantage. They utilize CNNs in two decoupled steps, i.e., first upsampling the low resolution (LR) image to the high resolution (HR) size with hand-designed techniques (e.g., bicubic interpolation), and then applying CNNs on the upsampled LR image to reconstruct HR results. In this paper, we seek an alternative and propose a new image SR method, which jointly learns the feature extraction, upsampling and HR reconstruction modules, yielding a completely end-to-end trainable deep CNN. As opposed to existing approaches, the proposed method conducts upsampling in the latent feature space with filters that are optimized for the task of image SR. In addition, the HR reconstruction is performed in a multi-scale manner to simultaneously incorporate both short- and long-range contextual information, ensuring more accurate restoration of HR images. To facilitate network training, a new training approach is designed, which jointly trains the proposed deep network with a relatively shallow network, leading to faster convergence and more superior performance. The proposed method is extensively evaluated on widely adopted data sets and improves the performance of state-of-the-art methods with a considerable margin. Moreover, in-depth ablation studies are conducted to verify the contribution of different network designs to image SR, providing additional insights for future research.

preprint2016arXiv

Energy Saving of Base Stations Sleep Scheduling for Multi-Hop Vehicular Networks

This paper investigates the energy saving of base station (BS) deployed in a 1-D multi-hop vehicular network with sleep scheduling strategy. We consider cooperative BS scheduling strategy where BSs can switch between sleep and active modes to reduce the average energy consumption utilizing the information of vehicular speeds and locations. Assuming a Poisson distribution of vehicles, we derive an appropriate probability distribution function of distance between two adjacent cluster heads, where a cluster is a maximal set of vehicles in which every two adjacent vehicles can communicate directly when their Euclidean distance is less than or equal to a threshold, known as the communication range of vehicles. Furthermore, the expected value of the sojourn time in the sleep mode and energy saving are obtained. The numerical results show that the sleep scheduling strategy significantly reduces the energy consumption of the base stations.

preprint2016arXiv

On Energy Efficiency of the Nearest-Neighbor Cooperative Communication in Heterogeneous Networks

In this paper, we consider a two-dimensional heterogeneous cellular network scenario consisting of one base station (BS) and some mobile stations (MSs) whose locations follow a Poisson point process (PPP). The MSs are equipped with multiple radio access interfaces including a cellular access interface and at least one short-range communication interface. We propose a nearest-neighbor cooperation communication (NNCC) scheme by exploiting the short-range communication between a MS and its nearest neighbor to collaborate on their uplink transmissions. In the proposed cooperation scheme, a MS and its nearest neighbor first exchange data by the short-range communication. Upon successful decoding of the data from each other, they proceed to send their own data, as well as the data received from the other to the BS respectively in orthogonal time slots. The energy efficiency analysis for the proposed scheme is presented based on the characteristics of the PPP and the Rayleigh fading channel. Numerical results show that the NNCC scheme significantly improves the energy efficiency compared to the conventional non-cooperative uplink transmissions.

preprint2016arXiv

Visual Tracking via Shallow and Deep Collaborative Model

In this paper, we propose a robust tracking method based on the collaboration of a generative model and a discriminative classifier, where features are learned by shallow and deep architectures, respectively. For the generative model, we introduce a block-based incremental learning scheme, in which a local binary mask is constructed to deal with occlusion. The similarity degrees between the local patches and their corresponding subspace are integrated to formulate a more accurate global appearance model. In the discriminative model, we exploit the advances of deep learning architectures to learn generic features which are robust to both background clutters and foreground appearance variations. To this end, we first construct a discriminative training set from auxiliary video sequences. A deep classification neural network is then trained offline on this training set. Through online fine-tuning, both the hierarchical feature extractor and the classifier can be adapted to the appearance change of the target for effective online tracking. The collaboration of these two models achieves a good balance in handling occlusion and target appearance change, which are two contradictory challenging factors in visual tracking. Both quantitative and qualitative evaluations against several state-of-the-art algorithms on challenging image sequences demonstrate the accuracy and the robustness of the proposed tracker.

preprint2015arXiv

Portable Microwave Frequency Dissemination in Free Space and Implications on Ground-Satellite Synchronization

Frequency dissemination and synchronization in free space plays an important role in global navigation satellite system, radio astronomy and synthetic aperture radar. In this paper, we demonstrate a portable radio frequency dissemination scheme via free space using microwave antennas. The setup has a good environment adaptability and high dissemination stability. The frequency signal is disseminated at different distances ranging from 10 to 640 m with a fixed 10 Hz locking bandwidth, and the scaling law of dissemination stability on distance and averaging time is discussed. The preliminary extrapolation shows that the dissemination stability may reach $1\times10^{-12}/s$ in ground-to-satellite synchronization, which far exceeds all present methods, and is worthy for further study.

preprint2014arXiv

Fiber-based ultra-stable frequency synchronization using client-side, 1f-2f active compensation method

We demonstrate a frequency synchronization scheme with the phase noise compensation function placed at the client site. One transmitting module hence can be linked with multiple client sites. As a performance test, using two separate 50 km fiber spools, we recover the 100 MHz disseminated reference frequencies at two remote sites, separately. Relative frequency stabilities between two recovered frequency signals of 2.8E-14/s and 2.5E-17/day are obtained. This scalable scheme is suitable for the applications of frequency dissemination with a star-topology, such as SKA and DSN.

preprint2011arXiv

An Algorithm for Repairing Low-Quality Video Enhancement Techniques Based on Trained Filter

Multifarious image enhancement algorithms have been used in different applications. Still, some algorithms or modules are imperfect for practical use. When the image enhancement modules have been fixed or combined by a series of algorithms, we need to repair them as a whole part without changing the inside. This report aims to find an algorithm based on trained filters to repair low-quality image enhancement modules. A brief review on basic image enhancement techniques and pixel classification methods will be presented, and the procedure of trained filters will be described step by step. The experiments and result comparisons for this algorithm will be described in detail.

Lijun Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

AR-MOT: Autoregressive Multi-object Tracking

Improve Power of Knockoffs with Annotation Information of Covariates

Sparse Full Configuration Interaction

Time shifting deviation method enhanced laser interferometry: ultrahigh precision localizing of traffic vibration using urban fiber link

Comparison between Time Shifting Deviation and Cross-correlation Methods

A Single Frame and Multi-Frame Joint Network for 360-degree Panorama Video Super-Resolution

When Relation Networks meet GANs: Relation GANs with Triplet Loss

Theoretical Investigation of the Black-body Zeeman Shift for Microwave Atomic Clocks

Interference Minimization in 5G Heterogeneous Networks

End-to-End Image Super-Resolution via Deep and Shallow Convolutional Networks

Energy Saving of Base Stations Sleep Scheduling for Multi-Hop Vehicular Networks

On Energy Efficiency of the Nearest-Neighbor Cooperative Communication in Heterogeneous Networks

Visual Tracking via Shallow and Deep Collaborative Model

Portable Microwave Frequency Dissemination in Free Space and Implications on Ground-Satellite Synchronization

Fiber-based ultra-stable frequency synchronization using client-side, 1f-2f active compensation method

An Algorithm for Repairing Low-Quality Video Enhancement Techniques Based on Trained Filter