Source author record

Jian Shi

Jian Shi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision math.FA cond-mat.mtrl-sci Artificial Intelligence eess.SY Machine Learning math.OA physics.acc-ph cond-mat.mes-hall cond-mat.str-el Cryptography and Security cs.CY Emerging Technologies math.CA Neurons and Cognition Systems and Control

Catalog footprint

What is connected

19works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Exploring Reliable Spatiotemporal Dependencies for Efficient Visual Tracking

Recent advances in transformer-based lightweight object tracking have established new standards across benchmarks, leveraging the global receptive field and powerful feature extraction capabilities of attention mechanisms. Despite these achievements, existing methods universally employ sparse sampling during training--utilizing only one template and one search image per sequence--which fails to comprehensively explore spatiotemporal information in videos. This limitation constrains performance and cause the gap between lightweight and high-performance trackers. To bridge this divide while maintaining real-time efficiency, we propose STDTrack, a framework that pioneers the integration of reliable spatiotemporal dependencies into lightweight trackers. Our approach implements dense video sampling to maximize spatiotemporal information utilization. We introduce a temporally propagating spatiotemporal token to guide per-frame feature extraction. To ensure comprehensive target state representation, we disign the Multi-frame Information Fusion Module (MFIFM), which augments current dependencies using historical context. The MFIFM operates on features stored in our constructed Spatiotemporal Token Maintainer (STM), where a quality-based update mechanism ensures information reliability. Considering the scale variation among tracking targets, we develop a multi-scale prediction head to dynamically adapt to objects of different sizes. Extensive experiments demonstrate state-of-the-art results across six benchmarks. Notably, on GOT-10k, STDTrack rivals certain high-performance non-real-time trackers (e.g., MixFormer) while operating at 192 FPS(GPU) and 41 FPS(CPU).

preprint2026arXiv

Will the Carbon Border Adjustment Mechanism Impact European Electricity Prices? A GNN-Based Network Analysis

The European Union's Carbon Border Adjustment Mechanism (CBAM) creates a complex challenge for the interconnected European electricity market. Traditional static analyses often miss the cross-border spillover effects that are vital for understanding this policy. This paper addresses this gap by developing a spatio-temporal Graph Neural Network (GNN) framework. It quantifies how CBAM affects electricity prices and carbon intensity (CI) at the same time. We modeled a subgraph of eight European countries. Our results suggest that CBAM is not just a uniform tax. Instead, it acts as a tool that transforms the market and creates structural differences. In our simulated scenarios, we observe that low-carbon countries like France and Switzerland can gain a competitive advantage. This suggests a potential decrease in their domestic electricity prices. Meanwhile, high-carbon countries like Poland face a double burden of rising costs. We identify the primary driver as a fundamental shift in the market's merit order.

preprint2022arXiv

DeepCatra: Learning Flow- and Graph-based Behaviors for Android Malware Detection

As Android malware is growing and evolving, deep learning has been introduced into malware detection, resulting in great effectiveness. Recent work is considering hybrid models and multi-view learning. However, they use only simple features, limiting the accuracy of these approaches in practice. In this paper, we propose DeepCatra, a multi-view learning approach for Android malware detection, whose model consists of a bidirectional LSTM (BiLSTM) and a graph neural network (GNN) as subnets. The two subnets rely on features extracted from statically computed call traces leading to critical APIs derived from public vulnerabilities. For each Android app, DeepCatra first constructs its call graph and computes call traces reaching critical APIs. Then, temporal opcode features used by the BiLSTM subnet are extracted from the call traces, while flow graph features used by the GNN subnet are constructed from all the call traces and inter-component communications. We evaluate the effectiveness of DeepCatra by comparing it with several state-of-the-art detection approaches. Experimental results on over 18,000 real-world apps and prevalent malware show that DeepCatra achieves considerable improvement, e.g., 2.7% to 14.6% on F1-measure, which demonstrates the feasibility of DeepCatra in practice.

preprint2022arXiv

Filtering electrons by mode coupling in finite semiconductor superlattices

Electron transmission through semiconductor superlattices is studied with transfer matrix method and resonance theory. The formation of electron band-pass transmission is ascribed to the coupling of different modes in those semiconductor superlattices with the symmetric unit cell. Upon Fabry-Pérot resonance condition, Bloch modes and two other resonant modes are identified to be related to the nature of the superlattice and its unit cell, respectively. The bands related to the unit cell and the superlattice overlap spontaneously in the tunneling region due to the shared wells, and the coupling of perfectly resonances results in the band-pass tunneling. Our findings provide a promising way to study electronic systems with more complicated superlattices or even optical systems with photonic crystals.

preprint2022arXiv

Semantic decomposition Network with Contrastive and Structural Constraints for Dental Plaque Segmentation

Segmenting dental plaque from images of medical reagent staining provides valuable information for diagnosis and the determination of follow-up treatment plan. However, accurate dental plaque segmentation is a challenging task that requires identifying teeth and dental plaque subjected to semantic-blur regions (i.e., confused boundaries in border regions between teeth and dental plaque) and complex variations of instance shapes, which are not fully addressed by existing methods. Therefore, we propose a semantic decomposition network (SDNet) that introduces two single-task branches to separately address the segmentation of teeth and dental plaque and designs additional constraints to learn category-specific features for each branch, thus facilitating the semantic decomposition and improving the performance of dental plaque segmentation. Specifically, SDNet learns two separate segmentation branches for teeth and dental plaque in a divide-and-conquer manner to decouple the entangled relation between them. Each branch that specifies a category tends to yield accurate segmentation. To help these two branches better focus on category-specific features, two constraint modules are further proposed: 1) contrastive constraint module (CCM) to learn discriminative feature representations by maximizing the distance between different category representations, so as to reduce the negative impact of semantic-blur regions on feature extraction; 2) structural constraint module (SCM) to provide complete structural information for dental plaque of various shapes by the supervision of an boundary-aware geometric constraint. Besides, we construct a large-scale open-source Stained Dental Plaque Segmentation dataset (SDPSeg), which provides high-quality annotations for teeth and dental plaque. Experimental results on SDPSeg datasets show SDNet achieves state-of-the-art performance.

preprint2022arXiv

TorMentor: Deterministic dynamic-path, data augmentations with fractals

We propose the use of fractals as a means of efficient data augmentation. Specifically, we employ plasma fractals for adapting global image augmentation transformations into continuous local transforms. We formulate the diamond square algorithm as a cascade of simple convolution operations allowing efficient computation of plasma fractals on the GPU. We present the TorMentor image augmentation framework that is totally modular and deterministic across images and point-clouds. All image augmentation operations can be combined through pipelining and random branching to form flow networks of arbitrary width and depth. We demonstrate the efficiency of the proposed approach with experiments on document image segmentation (binarization) with the DIBCO datasets. The proposed approach demonstrates superior performance to traditional image augmentation techniques. Finally, we use extended synthetic binary text images in a self-supervision regiment and outperform the same model when trained with limited data and simple extensions.

preprint2022arXiv

Upsampling Autoencoder for Self-Supervised Point Cloud Learning

In computer-aided design (CAD) community, the point cloud data is pervasively applied in reverse engineering, where the point cloud analysis plays an important role. While a large number of supervised learning methods have been proposed to handle the unordered point clouds and demonstrated their remarkable success, their performance and applicability are limited to the costly data annotation. In this work, we propose a novel self-supervised pretraining model for point cloud learning without human annotations, which relies solely on upsampling operation to perform feature learning of point cloud in an effective manner. The key premise of our approach is that upsampling operation encourages the network to capture both high-level semantic information and low-level geometric information of the point cloud, thus the downstream tasks such as classification and segmentation will benefit from the pre-trained model. Specifically, our method first conducts the random subsampling from the input point cloud at a low proportion e.g., 12.5%. Then, we feed them into an encoder-decoder architecture, where an encoder is devised to operate only on the subsampled points, along with a upsampling decoder is adopted to reconstruct the original point cloud based on the learned features. Finally, we design a novel joint loss function which enforces the upsampled points to be similar with the original point cloud and uniformly distributed on the underlying shape surface. By adopting the pre-trained encoder weights as initialisation of models for downstream tasks, we find that our UAE outperforms previous state-of-the-art methods in shape classification, part segmentation and point cloud upsampling tasks. Code will be made publicly available upon acceptance.

preprint2020arXiv

Multi-year Long-term Load Forecast for Area Distribution Feeders based on Selective Sequence Learning

Long-term load forecast (LTLF) for area distribution feeders is one of the most critical tasks frequently performed in electric distribution utility companies. For a specific planning area, cost-effective system upgrades can only be planned out based on accurate feeder LTLF results. In our previous research, we established a unique sequence prediction method which has the tremendous advantage of combining area top-down, feeder bottom-up and multi-year historical data all together for forecast and achieved a superior performance over various traditional methods by real-world tests. However, the previous method only focused on the forecast of the next one-year. In our current work, we significantly improved this method: the forecast can now be extended to a multi-year forecast window in the future; unsupervised learning techniques are used to group feeders by their load composition features to improve accuracy; we also propose a novel selective sequence learning mechanism which uses Gated Recurrent Unit network to not only learn how to predict sequence values but also learn to select the best-performing sequential configuration for each individual feeder. The proposed method was tested on an actual urban distribution system in West Canada. It was compared with traditional methods and our previous sequence prediction method. It demonstrates the best forecasting performance as well as the possibility of using sequence prediction models for multi-year component-level load forecast.

preprint2016arXiv

Injection method of barrier bucket supported by off-aligned electron cooling for CRing of HIAF

A new accelerator complex, HIAF (the High Intensity Heavy Ion Accelerator Facility), has been approved in China. It is designed to provide intense primary and radioactive ion beams for research in high energy density physics, nuclear physics, atomic physics as well as other applications. In order to achieve a high intensity of up to 5e11 ppp 238U34+, the Compression Ring (CRing) needs to stack more than 5 bunches transferred from the Booster Ring (BRing). However, the normal bucket to bucket injection scheme can only achieve an intensity gain of 2, so an injection method, fixed barrier bucket (BB) supported by electron cooling, is proposed. To suppress the severe space charge effect during the stacking process, off-alignment is adopted in the cooler to control the transverse emittance. In this paper, simulation and optimization with the BETACOOL program are presented.

preprint2016arXiv

Learning Non-Lambertian Object Intrinsics across ShapeNet Categories

We consider the non-Lambertian object intrinsic problem of recovering diffuse albedo, shading, and specular highlights from a single image of an object. We build a large-scale object intrinsics database based on existing 3D models in the ShapeNet database. Rendered with realistic environment maps, millions of synthetic images of objects and their corresponding albedo, shading, and specular ground-truth images are used to train an encoder-decoder CNN. Once trained, the network can decompose an image into the product of albedo and shading components, along with an additive specular component. Our CNN delivers accurate and sharp results in this classical inverse problem of computer vision, sharp details attributed to skip layer connections at corresponding resolutions from the encoder to the decoder. Benchmarked on our ShapeNet and MIT intrinsics datasets, our model consistently outperforms the state-of-the-art by a large margin. We train and test our CNN on different object categories. Perhaps surprising especially from the CNN classification perspective, our intrinsics CNN generalizes very well across categories. Our analysis shows that feature learning at the encoder stage is more crucial for developing a universal representation across categories. We apply our synthetic data trained model to images and videos downloaded from the internet, and observe robust and realistic intrinsics results. Quality non-Lambertian intrinsics could open up many interesting applications such as image-based albedo and specular editing.

preprint2016arXiv

Spinodal Decomposition-Enabled Halide Perovskite Double Heterostructure with Reduced Fröhlich Electron-Phonon Coupling

Epitaxial III-V semiconductor heterostructures are key components in modern microelectronics, electro-optics and optoelectronics. With superior semiconducting properties, halide perovskite materials are rising as promising candidates for coherent heterostructure devices. In this report, spinodal decomposition is proposed and experimentally implemented to produce epitaxial double heterostructures in halide perovskite system. Pristine epitaxial mixed halide perovskites rods and films were synthesized via Van der Waals epitaxy by chemical vapor deposition method. At room temperature, photon was applied as a knob to regulate the kinetics of spinodal decomposition and classic coarsening. By this approach, halide perovskite double heterostructures were created carrying epitaxial interfaces and outstanding optical properties. Reduced Fröhlich electron-phonon coupling was discovered in coherent halide double heterostructure, which is hypothetically attributed to the classic phonon confinement effect widely existing in III-V double heterostructures. The ability to develop coherent double heterostructures in halide perovskites paves an avenue to exploring halide perovskite-based quantum wells and superlattices for high-performance and low-cost optoelectronics, electro-optics and microelectronics.

preprint2015arXiv

Closed orbit correction of HIMM synchrotron

The correction of closed orbit has great influence on the operation of synchrotron. The design of correction system is one significant component of lattice design. It is suggested to set BPMs at the peaks of betatron oscillation. The correctors need to be located at the positions where \b{eta} function is large or the sources of large errors. In the simulation of the closed orbit correction of HIMM synchrotron, one important reason affecting the result of horizontal correction is the longitudinal alignment error of dipole magnet. It is advisable to decrease this kind of alignment error while the deflection angle of dipole magnet is large.

preprint2015arXiv

Two-Dimensional Van der Waals Epitaxy Kinetics in a Three-Dimensional Perovskite Halide

The exploration of emerging materials physics and prospective applications of two-dimensional materials greatly relies on the growth control of their thickness, phases, morphologies and film-substrate interactions. Though substantial progresses have been made for the development of two-dimensional films from conventional layered bulky materials, particular challenges remain on obtaining ultrathin, single crystalline, dislocation-free films from intrinsically non-Van der Waals-type three-dimensional materials. In this report, with the successful demonstration of single crystalline ultrathin large scale perovskite halide material, we reveal and identify the favorable role of weak Van der Waals film-substrate interaction on the nucleation and growth of the two-dimensional morphology out of non-layered materials compared to conventional epitaxy. We also show how the bonding nature of the three-dimensional material itself affects the kinetic energy landscape of ultrathin films growth. By studying the formation of fractal perovskites assisted with Monte Carlo simulations, we demonstrate that the competition between the Van der Waals diffusion and surface free energy of the perovskite leads to film thickening, suggesting extra strategies such as surface passivation may be needed for the growth of monolayer and a few layers films.

preprint2014arXiv

Neuromimetic Circuits with Synaptic Devices based on Strongly Correlated Electron Systems

Strongly correlated electron systems such as the rare-earth nickelates (RNiO3, R = rare-earth element) can exhibit synapse-like continuous long term potentiation and depression when gated with ionic liquids; exploiting the extreme sensitivity of coupled charge, spin, orbital, and lattice degrees of freedom to stoichiometry. We present experimental real-time, device-level classical conditioning and unlearning using nickelate-based synaptic devices in an electronic circuit compatible with both excitatory and inhibitory neurons. We establish a physical model for the device behavior based on electric-field driven coupled ionic-electronic diffusion that can be utilized for design of more complex systems. We use the model to simulate a variety of associate and non-associative learning mechanisms, as well as a feedforward recurrent network for storing memory. Our circuit intuitively parallels biological neural architectures, and it can be readily generalized to other forms of cellular learning and extinction. The simulation of neural function with electronic device analogues may provide insight into biological processes such as decision making, learning and adaptation, while facilitating advanced parallel information processing in hardware.

preprint2013arXiv

A sufficient condition on operator order for strictly positive operators

Let $A_{1}$, $A_{2}$, $...$, $A_{k}$ be strictly positive operators on a Hilbert space. This note is to show a sufficient condition of $A_{k}\geq A_{k-1}\geq\geq A_{3}\geq A_{2}\geq A_{1}$, which extends the related result before.

preprint2013arXiv

Electrostatic gating of metallic and insulating phases in SmNiO3 ultrathin films

The correlated electron system SmNiO3 exhibits a metal-insulator phase transition at 130 °C. Using an ionic liquid as an electric double layer (EDL) gate on three-terminal ultrathin SmNiO3 devices, we investigate gate control of the channel resistance and transition temperature. Resistance reduction is observed across both insulating and metallic phases with ~25% modulation at room temperature. We show that resistance modulation is predominantly due to electrostatic charge accumulation and not electrochemical doping by control experiments in inert and air en-vironments. We model the resistance behavior and estimate the accumulated sheet density (~1-2 x 10^14 cm^-2) and EDL capacitance (~12 μF/cm^2).

preprint2011arXiv

Characterizations of operator order for k strictly positive operators

Let $A_{i}\ (i=1, 2, ..., k)$ be bounded linear operators on a Hilbert space. This paper aims to show characterizations of operator order $A_{k}\geq A_{k-1}\geq...\geq A_{2}\geq A_{1}>0$ in terms of operator inequalities. Afterwards, an application of the characterizations is given to operator equalities due to Douglas's majorization and factorization theorem.

preprint2011arXiv

Further development of positive semidefinite solutions of the operator equation $\sum_{j=1}^{n}A^{n-j}XA^{j-1}=B$

In \cite{Positive semidefinite solutions}, T. Furuta discusses the existence of positive semidefinite solutions of the operator equation $\sum_{j=1}^{n}A^{n-j}XA^{j-1}=B$. In this paper, we shall apply Grand Furuta inequality to study the operator equation. A generalized special type of $B$ is obtained due to \cite{Positive semidefinite solutions}.

preprint2011arXiv

Operator inequalities dealing with operator equations

In this paper, we study the existence of solutions of some kinds of operator equations via operator inequalities. First, we investigate characterizations of operator order $A\geqslant B >0$ and chaotic operator order log $A \geqslant$ log $B$ for positive definite operators $A$, $B$ in terms of operator equations, and generalize the results in \cite{CSLin}. Then, we introduce applications of complete form of Furuta inequality in operator equations. Some kinds of operator equations are researched and related characterizations of solutions are proved.

Jian Shi

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

Exploring Reliable Spatiotemporal Dependencies for Efficient Visual Tracking

Will the Carbon Border Adjustment Mechanism Impact European Electricity Prices? A GNN-Based Network Analysis

DeepCatra: Learning Flow- and Graph-based Behaviors for Android Malware Detection

Filtering electrons by mode coupling in finite semiconductor superlattices

Semantic decomposition Network with Contrastive and Structural Constraints for Dental Plaque Segmentation

TorMentor: Deterministic dynamic-path, data augmentations with fractals

Upsampling Autoencoder for Self-Supervised Point Cloud Learning

Multi-year Long-term Load Forecast for Area Distribution Feeders based on Selective Sequence Learning

Injection method of barrier bucket supported by off-aligned electron cooling for CRing of HIAF

Learning Non-Lambertian Object Intrinsics across ShapeNet Categories

Spinodal Decomposition-Enabled Halide Perovskite Double Heterostructure with Reduced Fröhlich Electron-Phonon Coupling

Closed orbit correction of HIMM synchrotron

Two-Dimensional Van der Waals Epitaxy Kinetics in a Three-Dimensional Perovskite Halide

Neuromimetic Circuits with Synaptic Devices based on Strongly Correlated Electron Systems

A sufficient condition on operator order for strictly positive operators

Electrostatic gating of metallic and insulating phases in SmNiO3 ultrathin films

Characterizations of operator order for k strictly positive operators

Further development of positive semidefinite solutions of the operator equation $\sum_{j=1}^{n}A^{n-j}XA^{j-1}=B$

Operator inequalities dealing with operator equations