Source author record

Song Chen

Song Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.CO Computer Vision Hardware Architecture astro-ph.GA cond-mat.soft gr-qc Machine Learning math.OC

Catalog footprint

What is connected

12works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

DORA: Dynamic Online Reinforcement Agent for Token Merging in Vision Transformers

Vision Transformers (ViTs) incur significant computational overhead due to the quadratic complexity of self-attention relative to the token sequence length. While existing token reduction methods mitigate this issue, they predominantly rely on fixed heuristic metrics, predefined ratios, or static offline masks, which lack the adaptability to capture input-dependent redundancy during inference. In this paper, we propose DORA (Dynamic Online Reinforcement Agent), the first reinforcement learning (RL)-driven online inference framework for dynamic token merging in ViTs. We formulate the merging process as a sequential Markov Decision Process (MDP), where a lightweight RL agent determines the merging strategy for each Transformer block based on the current feature state and layer-specific context. To balance computational efficiency and feature fidelity, the agent is optimized via a dense reward function incorporating a non-linear distillation-based penalty. We implement an asymmetric Actor-Critic architecture that utilizes a high-capacity Critic for stable offline training while retaining a minimal Actor head for low-computation online inference. Evaluations across multiple ViT scales (Tiny to Large) demonstrate that DORA improves the accuracy-efficiency Pareto front compared to current baselines. Under strict negligible accuracy-drop constraints (<= 0.05%), DORA achieves up to a 12.66% token merging rate, and delivers up to a 569.7% relative improvement over the most efficient baseline. On ImageNet-1K, under aligned accuracy constraints, DORA achieves up to a 76% relative improvement in computational savings compared to state-of-the-art methods. Furthermore, on out-of-distribution (OOD) benchmarks such as ImageNet-A and ImageNet-C, DORA attains a relative efficiency advantage of over 430%.

preprint2023arXiv

Neural Observer with Lyapunov Stability Guarantee for Uncertain Nonlinear Systems

In this paper, we propose a novel nonlinear observer based on neural networks, called neural observer, for observation tasks of linear time-invariant (LTI) systems and uncertain nonlinear systems. In particular, the neural observer designed for uncertain systems is inspired by the active disturbance rejection control, which can measure the uncertainty in real-time. The stability analysis (e.g., exponential convergence rate) of LTI and uncertain nonlinear systems (involving neural observers) are presented and guaranteed, where it is shown that the observation problems can be solved only using the linear matrix inequalities (LMIs). Also, it is revealed that the observability and controllability of the system matrices are required to demonstrate the existence of solutions of LMIs. Finally, the effectiveness of neural observers is verified on three simulation cases, including the X-29A aircraft model, the nonlinear pendulum, and the four-wheel steering vehicle.

preprint2022arXiv

Semantic decoupled representation learning for remote sensing image change detection

Contemporary transfer learning-based methods to alleviate the data insufficiency in change detection (CD) are mainly based on ImageNet pre-training. Self-supervised learning (SSL) has recently been introduced to remote sensing (RS) for learning in-domain representations. Here, we propose a semantic decoupled representation learning for RS image CD. Typically, the object of interest (e.g., building) is relatively small compared to the vast background. Different from existing methods expressing an image into one representation vector that may be dominated by irrelevant land-covers, we disentangle representations of different semantic regions by leveraging the semantic mask. We additionally force the model to distinguish different semantic representations, which benefits the recognition of objects of interest in the downstream CD task. We construct a dataset of bitemporal images with semantic masks in an effortless manner for pre-training. Experiments on two CD datasets show our model outperforms ImageNet pre-training, in-domain supervised pre-training, and several recent SSL methods.

preprint2022arXiv

Semantic-aware Dense Representation Learning for Remote Sensing Image Change Detection

Supervised deep learning models depend on massive labeled data. Unfortunately, it is time-consuming and labor-intensive to collect and annotate bitemporal samples containing desired changes. Transfer learning from pre-trained models is effective to alleviate label insufficiency in remote sensing (RS) change detection (CD). We explore the use of semantic information during pre-training. Different from traditional supervised pre-training that learns the mapping from image to label, we incorporate semantic supervision into the self-supervised learning (SSL) framework. Typically, multiple objects of interest (e.g., buildings) are distributed in various locations in an uncurated RS image. Instead of manipulating image-level representations via global pooling, we introduce point-level supervision on per-pixel embeddings to learn spatially-sensitive features, thus benefiting downstream dense CD. To achieve this, we obtain multiple points via class-balanced sampling on the overlapped area between views using the semantic mask. We learn an embedding space where background and foreground points are pushed apart, and spatially aligned points across views are pulled together. Our intuition is the resulting semantically discriminative representations invariant to irrelevant changes (illumination and unconcerned land covers) may help change recognition. We collect large-scale image-mask pairs freely available in the RS community for pre-training. Extensive experiments on three CD datasets verify the effectiveness of our method. Ours significantly outperforms ImageNet pre-training, in-domain supervision, and several SSL methods. Empirical results indicate our pre-training improves the generalization and data efficiency of the CD model. Notably, we achieve competitive results using 20% training data than baseline (random initialization) using 100% data. Our code is available.

preprint2018arXiv

Cosmology with Phase 1 of the Square Kilometre Array; Red Book 2018: Technical specifications and performance forecasts

We present a detailed overview of the cosmological surveys that will be carried out with Phase 1 of the Square Kilometre Array (SKA1), and the science that they will enable. We highlight three main surveys: a medium-deep continuum weak lensing and low-redshift spectroscopic HI galaxy survey over 5,000 sqdeg; a wide and deep continuum galaxy and HI intensity mapping survey over 20,000 sqdeg from z = 0.35 - 3; and a deep, high-redshift HI intensity mapping survey over 100 sqdeg from z = 3 - 6. Taken together, these surveys will achieve an array of important scientific goals: measuring the equation of state of dark energy out to z ~ 3 with percent-level precision measurements of the cosmic expansion rate; constraining possible deviations from General Relativity on cosmological scales by measuring the growth rate of structure through multiple independent methods; mapping the structure of the Universe on the largest accessible scales, thus constraining fundamental properties such as isotropy, homogeneity, and non-Gaussianity; and measuring the HI density and bias out to z = 6. These surveys will also provide highly complementary clustering and weak lensing measurements that have independent systematic uncertainties to those of optical surveys like LSST and Euclid, leading to a multitude of synergies that can improve constraints significantly beyond what optical or radio surveys can achieve on their own. This document, the 2018 Red Book, provides reference technical specifications, cosmological parameter forecasts, and an overview of relevant systematic effects for the three key surveys, and will be regularly updated by the Cosmology Science Working Group in the run up to start of operations and the Key Science Programme of SKA1.

preprint2016arXiv

The angular two-point correlation of NVSS galaxies revisited

We measure the angular two-point correlation and angular power spectrum from the NRAO VLA Sky Survey (NVSS) of radio galaxies. They are found to be consistent with the best-fit cosmological model from the Planck analysis, and with the redshift distribution obtained from the Combined EIS-NVSS Survey Of Radio Sources (CENSORS). Our analysis is based on an optimal estimation of the two-point correlation function and makes use of a new mask, that takes into account direction dependent effects of the observations, sidelobe effects of bright sources and galactic foreground. We also set a flux threshold and take the cosmic radio dipole into account. The latter turns out to be an essential step in the analysis. This improved cosmological analysis of the NVSS emphasizes the importance of a flux calibration that is robust and stable on large angular scales for future radio continuum surveys.

preprint2015arXiv

Fluctuations of differential number counts of radio continuum sources

We investigate the differential number counts of sources in radio continuum surveys, including all terms at linear order in cosmological perturbations. Our framework does not assume a specific gauge condition. This general approach allows us to recover gauge invariance explicitly. With the complete derivations of the covariant volume integral on the past light cone, we have identified several contributions in the number counts. To clarify their underlying physics, we present each contributions in terms of scalar, vector and tensor modes. This theoretical framework promises to be widely applicable to continuum radio galaxy surveys to model the expected angular power spectrum and two-point correlation.

preprint2015arXiv

Testing foundations of modern cosmology with SKA all-sky surveys

Continuum and HI surveys with the Square Kilometre Array (SKA) will allow us to probe some of the most fundamental assumptions of modern cosmology, including the Cosmological Principle. SKA all-sky surveys will map an enormous slice of space-time and reveal cosmology at superhorizon scales and redshifts of order unity. We illustrate the potential of these surveys and discuss the prospects to measure the cosmic radio dipole at high fidelity. We outline several potentially transformational tests of cosmology to be carried out by means of SKA all-sky surveys.

preprint2014arXiv

Floorplanning and Topology Generation for Application-Specific Network-on-Chip

Network-on-chip (NoC) architectures have been proposed as a promising alternative to classical bus-based communication architectures. In this paper, we propose a two phases framework to solve application-specific NoCs topology generation problem. At floorplanning phase, we carry out partition driven floorplanning. At post-floorplanning phase, a heuristic method and a min-cost max-flow algorithm is used to insert switches and network interfaces. Finally, we allocate paths to minimize power consumption. The experimental results show our algorithm is effective for power saving.

preprint2014arXiv

Network flow-based simultaneous retiming and slack budgeting for low power design

Low power design has become one of the most significant requirements when CMOS technology entered the nanometer era. Therefore, timing budget is often performed to slow down as many components as possible so that timing slacks can be applied to reduce the power consumption while maintaining the performance of the whole design. Retiming is a procedure that involves the relocation of flip-flops (FFs) across logic gates to achieve faster clocking speed. In this paper we show that the retiming and slack budgeting problem can be formulated to a convex cost dual network flow problem. Both the theoretical analysis and experimental results show the efficiency of our approach which can not only reduce power consumption by 8.9%, but also speedup previous work by 500 times.

preprint2014arXiv

Voltage and Level-Shifter Assignment Driven Floorplanning

Low Power Design has become a significant requirement when the CMOS technology entered the nanometer era. Multiple-Supply Voltage (MSV) is a popular and effective method for both dynamic and static power reduction while maintaining performance. Level shifters may cause area and Interconnect Length Overhead (ILO), and should be considered at both floorplanning and post-floorplanning stages. In this paper, we propose a two phases algorithm framework, called VLSAF, to solve voltage and level shifter assignment problem. At floorplanning phase, we use a convex cost network flow algorithm to assign voltage and a minimum cost flow algorithm to handle level-shifter assignment. At post-floorplanning phase, a heuristic method is adopted to redistribute white spaces and calculate the positions and shapes of level shifters. The experimental results show VLSAF is effective.

preprint2013arXiv

Correlated diffusion of colloidal particles near a liquid-liquid interface

Optical microscopy and multi-particle tracking are used to investigate the cross-correlated diffusion of quasi two-dimensional (2D) colloidal particles near an oil-water interface. It is shown that the effect of the interface on correlated diffusion is asymmetric. Along the line joining the centers of particles, the amplitude of correlated diffusion coefficient ${D}_{\|}(r)$ is enhanced by the interface, while the decay rate of ${D}_{\|}(r)$ is hardly affected. At the direction perpendicular to the line, the decay rate of ${D}_{\bot}(r)$ is enhanced at short inter-particle separation $r$. This enhancing effect fades at the long $r$. In addition, both $D_{\|}(r)$ and $D_{\bot}(r)$ are independent of the colloidal area fraction $n$ at long $r$, which indicates that the hydrodynamic interactions (HIs) among the particles are dominated by that through the surrounding fluid at this region. However, at short $r$, $D_{\bot}(r)$ is dependent on $n$, which suggests the HIs are more contributed from the 2D particle monolayer self.

Song Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

DORA: Dynamic Online Reinforcement Agent for Token Merging in Vision Transformers

Neural Observer with Lyapunov Stability Guarantee for Uncertain Nonlinear Systems

Semantic decoupled representation learning for remote sensing image change detection

Semantic-aware Dense Representation Learning for Remote Sensing Image Change Detection

Cosmology with Phase 1 of the Square Kilometre Array; Red Book 2018: Technical specifications and performance forecasts

The angular two-point correlation of NVSS galaxies revisited

Fluctuations of differential number counts of radio continuum sources

Testing foundations of modern cosmology with SKA all-sky surveys

Floorplanning and Topology Generation for Application-Specific Network-on-Chip

Network flow-based simultaneous retiming and slack budgeting for low power design

Voltage and Level-Shifter Assignment Driven Floorplanning

Correlated diffusion of colloidal particles near a liquid-liquid interface