Source author record

Liang Feng

Liang Feng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.optics Neural and Evolutionary Computing Artificial Intelligence Hardware Architecture Machine Learning Computer Vision cond-mat.mes-hall quant-ph Robotics cond-mat.other Information Retrieval math.CA Networking and Internet Architecture physics.class-ph

Catalog footprint

What is connected

19works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail

End-to-end architectures trained via imitation learning have advanced autonomous driving by scaling model size and data, yet performance remains brittle in safety-critical long-tail scenarios where supervision is sparse and causal understanding is limited. We introduce Alpamayo-R1 (AR1), a vision-language-action model (VLA) that integrates Chain of Causation reasoning with trajectory planning for complex driving scenarios. Our approach features three key innovations: (1) the Chain of Causation (CoC) dataset, built through a hybrid auto-labeling and human-in-the-loop pipeline producing decision-grounded, causally linked reasoning traces aligned with driving behaviors; (2) a modular VLA architecture combining Cosmos-Reason, a vision-language model pre-trained for Physical AI, with a diffusion-based trajectory decoder that generates dynamically feasible trajectories in real time; (3) a multi-stage training strategy using supervised fine-tuning to elicit reasoning and reinforcement learning (RL) to enforce reasoning-action consistency and optimize reasoning quality. AR1 achieves up to a 12% improvement in planning accuracy on challenging cases compared to a trajectory-only baseline, with a 35% reduction in close encounter rate in closed-loop simulation. RL post-training improves reasoning quality by 45% and reasoning-action consistency by 37%. Model scaling from 0.5B to 7B parameters shows consistent improvements. On-vehicle road tests confirm real-time performance (99 ms latency) and successful urban deployment. By bridging interpretable reasoning with precise control, AR1 demonstrates a practical path towards Level 4 autonomous driving. Model weights are available at https://huggingface.co/nvidia/Alpamayo-R1-10B with inference code at https://github.com/NVlabs/alpamayo.

preprint2026arXiv

Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

To address the high sampling cost of Diffusion Transformers (DiTs), feature caching offers a training-free acceleration method. However, existing methods rely on hand-crafted forecasting formulas that fail under aggressive skipping. We propose L2P (Learnable Linear Predictor), a simple data-driven caching framework that replaces fixed coefficients with learnable per-timestep weights. Rapidly trained in ~20 seconds on a single GPU, L2P accurately reconstructs current features from past trajectories. L2P significantly outperforms existing baselines: it achieves a 4.55x FLOPs reduction and 4.15x latency speedup on FLUX.1-dev, and maintains high visual fidelity under up to 7.18x acceleration on Qwen-Image models, where prior methods show noticeable quality degradation. Our results show learning linear predictors is highly effective for efficient DiT inference. Code is available at https://github.com/Aredstone/L2P-Cache.

preprint2024arXiv

Towards Multi-Objective High-Dimensional Feature Selection via Evolutionary Multitasking

Evolutionary Multitasking (EMT) paradigm, an emerging research topic in evolutionary computation, has been successfully applied in solving high-dimensional feature selection (FS) problems recently. However, existing EMT-based FS methods suffer from several limitations, such as a single mode of multitask generation, conducting the same generic evolutionary search for all tasks, relying on implicit transfer mechanisms through sole solution encodings, and employing single-objective transformation, which result in inadequate knowledge acquisition, exploitation, and transfer. To this end, this paper develops a novel EMT framework for multiobjective high-dimensional feature selection problems, namely MO-FSEMT. In particular, multiple auxiliary tasks are constructed by distinct formulation methods to provide diverse search spaces and information representations and then simultaneously addressed with the original task through a multi-slover-based multitask optimization scheme. Each task has an independent population with task-specific representations and is solved using separate evolutionary solvers with different biases and search preferences. A task-specific knowledge transfer mechanism is designed to leverage the advantage information of each task, enabling the discovery and effective transmission of high-quality solutions during the search process. Comprehensive experimental results demonstrate that our MO-FSEMT framework can achieve overall superior performance compared to the state-of-the-art FS methods on 26 datasets. Moreover, the ablation studies verify the contributions of different components of the proposed MO-FSEMT.

preprint2022arXiv

The missing link between standing- and traveling-wave resonators

Optical resonators are structures that utilize wave interference and feedback to confine light in all three dimensions. Depending on the feedback mechanism, resonators can support either standing- or traveling-wave modes. Over the years, the distinction between these two different types of modes has become so prevalent that nowadays it is one of the main characteristics for classifying optical resonators. Here, we show that an intermediate link between these two rather different groups exists. In particular, we introduce a new class of photonic resonators that supports a hybrid optical mode, i.e. at one location along the resonator the electromagnetic fields associated with the mode feature a purely standing-wave pattern, while at a different location, the fields of the same mode represent a pure traveling wave. The proposed concept is general and can be implemented using chip-scale photonics as well as free-space optics. Moreover, it can be extended to other wave phenomena such as microwaves and acoustics.

preprint2021arXiv

MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions

Deep neural networks are widely used in personalized recommendation systems. Unlike regular DNN inference workloads, recommendation inference is memory-bound due to the many random memory accesses needed to lookup the embedding tables. The inference is also heavily constrained in terms of latency because producing a recommendation for a user must be done in about tens of milliseconds. In this paper, we propose MicroRec, a high-performance inference engine for recommendation systems. MicroRec accelerates recommendation inference by (1) redesigning the data structures involved in the embeddings to reduce the number of lookups needed and (2) taking advantage of the availability of High-Bandwidth Memory (HBM) in FPGA accelerators to tackle the latency by enabling parallel lookups. We have implemented the resulting design on an FPGA board including the embedding lookup step as well as the complete inference process. Compared to the optimized CPU baseline (16 vCPU, AVX2-enabled), MicroRec achieves 13.8~14.7x speedup on embedding lookup alone and 2.5$~5.4x speedup for the entire recommendation inference in terms of throughput. As for latency, CPU-based engines needs milliseconds for inferring a recommendation while MicroRec only takes microseconds, a significant advantage in real-time recommendation systems.

preprint2021arXiv

Multi-Space Evolutionary Search for Large-Scale Optimization

In recent years, to improve the evolutionary algorithms used to solve optimization problems involving a large number of decision variables, many attempts have been made to simplify the problem solution space of a given problem for the evolutionary search. In the literature, the existing approaches can generally be categorized as decomposition-based methods and dimension-reduction-based methods. The former decomposes a large-scale problem into several smaller subproblems, while the latter transforms the original high-dimensional solution space into a low-dimensional space. However, it is worth noting that a given large-scale optimization problem may not always be decomposable, and it is also difficult to guarantee that the global optimum of the original problem is preserved in the reduced low-dimensional problem space. This paper thus proposes a new search paradigm, namely the multi-space evolutionary search, to enhance the existing evolutionary search methods for solving large-scale optimization problems. In contrast to existing approaches that perform an evolutionary search in a single search space, the proposed paradigm is designed to conduct a search in multiple solution spaces that are derived from the given problem, each possessing a unique landscape. The proposed paradigm makes no assumptions about the large-scale optimization problem of interest, such as that the problem is decomposable or that a certain relationship exists among the decision variables. To verify the efficacy of the proposed paradigm, comprehensive empirical studies in comparison to four state-of-the-art algorithms were conducted using the CEC2013 large-scale benchmark problems.

preprint2020arXiv

FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications

Fast and accurate depth estimation, or stereo matching, is essential in embedded stereo vision systems, requiring substantial design effort to achieve an appropriate balance among accuracy, speed and hardware cost. To reduce the design effort and achieve the right balance, we propose FP-Stereo for building high-performance stereo matching pipelines on FPGAs automatically. FP-Stereo consists of an open-source hardware-efficient library, allowing designers to obtain the desired implementation instantly. Diverse methods are supported in our library for each stage of the stereo matching pipeline and a series of techniques are developed to exploit the parallelism and reduce the resource overhead. To improve the usability, FP-Stereo can generate synthesizable C code of the FPGA accelerator with our optimized HLS templates automatically. To guide users for the right design choice meeting specific application requirements, detailed comparisons are performed on various configurations of our library to investigate the accuracy/speed/cost trade-off. Experimental results also show that FP-Stereo outperforms the state-of-the-art FPGA design from all aspects, including 6.08% lower error, 2x faster speed, 30% less resource usage and 40% less energy consumption. Compared to GPU designs, FP-Stereo achieves the same accuracy at a competitive speed while consuming much less energy.

preprint2020arXiv

Multi-factorial Optimization for Large-scale Virtual Machine Placement in Cloud Computing

The placement scheme of virtual machines (VMs) to physical servers (PSs) is crucial to lowering operational cost for cloud providers. Evolutionary algorithms (EAs) have been performed promising-solving on virtual machine placement (VMP) problems in the past. However, as growing demand for cloud services, the existing EAs fail to implement in large-scale virtual machine placement (LVMP) problem due to the high time complexity and poor scalability. Recently, the multi-factorial optimization (MFO) technology has surfaced as a new search paradigm in evolutionary computing. It offers the ability to evolve multiple optimization tasks simultaneously during the evolutionary process. This paper aims to apply the MFO technology to the LVMP problem in heterogeneous environment. Firstly, we formulate a deployment cost based VMP problem in the form of the MFO problem. Then, a multi-factorial evolutionary algorithm (MFEA) embedded with greedy-based allocation operator is developed to address the established MFO problem. After that, a re-migration and merge operator is designed to offer the integrated solution of the LVMP problem from the solutions of MFO problem. To assess the effectiveness of our proposed method, the simulation experiments are carried on large-scale and extra large-scale VMs test data sets. The results show that compared with various heuristic methods, our method could shorten optimization time significantly and offer a competitive placement solution for the LVMP problem in heterogeneous environment.

preprint2020arXiv

Scalable Light-Weight Integration of FPGA Based Accelerators with Chip Multi-Processors

Modern multicore systems are migrating from homogeneous systems to heterogeneous systems with accelerator-based computing in order to overcome the barriers of performance and power walls. In this trend, FPGA-based accelerators are becoming increasingly attractive, due to their excellent flexibility and low design cost. In this paper, we propose the architectural support for efficient interfacing between FPGA-based multi-accelerators and chip-multiprocessors (CMPs) connected through the network-on-chip (NoC). Distributed packet receivers and hierarchical packet senders are designed to maintain scalability and reduce the critical path delay under a heavy task load. A dedicated accelerator chaining mechanism is also proposed to facilitate intra-FPGA data reuse among accelerators to circumvent prohibitive communication overhead between the FPGA and processors. In order to evaluate the proposed architecture, a complete system emulation with programmability support is performed using FPGA prototyping. Experimental results demonstrate that the proposed architecture has high-performance, and is light-weight and scalable in characteristics.

preprint2020arXiv

Vehicular Fog Computing Enabled Real-time Collision Warning via Trajectory Calibration

Vehicular fog computing (VFC) has been envisioned as a promising paradigm for enabling a variety of emerging intelligent transportation systems (ITS). However, due to inevitable as well as non-negligible issues in wireless communication, including transmission latency and packet loss, it is still challenging in implementing safety-critical applications, such as real-time collision warning in vehicular networks. In this paper, we present a vehicular fog computing architecture, aiming at supporting effective and real-time collision warning by offloading computation and communication overheads to distributed fog nodes. With the system architecture, we further propose a trajectory calibration based collision warning (TCCW) algorithm along with tailored communication protocols. Specifically, an application-layer vehicular-to-infrastructure (V2I) communication delay is fitted by the Stable distribution with real-world field testing data. Then, a packet loss detection mechanism is designed. Finally, TCCW calibrates real-time vehicle trajectories based on received vehicle status including GPS coordinates, velocity, acceleration, heading direction, as well as the estimation of communication delay and the detection of packet loss. For performance evaluation, we build the simulation model and implement conventional solutions including cloud-based warning and fog-based warning without calibration for comparison. Real-vehicle trajectories are extracted as the input, and the simulation results demonstrate that the effectiveness of TCCW in terms of the highest precision and recall in a wide range of scenarios.

preprint2016arXiv

Optical Reciprocity Induced Symmetry of the Scattering Eigenstates in Non-$\cal PT$-Symmetric Heterostructures

The scattering matrix $S$ obeys the unitary relation $S^\dagger S=1$ in a Hermitian system and the symmetry property ${\cal PT}S{\cal PT}=S^{-1}$ in a Parity-Time (${\cal PT}$) symmetric system. Here we report a different symmetry relation of the $S$ matrix in a one-dimensional heterostructure, which is given by the amplitude ratio of the incident waves in the scattering eigenstates. It originates from the optical reciprocity and holds independent of the Hermiticity or $\cal PT$ symmetry of the system. Using this symmetry relation, we probe a quasi-transition that is reminiscent of the spontaneous symmetry breaking of a $\cal PT$-symmetric $S$ matrix, now with unbalanced gain and loss and even in the absence of gain. We show that the additional symmetry relation provides a clear evidence of an exceptional point, even when all other signatures of the $\cal PT$ symmetry breaking are completely erased. We also discuss the existence of a final exceptional point in this correspondence, which is attributed to asymmetric reflections from the two sides of the heterostructure.

preprint2015arXiv

Acoustic topological insulator and robust one-way sound transport

Discovery of novel topological orders of condensed matters is of a significant interest in both fundamental and applied physics due to the associated quantum conductance behaviors and unique symmetry-protected backscattering-immune propagation against defects, which inspired similar fantastic effects in classical waves system, leading to the revolution of the manipulation of wave propagation. To date, however, only few theoretical models were proposed to realize acoustic topological states. Here, we theoretically and experimentally demonstrate a two dimensional acoustic topological insulators with acoustic analogue of quantum spin Hall Effect. Due to the band inversion mechanism near the double Dirac cones, acoustic one-way pseudospin dependent propagating edge states, corresponding to spin-plus and spin-minus, can be observed at the interface between two graphene-like acoustic crystals. We have also experimentally verified the associated topological immunity of such one-way edge states against the different lattice defects and disorders, which can always lead to inherent propagation loss and noise. We show that this unique acoustic topological phenomenon can offer a new promising application platform for the design of novel acoustic devices, such as one-way sound isolators, acoustic mode switchers, splitters, filters etc.

preprint2015arXiv

Active coupling control in densely packed subwavelength waveguides via dark mode

The ever growing need for energy-efficient and fast communications is driving the development of highly integrated photonic circuits where controlling light at the nanoscale becomes the most critical aspect of information transfer . Directional couplers, two interacting optical waveguides placed in close proximity, are important building blocks in these integrated photonics circuits and have been employed as optical modulators and switches for high speed communication, data processing and integrated quantum operations. However, active control over the coupling between closely packed waveguides is highly desirable and yet remains a critical barrier towards ultra small footprint devices. A general approach to achieve active control in waveguide systems is to exploit optical nonlinearities enabled by a strong control pulse. However these devices suffer from the nonlinear absorption induced by the intense control pulse as the signal and its control propagate in the same waveguide. Here we experimentally demonstrate a unique scheme based on adiabatic elimination (AE) concept that effectively manipulates the coupling between densely packed waveguides. We demonstrate active coupling control between two closely packed waveguides by tuning the mode index of an in-between decoupled waveguide. This is achieved via a dark mode and thus leaves the signal unaffected by the induced losses. Such a scheme is a promising candidate for ultra-dense integrated nano-photonics such as on-chip ultrafast modulators and tunable filters for optical communication and quantum computing.

preprint2015arXiv

Scattering in $\cal PT$ and $\cal RT$ Symmetric Multimode Waveguides: Generalized Conservation Laws and Spontaneous Symmetry Breaking beyond One Dimension

We extend the generalize conservation law of light propagating in a one-dimensional $\cal PT$-symmetric system, i.e., $|T-1|=\sqrt{R_LR_R}$ for the transmittance $T$ and the reflectance $R_{L,R}$ from the left and right, to a multimode waveguide with either $\cal PT$ or $\cal RT$ symmetry, in which higher dimensional investigations are necessary. These conservation laws exist not only in a matrix form for the transmission and reflection matrices; they also exist in a scalar form for real-valued quantities by defining generalized transmittance and reflectance. We then discuss, for the first time, how a multimode $\cal PT$-symmetric waveguide can be used to observe spontaneous symmetry breaking of the scattering matrix, which typically requires tuning the non-hermiticity of the system (i.e. the strength of gain and loss). Here the advantage of using a multimode waveguide is the elimination of tuning any system parameters: the transverse mode order $m$ plays the role of the symmetry breaking parameter, and one observes the symmetry breaking by simply performing scattering experiment in each waveguide channel at a single frequency and fixed strength of gain and loss.

preprint2014arXiv

$\mathcal{PT}$-symmetric microring laser-absorber

The lasing and coherent perfect absorption (CPA) properties of $\mathcal{PT}$-symmetric microrings with mixed index and gain gratings, externally coupled to a bus waveguide, are theoretically investigated. For a complex grating at the $\mathcal{PT}$ symmetry breaking point, perfect unidirectional (either clockwise or counterclockwise) laser emission can be realized, however the grating does not discriminate longitudinal modes and CPA can not be simultaneously achieved. Above the grating $\mathcal{PT}$ symmetry breaking point, single mode emission and simultaneous CPA can be obtained, with unbalanced and controllable excitation of clockwise and counterclockwise modes in the ring.

preprint2014arXiv

Parity-Time Synthetic Laser

Parity-time (PT) symmetry is a fundamental notion in quantum field theories. It has opened a new paradigm for non-Hermitian Hamiltonians ranging from quantum mechanics, electronics, to optics. In the realm of optics, optical loss is responsible for power dissipation, therefore typically degrading device performance such as attenuation of a laser beam. By carefully exploiting optical loss in the complex dielectric permittivity, however, recent exploration of PT symmetry revolutionizes our understandings in fundamental physics and intriguing optical phenomena such as exceptional points and phase transition that are critical for high-speed optical modulators. The interplay between optical gain and loss in photonic PT synthetic matters offers a new criterion of positively utilizing loss to efficiently manipulate gain and its associated optical properties. Instead of simply compensating optical loss in conventional lasers, for example, it is theoretically proposed that judiciously designed delicate modulation of optical loss and gain can lead to PT synthetic lasing that fundamentally broadens laser physics. Here, we report the first experimental demonstration of PT synthetic lasers. By carefully exploiting the interplay between gain and loss, we achieve degenerate eigen modes at the same frequency but with complex conjugate gain and loss coefficients. In contrast to conventional ring cavity lasers with multiple modes, the PT synthetic micro-ring laser exhibits an intrinsic single mode lasing: the non-threshold PT broken phase inherently associated in such a photonic system squeezes broadband optical gain into a single lasing mode regardless of the gain spectral bandwidth. This chip-scale semiconductor platform provides a unique route towards fundamental explorations of PT physics and next generation of optoelectronic devices for optical communications and computing.

preprint2014arXiv

Photonic analogue of quantum spin Hall effect

Symmetry-protected photonic topological insulator exhibiting robust pseudo-spin-dependent transportation, analogous to quantum spin Hall (QSH) phases and topological insulators, are of great importance in fundamental physics. Such transportation robustness is protected by time-reversal symmetry. Since electrons (fermion) and photons (boson) obey different statistics rules and associate with different time-reversal operators (i.e., Tf and Tb, respectively), whether photonic counterpart of Kramers degeneracy is topologically protected by bosonic Tb remains unidentified. Here, we construct the degenerate gapless edge states of two photonic pseudo-spins (left/right circular polarizations) in the band gap of a two-dimensional photonic crystal with strong magneto-electric coupling. We further demonstrated that the topological edge states are in fact protected by Tf rather than commonly believed Tb and their pseudo-spin dependent transportation is robust against Tf invariant impurities, discovering for the first time the topological nature of photons. Our results will pave a way towards novel photonic topological insulators and revolutionize our understandings in topological physics of fundamental particles.

preprint2012arXiv

Meme as Building Block for Evolutionary Optimization of Problem Instances

A significantly under-explored area of evolutionary optimization in the literature is the study of optimization methodologies that can evolve along with the problems solved. Particularly, present evolutionary optimization approaches generally start their search from scratch or the ground-zero state of knowledge, independent of how similar the given new problem of interest is to those optimized previously. There has thus been the apparent lack of automated knowledge transfers and reuse across problems. Taking the cue, this paper introduces a novel Memetic Computational Paradigm for search, one that models after how human solves problems, and embarks on a study towards intelligent evolutionary optimization of problems through the transfers of structured knowledge in the form of memes learned from previous problem-solving experiences, to enhance future evolutionary searches. In particular, the proposed memetic search paradigm is composed of four culture-inspired operators, namely, Meme Learning, Meme Selection, Meme Variation and Meme Imitation. The learning operator mines for memes in the form of latent structures derived from past experiences of problem-solving. The selection operator identifies the fit memes that replicate and transmit across problems, while the variation operator introduces innovations into the memes. The imitation operator, on the other hand, defines how fit memes assimilate into the search process of newly encountered problems, thus gearing towards efficient and effective evolutionary optimization. Finally, comprehensive studies on two widely studied challenging well established NP-hard routing problem domains, particularly, the capacitated vehicle routing (CVR) and capacitated arc routing (CAR), confirm the high efficacy of the proposed memetic computational search paradigm for intelligent evolutionary optimization of problems.

preprint2011arXiv

Bifurcation of limit cycles by perturbing a piecewise linear Hamiltonian system with a homoclinic loop

In this paper, we study limit cycle bifurcations for a kind of non-smooth polynomial differential systems by perturbing a piecewise linear Hamiltonian system with a center at the origin and a homoclinic loop around the origin. By using the first Melnikov function of piecewise near-Hamiltonian systems, we give lower bounds of the maximal number of limit cycles in Hopf and homoclnic bifurcations, and derive an upper bound of the number of limit cycles that bifurcate from the periodic annulus between the center and the homoclinic loop up to the first order in. In the case when the degree of perturbing terms is low, we obtain a precise result on the number of zeros of the first Melnikov function.

Liang Feng

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail

Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

Towards Multi-Objective High-Dimensional Feature Selection via Evolutionary Multitasking

The missing link between standing- and traveling-wave resonators

MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions

Multi-Space Evolutionary Search for Large-Scale Optimization

FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications

Multi-factorial Optimization for Large-scale Virtual Machine Placement in Cloud Computing

Scalable Light-Weight Integration of FPGA Based Accelerators with Chip Multi-Processors

Vehicular Fog Computing Enabled Real-time Collision Warning via Trajectory Calibration

Optical Reciprocity Induced Symmetry of the Scattering Eigenstates in Non-$\cal PT$-Symmetric Heterostructures

Acoustic topological insulator and robust one-way sound transport

Active coupling control in densely packed subwavelength waveguides via dark mode

Scattering in $\cal PT$ and $\cal RT$ Symmetric Multimode Waveguides: Generalized Conservation Laws and Spontaneous Symmetry Breaking beyond One Dimension

$\mathcal{PT}$-symmetric microring laser-absorber

Parity-Time Synthetic Laser

Photonic analogue of quantum spin Hall effect

Meme as Building Block for Evolutionary Optimization of Problem Instances

Bifurcation of limit cycles by perturbing a piecewise linear Hamiltonian system with a homoclinic loop