Source author record

Yixing Li

Yixing Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Machine Learning Computer Vision physics.comp-ph physics.flu-dyn

Catalog footprint

What is connected

6works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model

Transformers are the cornerstone of modern large language models, but their quadratic computational complexity limits efficiency in long-sequence processing. Recent advancements in Mamba, a state space model (SSM) with linear complexity, offer promising efficiency gains but suffer from unstable contextual learning and multitask generalization. Some works conduct layer-level hybrid structures that combine Transformer and Mamba layers, aiming to make full use of both advantages. This paper proposes TransMamba, a novel sequence-level hybrid framework that unifies Transformer and Mamba through shared parameter matrices (QKV and CBx), and thus could dynamically switch between attention and SSM mechanisms at different token lengths and layers. We design the Memory Converter to bridge Transformer and Mamba by converting attention outputs into SSM-compatible states, ensuring seamless information flow at TransPoints where the transformation happens. The TransPoint scheduling is also thoroughly explored for balancing effectiveness and efficiency. We conducted extensive experiments demonstrating that TransMamba achieves superior training efficiency and performance compared to single and hybrid baselines, and validated the deeper consistency between Transformer and Mamba paradigms at sequence level, offering a scalable solution for next-generation language modeling. Code and data are available at https://github.com/Yixing-Li/TransMamba

preprint2021arXiv

Common kernel-smoothed proper orthogonal decomposition (CKSPOD): An efficient reduced-order model for emulation of spatiotemporally evolving flow dynamics

In the present study, we propose a new surrogate model, called common kernel-smoothed proper orthogonal decomposition (CKSPOD), to efficiently emulate the spatiotemporal evolution of fluid flow dynamics. The proposed surrogate model integrates and extends recent developments in Gaussian process learning, high-fidelity simulations, projection-based model reduction, uncertainty quantification, and experimental design, rendering a systematic, multidisciplinary framework. The novelty of the CKSPOD emulation lies in the construction of a common Gram matrix, which results from the Hadamard product of Gram matrices of all observed design settings. The Gram matrix is a spatially averaged temporal correlation matrix and contains the temporal dynamics of the corresponding sampling point. The common Gram matrix synthesizes the temporal dynamics by transferring POD modes into spatial functions at each observed design setting, which remedies the phase-difference issue encountered in the kernel-smoothed POD (KSPOD) emulation, a recent fluid flow emulator proposed in Chang et al. (2020). The CKSPOD methodology is demonstrated through a model study of flow dynamics of swirl injectors with three design parameters. A total of 30 training design settings and 8 validation design settings are included. Both qualitative and quantitative results show that the CKSPOD emulation outperforms the KSPOD emulation for all validation cases, and is capable of capturing small-scale wave structures on the liquid-film surface faithfully. The turbulent kinetic energy prediction using CKSPOD reveals lower predictive uncertainty than KSPOD, thereby allowing for more accurate and precise flow predictions. The turnaround time of the CKSPOD emulation is about 5 orders of magnitude faster than the corresponding high-fidelity simulation, which enables an efficient and scalable framework for design exploration and optimization.

preprint2016arXiv

A Binary Convolutional Encoder-decoder Network for Real-time Natural Scene Text Processing

In this paper, we develop a binary convolutional encoder-decoder network (B-CEDNet) for natural scene text processing (NSTP). It converts a text image to a class-distinguished salience map that reveals the categorical, spatial and morphological information of characters. The existing solutions are either memory consuming or run-time consuming that cannot be applied to real-time applications on resource-constrained devices such as advanced driver assistance systems. The developed network can process multiple regions containing characters by one-off forward operation, and is trained to have binary weights and binary feature maps, which lead to both remarkable inference run-time speedup and memory usage reduction. By training with over 200, 000 synthesis scene text images (size of $32\times128$), it can achieve $90\%$ and $91\%$ pixel-wise accuracy on ICDAR-03 and ICDAR-13 datasets. It only consumes $4.59\ ms$ inference run-time realized on GPU with a small network size of 2.14 MB, which is up to $8\times$ faster and $96\%$ smaller than it full-precision version.

preprint2016arXiv

A Data-Driven Compressive Sensing Framework for Long-Term Health Monitoring

Compressive sensing (CS) is a promising technology for realizing energy-efficient wireless sensors for long-term health monitoring. In this paper, we propose a data-driven CS framework that learns signal characteristics and individual variability from patients' data to significantly enhance CS performance and noise resilience. This is accomplished by a co-training approach that optimizes both the sensing matrix and dictionary towards improved restricted isometry property (RIP) and signal sparsity, respectively. Experimental results upon ECG signals show that our framework is able to achieve better reconstruction quality with up to 80% higher compression ratio (CP) than conventional frameworks based on random sensing matrices and overcomplete bases. In addition, our framework shows great noise resilience capability, which tolerates up to 40dB higher noise energy at a CP of 9 times.

preprint2016arXiv

A Data-Driven Compressive Sensing Framework Tailored For Energy-Efficient Wearable Sensing

Compressive sensing (CS) is a promising technology for realizing energy-efficient wireless sensors for long-term health monitoring. However, conventional model-driven CS frameworks suffer from limited compression ratio and reconstruction quality when dealing with physiological signals due to inaccurate models and the overlook of individual variability. In this paper, we propose a data-driven CS framework that can learn signal characteristics and personalized features from any individual recording of physiologic signals to enhance CS performance with a minimized number of measurements. Such improvements are accomplished by a co-training approach that optimizes the sensing matrix and the dictionary towards improved restricted isometry property and signal sparsity, respectively. Experimental results upon ECG signals show that the proposed method, at a compression ratio of 10x, successfully reduces the isometry constant of the trained sensing matrices by 86% against random matrices and improves the overall reconstructed signal-to-noise ratio by 15dB over conventional model-driven approaches.

preprint2016arXiv

An Energy-Efficient Compressive Sensing Framework Incorporating Online Dictionary Learning for Long-term Wireless Health Monitoring

Wireless body area network (WBAN) is emerging in the mobile healthcare area to replace the traditional wire-connected monitoring devices. As wireless data transmission dominates power cost of sensor nodes, it is beneficial to reduce the data size without much information loss. Compressive sensing (CS) is a perfect candidate to achieve this goal compared to existing compression techniques. In this paper, we proposed a general framework that utilize CS and online dictionary learning (ODL) together. The learned dictionary carries individual characteristics of the original signal, under which the signal has an even sparser representation compared to pre-determined dictionaries. As a consequence, the compression ratio is effectively improved by 2-4x comparing to prior works. Besides, the proposed framework offloads pre-processing from sensor nodes to the server node prior to dictionary learning, providing further reduction in hardware costs. As it is data driven, the proposed framework has the potential to be used with a wide range of physiological signals.

Yixing Li

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model

Common kernel-smoothed proper orthogonal decomposition (CKSPOD): An efficient reduced-order model for emulation of spatiotemporally evolving flow dynamics

A Binary Convolutional Encoder-decoder Network for Real-time Natural Scene Text Processing

A Data-Driven Compressive Sensing Framework for Long-Term Health Monitoring

A Data-Driven Compressive Sensing Framework Tailored For Energy-Efficient Wearable Sensing

An Energy-Efficient Compressive Sensing Framework Incorporating Online Dictionary Learning for Long-term Wireless Health Monitoring