Source author record

Chenchen Yang

Chenchen Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Networking and Internet Architecture Computation and Language Artificial Intelligence Computer Vision cond-mat.mtrl-sci physics.app-ph Robotics Sound

Catalog footprint

What is connected

10works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

WESR: Scaling and Evaluating Word-level Event-Speech Recognition

Speech conveys not only linguistic information but also rich non-verbal vocal events such as laughing and crying. While semantic transcription is well-studied, the precise localization of non-verbal events remains a critical yet under-explored challenge. Current methods suffer from insufficient task definitions with limited category coverage and ambiguous temporal granularity. They also lack standardized evaluation frameworks, hindering the development of downstream applications. To bridge this gap, we first develop a refined taxonomy of 21 vocal events, with a new categorization into discrete (standalone) versus continuous (mixed with speech) types. Based on the refined taxonomy, we introduce WESR-Bench, an expert-annotated evaluation set (900+ utterances) with a novel position-aware protocol that disentangles ASR errors from event detection, enabling precise localization measurement for both discrete and continuous events. We also build a strong baseline by constructing a 1,700+ hour corpus, and train specialized models, surpassing both open-source audio-language models and commercial APIs while preserving ASR quality. We anticipate that WESR will serve as a foundational resource for future research in modeling rich, real-world auditory scenes.

preprint2026arXiv

World Action Models: The Next Frontier in Embodied AI

Vision-Language-Action (VLA) models have achieved strong semantic generalization for embodied policy learning, yet they learn reactive observation-to-action mappings without explicitly modeling how the physical world evolves under intervention. A growing body of work addresses this limitation by integrating world models, predictive models of environment dynamics, into the action generation pipeline. We term this emerging paradigm World Action Models (WAMs): embodied foundation models that unify predictive state modeling with action generation, targeting a joint distribution over future states and actions rather than actions alone. However, the literature remains fragmented across architectures, learning objectives, and application scenarios, lacking a unified conceptual framework. We formally define WAMs and disambiguate them from related concepts, and trace the foundations and early integration of VLA and world model research that gave rise to this paradigm. We organize existing methods into a structured taxonomy of Cascaded and Joint WAMs, with further subdivision by generation modality, conditioning mechanism, and action decoding strategy. We systematically analyze the data ecosystem fueling WAMs development, spanning robot teleoperation, portable human demonstrations, simulation, and internet-scale egocentric video, and synthesize emerging evaluation protocols organized around visual fidelity, physical commonsense, and action plausibility. Overall, this survey provides the first systematic account of the WAMs landscape, clarifies key architectural paradigms and their trade-offs, and identifies open challenges and future opportunities for this rapidly evolving field.

preprint2020arXiv

Ultraviolet and Near-Infrared Dual Band Selective-Harvesting Transparent Luminescent Solar Concentrators

Visibly transparent luminescent solar concentrators (TLSCs) can optimize both power production and visible transparency by selectively harvesting the invisible portion of the solar spectrum. Since the primary applications of TLSCs include building envelopes, greenhouses, automobiles, signage, and mobile electronics, maintaining aesthetics and functionalities is as important as achieving high power conversion efficiencies (PCEs) in practical deployment. In this work, we combine massive-downshifting phosphorescent nanoclusters and fluorescent organic molecules into a TLSC system as ultraviolet (UV) and near-infrared (NIR) selective-harvesting luminophores, respectively, demonstrating UV and NIR dual-band selective-harvesting TLSCs with PCE over 3%, average visible transmittance (AVT) exceeding 75% and color metrics suitable for the window industry. With distinct wavelength-selectivity and effective utilization of the invisible portion of the solar spectrum, this work reports the highest light utilization efficiency (PCE x AVT) of 2.6 for a TLSC system, the highest PCE of any transparent photovoltaic device with AVT greater than 70%, and outperforms the practical limit for non-wavelength-selective transparent photovoltaics.

preprint2016arXiv

Interference Cancellation at Receivers in Cache-Enabled Wireless Networks

In this paper, we propose to exploit the limited cache packets as side information to cancel incoming interference at the receiver side. We consider a stochastic network where the random locations of base stations and users are modeled using Poisson point processes. Caching schemes to reap both the local caching gain and the interference cancellation gain for the users are developed based on two factors: the density of different user subsets and the packets cached in the corresponding subsets. The packet loss rate (PLR) is analyzed, which depends on both the cached packets and the channel state information (CSI) available at the receiver. Theoretical results reveal the tradeoff between caching resource and wireless resource. The performance for different caching schemes are analyzed and the minimum achievable PLR for the distributed caching is derived.

preprint2016arXiv

Modeling and Analysis for Cache-enabled Cognitive D2D Communications in Cellular Networks

Exploiting cognition to the cache-enabled device-to-device (D2D) communication underlaying the multi-channel cellular network is the main focus of this paper. D2D pairs perform direct communications via sensing the available cellular channels, bypassing the base station (BS). Dynamic service is considered and the network performance is evaluated with the stochastic geometry. Node locations are first modeled as mutually independent Poisson Point Processes, and the service queueing process is formulated. Then the corresponding tier association and cognitive access protocol are developed. The delay and the length for the queue at the BS and D2D transmitter are further elaborated, with modeling the traffic dynamics of request arrivals and departures as the discrete-time multiserver queue with priorities. Moreover, impacts of the physical layer and content-centric features on the system performance are jointly investigated to provide a valuable insight.

preprint2016arXiv

Modeling and Analysis for Cache-Enabled Networks with Dynamic Traffic

Instead of assuming fully loaded cells in the analysis on cache-enabled networks with tools of stochastic geometry, we focus on the dynamic traffic in this letter. With modeling traffic dynamics of request arrivals and departures, probabilities of full-, free-, and modest-load cells in the large-scale cache-enabled network are elaborated based on the traffic queue state. Moreover, we propose to exploit the packets cached at cache-enabled users as side information to cancel the incoming interference. Then the packet loss rates for both the cache-enabled and cache-untenable users are investigated. The simulation results verify our analysis.

preprint2016arXiv

Opportunistic Channel Sharing in Stochastic Networks with Dynamic Traffic

In this paper, we consider the stochastic network with dynamic traffic. The spatial distribution of access points (APs) and users are first modeled as mutually independent Poisson point processes (PPPs). Different from most previous literatures which assume all the APs are fully loaded, we consider the fact that APs having no data to transmit do not generate interference to users. The APs opportunistically share the channel according to the existence of the packet to be transmitted and the proposed interference suppression strategy. In the interference suppression region, only one AP can be active at a time to transmit the packet on the channel and the other adjacent APs keep silent to reduce serious interference. The idle probability of any AP, influenced by the traffic load and availability of the channels, is analyzed. The density of simultaneously active APs in the network is obtained and the packet loss rate is further elaborated. We reveal the impacts of network features (e.g., AP density, user density and channel state) and service features (e.g., user request, packet size) on the network performance. Simulation results validate our proposed model.

preprint2015arXiv

Analysis on Cache-enabled Wireless Heterogeneous Networks

Caching the popular multimedia content is a promising way to unleash the ultimate potential of wireless networks. In this paper, we contribute to proposing and analyzing the cache-based content delivery in a three-tier heterogeneous network (HetNet), where base stations (BSs), relays and device-to-device (D2D) pairs are included. We advocate to proactively cache the popular contents in the relays and parts of the users with caching ability when the network is off-peak. The cached contents can be reused for frequent access to offload the cellular network traffic. The node locations are first modeled as mutually independent Poisson Point Processes (PPPs) and the corresponding content access protocol is developed. The average ergodic rate and outage probability in the downlink are then analyzed theoretically. We further derive the throughput and the delay based on the \emph{multiclass processor-sharing queue} model and the continuous-time Markov process. According to the critical condition of the steady state in the HetNet, the maximum traffic load and the global throughput gain are investigated. Moreover, impacts of some key network characteristics, e.g., the heterogeneity of multimedia contents, node densities and the limited caching capacities, on the system performance are elaborated to provide a valuable insight.

preprint2015arXiv

Optimal Caching Placement for D2D Assisted Wireless Caching Networks

In this paper, we devise the optimal caching placement to maximize the offloading probability for a two-tier wireless caching system, where the helpers and a part of users have caching ability. The offloading comes from the local caching, D2D sharing and the helper transmission. In particular, to maximize the offloading probability we reformulate the caching placement problem for users and helpers into a difference of convex (DC) problem which can be effectively solved by DC programming. Moreover, we analyze the two extreme cases where there is only help-tier caching network and only user-tier. Specifically, the placement problem for the helper-tier caching network is reduced to a convex problem, and can be effectively solved by the classical water-filling method. We notice that users and helpers prefer to cache popular contents under low node density and prefer to cache different contents evenly under high node density. Simulation results indicate a great performance gain of the proposed caching placement over existing approaches.

preprint2015arXiv

When ICN Meets C-RAN for HetNets: An SDN Approach

In this paper, we contribute to novelly proposing and elaborating the integration of the ICN, C-RAN and SDN for the HetNet to achieve win-win situation. The vision of the proposed system is demonstrated, followed by the advantages and challenges. We further present the hybrid system with a large-scale wireless heterogeneous campus network.

Chenchen Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

WESR: Scaling and Evaluating Word-level Event-Speech Recognition

World Action Models: The Next Frontier in Embodied AI

Ultraviolet and Near-Infrared Dual Band Selective-Harvesting Transparent Luminescent Solar Concentrators

Interference Cancellation at Receivers in Cache-Enabled Wireless Networks

Modeling and Analysis for Cache-enabled Cognitive D2D Communications in Cellular Networks

Modeling and Analysis for Cache-Enabled Networks with Dynamic Traffic

Opportunistic Channel Sharing in Stochastic Networks with Dynamic Traffic

Analysis on Cache-enabled Wireless Heterogeneous Networks

Optimal Caching Placement for D2D Assisted Wireless Caching Networks

When ICN Meets C-RAN for HetNets: An SDN Approach