Source author record

Miao Wang

Miao Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mes-hall cond-mat.mtrl-sci eess.SP eess.SY physics.atom-ph Systems and Control Artificial Intelligence astro-ph.IM Computer Vision Machine Learning Multiagent Systems physics.chem-ph quant-ph

Catalog footprint

What is connected

11works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing

Text-based role-playing models can imitate character styles, yet they often fail to reflect a scene's atmosphere and evolving tension, both essential for immersive applications such as Virtual Reality (VR) games and interactive narratives. We study video-grounded role-playing dialogue and introduce EBM-RL (Eye-Brain-Mouth Reinforcement Learning), a decoupled GRPO-based framework that explicitly separates observation ([perception]), reasoning ([think]), and utterance ([answer]). This structure promotes human-like sensory grounding by compelling the model to first attend to visual cues, then form internal interpretations, and finally generate context-appropriate dialogue. EBM-RL integrates four complementary rewards: (i) CLIP-based scene-text alignment to improve ambiance and emotion; (ii) a Perceptual-Cognitive reward that encourages [perception] and [think] processes that increase the likelihood of the reference response; (iii) answer accuracy to ensure faithfulness; and (iv) a dense format reward to enforce the desired structured output. Extensive experiments demonstrate that EBM-RL substantially outperforms text-only role-playing baselines and larger-scale vision-language models on our immersive role-playing benchmark, delivering simultaneous gains in visual-atmosphere consistency and character authenticity. Beyond the role-playing domain, EBM-RL also exhibits strong zero-shot generalization: without any additional fine-tuning, it consistently improves performance on out-of-domain VideoQA benchmarks. We additionally release an open-source dataset for video-grounded role-playing dialogue.

preprint2026arXiv

SC-MAS: Constructing Cost-Efficient Multi-Agent Systems with Edge-Level Heterogeneous Collaboration

Large Language Model (LLM)-based Multi-Agent Systems (MAS) enhance complex problem solving through multi-agent collaboration, but often incur substantially higher costs than single-agent systems. Recent MAS routing methods aim to balance performance and overhead by dynamically selecting agent roles and language models. However, these approaches typically rely on a homogeneous collaboration mode, where all agents follow the same interaction pattern, limiting collaboration flexibility across different roles. Motivated by Social Capital Theory, which emphasizes that different roles benefit from distinct forms of collaboration, we propose SC-MAS, a framework for constructing heterogeneous and cost-efficient multi-agent systems. SC-MAS models MAS as directed graphs, where edges explicitly represent pairwise collaboration strategies, allowing different agent pairs to interact through tailored communication patterns. Given an input query, a unified controller progressively constructs an executable MAS by selecting task-relevant agent roles, assigning edge-level collaboration strategies, and allocating appropriate LLM backbones to individual agents. Experiments on multiple benchmarks demonstrate the effectiveness of SC-MAS. In particular, SC-MAS improves accuracy by 3.35% on MMLU while reducing inference cost by 15.38%, and achieves a 3.53% accuracy gain with a 12.13% cost reduction on MBPP. These results validate the feasibility of SC-MAS and highlight the effectiveness of heterogeneous collaboration in multi-agent systems.

preprint2022arXiv

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Scene text image super-resolution (STISR) has been regarded as an important pre-processing task for text recognition from low-resolution scene text images. Most recent approaches use the recognizer's feedback as clues to guide super-resolution. However, directly using recognition clue has two problems: 1) Compatibility. It is in the form of probability distribution, has an obvious modal gap with STISR - a pixel-level task; 2) Inaccuracy. it usually contains wrong information, thus will mislead the main task and degrade super-resolution performance. In this paper, we present a novel method C3-STISR that jointly exploits the recognizer's feedback, visual and linguistical information as clues to guide super-resolution. Here, visual clue is from the images of texts predicted by the recognizer, which is informative and more compatible with the STISR task; while linguistical clue is generated by a pre-trained character-level language model, which is able to correct the predicted texts. We design effective extraction and fusion mechanisms for the triple cross-modal clues to generate a comprehensive and unified guidance for super-resolution. Extensive experiments on TextZoom show that C3-STISR outperforms the SOTA methods in fidelity and recognition performance. Code is available in https://github.com/zhaominyiz/C3-STISR.

preprint2022arXiv

Measurement of infrared magic wavelength for an all-optical trapping of $^{40}$Ca$^{+}$ ion clock

For the first time, we experimentally determine the infrared magic wavelength for the $^{40}$Ca$^{+}$ $4s\, ^{2}\!S_{1/2} \rightarrow 3d\,^{2}\!D_{5/2}$ electric quadrupole transition by observation of the light shift canceling in $^{40}$Ca$^{+}$ optical clock. A "magic" magnetic field direction is chosen to make the magic wavelength insensitive to both the linear polarization purity and the polarization direction of the laser. The determined magic wavelength for this transition is 1056.37(9)~nm, which is not only in good agreement with theoretical predictions but also more precise by a factor of about 300. Using this measured magic wavelength we also derive the differential static polarizability to be $-44.32(32)$~a.u., which will be an important input for the evaluation of the blackbody radiation shift at room temperatures. Our work paves a way for all-optical-trapping of $^{40}$Ca$^{+}$ optical clock.

preprint2020arXiv

An ACE/CRIS-observation-based Galactic Cosmic Rays heavy nuclei spectra model II

An observation-based Galactic Cosmic Ray (GCR) spectral model for heavy nuclei is developed. Zhao and Qin (J. Geophys. Res. Space Phys.118, 1837(2013)) proposed an empirical elemental GCR spectra model for nuclear charge 5-28 over the energy range from 30 to 500 MeV/nuc, which is proved to be successful in predicting yearly averaged GCR heavy nuclei spectra.Based on the latest highly statistically precise measurements from ACE/CRIS,a further elemental GCR model with monthly averaged spectra is presented. The model can reproduce the past and predict the futureGCR intensity monthly by correlating model parameters with thecontinuous sunspot number (SSN) record. The effects of solar activity on GCR modulation are considered separately for odd and even solar cycles. Compared with other comprehensive GCR models, our modeling results are satisfyingly consistent with the GCR spectral measurements from ACE/SIS and IMP-8, and have comparable prediction accuracy as the Badhwar & O'Neill 2014 model.A detailed error analysis is also provided.Finally, the GCR carbon and iron nuclei fluxes for the subsequent two solar cycles (SC 25 and 26) are predicted and they show a potential trend in reduced flux amplitude, which is suspected to be relevant to possible weak solar cycles.

preprint2020arXiv

Energy and Information Management of Electric Vehicular Network: A Survey

The connected vehicle paradigm empowers vehicles with the capability to communicate with neighboring vehicles and infrastructure, shifting the role of vehicles from a transportation tool to an intelligent service platform. Meanwhile, the transportation electrification pushes forward the electric vehicle (EV) commercialization to reduce the greenhouse gas emission by petroleum combustion. The unstoppable trends of connected vehicle and EVs transform the traditional vehicular system to an electric vehicular network (EVN), a clean, mobile, and safe system. However, due to the mobility and heterogeneity of the EVN, improper management of the network could result in charging overload and data congestion. Thus, energy and information management of the EVN should be carefully studied. In this paper, we provide a comprehensive survey on the deployment and management of EVN considering all three aspects of energy flow, data communication, and computation. We first introduce the management framework of EVN. Then, research works on the EV aggregator (AG) deployment are reviewed to provide energy and information infrastructure for the EVN. Based on the deployed AGs, we present the research work review on EV scheduling that includes both charging and vehicle-to-grid (V2G) scheduling. Moreover, related works on information communication and computing are surveyed under each scenario. Finally, we discuss open research issues in the EVN.

preprint2020arXiv

SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement Learning

Energy-aware control for multiple unmanned aerial vehicles (UAVs) is one of the major research interests in UAV based networking. Yet few existing works have focused on how the network should react around the timing when the UAV lineup is changed. In this work, we study proactive self-remedy of energy-constrained UAV networks when one or more UAVs are short of energy and about to quit for charging. We target at an energy-aware optimal UAV control policy which proactively relocates the UAVs when any UAV is about to quit the network, rather than passively dispatches the remaining UAVs after the quit. Specifically, a deep reinforcement learning (DRL)-based self remedy approach, named SREC-DRL, is proposed to maximize the accumulated user satisfaction scores for a certain period within which at least one UAV will quit the network. To handle the continuous state and action space in the problem, the state-of-the-art algorithm of the actor-critic DRL, i.e., deep deterministic policy gradient (DDPG), is applied with better convergence stability. Numerical results demonstrate that compared with the passive reaction method, the proposed SREC-DRL approach shows a $12.12\%$ gain in accumulative user satisfaction score during the remedy period.

preprint2019arXiv

Perturbed Field Ionization for Improved State Selectivity

Selective field ionization is used to determine the state or distribution of states to which a Rydberg atom is excited. By evolving a small perturbation to the ramped electric field using a genetic algorithm, the shape of the time-resolved ionization signal can be controlled. This allows for separation of signals from pairs of states that would be indistinguishable with unperturbed selective field ionization. Measurements and calculations are presented that demonstrate this technique and shed light on how the perturbation directs the pathway of the electron to ionization. Pseudocode for the genetic algorithm is provided. Using the improved resolution afforded by this technique, quantitative measurements of the $36p_{3/2}+36p_{3/2}\rightarrow 36s_{1/2}+37s_{1/2}$ dipole-dipole interaction are made.

preprint2016arXiv

Dendrite Suppression by Shock Electrodeposition in Charged Porous Media

It is shown that surface conduction can stabilize electrodeposition in random, charged porous media at high rates, above the diffusion-limited current. After linear sweep voltammetry and impedance spectroscopy, copper electrodeposits are visualized by scanning electron microscopy and energy dispersive spectroscopy in two different porous separators (cellulose nitrate, polyethylene), whose surfaces are modified by layer-by-layer deposition of positive or negative charged polyelectrolytes. Above the limiting current, surface conduction inhibits growth in the positive separators and produces irregular dendrites, while it enhances growth and suppresses dendrites behind a deionization shock in the negative separators, also leading to improved cycle life. The discovery of stable uniform growth in the random media differs from the non-uniform growth observed in parallel nanopores and cannot be explained by classical quasi-steady leaky membrane models, which always predict instability and dendritic growth. Instead, the experimental results suggest that transient electro-diffusion in random porous media imparts the stability of a deionization shock to the growing metal interface behind it. Shock electrodeposition could be exploited to enhance the cycle life and recharging rate of metal batteries or to accelerate the fabrication of metal matrix composite coatings.

preprint2016arXiv

Gate-Tunable Negative Longitudinal Magnetoresistance in the Predicted Type-II Weyl Semimetal WTe2

The progress in exploiting new electronic materials and devices has been a major driving force in solid-state physics. As a new state of matter, a Weyl semimetal (WSM), particularly a type-II WSM, hosts Weyl fermions as emergent quasiparticles and may harbor novel electrical transport properties because of the exotic Fermi surface. Nevertheless, such a type-II WSM material has not been experimentally observed in nature. In this work, by performing systematic magneto-transport studies on thin films of a predicted material candidate WTe2, we observe notable angle-sensitive (between the electric and magnetic fields) negative longitudinal magnetoresistance (MR), which can likely be attributed to the chiral anomaly in WSM. This phenomenon also exhibits strong planar orientation dependence with the absence of negative longitudinal MR along the tungsten chains (a axis), which is consistent with the distinctive feature of a type-II WSM. By applying a gate voltage, we demonstrate that the Fermi energy can be tuned through the Weyl points via the electric field effect; this is the first report of controlling the unique transport properties in situ in a WSM system. Our results have important implications for investigating simulated quantum field theory in solid-state systems and may open opportunities for implementing new types of electronic applications, such as field-effect chiral electronic devices.

preprint2015arXiv

The positive piezoconductive effect in graphene

As the thinnest conductive and elastic material, graphene is expected to play a crucial role in post-Moore era. Besides applications on electronic devices, graphene has shown great potential for nano-electromechanical systems. While interlayer interactions play a key role in modifying the electronic structures of layered materials, no attention has been given to their impact on electromechanical properties. Here we report the positive piezoconductive effect observed in suspended bi- and multi-layer graphene. The effect is highly layer number dependent and shows the most pronounced response for tri-layer graphene. The effect, and its dependence on the layer number, can be understood as resulting from the strain-induced competition between interlayer coupling and intralayer transport, as confirmed by the numerical calculations based on the non-equilibrium Green's function method. Our results enrich the understanding of graphene and point to layer number as a powerful tool for tuning the electromechanical properties of graphene for future applications.

Miao Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing

SC-MAS: Constructing Cost-Efficient Multi-Agent Systems with Edge-Level Heterogeneous Collaboration

C3-STISR: Scene Text Image Super-resolution with Triple Clues

Measurement of infrared magic wavelength for an all-optical trapping of $^{40}$Ca$^{+}$ ion clock

An ACE/CRIS-observation-based Galactic Cosmic Rays heavy nuclei spectra model II

Energy and Information Management of Electric Vehicular Network: A Survey

SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement Learning

Perturbed Field Ionization for Improved State Selectivity

Dendrite Suppression by Shock Electrodeposition in Charged Porous Media

Gate-Tunable Negative Longitudinal Magnetoresistance in the Predicted Type-II Weyl Semimetal WTe2

The positive piezoconductive effect in graphene