Source author record

Xihua Wang

Xihua Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Biological Physics cond-mat.mtrl-sci physics.chem-ph Quantitative Methods

Catalog footprint

What is connected

5works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Qwen-Image-2.0 Technical Report

We present Qwen-Image-2.0, an omni-capable image generation foundation model that unifies high-fidelity generation and precise image editing within a single framework. Despite recent progress, existing models still struggle with ultra-long text rendering, multilingual typography, high-resolution photorealism, robust instruction following, and efficient deployment, especially in text-rich and compositionally complex scenarios. Qwen-Image-2.0 addresses these challenges by coupling Qwen3-VL as the condition encoder with a Multimodal Diffusion Transformer for joint condition-target modeling, supported by large-scale data curation and a customized multi-stage training pipeline. This enables strong multimodal understanding while preserving flexible generation and editing capabilities. The model supports instructions of up to 1K tokens for generating text-rich content such as slides, posters, infographics, and comics, while significantly improving multilingual text fidelity and typography. It also enhances photorealistic generation with richer details, more realistic textures, and coherent lighting, and follows complex prompts more reliably across diverse styles. Extensive human evaluations show that Qwen-Image-2.0 substantially outperforms previous Qwen-Image models in both generation and editing, marking a step toward more general, reliable, and practical image generation foundation models.

preprint2026arXiv

SyncDPO: Enhancing Temporal Synchronization in Video-Audio Joint Generation via Preference Learning

Recent advancements in video-audio joint generation have achieved remarkable success in semantic correspondence. However, achieving precise temporal synchronization, which requires fine-grained alignment between audio events and their visual triggers, remains a challenging problem. The post-training method for joint generation is largely dominated by Supervised Fine-Tuning, but the commonly used Mean Squared Error loss provides insufficient penalties for subtle temporal misalignments. Direct Preference Optimization offers an alternative by introducing explicit misaligned counterparts to better improve temporal sensitivity. In this paper we propose a post-training framework SyncDPO, leveraging DPO to improve the temporal sensitivity of V-A joint generation. Conventional DPO pipelines typically depend on costly sampling-and-ranking procedures to construct preference pairs, resulting in substantial computational cost. To improve efficiency, we introduce a suite of on-the-fly rule-based negative construction strategies that distort temporal structures without incurring additional annotation or sampling. We demonstrate that the temporal alignment capability can be effectively reinforced by providing explicit negative supervision through temporally distorted V-A pairs. Accordingly, we implement a curriculum learning strategy that progressively increases the difficulty of negative samples, transitioning from coarse misalignment to subtle inconsistencies. Extensive objective and subjective experiments across four diverse benchmarks, ranging from ambient sound videos to human speech videos, demonstrate that SyncDPO significantly outperforms other methods in improving model's temporal alignment capability. It also demonstrates superior generalization on out-of-distribution benchmark by capturing intrinsic motion-sound dynamics. Demo and code is available in https://syncdpo.github.io/syncdpo/.

preprint2022arXiv

Surface microlenses for much more efficient photodegradation in water treatment

The global need for clean water requires sustainable technology for purifying contaminated water. Highly efficient solar-driven photodegradation is a sustainable strategy for wastewater treatment. In this work, we demonstrate that the photodegradation efficiency of micropollutants in water can be improved by ~2-24 times by leveraging polymeric microlenses (MLs). These microlenses (MLs) are fabricated from the in-situ polymerization of surface nanodroplets. We found that photodegradation efficiency (η) in water correlates approximately linearly with the sum of the intensity from all focal points of MLs, although no difference in the photodegradation pathway is detected from the chemical analysis of the byproducts. With the same overall power over a given surface area, η is doubled by using ordered arrays, compared to heterogeneous MLs on an unpatterned substrate. Higher η from ML arrays may be attributed to a coupled effect from the focal points on the same plane that creates high local concentrations of active species to further speed up the rate of photodegradation. As a proof-of-concept for ML-enhanced water treatment, MLs were formed on the inner wall of glass bottles that were used as containers for water to be treated. Three representative micropollutants (norfloxacin, sulfadiazine, and sulfamethoxazole) in the bottles functionalized by MLs were photodegraded by 30% to 170% faster than in normal bottles. Our findings suggest that the ML-enhanced photodegradation may lead to a highly efficient solar water purification approach without a large solar collector size. Such an approach may be particularly suitable for portable transparent bottles in remote regions.

preprint2014arXiv

Field Effect Transistor Nanosensor for Breast Cancer Diagnostics

Silicon nanochannel field effect transistor (FET) biosensors are one of the most promising technologies in the development of highly sensitive and label-free analyte detection for cancer diagnostics. With their exceptional electrical properties and small dimensions, silicon nanochannels are ideally suited for extraordinarily high sensitivity. In fact, the high surface-to-volume ratios of these systems make single molecule detection possible. Further, FET biosensors offer the benefits of high speed, low cost, and high yield manufacturing, without sacrificing the sensitivity typical for traditional optical methods in diagnostics. Top down manufacturing methods leverage advantages in Complementary Metal Oxide Semiconductor (CMOS) technologies, making richly multiplexed sensor arrays a reality. Here, we discuss the fabrication and use of silicon nanochannel FET devices as biosensors for breast cancer diagnosis and monitoring.

preprint2008arXiv

Surface modified silicon nanochannel for urea sensing

Silicon nanowires have been surface functionalized with the enzyme urease for biosensor applications to detect and quantify urea concentration. The device is nanofabricated from a silicon on insulator (SOI) wafer with a top down lithography approach. The differential conductance of silicon nanowires can be tuned for optimum performance using the source drain bias voltage, and is sensitive to urea at low concentration. The experimental results show a linear relationship between surface potential change and urea concentration in the range of 0.1 to 0.68 mM. The sensitivity of our devices shows high reproducibility with time and different measurement conditions. The nanowire urea biosensor offers the possibility of high quality, reusable enzyme sensor array integration with silicon based circuits.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint