Researcher profile

Chao Wu

Chao Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

Detection of Oscillations in a Type I X-Ray Burst of 4U 0614+091 with SVOM/ECLAIRs

On 2025 January 10, a thermonuclear (Type I) X-ray burst from the neutron star low-mass X-ray binary \textit{4U~0614+091} was detected with the ECLAIRs instrument on board the \textit{SVOM} mission. We present here a time-resolved spectroscopic analysis of the burst, along with the detection of burst oscillations within a 51-second interval during the decay phase. The oscillation frequency is measured to be $ν= 413.674 \pm 0.002\,\mathrm{Hz}$, consistent with previous reports. However, we detect a significant downward frequency drift over the burst duration, characterized by $\dotν = (-4.7 \pm 0.3) \times 10^{-3}\,\mathrm{Hz\,s^{-1}}$. This frequency evolution is atypical compared to those observed in similar burst oscillation sources. We tentatively attribute the observed drift to a Doppler shift induced by orbital motion. Under this interpretation, the inferred orbital period must be shorter than 20 minutes, placing \textit{4U~0614+091} among the most compact known low-mass X-ray binaries.

preprint2026arXiv

Interference-governed electromagnetic-thermal coupling and heat transport in pulse EUV-irradiated multilayer nanofilms

Mo-Si multilayer mirrors are central to extreme ultraviolet lithography, where nanoscale optical interference and heat accumulation together constrain reflectivity and operational stability. Here we develop an analytical electromagnetic-thermal coupling model that directly links transfer-matrix-based interference-controlled energy deposition with transient heat conduction in EUV-irradiated multilayers. The model reveals a fundamental trade-off whereby increasing the multilayer period number enhances reflectivity but simultaneously elevates temperature by impeding heat dissipation. Interference-driven volumetric absorption further gives rise to pronounced axial temperature gradients and a post-pulse downward migration of the heat-flux maximum, a delayed-heating effect inaccessible to conventional surface-flux-based models. Systematic analysis establishes scaling laws connecting interfacial thermal resistance, beam size, and incident energy density to thermal confinement and temperature rise. By incorporating interfacial compaction kinetics, the model enables a quantitative assessment of mirror lifetime. This work offers a theoretical tool for thermal-optical co-design of multilayer nanostructures including EUV mirrors under pulsed irradiation across a wide spectral range.

preprint2026arXiv

Massless-Massive Amplitude Correspondence I: Helicity-chirality Matching and On-shell Higgsing

In this work, the massless-massive correspondence for the on-shell scattering amplitudes is constructed so the massive amplitudes could inherit advantageous techniques developed in the massless calculation. This correspondence is established by matching massless amplitudes to Minimal Helicity-Chirality (MHC) amplitudes, which arise from an expansion of massive spin-spinor amplitudes in terms of the chirality-flip $mη$ order by order. The primary MHC amplitude deforms into a massless amplitude of the same helicity; if a vector boson is involved, it may instead vanish due to the associated conserved current. In cases where the primary amplitude vanishes, the leading contributions originate from descendant MHC amplitudes, each corresponding to a distinct massless amplitude in the ultraviolet theory containing either a transverse gauge boson or a Goldstone boson. We propose a systematic amplitude deformation procedure for three-point massless-massive matching based on helicity-chirality unification and the scaling properties of $mη$. Sub-leading MHC amplitudes are matched to massless amplitudes with additional on-shell Higgs splitting, a process known as on-shell Higgsing. In this work, we extend and reinterpret on-shell Higgsing as a transversality flip between different MHC states, and obtain all the 3-point massless-massive matching results in the spontaneous broken standard model.

preprint2026arXiv

Massless-Massive Amplitude Correspondence II: Constructive Massive Amplitudes in Standard Model

In the minimal helicity-chirality formalism, we systematically construct higher-point massive amplitudes from the fundamental building blocks: the contact three-point and four-point massive amplitudes. The inclusion of four-point contact amplitudes is essential to maintain gauge invariance in the spontaneously broken Standard Model. We construct all the standard model massive contact amplitudes and identify the physical light-cone gauge nature of massive amplitudes. Then only using the contact minimal helicity-chirality amplitudes at the leading order, we show both bootstrap techniques and on-shell recursion relations can be utilized to compute higher-point massive amplitudes. This provides a systematic framework for constructing various higher-point electroweak amplitudes, analogous to established on-shell methods for massless theories. Finally by deforming the gauge-invariant $n$-point amplitudes, we extend the massless-massive correspondence from three-and-four point contact amplitudes to general $n$-point factorized amplitudes.

preprint2026arXiv

Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning

Multi-modal reasoning requires the seamless integration of visual and linguistic cues, yet existing Chain-of-Thought methods suffer from two critical limitations in cross-modal scenarios: (1) over-reliance on single coarse-grained image regions, and (2) semantic fragmentation between successive reasoning steps. To address these issues, we propose the CoCoT (Collaborative Coross-modal Thought) framework, built upon two key innovations: a) Dynamic Multi-Region Grounding to adaptively detect the most relevant image regions based on the question, and b) Relation-Aware Reasoning to enable multi-region collaboration by iteratively aligning visual cues to form a coherent and logical chain of thought. Through this approach, we construct the CoCoT-70K dataset, comprising 74,691 high-quality samples with multi-region annotations and structured reasoning chains. Extensive experiments demonstrate that CoCoT significantly enhances complex visual reasoning, achieving an average accuracy improvement of 15.4% on LLaVA-1.5 and 4.0% on Qwen2-VL across six challenging benchmarks. The data and code are available at: https://github.com/deer-echo/CoCoT.

preprint2022arXiv

Adversarial Examples for Good: Adversarial Examples Guided Imbalanced Learning

Adversarial examples are inputs for machine learning models that have been designed by attackers to cause the model to make mistakes. In this paper, we demonstrate that adversarial examples can also be utilized for good to improve the performance of imbalanced learning. We provide a new perspective on how to deal with imbalanced data: adjust the biased decision boundary by training with Guiding Adversarial Examples (GAEs). Our method can effectively increase the accuracy of minority classes while sacrificing little accuracy on majority classes. We empirically show, on several benchmark datasets, our proposed method is comparable to the state-of-the-art method. To our best knowledge, we are the first to deal with imbalanced learning with adversarial examples.

preprint2022arXiv

Camera-Conditioned Stable Feature Generation for Isolated Camera Supervised Person Re-IDentification

To learn camera-view invariant features for person Re-IDentification (Re-ID), the cross-camera image pairs of each person play an important role. However, such cross-view training samples could be unavailable under the ISolated Camera Supervised (ISCS) setting, e.g., a surveillance system deployed across distant scenes. To handle this challenging problem, a new pipeline is introduced by synthesizing the cross-camera samples in the feature space for model training. Specifically, the feature encoder and generator are end-to-end optimized under a novel method, Camera-Conditioned Stable Feature Generation (CCSFG). Its joint learning procedure raises concern on the stability of generative model training. Therefore, a new feature generator, $σ$-Regularized Conditional Variational Autoencoder ($σ$-Reg.~CVAE), is proposed with theoretical and experimental analysis on its robustness. Extensive experiments on two ISCS person Re-ID datasets demonstrate the superiority of our CCSFG to the competitors.

preprint2022arXiv

Compactified AdS black holes, Chamblin-Reall background, and their dual non-conformal relativistic fluids

The Chamblin-Reall background is a static solution of Einstein gravity coupled with a background scalar field and a dynamical domain wall, with the potential of the scalar field being of Liouville type. It can be got by dimensionally reducing a higher dimensional background with a constant potential. Compactified AdS black holes are black hole backgrounds constructed by wrapping one or more spatial directions of a higher dimensional AdS black hole on a torus and then integrating them out. The compactified AdS black hole background is asymptotically flat, non-conformal, and of Chamblin-Reall type. In this work, we derive all the 7 dynamical second-order transport coefficients for the relativistic fluids dual to compactified AdS black holes of various dimensions via fluid/gravity correspondence. Through this work, we achieve three main goals: (1) We prove that all the gravitational backgrounds that can be used to extract analytical results for second-order transport coefficients hitherto are all Chamblin-Reall type backgrounds. (2) We generalize the results in previous studies on the second-order transport coefficients of the relativistic fluids dual to 5-dimensional Chamblin-Reall model into general dimensions. (3) We offer a thorough study on the Kanitscheider-Skenderis proposal and find its physical accounts.

preprint2022arXiv

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI

Influenced by the great success of deep learning via cloud computing and the rapid development of edge chips, research in artificial intelligence (AI) has shifted to both of the computing paradigms, i.e., cloud computing and edge computing. In recent years, we have witnessed significant progress in developing more advanced AI models on cloud servers that surpass traditional deep learning models owing to model innovations (e.g., Transformers, Pretrained families), explosion of training data and soaring computing capabilities. However, edge computing, especially edge and cloud collaborative computing, are still in its infancy to announce their success due to the resource-constrained IoT scenarios with very limited algorithms deployed. In this survey, we conduct a systematic review for both cloud and edge AI. Specifically, we are the first to set up the collaborative learning mechanism for cloud and edge modeling with a thorough review of the architectures that enable such mechanism. We also discuss potentials and practical experiences of some on-going advanced edge AI topics including pretraining models, graph neural networks and reinforcement learning. Finally, we discuss the promising directions and challenges in this field.

preprint2022arXiv

Massive On-shell Recursion Relations for n-point Amplitudes

We construct two and three-line shifts for tree-level amplitude with massless and/or massive particles, and provide a method to construct general multi-line shifts for all masses. We choose the massless-massive BCFW shift from these shifts and examine its validity in renormalizable theories. Using such a shift, we find that amplitudes with at least one massless vector boson are constructible. This reveals the importance of gauge theory in the construction of amplitudes with massive particles. We also find that this kind of amplitudes have a cancellation related to group structure among different channels, which is essential for constructibility. Furthermore, we show that in the limit of large shift parameter $z$, the amplitude with four massive vector bosons, which can include transverse massive vector particles, have structures proportional to the amplitude with shifted vector particles replaced by Goldstone bosons in the leading order. This is responsible for the failure of massive-massive BCFW recursion relations in the amplitudes with four massive vector bosons.

preprint2022arXiv

S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

Collaborative multi-agent reinforcement learning (MARL) has been widely used in many practical applications, where each agent makes a decision based on its own observation. Most mainstream methods treat each local observation as an entirety when modeling the decentralized local utility functions. However, they ignore the fact that local observation information can be further divided into several entities, and only part of the entities is helpful to model inference. Moreover, the importance of different entities may change over time. To improve the performance of decentralized policies, the attention mechanism is used to capture features of local information. Nevertheless, existing attention models rely on dense fully connected graphs and cannot better perceive important states. To this end, we propose a sparse state based MARL (S2RL) framework, which utilizes a sparse attention mechanism to discard irrelevant information in local observations. The local utility functions are estimated through the self-attention and sparse attention mechanisms separately, then are combined into a standard joint value function and auxiliary joint value function in the central critic. We design the S2RL framework as a plug-and-play module, making it general enough to be applied to various methods. Extensive experiments on StarCraft II show that S2RL can significantly improve the performance of many state-of-the-art methods.

preprint2022arXiv

Software Engineers Response to Public Crisis: Lessons Learnt from Spontaneously Building an Informative COVID-19 Dashboard

The Coronavirus disease 2019 (COVID-19) outbreak quickly spread around the world, resulting in over 240 million infections and 4 million deaths by Oct 2021. While the virus is spreading from person to person silently, fear has also been spreading around the globe. The COVID-19 information from the Australian Government is convincing but not timely or detailed, and there is much information on social networks with both facts and rumors. As software engineers, we have spontaneously and rapidly constructed a COVID-19 information dashboard aggregating reliable information semi-automatically checked from different sources for providing one-stop information sharing site about the latest status in Australia. Inspired by the John Hopkins University COVID-19 Map, our dashboard contains the case statistics, case distribution, government policy, latest news, with interactive visualization. In this paper, we present a participant's in-person observations in which the authors acted as founders of https://covid-19-au.com/ serving more than 830K users with 14M page views since March 2020. According to our first-hand experience, we summarize 9 lessons for developers, researchers and instructors. These lessons may inspire the development, research and teaching in software engineer aspects for coping with similar public crises in the future.

preprint2022arXiv

Unified Group Fairness on Federated Learning

Federated learning (FL) has emerged as an important machine learning paradigm where a global model is trained based on the private data from distributed clients. However, most of existing FL algorithms cannot guarantee the performance fairness towards different groups because of data distribution shift over groups. In this paper, we formulate the problem of unified group fairness on FL, where the groups can be formed by clients (including existing clients and newly added clients) and sensitive attribute(s). To solve this problem, we first propose a general fair federated framework. Then we construct a unified group fairness risk from the view of federated uncertainty set with theoretical analyses to guarantee unified group fairness on FL. We also develop an efficient federated optimization algorithm named Federated Mirror Descent Ascent with Momentum Acceleration (FMDA-M) with convergence guarantee. We validate the advantages of the FMDA-M algorithm with various kinds of distribution shift settings in experiments, and the results show that FMDA-M algorithm outperforms the existing fair FL algorithms on unified group fairness.

preprint2021arXiv

Applications of Artificial Intelligence in Particle Radiotherapy

Radiotherapy, due to its technology-intensive nature and reliance on digital data and human-machine interactions, is particularly suited to benefit from artificial intelligence (AI) to improve the accuracy and efficiency of its clinical workflow. Recently, various artificial intelligence (AI) methods have been successfully developed to exploit the benefit of the inherent physical properties of particle therapy. Many reviews about AI applications in radiotherapy have already been published, but none were specifically dedicated to particle therapy. In this article, we present a comprehensive review of the recent published works on AI applications in particle therapy, which can be classified into particle therapy treatment planning, adaptive particle therapy, range and dose verification and other applications in particle therapy. Although promising results reported in these works demonstrate how AI-based methods can help exploit the intrinsic physic advantages of particle therapy, challenges remained to be address before AI applications in particle therapy enjoy widespread implementation in clinical practice.

preprint2020arXiv

Evaluation Framework For Large-scale Federated Learning

Federated learning is proposed as a machine learning setting to enable distributed edge devices, such as mobile phones, to collaboratively learn a shared prediction model while keeping all the training data on device, which can not only take full advantage of data distributed across millions of nodes to train a good model but also protect data privacy. However, learning in scenario above poses new challenges. In fact, data across a massive number of unreliable devices is likely to be non-IID (identically and independently distributed), which may make the performance of models trained by federated learning unstable. In this paper, we introduce a framework designed for large-scale federated learning which consists of approaches to generating dataset and modular evaluation framework. Firstly, we construct a suite of open-source non-IID datasets by providing three respects including covariate shift, prior probability shift, and concept shift, which are grounded in real-world assumptions. In addition, we design several rigorous evaluation metrics including the number of network nodes, the size of datasets, the number of communication rounds and communication resources etc. Finally, we present an open-source benchmark for large-scale federated learning research.

preprint2020arXiv

Federated Mutual Learning

Federated learning (FL) enables collaboratively training deep learning models on decentralized data. However, there are three types of heterogeneities in FL setting bringing about distinctive challenges to the canonical federated learning algorithm (FedAvg). First, due to the Non-IIDness of data, the global shared model may perform worse than local models that solely trained on their private data; Second, the objective of center server and clients may be different, where center server seeks for a generalized model whereas client pursue a personalized model, and clients may run different tasks; Third, clients may need to design their customized model for various scenes and tasks; In this work, we present a novel federated learning paradigm, named Federated Mutual Leaning (FML), dealing with the three heterogeneities. FML allows clients training a generalized model collaboratively and a personalized model independently, and designing their private customized models. Thus, the Non-IIDness of data is no longer a bug but a feature that clients can be personally served better. The experiments show that FML can achieve better performance than alternatives in typical FL setting, and clients can be benefited from FML with different models and tasks.

preprint2020arXiv

Improving Proton Dose Calculation Accuracy by Using Deep Learning

Accurate dose calculation is vitally important for proton therapy. Pencil beam (PB) model-based dose calculation is fast but inaccurate due to the approximation when dealing with inhomogeneities. Monte Carlo (MC) dose calculation is the most accurate method, but it is time consuming. We hypothesize that deep learning methods can boost the accuracy of PB dose calculation to the level of MC. In this work, we developed a deep learning model that converts PB to MC doses for different tumor sites. The proposed model is based on our newly developed hierarchically densely connected U-Net (HD U-Net) network, and it uses the PB dose and patient CT image as inputs to generate the MC dose. We used 290 patients (90 with head and neck, 93 with liver, 75 with prostate, and 32 with lung cancer) to train, validate, and test the model. For each tumor site, we performed four numerical experiments to explore various combinations of training datasets. Training the model on data from all tumor sites together and using the dose distribution of each individual beam as input yielded the best performance for all four tumor sites. The average gamma index (1mm/1% criteria) between the converted dose and the MC dose was 92.8%, 92.7%, 89.7% and 99.6% for head and neck, liver, lung, and prostate test patients, respectively. The average time for dose conversion for a single field was less than 4 seconds. In conclusion, our deep learning-based approach can quickly boost the accuracy of proton PB dose distributions to that of MC dose distributions. The trained model can be readily adapted to new datasets for different tumor sites and from different hospitals through transfer learning. This model can be added as a plug-in to the clinical workflow of proton therapy treatment planning to improve the accuracy of proton dose calculation.

preprint2020arXiv

Second order transport coefficients of nonconformal relativistic fluids in various dimensions from Dp-brane

We derive all the dynamical second order transport coefficients for Dp-brane with $p$ from 1 to 6 within the framework of fluid/gravity correspondence in this paper. The D5 and D6-brane do not have dual relativistic fluids; D3-brane corresponds to 4-dimensional conformal relativistic fluid; D1, D2 and D4-brane separately correspond to nonconformal relativistic fluids of dimensions 2, 3 and 5. The Haack-Yarom relation only exists for Dp-branes with $p$ larger than 2 and is also satisfied by them. We also find that the Romatschke and Kleinert-Probst relations need to be generalized in order to be valid for relativistic fluids of dimensions other than 4.

preprint2020arXiv

Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach

Peer-to-peer knowledge transfer in distributed environments has emerged as a promising method since it could accelerate learning and improve team-wide performance without relying on pre-trained teachers in deep reinforcement learning. However, for traditional peer-to-peer methods such as action advising, they have encountered difficulties in how to efficiently expressed knowledge and advice. As a result, we propose a brand new solution to reuse experiences and transfer value functions among multiple students via model distillation. But it is still challenging to transfer Q-function directly since it is unstable and not bounded. To address this issue confronted with existing works, we adopt Categorical Deep Q-Network. We also describe how to design an efficient communication protocol to exploit heterogeneous knowledge among multiple distributed agents. Our proposed framework, namely Learning and Teaching Categorical Reinforcement (LTCR), shows promising performance on stabilizing and accelerating learning progress with improved team-wide reward in four typical experimental environments.

preprint2019arXiv

Charged black holes in the Einstein-Maxwell-Weyl gravity

We construct charged asymptotically flat black hole solutions in Einstein-Maxwell-Weyl(EMW) gravity. These solutions can be interpreted as generalizations of two different groups: Schwarzschild black hole (SBH) and non-Schwarzschild black hole (NSBH) solutions. In addition, we discuss the thermodynamic properties of two groups of numerical solutions in detail, and show that they obey the first law of thermodynamics.