Researcher profile

Da Xu

Da Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

The Angular Momentum Penrose Inequality

We prove the Penrose inequality with angular momentum for asymptotically flat, axisymmetric vacuum initial data sets containing a stable marginally outer trapped surface. This inequality provides a lower bound for the ADM mass in terms of the area and angular momentum of the black hole horizon, with equality holding if and only if the initial data set corresponds to a slice of the Kerr spacetime. Our proof combines the Jang equation approach with the p-harmonic level set method. A key component of the analysis is a modified Hawking mass functional that incorporates angular momentum and exhibits monotonicity along the flow. We also establish the rigidity of the inequality using the Mars-Simon tensor.

preprint2025arXiv

The Spacetime Penrose Inequality: Conditional Results for Stable MOTS and General Trapped Surfaces

We present a rigorous proof of the Spacetime Penrose Inequality relating the ADM mass to the area of trapped surfaces in asymptotically flat initial data sets satisfying the dominant energy condition. The main theorem establishes that the ADM mass is bounded below by the square root of the area divided by 16 pi for an area-maximizing marginally outer trapped surface (MOTS), subject to a distributional favorable jump condition which we prove is structurally guaranteed by KKT optimality. The extension to the outermost MOTS remains conditional on the hypothesis that the area maximizer coincides with the outermost MOTS, or equivalently on Weak Cosmic Censorship. We explicitly flag that without this condition, the proof for general trapped surfaces does not go through, as evidenced by binary merger counterexamples. We provide a complete double-limit analysis of the Agostiniani-Mazzieri-Oronzio level-set flow on the singular Jang space, resolving regularity and boundary-term obstructions. In the equality case, the initial data embed isometrically into the Schwarzschild spacetime.

preprint2022arXiv

A three-dimensional dynamic mode decomposition analysis of wind farm flow aerodynamics

High-fidelity large-eddy simulations are suitable to obtain insight into the complex flow dynamics in extended wind farms. In order to better understand these flow dynamics, we use dynamic mode decomposition (DMD) to analyze and reconstruct the flow field in large-scale numerically simulated wind farms by large-eddy simulations (LES). Different wind farm layouts are considered, and we find that a combination of horizontal and vertical staggering leads to improved wind farm performance compared to traditional horizontal staggering. We analyze the wind farm flows using the amplitude selection (AP) and sparsity-promoting (SP method) DMD approach. We find that the AP method tends to select modes with a small length scale and a high frequency, while the SP method selects large coherent structures with low frequency. The latter are somewhat reminiscent of modes obtained using proper orthogonal decomposition (POD). We find that a relatively limited number of SP-DMD modes is sufficient to accurately reconstruct the flow field in the entire wind farm, whereas the AP-DMD method requires more modes to achieve an accurate reconstruction. Thus, the SP-DMD method has a smaller performance loss compared to the AP-DMD method in terms of the reconstruction of the flow field.

preprint2022arXiv

Clutter Edges Detection Algorithms for Structured Clutter Covariance Matrices

This letter deals with the problem of clutter edge detection and localization in training data. To this end, the problem is formulated as a binary hypothesis test assuming that the ranks of the clutter covariance matrix are known, and adaptive architectures are designed based on the generalized likelihood ratio test to decide whether the training data within a sliding window contains a homogeneous set or two heterogeneous subsets. In the design stage, we utilize four different covariance matrix structures (i.e., Hermitian, persymmetric, symmetric, and centrosymmetric) to exploit the a priori information. Then, for the case of unknown ranks, the architectures are extended by devising a preliminary estimation stage resorting to the model order selection rules. Numerical examples based on both synthetic and real data highlight that the proposed solutions possess superior detection and localization performance with respect to the competitors that do not use any a priori information.

preprint2022arXiv

From Intervention to Domain Transportation: A Novel Perspective to Optimize Recommendation

The interventional nature of recommendation has attracted increasing attention in recent years. It particularly motivates researchers to formulate learning and evaluating recommendation as causal inference and data missing-not-at-random problems. However, few take seriously the consequence of violating the critical assumption of overlapping, which we prove can significantly threaten the validity and interpretation of the outcome. We find a critical piece missing in the current understanding of information retrieval (IR) systems: as interventions, recommendation not only affects the already observed data, but it also interferes with the target domain (distribution) of interest. We then rephrase optimizing recommendation as finding an intervention that best transports the patterns it learns from the observed domain to its intervention domain. Towards this end, we use domain transportation to characterize the learning-intervention mechanism of recommendation. We design a principled transportation-constraint risk minimization objective and convert it to a two-player minimax game. We prove the consistency, generalization, and excessive risk bounds for the proposed objective, and elaborate how they compare to the current results. Finally, we carry out extensive real-data and semi-synthetic experiments to demonstrate the advantage of our approach, and launch online testing with a real-world IR system.

preprint2022arXiv

On the Advances and Challenges of Adaptive Online Testing

In recent years, the interest in developing adaptive solutions for online testing has grown significantly in the industry. While the advances related to this relative new technology have been developed in multiple domains, it lacks in the literature a systematic and complete treatment of the procedure that involves exploration, inference, and analysis. This short paper aims to develop a comprehensive understanding of adaptive online testing, including various building blocks and analytical results. We also address the latest developments, research directions, and challenges that have been less mentioned in the literature.

preprint2022arXiv

Pointwise error estimates of compact difference scheme for mixed-type time-fractional Burgers' equation

In this paper, based on the developed nonlinear fourth-order operator and method of order reduction, a novel fourth-order compact difference scheme is constructed for the mixed-type time-fractional Burgers' equation, from which $L_1$-discretization formula is employed to deal with the terms of fractional derivative, and the nonlinear convection term is discretized by nonlinear compact difference operator. Then a fully discrete compact difference scheme can be established by approximating spatial second-order derivative with classic compact difference formula. The convergence and stability are rigorously proved in the $L^{\infty}$-norm by the energy argument and mathematical induction. Finally, several numerical experiments are provided to verify the theoretical analysis.

preprint2022arXiv

Reinforcement-learning-based control of convectively-unstable flows

This work reports the application of a model-free deep-reinforcement-learning-based (DRL) flow control strategy to suppress perturbations evolving in the 1-D linearised Kuramoto-Sivashinsky (KS) equation and 2-D boundary layer flows. The former is commonly used to model the disturbance developing in flat-plate boundary layer flows. These flow systems are convectively unstable, being able to amplify the upstream disturbance, and are thus difficult to control. The control action is implemented through a volumetric force at a fixed position and the control performance is evaluated by the reduction of perturbation amplitude downstream. We first demonstrate the effectiveness of the DRL-based control in the KS system subjected to a random upstream noise. The amplitude of perturbation monitored downstream is significantly reduced and the learnt policy is shown to be robust to both measurement and external noise. One of our focuses is to optimally place sensors in the DRL control using the gradient-free particle swarm optimisation algorithm. After the optimisation process for different numbers of sensors, a specific eight-sensor placement is found to yield the best control performance. The optimised sensor placement in the KS equation is applied directly to control 2-D Blasius boundary layer flows and can efficiently reduce the downstream perturbation energy. Via flow analyses, the control mechanism found by DRL is the opposition control. Besides, it is found that when the flow instability information is embedded in the reward function of DRL to penalise the instability, the control performance can be further improved in this convectively-unstable flow.

preprint2022arXiv

Towards Robust Off-policy Learning for Runtime Uncertainty

Off-policy learning plays a pivotal role in optimizing and evaluating policies prior to the online deployment. However, during the real-time serving, we observe varieties of interventions and constraints that cause inconsistency between the online and offline settings, which we summarize and term as runtime uncertainty. Such uncertainty cannot be learned from the logged data due to its abnormality and rareness nature. To assert a certain level of robustness, we perturb the off-policy estimators along an adversarial direction in view of the runtime uncertainty. It allows the resulting estimators to be robust not only to observed but also unexpected runtime uncertainties. Leveraging this idea, we bring runtime-uncertainty robustness to three major off-policy learning methods: the inverse propensity score method, reward-model method, and doubly robust method. We theoretically justify the robustness of our methods to runtime uncertainty, and demonstrate their effectiveness using both the simulation and the real-world online experiments.

preprint2022arXiv

Towards the D-Optimal Online Experiment Design for Recommender Selection

Selecting the optimal recommender via online exploration-exploitation is catching increasing attention where the traditional A/B testing can be slow and costly, and offline evaluations are prone to the bias of history data. Finding the optimal online experiment is nontrivial since both the users and displayed recommendations carry contextual features that are informative to the reward. While the problem can be formalized via the lens of multi-armed bandits, the existing solutions are found less satisfactorily because the general methodologies do not account for the case-specific structures, particularly for the e-commerce recommendation we study. To fill in the gap, we leverage the \emph{D-optimal design} from the classical statistics literature to achieve the maximum information gain during exploration, and reveal how it fits seamlessly with the modern infrastructure of online inference. To demonstrate the effectiveness of the optimal designs, we provide semi-synthetic simulation studies with published code and data for reproducibility purposes. We then use our deployment example on Walmart.com to fully illustrate the practical insights and effectiveness of the proposed methods.

preprint2022arXiv

Tutorial: Modern Theoretical Tools for Understanding and Designing Next-generation Information Retrieval System

In the relatively short history of machine learning, the subtle balance between engineering and theoretical progress has been proved critical at various stages. The most recent wave of AI has brought to the IR community powerful techniques, particularly for pattern recognition. While many benefits from the burst of ideas as numerous tasks become algorithmically feasible, the balance is tilting toward the application side. The existing theoretical tools in IR can no longer explain, guide, and justify the newly-established methodologies. The consequences can be suffering: in stark contrast to how the IR industry has envisioned modern AI making life easier, many are experiencing increased confusion and costs in data manipulation, model selection, monitoring, censoring, and decision making. This reality is not surprising: without handy theoretical tools, we often lack principled knowledge of the pattern recognition model's expressivity, optimization property, generalization guarantee, and our decision-making process has to rely on over-simplified assumptions and human judgments from time to time. Time is now to bring the community a systematic tutorial on how we successfully adapt those tools and make significant progress in understanding, designing, and eventually productionize impactful IR systems. We emphasize systematicity because IR is a comprehensive discipline that touches upon particular aspects of learning, causal inference analysis, interactive (online) decision-making, etc. It thus requires systematic calibrations to render the actual usefulness of the imported theoretical tools to serve IR problems, as they usually exhibit unique structures and definitions. Therefore, we plan this tutorial to systematically demonstrate our learning and successful experience of using advanced theoretical tools for understanding and designing IR systems.

preprint2021arXiv

6 nm super-resolution optical transmission and scattering spectroscopic imaging of carbon nanotubes using a nanometer-scale white light source

Optical hyperspectral imaging based on absorption and scattering of photons at the visible and adjacent frequencies denotes one of the most informative and inclusive characterization methods in material research. Unfortunately, restricted by the diffraction limit of light, it is unable to resolve the nanoscale inhomogeneity in light-matter interactions, which is diagnostic of the local modulation in material structure and properties. Moreover, many nanomaterials have highly anisotropic optical properties that are outstandingly appealing yet hard to characterize through conventional optical methods. Therefore, there has been a pressing demand in the diverse fields including electronics, photonics, physics, and materials science to extend the optical hyperspectral imaging into the nanometer length scale. In this work, we report a super-resolution hyperspectral imaging technique that simultaneously measures optical absorption and scattering spectra with the illumination from a tungsten-halogen lamp. We demonstrated sub-5 nm spatial resolution in both visible and near-infrared wavelengths (415 to 980 nm) for the hyperspectral imaging of strained single-walled carbon nanotubes (SWNT) and reconstructed true-color images to reveal the longitudinal and transverse optical transition-induced light absorption and scattering in the SWNTs. This is the first time transverse optical absorption in SWNTs were clearly observed experimentally. The new technique provides rich near-field spectroscopic information that had made it possible to analyze the spatial modulation of band-structure along a single SWNT induced through strain engineering.

preprint2021arXiv

Ground-state cooling of multiple near-degenerate mechanical modes

We propose a general and experimentally feasible approach to realize simultaneous ground-state cooling of arbitrary number of near-degenerate, or even fully degenerate mechanical modes, overcoming the limit imposed by the formation of mechanical dark modes. Multiple optical modes are employed to provide different dissipation channels that prevent complete destructive interference of the cooling pathway, and thus eliminating the dark modes. The cooling rate and limit are explicitly specified, in which the distinguishability of the optical modes to the mechanical modes is found to be critical for an efficient cooling process. In a realistic multi-mode optomechanical system, ground-state cooling of all mechanical modes is demonstrated by sequentially introducing optical drives, proving the feasibility and scalability of the proposed scheme. The work may provide new insights in preparing and manipulating multiple quantum states in macroscopic systems.

preprint2021arXiv

Theoretical Understandings of Product Embedding for E-commerce Machine Learning

Product embeddings have been heavily investigated in the past few years, serving as the cornerstone for a broad range of machine learning applications in e-commerce. Despite the empirical success of product embeddings, little is known on how and why they work from the theoretical standpoint. Analogous results from the natural language processing (NLP) often rely on domain-specific properties that are not transferable to the e-commerce setting, and the downstream tasks often focus on different aspects of the embeddings. We take an e-commerce-oriented view of the product embeddings and reveal a complete theoretical view from both the representation learning and the learning theory perspective. We prove that product embeddings trained by the widely-adopted skip-gram negative sampling algorithm and its variants are sufficient dimension reduction regarding a critical product relatedness measure. The generalization performance in the downstream machine learning task is controlled by the alignment between the embeddings and the product relatedness measure. Following the theoretical discoveries, we conduct exploratory experiments that supports our theoretical insights for the product embeddings.

preprint2020arXiv

Inductive Representation Learning on Temporal Graphs

Inductive representation learning on temporal graphs is an important step toward salable machine learning on real-world dynamic networks. The evolving nature of temporal dynamic graphs requires handling new nodes as well as capturing temporal patterns. The node embeddings, which are now functions of time, should represent both the static node features and the evolving topological structures. Moreover, node and topological features can be temporal as well, whose patterns the node embeddings should also capture. We propose the temporal graph attention (TGAT) layer to efficiently aggregate temporal-topological neighborhood features as well as to learn the time-feature interactions. For TGAT, we use the self-attention mechanism as building block and develop a novel functional time encoding technique based on the classical Bochner's theorem from harmonic analysis. By stacking TGAT layers, the network recognizes the node embeddings as functions of time and is able to inductively infer embeddings for both new and observed nodes as the graph evolves. The proposed approach handles both node classification and link prediction task, and can be naturally extended to include the temporal edge features. We evaluate our method with transductive and inductive tasks under temporal settings with two benchmark and one industrial dataset. Our TGAT model compares favorably to state-of-the-art baselines as well as the previous temporal graph embedding approaches.