Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

Beyond GSD-as-Token: Continuous Scale Conditioning for Remote Sensing VLMs

Remote sensing vision-language models (RS-VLMs) face a fundamental mismatch with natural-image counterparts: the same geographic object exhibits radically different visual evidence across ground sampling distances (GSDs) spanning multiple orders of magnitude. Yet existing RS-VLMs often discard GSD or inject it as a discrete text token, forcing a single static parameter set to absorb the entire scale spectrum. We introduce ScaleEarth, a parameter-efficient fine-tuning framework built on Qwen3-VL that treats GSD as a continuous conditioning variable governing the model's computation path. At its core, CS-HLoRA (Continuous Scale-Conditioned Hyper-LoRA) modulates the LoRA low-rank subspace through a GSD-driven gate, enabling the model to dynamically route computation by physical scale. To remove reliance on sensor metadata at deployment, we pair CS-HLoRA with SSE-U, a lightweight heteroscedastic sub-head that predicts GSD and its uncertainty from visual features. To provide matching supervision, we construct GeoScale-VQA, a 1.5M-sample scale-layered RS-VQA corpus whose question-answer generation is conditioned on the same physical scalar that drives CS-HLoRA, forming a closed method-data loop. Trained with QLoRA on an 8B backbone, ScaleEarth achieves state-of-the-art results on remote-sensing benchmarks covering diverse Earth-system tasks, including XLRS-Bench and OmniEarth-Bench.

preprint2026arXiv

SPARKLE: A Nonparametric Approach for Online Decision-Making with High-Dimensional Covariates

Personalized services are central to today's digital economy, and their sequential decisions are often modeled as contextual bandits. Modern applications pose two main challenges: high-dimensional covariates and the need for nonparametric models to capture complex reward-covariate relationships. We propose SPARKLE, a novel contextual bandit algorithm based on a sparse additive reward model that addresses both challenges through (i) a doubly penalized estimator for nonparametric reward estimation and (ii) an epoch-based design with adaptive screening to balance exploration and exploitation. We prove a sublinear regret bound that grows only logarithmically in the covariate dimensionality; to our knowledge, this is the first such result for nonparametric contextual bandits with high-dimensional covariates. We also derive an information-theoretic lower bound, and the gap to the upper bound vanishes as the reward smoothness increases. Extensive experiments on synthetic data and real data from video recommendation and personalized medicine show strong performance in high-dimensional settings.

preprint2026arXiv

Uncertainty-Adjusted Sorting for Asset Pricing with Machine Learning

Machine learning is central to empirical asset pricing, but portfolio construction still relies on point predictions and largely ignores asset-specific estimation uncertainty. We propose a simple change: sort assets using uncertainty-adjusted prediction bounds instead of point predictions alone. Across a broad set of ML models and a U.S. equity panel, this approach improves portfolio performance relative to point-prediction sorting. These gains persist even when bounds are built from partial or misspecified uncertainty information. They arise mainly from reduced volatility and are strongest for flexible machine learning models. Identification and robustness exercises show that these improvements are driven by asset-level rather than time or aggregate predictive uncertainty.

preprint2024arXiv

A First-Principle Approach to X-ray Active Optics: Design and Verification

This paper presents the first-principle design approach for X-ray active optics, using the simulation-modulation cycle in place of the measurement-modulation feedback loops used in traditional active optics. Hence, the new active optics have the potential to outperform the accuracy of surface-shape metrology instruments. We apply an X-ray mirror with localized thermal elastic deformation to validate the idea. Both the finite element simulations and surface shape measurements have demonstrated that the active optics modulation accuracy limit can be achieved at the atomic layer level. It is believed that the implementation of the first-principle design strategy has the capacity to revolutionize both the manufacturing processes of X-ray mirrors and the beamline engineering of synchrotron radiation.

preprint2022arXiv

Ab initio calculations of spin-nonconserving exciton-phonon scattering in monolayer transition metal dichalcogenides

We investigate the spin-nonconserving relaxation channel of excitons by their couplings with phonons in two-dimensional transition metal dichalcogenides using $\textit{ab initio}$ approaches. Combining $\text{GW}$-Bethe-Salpeter equation method and density functional perturbation theory, we calculate the electron-phonon and exciton-phonon coupling matrix elements for the spin-flip scattering in monolayer WSe$_{\text{2}}$, and further analyze the microscopic mechanisms influencing these scattering strengths. We find that phonons could produce effective in-plane magnetic fields which flip spin of excitons, giving rise to relaxation channels complimentary to the spin-conserving relaxation. Finally, we calculate temperature-dependent spin-flip exciton-phonon relaxation times. Our method and analysis can be generalized to study other two-dimensional materials and would stimulate experimental measurements of spin-flip exciton relaxation dynamics.

preprint2022arXiv

Knowledge Gradient for Selection with Covariates: Consistency and Computation

Knowledge gradient is a design principle for developing Bayesian sequential sampling policies to solve optimization problems. In this paper we consider the ranking and selection problem in the presence of covariates, where the best alternative is not universal but depends on the covariates. In this context, we prove that under minimal assumptions, the sampling policy based on knowledge gradient is consistent, in the sense that following the policy the best alternative as a function of the covariates will be identified almost surely as the number of samples grows. We also propose a stochastic gradient ascent algorithm for computing the sampling policy and demonstrate its performance via numerical experiments.

preprint2022arXiv

Pre-training Tasks for User Intent Detection and Embedding Retrieval in E-commerce Search

BERT-style models pre-trained on the general corpus (e.g., Wikipedia) and fine-tuned on specific task corpus, have recently emerged as breakthrough techniques in many NLP tasks: question answering, text classification, sequence labeling and so on. However, this technique may not always work, especially for two scenarios: a corpus that contains very different text from the general corpus Wikipedia, or a task that learns embedding spacial distribution for a specific purpose (e.g., approximate nearest neighbor search). In this paper, to tackle the above two scenarios that we have encountered in an industrial e-commerce search system, we propose customized and novel pre-training tasks for two critical modules: user intent detection and semantic embedding retrieval. The customized pre-trained models after fine-tuning, being less than 10% of BERT-base's size in order to be feasible for cost-efficient CPU serving, significantly improve the other baseline models: 1) no pre-training model and 2) fine-tuned model from the official pre-trained BERT using general corpus, on both offline datasets and online system. We have open sourced our datasets for the sake of reproducibility and future works.

preprint2022arXiv

Sample and Computationally Efficient Stochastic Kriging in High Dimensions

Stochastic kriging has been widely employed for simulation metamodeling to predict the response surface of complex simulation models. However, its use is limited to cases where the design space is low-dimensional because, in general, the sample complexity (i.e., the number of design points required for stochastic kriging to produce an accurate prediction) grows exponentially in the dimensionality of the design space. The large sample size results in both a prohibitive sample cost for running the simulation model and a severe computational challenge due to the need to invert large covariance matrices. Based on tensor Markov kernels and sparse grid experimental designs, we develop a novel methodology that dramatically alleviates the curse of dimensionality. We show that the sample complexity of the proposed methodology grows only slightly in the dimensionality, even under model misspecification. We also develop fast algorithms that compute stochastic kriging in its exact form without any approximation schemes. We demonstrate via extensive numerical experiments that our methodology can handle problems with a design space of more than 10,000 dimensions, improving both prediction accuracy and computational efficiency by orders of magnitude relative to typical alternative methods in practice.

preprint2021arXiv

Engineering of Atomic-Scale Flexoelectricity at Grain Boundaries

Flexoelectricity is a type of ubiquitous and prominent electromechanical coupling, pertaining to the response of electrical polarization to mechanical strain gradients while not restricted to the symmetry of materials. However, large elastic deformation in most solids is usually difficult to achieve and the strain gradient at minuscule is challenging to control. Here we exploit the exotic structural inhomogeneity of grain boundary to achieve a huge strain gradient (~ 1.2 nm-1) within 3 ~ 4 unit-cells, and thus obtain atomic-scale flexoelectric polarization up to ~ 38 μC/cm2 at a 24 LaAlO3 grain boundary. The nanoscale flexoelectricity also modifies the electrical activity of grain boundaries. Moreover, we prove that it is a general and feasible way to form large strain gradients at atomic scale by altering the misorientation angles of grain boundaries in different dielectric materials. Thus, engineering of grain boundaries provides an effective pathway to achieve tunable flexoelectricity and broadens the electromechanical functionalities of non-piezoelectric materials.

preprint2020arXiv

Ranking and Selection with Covariates for Personalized Decision Making

We consider a problem of ranking and selection via simulation in the context of personalized decision making, where the best alternative is not universal but varies as a function of some observable covariates. The goal of ranking and selection with covariates (R&S-C) is to use simulation samples to obtain a selection policy that specifies the best alternative with certain statistical guarantee for subsequent individuals upon observing their covariates. A linear model is proposed to capture the relationship between the mean performance of an alternative and the covariates. Under the indifference-zone formulation, we develop two-stage procedures for both homoscedastic and heteroscedastic simulation errors, respectively, and prove their statistical validity in terms of average probability of correct selection. We also generalize the well-known slippage configuration, and prove that the generalized slippage configuration is the least favorable configuration for our procedures. Extensive numerical experiments are conducted to investigate the performance of the proposed procedures, the experimental design issue, and the robustness to the linearity assumption. Finally, we demonstrate the usefulness of R&S-C via a case study of selecting the best treatment regimen in the prevention of esophageal cancer. We find that by leveraging disease-related personal information, R&S-C can substantially improve patients' expected quality-adjusted life years by providing patient-specific treatment regimen.

preprint2020arXiv

The Unusual Eruption of the Extragalactic Classical Nova M31N 2017-09a

M31N 2017-09a is a classical nova and was observed for some 160 days following its initial eruption, during which time it underwent a number of bright secondary outbursts. The light-curve is characterized by continual variation with excursions of at least 0.5 magnitudes on a daily time-scale. The lower envelope of the eruption suggests that a single power-law can describe the decline rate. The eruption is relatively long with $t_2 = 111$, and $t_3 = 153$ days.

preprint2019arXiv

Acoustic Multifunctional Logic Gates and Amplifier based on Passive Parity-Time Symmetry

Acoustic analogue computation and signal processing are of great significance, however, it's challenging to realize the acoustic computing devices because of their limitations of single function and complex structure. In this paper, an acoustic multifunctional device, which can gate or amplify acoustic waves without resorting to altering the frequency and structure using a passive acoustic parity-time (PT)-symmetric metamaterial, is realized theoretically and experimentally. The metamaterial is constructed by five lossless-loss periodically distributed media which are modulated to achieve the passive PT symmetry. At the coherent perfect absorber (CPA)-emitter point in the broken PT-symmetric phase, the logic gates (AND, OR, XOR and NOT) and small signal amplifier are realized in a single system by adjusting the phase and amplitude differences between two incoming beams, respectively. This work provides a new route for the connection between the PT symmetry and the acoustic metamaterial, which has great potential applications in acoustic modulation and acoustic multifunctional device.

preprint2019arXiv

Rule-Guided Compositional Representation Learning on Knowledge Graphs

Representation learning on a knowledge graph (KG) is to embed entities and relations of a KG into low-dimensional continuous vector spaces. Early KG embedding methods only pay attention to structured information encoded in triples, which would cause limited performance due to the structure sparseness of KGs. Some recent attempts consider paths information to expand the structure of KGs but lack explainability in the process of obtaining the path representations. In this paper, we propose a novel Rule and Path-based Joint Embedding (RPJE) scheme, which takes full advantage of the explainability and accuracy of logic rules, the generalization of KG embedding as well as the supplementary semantic structure of paths. Specifically, logic rules of different lengths (the number of relations in rule body) in the form of Horn clauses are first mined from the KG and elaborately encoded for representation learning. Then, the rules of length 2 are applied to compose paths accurately while the rules of length 1 are explicitly employed to create semantic associations among relations and constrain relation embeddings. Besides, the confidence level of each rule is also considered in optimization to guarantee the availability of applying the rule to representation learning. Extensive experimental results illustrate that RPJE outperforms other state-of-the-art baselines on KG completion task, which also demonstrate the superiority of utilizing logic rules as well as paths for improving the accuracy and explainability of representation learning.