Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
18works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

18 published item(s)

preprint2026arXiv

Single-Sample Black-Box Membership Inference Attack against Vision-Language Models via Cross-modal Semantic Alignment

Vision-Language Models (VLMs) have achieved remarkable success, yet their reliance on massive datasets and unintended memorization of training data raise significant data security risk. Membership Inference Attacks (MIAs) aim to assess these risks by determining whether a data sample was included in a model's training set. However, existing MIA methods against VLMs face critical bottlenecks: gray-box method relies on internal logits that are typically restricted in real-world Application Programming Interfaces (APIs), while black-box method depends on large-scale statistical distributions, which struggle in single-sample scenarios. To this end, we investigate MIAs from the perspective of cross-modal semantic alignment, and observe that member images exhibit significantly stronger image-caption alignment due to training memorization, whereas generated captions for non-members may deviate from the original visual content. Leveraging this insight, we propose a novel MIA framework designed for strict black-box and single-sample setting that quantifies such alignment within a joint embedding space, thereby bypassing these unrealistic assumptions. We conducted extensive experiments on three open-source and two closed-source VLMs. On the VL-MIA/Flicker dataset, our method achieves an AUC of 0.821 against LLaVA-1.5, significantly outperforming existing baselines. Furthermore, it remains robust under diverse image perturbations, highlighting its practicality.

preprint2025arXiv

Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration

All-in-One Image Restoration (AiOIR), which addresses diverse degradation types with a unified model, presents significant challenges in designing task-aware prompts that effectively guide restoration across multiple degradation scenarios. While adaptive prompt learning enables end-to-end optimization, it often yields overlapping or redundant task representations. Conversely, explicit prompts derived from pretrained classifiers enhance discriminability but discard critical visual information needed for reconstruction. To address these limitations, we introduce Contrastive Prompt Learning (CPL), a framework that aims to improve prompt-task alignment through two complementary components: a Sparse Prompt Module (SPM) that efficiently captures degradation-aware representations while reducing redundancy, and a Contrastive Prompt Regularization (CPR) that explicitly strengthens task boundaries by incorporating negative prompt samples across different degradation types. Unlike previous approaches that focus primarily on degradation classification, CPL directly optimizes the interaction between prompts and the restoration model. Extensive experiments across five benchmarks show that CPL consistently boosts the performance of strong AiOIR baselines across diverse scenarios. Our approach achieves state-of-the-art average performance on these benchmarks, providing a general and robust solution for AiOIR. The code is available at https://github.com/Aitical/CPLIR

preprint2022arXiv

Converse: A Tree-Based Modular Task-Oriented Dialogue System

Creating a system that can have meaningful conversations with humans to help accomplish tasks is one of the ultimate goals of Artificial Intelligence (AI). It has defined the meaning of AI since the beginning. A lot has been accomplished in this area recently, with voice assistant products entering our daily lives and chat bot systems becoming commonplace in customer service. At first glance there seems to be no shortage of options for dialogue systems. However, the frequently deployed dialogue systems today seem to all struggle with a critical weakness - they are hard to build and harder to maintain. At the core of the struggle is the need to script every single turn of interactions between the bot and the human user. This makes the dialogue systems more difficult to maintain as the tasks become more complex and more tasks are added to the system. In this paper, we propose Converse, a flexible tree-based modular task-oriented dialogue system. Converse uses an and-or tree structure to represent tasks and offers powerful multi-task dialogue management. Converse supports task dependency and task switching, which are unique features compared to other open-source dialogue frameworks. At the same time, Converse aims to make the bot building process easy and simple, for both professional and non-professional software developers. The code is available at https://github.com/salesforce/Converse.

preprint2022arXiv

Effectively Using Long and Short Sessions for Multi-Session-based Recommendations

It is not accurate to make recommendations only based one single current session. Therefore, multi-session-based recommendation(MSBR) is a solution for the problem. Compared with the previous MSBR models, we have made three improvements in this paper. First, the previous work choose to use all the history sessions of the user and/or of his similar users. When the user's current interest changes greatly from the past, most of these sessions can only have negative impacts. Therefore, we select a large number of randomly chosen sessions from the dataset as candidate sessions to avoid over depending on history data. Then we only choose to use the most similar sessions to get the most useful information while reduce the noise caused by dissimilar sessions. Second, in real-world datasets, short sessions account for a large proportion. The RNN often used in previous work is not suitable to process short sessions, because RNN only focuses on the sequential relationship, which we find is not the only relationship between items in short sessions. So, we designed a more suitable method named GAFE based on attention to process short sessions. Third, Although there are few long sessions, they can not be ignored. Not like previous models, which simply process long sessions in the same way as short sessions, we propose LSIS, which can split the interest of long sessions, to make better use of long sessions. Finally, to help recommendations, we also have considered users' long-term interests captured by a multi-layer GRU. Considering the four points above, we built the model ENIREC. Experiments on two real-world datasets show that the comprehensive performance of ENIREC is better than other existing models.

preprint2022arXiv

Fractal dimension of potential singular points set in the Navier-Stokes equations under supercritical regularity

The main objective of this paper is to answer the questions posed by Robinson and Sadowski [21, p. 505, Comm. Math. Phys., 2010]{[RS3]} for the Navier-Stokes equations. Firstly, we prove that the upper box dimension of the potential singular points set $\mathcal{S}$ of suitable weak solution $u$ belonging in $ L^{q}(0,T;L^{p}(\mathbb{R}^{3}))$ for $1\leq\frac{2}{q}+\frac{ 3}{p}\leq\frac32$ with $2\leq q<\infty$ and $2<p<\infty$ is at most $\max\{p,q\}(\frac{2}{q}+\frac{ 3}{p}-1)$ in this system. Secondly, it is shown that $1-2 s$ dimension Hausdorff measure of potential singular points set of suitable weak solutions satisfying $ u\in L^{2}(0,T;\dot{H}^{s+1}(\mathbb{R}^{3}))$ for $0\leq s\leq\frac12$ is zero, whose proof relies on Caffarelli-Silvestre&#39;s extension. Inspired by Baker-Wang&#39;s recent work [1], this further allows us to discuss the Hausdorff dimension of potential singular points set of suitable weak solutions if the gradient of the velocity under some supercritical regularity.

preprint2022arXiv

GLaMa: Joint Spatial and Frequency Loss for General Image Inpainting

The purpose of image inpainting is to recover scratches and damaged areas using context information from remaining parts. In recent years, thanks to the resurgence of convolutional neural networks (CNNs), image inpainting task has made great breakthroughs. However, most of the work consider insufficient types of mask, and their performance will drop dramatically when encountering unseen masks. To combat these challenges, we propose a simple yet general method to solve this problem based on the LaMa image inpainting framework, dubbed GLaMa. Our proposed GLaMa can better capture different types of missing information by using more types of masks. By incorporating more degraded images in the training phase, we can expect to enhance the robustness of the model with respect to various masks. In order to yield more reasonable results, we further introduce a frequency-based loss in addition to the traditional spatial reconstruction loss and adversarial loss. In particular, we introduce an effective reconstruction loss both in the spatial and frequency domain to reduce the chessboard effect and ripples in the reconstructed image. Extensive experiments demonstrate that our method can boost the performance over the original LaMa method for each type of mask on FFHQ, ImageNet, Places2 and WikiArt dataset. The proposed GLaMa was ranked first in terms of PSNR, LPIPS and SSIM in the NTIRE 2022 Image Inpainting Challenge Track 1 Unsupervised.

preprint2022arXiv

Partial regularity of suitable weak solutions of the model arising in amorphous molecular beam epitaxy

In this paper, we are concerned with the precise relationship between the Hausdorff dimension of possible singular point set $\mathcal{S}$ of suitable weak solutions and the parameter $α$ in the nonlinear term in the following parabolic equation $$h_t+h_{xxxx}+\partial_{xx}|h_x|^α=f.$$ It is shown that when $5/3\leqα<7/3$, the $\frac{3α-5}{α-1}$-dimensional parabolic Hausdorff measure of $\mathcal{S}$ is zero, which generalizes the recent corresponding work of Ozánski and Robinson in [31,SIAM J. Math. Anal. 51: 228--255, 2019] for $α=2$ and $f=0$. The same result is valid for a 3D modified Navier-Stokes system.

preprint2022arXiv

Robust topology optimization of structures under uncertain propagation of imprecise stochastic-based uncertain field

This study introduces a novel computational framework for Robust Topology Optimization (RTO) considering imprecise random field parameters. Unlike the worst-case approach, the present method provides upper and lower bounds for the mean and standard deviation of compliance as well as the optimized topological layouts of a structure for various scenarios. In the proposed approach, the imprecise random field variables are determined utilizing parameterized p-boxes with different confidence intervals. The Karhunen-Loève (K-L) expansion is extended to provide a spectral description of the imprecise random field. The linear superposition method in conjunction with a linear combination of orthogonal functions is employed to obtain explicit mathematical expressions for the first and second order statistical moments of the structural compliance. Then, an interval sensitivity analysis is carried out, applying the Orthogonal Similarity Transformation (OST) method with the boundaries of each of the intermediate variable searched efficiently at every iteration using a Combinatorial Approach (CA). Finally, the validity, accuracy, and applicability of the work are rigorously checked by comparing the outputs of the proposed approach with those obtained using the particle swarm optimization (PSO) and Quasi-Monte-Carlo Simulation (QMCS) methods. Three different numerical examples with imprecise random field loads are presented to show the effectiveness and feasibility of the study.

preprint2021arXiv

Electronic transport descriptors for the rapid screening of thermoelectric materials

The discovery of novel materials for thermoelectric energy conversion has potential to be accelerated by data-driven screening combined with high-throughput calculations. One way to increase the efficacy of successfully choosing a candidate material is through its evaluation using transport descriptors. Using a data-driven screening, we selected 12 potential candidates in the trigonal ABX2 family, followed by charge transport property simulations from first principles. The results suggest that carrier scattering processes in these materials are dominated by ionised impurities and polar optical phonons, contrary to the oft-assumed acoustic-phonon-dominated scattering. Combined with calculations of thermal conductivity based on three-phonon scattering, we predict p-type AgBiS2 and TlBiTe2 as potential high-performance thermoelectrics in the intermediate temperature range for low grade waste heat harvesting, with a predicted zT above 1 at 500 K. Using these data, we further derive ground-state transport descriptors for the carrier mobility and the thermoelectric power factor. In addition to low carrier mass, high dielectric constant was found to be an important factor towards high carrier mobility. A quadratic correlation between dielectric constant and transport performance was established and further validated with literature. Looking ahead, dielectric constant can potentially be exploited as an independent tuning knob for improving the thermoelectric performance.

preprint2021arXiv

HI mapping of the Leo Triplet: Morphologies and kinematics of tails and bridges

A fully-sampled and hitherto highest resolution and sensitivity observation of neutral hydrogen (HI) in the Leo Triplet (NGC 3628, M 65/NGC 3623, and M 66/NGC 3627) reveals six HI structures beyond the three galaxies. We present detailed results of the morphologies and kinematics of these structures, which can be used for future simulations. In particular, we detect a two-arm structure in the plume of NGC 3628 for the first time, which can be explained by a tidal interaction model. The optical counterpart of the plume is mainly associated with the southern arm. The connecting part (base) of the plume (directed eastwards) with NGC 3628 is located at the blueshifted (western) side of NGC 3628. Two bases appear to be associated with the two arms of the plume. A clump with reversed velocity gradient (relative to the velocity gradient of M 66) and a newly detected tail, i.e. M 66SE, is found in the southeast of M 66. We suspect that M 66SE represents gas from NGC 3628 which was captured by M 66 in the recent interaction between the two galaxies. Meanwhile gas is falling toward M 66, resulting in features already previously observed in the southeastern part of M 66, e.g. large line widths and double peaks. An upside-down `Y&#39;-shaped HI gas component (M 65S) is detected in the south of M 65 which suggests that M 65 may also have been involved in the interaction. We strongly encourage modern hydrodynamical simulations of this interacting group of galaxies to reveal the origin of the gaseous debris surrounding all three galaxies.

preprint2020arXiv

Ab initio dipolar electron-phonon interactions in two-dimensional materials

We develop an ab initio formalism for dipolar electron-phonon interactions (EPI) in two-dimensional (2D) materials. Unlike purely longitudinal Fröhlich model, we show that the out-of-plane dipoles also contribute to the long-wavelength non-analytical behavior of EPI. And the 2D dipolar EPI plays an important role not only in the typical polar material MoS$_2$, but also in graphane and fluorinated graphene. By incorporating this formalism into Wannier-Fourier interpolation, we enable accurate EPI calculations for 2D materials and subsequent intrinsic carrier mobility prediction. The results show that Fröhlich model is inadequate for 2D materials and correct long-wavelength interaction must be included for the reliable prediction.

preprint2020arXiv

Extended HNCO, SiO, and HC$_{3}$N emission in 43 southern star-forming regions

We have selected 43 southern massive star-forming regions to study the spatial distribution of HNCO 4$_{04}$-3$_{03}$, SiO 2-1 and HC$_{3}$N 10-9 line emission and to investigate their spatial association with the dust emission. The morphology of HNCO 4$_{04}$-3$_{03}$ and HC$_{3}$N 10-9 agrees well with the dust emission. HC$_{3}$N 10-9 tends to originate from more compact regions than HNCO 4$_{04}$-3$_{03}$ and SiO 2-1. We divided our sources into three groups: those in the Central Molecular Zone (CMZ), those associated with bubbles (Bubble), and the remaining sources, which are termed &#39;normal star forming regions&#39; (NMSFR). These three groups, subdivided into three different categories with respect to line widths, integrated intensities, and column densities, hint at the presence of different physical and chemical processes. We find that the dust temperature $T_{\rm d}$, and the abundance ratios of $N_{\rm HNCO}/N_{\rm SiO}$ and $N_{\rm HNCO}/N_{\rm HC3N}$ show a decreasing trend towards the central dense regions of CMZ sources, while $N_{\rm HC3N}/N_{\rm SiO}$ moves into the opposite direction. Moreover, a better agreement is found between $T_{\rm d}$ and $N_{\rm HC3N}/N_{\rm SiO}$ in Bubble and NMSFR category sources. Both outflow and inflow activities have been found in eight of the sixteen bubble and NMSFR sources. The low outflow detection rate indicates that in these sources the SiO 2-1 line wing emission is either below our sensitivity limit or that the bulk of the SiO emission may be produced by the expansion of an H{\sc\,ii} region or supernova remnant, which has pushed molecular gas away forming a shock and yielding SiO.

preprint2020arXiv

Load Balanced Dynamic Resource Allocation for MTC Relay

A Load Balancing Relay Algorithm (LBRA) was proposed to solve the unfair spectrum resource allocation in the traditional mobile MTC relay. In order to obtain reasonable use of spectrum resources, and a balanced MTC devices (MTCDs) distribution, spectrum resources are dynamically allocated by MTCDs regrouped on the MTCD to MTC gateway link. Moreover, the system outage probability and transmission capacity are derived when using LBRA. The numerical results show that the proposed algorithm has better performance in transmission capacity and outage probability than the traditional method. LBRA had an increase in transmission capacity of about 0.7dB, and an improvement in outage probability of about 0.8dB with a high MTCD density.

preprint2020arXiv

Modeling, Analysis, and Optimization of Grant-Free NOMA in Massive MTC via Stochastic Geometry

Massive machine-type communications (mMTC) is a crucial scenario to support booming Internet of Things (IoTs) applications. In mMTC, although a large number of devices are registered to an access point (AP), very few of them are active with uplink short packet transmission at the same time, which requires novel design of protocols and receivers to enable efficient data transmission and accurate multi-user detection (MUD). Aiming at this problem, grant-free non-orthogonal multiple access (GF-NOMA) protocol is proposed. In GF-NOMA, active devices can directly transmit their preambles and data symbols altogether within one time frame, without grant from the AP. Compressive sensing (CS)-based receivers are adopted for non-orthogonal preambles (NOP)-based MUD, and successive interference cancellation is exploited to decode the superimposed data signals. In this paper, we model, analyze, and optimize the CS-based GF-MONA mMTC system via stochastic geometry (SG), from an aspect of network deployment. Based on the SG network model, we first analyze the success probability as well as the channel estimation error of the CS-based MUD in the preamble phase and then analyze the average aggregate data rate in the data phase. As IoT applications highly demands low energy consumption, low infrastructure cost, and flexible deployment, we optimize the energy efficiency and AP coverage efficiency of GF-NOMA via numerical methods. The validity of our analysis is verified via Monte Carlo simulations. Simulation results also show that CS-based GF-NOMA with NOP yields better MUD and data rate performances than contention-based GF-NOMA with orthogonal preambles and CS-based grant-free orthogonal multiple access.

preprint2020arXiv

NH$_{3}$ (1,1) hyperfine intensity anomalies in the Orion A molecular cloud

In LTE, the two inner satellite lines (ISLs) and the two outer satellite lines (OSLs) of the NH$_{3}$ (1,1) transition are each predicted to have equal intensities. However, hyperfine intensity anomalies (HIAs) are observed to be omnipresent in star formation regions, which is still not fully understood. In addressing this issue, we find that the computation method of the HIA by the ratio of the peak intensities may have defects, especially when being used to process the spectra with low velocity dispersions. Therefore we define the integrated HIAs of the ISLs (HIA$_{\rm IS}$) and OSLs (HIA$_{\rm OS}$) by the ratio of their redshifted to blueshifted integrated intensities and develop a procedure to calculate them. Based on this procedure, we present a systematic study of the integrated HIAs in the northern part of the Orion A MC. We find that integrated HIA$_{\rm IS}$ and HIA$_{\rm OS}$ are commonly present in the Orion A MC and no clear distinction is found at different locations of the MC. The medians of the integrated HIA$_{\rm IS}$ and HIA$_{\rm OS}$ are 0.921$\pm$0.003 and 1.422$\pm$0.009, respectively, which is consistent with the HIA core model and inconsistent with the CE model. Selecting those 170 positions where both integrated HIAs deviate by more than 3-$σ$ from unity, most (166) are characterized by HIA$_{\rm IS}$<1 and HIA$_{\rm OS}$>1, which suggests that the HIA core model plays a more significant role than the CE model. The remaining four positions are consistent with the CE model. We compare the integrated HIAs with the para-NH$_{3}$ column density ($N$(para-NH$_{3}$)), kinetic temperature ($T_{\rm K}$), total velocity dispersion ($σ_{\rm v}$), non-thermal velocity dispersion ($σ_{\rm NT}$), and the total opacity of the NH$_{3}$ (1,1) line ($τ_{0}$). Their correlations can not be fully explained by neither the HIA core nor the CE model.

preprint2020arXiv

Scalable Bid Landscape Forecasting in Real-time Bidding

In programmatic advertising, ad slots are usually sold using second-price (SP) auctions in real-time. The highest bidding advertiser wins but pays only the second-highest bid (known as the winning price). In SP, for a single item, the dominant strategy of each bidder is to bid the true value from the bidder&#39;s perspective. However, in a practical setting, with budget constraints, bidding the true value is a sub-optimal strategy. Hence, to devise an optimal bidding strategy, it is of utmost importance to learn the winning price distribution accurately. Moreover, a demand-side platform (DSP), which bids on behalf of advertisers, observes the winning price if it wins the auction. For losing auctions, DSPs can only treat its bidding price as the lower bound for the unknown winning price. In literature, typically censored regression is used to model such partially observed data. A common assumption in censored regression is that the winning price is drawn from a fixed variance (homoscedastic) uni-modal distribution (most often Gaussian). However, in reality, these assumptions are often violated. We relax these assumptions and propose a heteroscedastic fully parametric censored regression approach, as well as a mixture density censored network. Our approach not only generalizes censored regression but also provides flexibility to model arbitrarily distributed real-world data. Experimental evaluation on the publicly available dataset for winning price estimation demonstrates the effectiveness of our method. Furthermore, we evaluate our algorithm on one of the largest demand-side platforms and significant improvement has been achieved in comparison with the baseline solutions.

preprint2020arXiv

Structured Policy Iteration for Linear Quadratic Regulator

Linear quadratic regulator (LQR) is one of the most popular frameworks to tackle continuous Markov decision process tasks. With its fundamental theory and tractable optimal policy, LQR has been revisited and analyzed in recent years, in terms of reinforcement learning scenarios such as the model-free or model-based setting. In this paper, we introduce the \textit{Structured Policy Iteration} (S-PI) for LQR, a method capable of deriving a structured linear policy. Such a structured policy with (block) sparsity or low-rank can have significant advantages over the standard LQR policy: more interpretable, memory-efficient, and well-suited for the distributed setting. In order to derive such a policy, we first cast a regularized LQR problem when the model is known. Then, our Structured Policy Iteration (S-PI) algorithm, which takes a policy evaluation step and a policy improvement step in an iterative manner, can solve this regularized LQR efficiently. We further extend the S-PI algorithm to the model-free setting where a smoothing procedure is adopted to estimate the gradient. In both the known-model and model-free setting, we prove convergence analysis under the proper choice of parameters. Finally, the experiments demonstrate the advantages of S-PI in terms of balancing the LQR performance and level of structure by varying the weight parameter.

preprint2020arXiv

Studies of the distinct regions due to CO selective dissociation in the Aquila molecular cloud

Aims. We investigate the role of selective dissociation in the process of star formation by comparing the physical parameters of protostellar-prestellar cores and the distinct regions with the CO isotope distributions in photodissociation regions. We seek to understand whether there is a better connection between the evolutionary age of star forming regions and the effect of selective dissociation Methods. Wide-field observations of the $\rm ^{12}CO$, $\rm ^{13}CO$, and $\rm C^{18}O$ ( J = 1 - 0) emission lines are used to study the ongoing star formation activity in the Aquila molecular region, and the 70 $μ$m and 250 $μ$m data are used to describe the heating of the surrounding material and as an indicator of the evolutionary age of the core. Results. The protostellar-prestellar cores are found at locations with the highest $\rm C^{18}O$ column densities and their increasing evolutionary age would relate to an increasing 70$μ$m/250$μ$m emission ratio at their location. An evolutionary age of the cores may also follow from the $\rm ^{13}CO$ versus $\rm C^{18}O$ abundance ratio, which decreases with increasing $\rm C^{18}O$ column densities. The original mass has been estimated for nine representative star formation regions and the original mass of the region correlated well with the integrated 70 $μ$m flux density. Similarly, the $ X_{\rm ^{13}CO}$/$X_{\rm C^{18}O}$ implying the dissociation rate for these regions correlates with the 70$μ$m/250$μ$m flux density ratio and reflects the evolutionary age of the star formation activity.