Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

Can Large Language Models Understand, Reason About, and Generate Code-Switched Text?

Code-switching is a pervasive phenomenon in multilingual communication, yet the robustness of large language models (LLMs) in mixed-language settings remains insufficiently understood. In this work, we present a comprehensive evaluation of LLM capabilities in understanding, reasoning over, and generating code-switched text. We introduce CodeMixQA a novel benchmark with high-quality human annotations, comprising 16 diverse parallel code-switched language-pair variants that span multiple geographic regions and code-switching patterns, and include both original scripts and their transliterated forms. Using this benchmark, we analyze the reasoning behavior of LLMs on code-switched question-answering tasks, shedding light on how models process and reason over mixed-language inputs. We further conduct a systematic evaluation of LLM-generated synthetic code-switched text, focusing on both naturalness and semantic fidelity, and uncover key limitations in current generation capabilities. Our findings reveal persistent challenges in both reasoning and generation under code-switching conditions and provide actionable insights for building more robust multilingual LLMs. We release the dataset and code as open source.

preprint2023arXiv

Neutrino secret self-interactions: a booster shot for the cosmic neutrino background

Neutrinos might interact among themselves through forces that have so far remained hidden. Throughout the history of the Universe, such \emph{secret} interactions could lead to scatterings between the neutrinos from supernova explosions and the non-relativistic relic neutrinos left over from the Big Bang. Such scatterings can boost the cosmic neutrino background (C$ν$B) to energies of ${\cal O}$(MeV), making it, in principle, observable in experiments searching for the diffuse supernova neutrino background. Assuming a model-independent, but flavor universal, four-Fermi interaction, we determine the upscattered cosmic neutrino flux, and derive constraints on such secret interactions from the latest results from Super-Kamiokande. Furthermore, we also study prospects for detection of the boosted flux in future lead-based coherent elastic neutrino-nucleus scattering experiments. Nevertheless, given current constraints on flavor universal self-interactions, we find that the upscattered C$ν$B~contribution to the total DSNB flux is negligible, making a possible measurement of the boosted C$ν$B insurmountable.

preprint2022arXiv

Boosted dark matter from diffuse supernova neutrinos

The XENON collaboration recently reported an excess of electron recoil events in the low energy region with a significance of around $3.3σ$. An explanation of this excess in terms of thermal dark matter seems challenging. We propose a scenario where dark matter in the Milky Way halo gets boosted as a result of scattering with the diffuse supernova neutrino background. This interaction can accelerate the dark-matter to semi-relativistic velocities, and this flux, in turn, can scatter with the electrons in the detector, thereby providing a much better fit to the data. We identify regions in the parameter space of dark-matter mass and interaction cross-section which satisfy the excess. Furthermore, considering the data only hypothesis, we also impose bounds on the dark-matter scattering cross-section, which are competitive with bounds from other experiments.

preprint2022arXiv

Multi-Band Superconductivity in Strongly Hybridized 1T'-WTe$_2$/NbSe$_2$ Heterostructures

The interplay of topology and superconductivity has become a subject of intense research in condensed matter physics for the pursuit of topologically non-trivial forms of superconducting pairing. An intrinsically normal-conducting material can inherit superconductivity via electrical contact to a parent superconductor via the proximity effect, usually understood as Andreev reflection at the interface between the distinct electronic structures of two separate conductors. However, at high interface transparency, strong coupling inevitably leads to changes in the band structure, locally, owing to hybridization of electronic states. Here, we investigate such strongly proximity-coupled heterostructures of monolayer 1T'-WTe$_2$, grown on NbSe$_2$ by van-der-Waals epitaxy. The superconducting local density of states (LDOS), resolved in scanning tunneling spectroscopy down to 500~mK, reflects a hybrid electronic structure, well-described by a multi-band framework based on the McMillan equations which captures the multi-band superconductivity inherent to the NbSe$_2$ substrate and that induced by proximity in WTe$_2$, self-consistently. Our material-specific tight-binding model captures the hybridized heterostructure quantitatively, and confirms that strong inter-layer hopping gives rise to a semi-metallic density of states in the 2D WTe$_2$ bulk, even for nominally band-insulating crystals. The model further accurately predicts the measured order parameter $Δ\simeq 0.6$~meV induced in the WTe$_2$ monolayer bulk, stable beyond a 2~T magnetic field. We believe that our detailed multi-band analysis of the hybrid electronic structure provides a useful tool for sensitive spatial mapping of induced order parameters in proximitized atomically thin topological materials.

preprint2022arXiv

Multi-Level Local SGD for Heterogeneous Hierarchical Networks

We propose Multi-Level Local SGD, a distributed gradient method for learning a smooth, non-convex objective in a heterogeneous multi-level network. Our network model consists of a set of disjoint sub-networks, with a single hub and multiple worker nodes; further, worker nodes may have different operating rates. The hubs exchange information with one another via a connected, but not necessarily complete communication network. In our algorithm, sub-networks execute a distributed SGD algorithm, using a hub-and-spoke paradigm, and the hubs periodically average their models with neighboring hubs. We first provide a unified mathematical framework that describes the Multi-Level Local SGD algorithm. We then present a theoretical analysis of the algorithm; our analysis shows the dependence of the convergence error on the worker node heterogeneity, hub network topology, and the number of local, sub-network, and global iterations. We back up our theoretical results via simulation-based experiments using both convex and non-convex objectives.

preprint2022arXiv

Stellar Shocks From Dark Matter Asteroid Impacts

Macroscopic dark matter is almost unconstrained over a wide "asteroid-like" mass range, where it could scatter on baryonic matter with geometric cross section. We show that when such an object travels through a star, it produces shock waves which reach the stellar surface, leading to a distinctive transient optical, UV and X-ray emission. This signature can be searched for on a variety of stellar types and locations. In a dense globular cluster, such events occur far more often than flare backgrounds, and an existing UV telescope could probe orders of magnitude in dark matter mass in one week of dedicated observation.

preprint2022arXiv

UniPreCIS : A data pre-processing solution for collocated services on shared IoT

Next-generation smart city applications, attributed by the power of Internet of Things (IoT) and Cyber-Physical Systems (CPS), significantly rely on the quality of sensing data. With an exponential increase in intelligent applications for urban development and enterprises offering sensing-as-aservice these days, it is imperative to provision for a shared sensing infrastructure for better utilization of resources. However, a shared sensing infrastructure that leverages low-cost sensing devices for a cost effective solution, still remains an unexplored territory. A significant research effort is still needed to make edge based data shaping solutions, more reliable, feature-rich and costeffective while addressing the associated challenges in sharing the sensing infrastructure among multiple collocated services with diverse Quality of Service (QoS) requirements. Towards this, we propose a novel edge based data pre-processing solution, named UniPreCIS that accounts for the inherent characteristics of lowcost ambient sensors and the exhibited measurement dynamics with respect to application-specific QoS. UniPreCIS aims to identify and select quality data sources by performing sensor ranking and selection followed by multimodal data pre-processing in order to meet heterogeneous application QoS and at the same time reducing the resource consumption footprint for the resource constrained network edge. As observed, the processing time and memory utilization has been reduced in the proposed approach while achieving upto 90% accuracy which is arguably significant as compared to state-of-the-art techniques for sensing. The effectiveness of UniPreCIS has been evaluated on a testbed for a specific use case of indoor occupancy estimation that proves its effectiveness.

preprint2021arXiv

Multi-Tier Federated Learning for Vertically Partitioned Data

We consider decentralized model training in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical data shard partitioned horizontally across its clients. We propose Tiered Decentralized Coordinate Descent (TDCD), a communication-efficient decentralized training algorithm for such two-tiered networks. To reduce communication overhead, the clients in each silo perform multiple local gradient steps before sharing updates with their hub. Each hub adjusts its coordinates by averaging its workers' updates, and then hubs exchange intermediate updates with one another. We present a theoretical analysis of our algorithm and show the dependence of the convergence rate on the number of vertical partitions, the number of local updates, and the number of clients in each hub. We further validate our approach empirically via simulation-based experiments using a variety of datasets and both convex and non-convex objectives.

preprint2021arXiv

Thermal effects on collective modes in disordered $s$-wave superconductors

We investigate the effect of thermal fluctuations on the two-particle spectral function for a disordered $s$-wave superconductor in two dimensions, focusing on the evolution of the collective amplitude and phase modes. We find three main effects of thermal fluctuations: (a) the phase mode is softened with increasing temperature reflecting the decrease of superfluid stiffness; (b) remarkably, the non-dispersive collective amplitude modes at finite energy near ${\bf q}=[0,0]$ and ${\bf q}=[π,π]$ survive even in presence of thermal fluctuations in the disordered superconductor; and (c) the scattering of the thermally excited fermionic quasiparticles leads to low energy incoherent spectral weight that forms a strongly momentum-dependent background halo around the phase and amplitude collective modes and broadens them. Due to momentum and energy conservation constraints, this halo has a boundary which disperses linearly at low momenta and shows a strong dip near the $[π,π]$ point in the Brillouin zone.

preprint2020arXiv

Diffusion and Consensus in a Weakly Coupled Network of Networks

We study diffusion and consensus dynamics in a Network of Networks model. In this model, there is a collection of sub-networks, connected to one another using a small number of links. We consider a setting where the links between networks have small weights, or are used less frequently than links within each sub-network. Using spectral perturbation theory, we analyze the diffusion rate and convergence rate of the investigated systems. Our analysis shows that the first order approximation of the diffusion and convergence rates is independent of the topologies of the individual graphs; the rates depend only on the number of nodes in each graph and the topology of the connecting edges. The second order analysis shows a relationship between the diffusion and convergence rates and the information centrality of the connecting nodes within each sub-network. We further highlight these theoretical results through numerical examples.

preprint2020arXiv

Galactic Positron Excess from Selectively Enhanced Dark Matter Annihilation

Precision measurements of the positron flux in cosmic ray have revealed an unexplained bump in the spectrum around $E\simeq 300\,\mathrm{GeV}$, not clearly attributable to known astrophysical processes. We propose annihilation of dark matter of mass $m_χ= 780\,\mathrm{GeV}$ with a late-time cross section $σv = 4.63\times 10^{-24}\,\mathrm{cm^3\,s^{-1}}$ as a possible source. The nonmonotonic dependence of the annihilation rate on dark matter velocity, owing to a selective $p$-wave Sommerfeld enhancement, allows such a large signal from the Milky Way without violating corresponding constraints from CMB and dwarf galaxy observations. We briefly explore other signatures of this scenario, and outline avenues to test it in future experiments.

preprint2020arXiv

Mixed WIMP-axion dark matter

We study the experimental constraints on a model of a two-component dark matter, consisting of the QCD axion, and a scalar particle, both contributing to the dark matter relic abundance of the Universe. The global Peccei-Quinn symmetry of the theory can be spontaneously broken down to a residual $\mathbb{Z}_2$ symmetry, thereby identifying this scalar as a stable weakly interacting massive particle, i.e., a dark matter candidate, in addition to the axion. We perform a comprehensive study of the model using the latest data from dark matter direct and indirect detection experiments, as well as new physics searches at the Large Hadron Collider. We find that although the model is mostly constrained by the dark matter detection experiments, it is still viable around a small region of the parameter space where the scalar dark matter is half as heavy as the Standard Model Higgs. In this allowed region, the bounds from these experiments are evaded due to a cancellation mechanism in the dark matter--Higgs coupling. The collider search results, however, are shown to impose weak bounds on the model.

preprint2020arXiv

Performance Optimization for Edge-Cloud Serverless Platforms via Dynamic Task Placement

We present a framework for performance optimization in serverless edge-cloud platforms using dynamic task placement. We focus on applications for smart edge devices, for example, smart cameras or speakers, that need to perform processing tasks on input data in real to near-real time. Our framework allows the user to specify cost and latency requirements for each application task, and for each input, it determines whether to execute the task on the edge device or in the cloud. Further, for cloud executions, the framework identifies the container resource configuration needed to satisfy the performance goals. We have evaluated our framework in simulation using measurements collected from serverless applications in AWS Lambda and AWS Greengrass. In addition, we have implemented a prototype of our framework that runs in these same platforms. In experiments with our prototype, our models can predict average end-to-end latency with less than 6% error, and we obtain almost three orders of magnitude reduction in end-to-end latency compared to edge-only execution.

preprint2020arXiv

Skedulix: Hybrid Cloud Scheduling for Cost-Efficient Execution of Serverless Applications

We present a framework for scheduling multifunction serverless applications over a hybrid public-private cloud. A set of serverless jobs is input as a batch, and the objective is to schedule function executions over the hybrid platform to minimize the cost of public cloud use, while completing all jobs by a specified deadline. As this scheduling problem is NP-Hard, we propose a greedy algorithm that dynamically determines both the order and placement of each function execution using predictive models of function execution time and network latencies. We present a prototype implementation of our framework that uses AWS Lambda and OpenFaaS, for the public and private cloud, respectively. We evaluate our prototype in live experiments using a mixture of compute and I/O heavy serverless applications. Our results show that our framework can achieve a speedup in batch processing of up to 1.92 times that of an approach that uses only the private cloud, at 40.5% the cost of an approach that uses only the public cloud.