Researcher profile

Arun Kumar

Arun Kumar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models

Background: Reporting and Data Systems (RADS) standardize radiology risk communication but automated RADS assignment from narrative reports is challenging because of guideline complexity, output-format constraints, and limited benchmarking across RADS frameworks and model sizes. Purpose: To create RXL-RADSet, a radiologist-verified synthetic multi-RADS benchmark, and compare validity and accuracy of open-weight small language models (SLMs) with a proprietary model for RADS assignment. Materials and Methods: RXL-RADSet contains 1,600 synthetic radiology reports across 10 RADS (BI-RADS, CAD-RADS, GB-RADS, LI-RADS, Lung-RADS, NI-RADS, O-RADS, PI-RADS, TI-RADS, VI-RADS) and multiple modalities. Reports were generated by LLMs using scenario plans and simulated radiologist styles and underwent two-stage radiologist verification. We evaluated 41 quantized SLMs (12 families, 0.135-32B parameters) and GPT-5.2 under a fixed guided prompt. Primary endpoints were validity and accuracy; a secondary analysis compared guided versus zero-shot prompting. Results: Under guided prompting GPT-5.2 achieved 99.8% validity and 81.1% accuracy (1,600 predictions). Pooled SLMs (65,600 predictions) achieved 96.8% validity and 61.1% accuracy; top SLMs in the 20-32B range reached ~99% validity and mid-to-high 70% accuracy. Performance scaled with model size (inflection between <1B and >=10B) and declined with RADS complexity primarily due to classification difficulty rather than invalid outputs. Guided prompting improved validity (99.2% vs 96.7%) and accuracy (78.5% vs 69.6%) compared with zero-shot. Conclusion: RXL-RADSet provides a radiologist-verified multi-RADS benchmark; large SLMs (20-32B) can approach proprietary-model performance under guided prompting, but gaps remain for higher-complexity schemes.

preprint2022arXiv

AWESOME: Empowering Scalable Data Science on Social Media Data with an Optimized Tri-Store Data System

Modern data science applications increasingly use heterogeneous data sources and analytics. This has led to growing interest in polystore systems, especially analytical polystores. In this work, we focus on emerging multi-data model analytics workloads over social media data that fluidly straddle relational, graph, and text analytics. Instead of a generic polystore, we build a &#34;tri-store&#34; system that is more aware of the underlying data models to better optimize execution to improve scalability and runtime efficiency. We name our system AWESOME (Analytics WorkbEnch for SOcial MEdia). It features a powerful domain-specific language named ADIL. ADIL builds on top of underlying query engines (e.g., SQL and Cypher) and features native data types for succinctly specifying cross-engine queries and NLP operations, as well as automatic in-memory and query optimizations. Using real-world tri-model analytical workloads and datasets, we empirically demonstrate the functionalities of AWESOME for scalable data science over social media data and evaluate its efficiency.

preprint2022arXiv

Bardeen Black Holes in the Regularized $4D$ Einstein--Gauss--Bonnet Gravity

We obtain exact Bardeen black holes to the regularized $4D$ Einstein--Gauss--Bonnet (EGB) gravity minimally coupled with the nonlinear electrodynamics (NED). In turn, we analyze the horizon structure to determine the effect of GB parameter $α$ on the minimum cutoff values of mass, $M_0$, and magnetic monopole charge, $g_0$, for the existence of a black hole horizon. We obtain an exact expression for thermodynamic quantities, namely, Hawking temperature $T_+$, entropy $S_+$, Helmholtz free energy $F_+$, and specific heat $C_+$ associated with the black hole horizon, and they show significant deviations from the $4D$ EGB case owing to NED. Interestingly, there exists a critical value of horizon radius, $r_+^{c}$, corresponding to the local maximum of Hawking temperature, at which heat capacity diverges, confirming the second-order phase transition. A discussion on the black holes of alternate regularized $4D$ EGB gravity belonging to the scalar-tensor theory is appended.

preprint2022arXiv

Figure-of-merit for Semi-transparent Solar Cells

Semi-transparent Solar Cells (ST-SCs) has emerged as one of the most prominent energy harvesting technology that combines the benefits of light transparency and light-to-electricity conversion. The biggest opportunities for such technologies lie in their integration as windows and skylights within energy-sustainable buildings or combining them with other solar cell technologies in tandem configuration. The performance of ST-SCs is mainly determined by the trade-off between the competing parameters of the capability to convert the incident light into electricity while allowing some parts to transmit providing transparency through the device. Depending on the target application, the selection of ST-SCs is a tricky affair as some devices might offer high efficiency but compromises transparency and vice-versa. On the other way around, this is again not helped by the fact that due to advancements in materials engineering, processing, and characterization, a vastly different combination of efficiency and transparency has been reported by research groups. So, in order to quantify the performance of ST-SCs, we proposed, a figure-of-merit (FoM) which can be used as a tool that can help in analysing and comparing the performance among various ST-SCs. The defined FoM focuses on the power conversion efficiency of the device, bifaciality factor, transmittance in the desired region, and that corresponding to 550 nm wavelength. Additionally, in this work, we have been shown how the proposed FoM can be correlated for tandem and building-integrated photovoltaics applications. Based on these resultant parameters, FoM is calculated and compared for different device architectures available in the literature. The proposed FoM shall serve as a meaningful guiding path to the researchers for the development of advanced ST-SCs.

preprint2022arXiv

GARTFIMA Process and its Empirical Spectral Density Based Estimation

In this article, we introduce a Gegenbauer autoregressive tempered fractionally integrated moving average (GARTFIMA) process. We work on the spectral density and autocovariance function for the introduced process. The parameter estimation is done using the empirical spectral density with the help of the nonlinear least square technique and the Whittle likelihood estimation technique. The performance of the proposed estimation techniques is assessed on simulated data. Further, the introduced process is shown to better model the real-world data in comparison to other time series models.

preprint2022arXiv

Hydra: A System for Large Multi-Model Deep Learning

Scaling up model depth and size is now a common approach to raise accuracy in many deep learning (DL) applications, as evidenced by the widespread success of multi-billion or even trillion parameter models in natural language processing (NLP) research. Despite success in DL research and at major technology companies, broader practical adoption of such large models among domain scientists and businesses is still bottlenecked by GPU memory limits, high training costs, and low GPU availability, even on public clouds. Model selection needs further compound these resource challenges: users often need to compare dozens of models with different hyper-parameters or neural architectures to suit their specific task and dataset. In this paper, we present Hydra, a system designed to tackle such challenges by enabling out-of-the-box scaling for multi-large-model DL workloads on even commodity GPUs in a resource-efficient manner. Hydra is the first approach to holistically optimize the execution of multi-model workloads for large DL models. We do this by adapting prior &#34;model-parallel&#34; execution schemes to work with scalable parameter offloading across the memory hierarchy and further hybridizing this approach with task-parallel job scheduling techniques. Hydra decouples scalability of model parameters from parallelism of execution, thus enabling DL users to train even a 6-billion parameter model on a single commodity GPU. It also fully exploits the speedup potential of task parallelism in multi-GPU setups, yielding near-linear strong scaling and making rigorous model selection perhaps more practical for such models. We evaluate end-to-end performance by fine-tuning GPT-2 for language modeling. We find that Hydra offers between 50% and 100% higher training throughput than even the best settings of state-of-the-art industrial frameworks such as DeepSpeed and GPipe for multi-large-model training.

preprint2022arXiv

Nonsingular Black Holes in $4D$ Einstein--Gauss--Bonnet Gravity

Recently, several methods have been proposed to regularize a $D \to 4$ limit of Einstein-Gauss-Bonnet (EGB), leading to nontrivial gravitational dynamics in $4D$. We present an exact nonsingular black hole solution in the $4D$ EGB gravity coupled to non-linear electrodynamics and analyze their thermodynamic properties to calculate precise expressions for the black hole mass, temperature, and entropy. Because of the magnetic charge, the thermodynamic quantities are corrected, and the Hawking--Page phase transition is achievable with diverges of the heat capacity at a larger critical radius $r=r_{+}^{C}$ in comparison to the $5D$ counterpart where the temperature is maximum. Thus, we have a black hole with Cauchy and event horizons, and its evaporation leads to a thermodynamically stable extremal black hole remnant with vanishing temperature, and its size is larger than the $5D$ counterpart. The entropy does not satisfy the usual exact horizon Bekenstein--Hawking area law of general relativity with a logarithmic area correction term.

preprint2022arXiv

Thermodynamic stability and P-V criticality of nonsingular-AdS black holes endowed with clouds of strings

We investigate the extended phase space thermodynamics of nonsingular-AdS black holes minimally coupled to clouds of strings in which we consider the cosmological constant ($Λ$) as the pressure ($P$) of the black holes and its conjugate variable thermodynamical volume ($V$) of the black holes. Owing to the background clouds of strings parameter ($a$), we analyse the Hawking temperature, entropy and specific heat on horizon radius for fixed-parameter $k$. We find that the strings clouds background does not alter small/large black hole (SBH/LBH) phase transition but occurs at a larger horizon radius, and two second-order phase transitions occur at a smaller horizon radius. Indeed, the $G$--$T$ plots exhibit a swallowtail below the critical pressure, implying that the first-order phase transition is analogous to the liquid-gas phase transition at a lower temperature and lower critical pressure. To further examine the analogy between nonsingular-AdS black holes and a liquid-gas system, we derive the exact critical points and probe the effects of a cloud of strings on $P-V$ criticality to find that the isotherms undergo liquid-gas like phase transition for $\tilde{T}\,<\,\tilde{T}_c$ at lower $\tilde{T}_c$. We have also calculated the critical exponents identical with Van der Walls fluid, i.e., same as those obtained before for arbitrary other AdS black holes, which implies that the background clouds of strings do not change the critical exponents.

preprint2021arXiv

Morphological Segmentation Inside-Out

Morphological segmentation has traditionally been modeled with non-hierarchical models, which yield flat segmentations as output. In many cases, however, proper morphological analysis requires hierarchical structure -- especially in the case of derivational morphology. In this work, we introduce a discriminative, joint model of morphological segmentation along with the orthographic changes that occur during word formation. To the best of our knowledge, this is the first attempt to approach discriminative segmentation with a context-free model. Additionally, we release an annotated treebank of 7454 English words with constituency parses, encouraging future research in this area.

preprint2020arXiv

A Methodology to Assess the Human Factors Associated with Lunar Teleoperated Assembly Tasks

Low-latency telerobotics can enable more intricate surface tasks on extraterrestrial planetary bodies than has ever been attempted. For humanity to create a sustainable lunar presence, well-developed collaboration between humans and robots is necessary to perform complex tasks. This paper presents a methodology to assess the human factors, situational awareness (SA) and cognitive load (CL), associated with teleoperated assembly tasks. Currently, telerobotic assembly on an extraterrestrial body has never been attempted, and a valid methodology to assess the associated human factors has not been developed. The Telerobotics Laboratory at the University of Colorado-Boulder created the Telerobotic Simulation System (TSS) which enables remote operation of a rover and a robotic arm. The TSS was used in a laboratory experiment designed as an analog to a lunar mission. The operator&#39;s task was to assemble a radio interferometer. Each participant completed this task under two conditions, remote teleoperation (limited SA) and local operation (optimal SA). The goal of the experiment was to establish a methodology to accurately measure the operator&#39;s SA and CL while performing teleoperated assembly tasks. A successful methodology would yield results showing greater SA and lower CL while operating locally. Performance metrics showed greater SA and lower CL in the local environment, supported by a 27% increase in the mean time to completion of the assembly task when operating remotely. Subjective measurements of SA and CL did not align with the performance metrics. Results from this experiment will guide future work attempting to accurately quantify the human factors associated with telerobotic assembly. Once an accurate methodology has been developed, we will be able to measure how new variables affect an operator&#39;s SA and CL to optimize the efficiency and effectiveness of telerobotic assembly tasks.

preprint2020arXiv

Hayward black holes in Einstein-Gauss-Bonnet gravity

The Hayward metric is a spherically symmetric charged regular black holes, a modification of the Reisnner-Nordstr$\ddot{o}$m black holes of Einstein&#39;s equations coupled to nonlinear electrodynamics. We consider Einstein-Gauss-Bonnet gravity (EGB) coupled to nonlinear electrodynamics to present an exact five dimension ($5D$) Hayward black holes with a regular center, having inner (Cauchy) and outer (event) horizons which go over to Boulware-Desser black holes when the charge is switched off ($e=0$). The presence of charge $e$ leads the modification in thermodynamical quantities, and it has also been shown that the Hawking-Page like phase transition can be achieved. The specific heat shows divergence at the horizon radius $r=r_C$ (critical radius), where the temperature has a maximum. Our result in the limit, $e\to0$, reduces vis-a-vis to the $5D$ Boulware-Desser solutions.

preprint2020arXiv

Potential Theory of Normal Tempered Stable Process

In this article, we study the potential theory of normal tempered stable process which is obtained by time-changing the Brownian motion with a tempered stable subordinator. Precisely, we study the asymptotic behavior of potential density and Levy density associated with tempered stable subordinator and the Green function and the Levy density associated with the normal tempered stable process. We also provide the corresponding results for normal inverse Gaussian process which is a well studied process in literature.

preprint2020arXiv

Understanding and Benchmarking the Impact of GDPR on Database Systems

The General Data Protection Regulation (GDPR) provides new rights and protections to European people concerning their personal data. We analyze GDPR from a systems perspective, translating its legal articles into a set of capabilities and characteristics that compliant systems must support. Our analysis reveals the phenomenon of metadata explosion, wherein large quantities of metadata needs to be stored along with the personal data to satisfy the GDPR requirements. Our analysis also helps us identify new workloads that must be supported under GDPR. We design and implement an open-source benchmark called GDPRbench that consists of workloads and metrics needed to understand and assess personal-data processing database systems. To gauge the readiness of modern database systems for GDPR, we follow best practices and developer recommendations to modify Redis, PostgreSQL, and a commercial database system to be GDPR compliant. Our experiments demonstrate that the resulting GDPR compliant systems achieve poor performance on GPDR workloads, and that performance scales poorly as the volume of personal data increases. We discuss the real-world implications of these findings, and identify research challenges towards making GDPR compliance efficient in production environments. We release all of our software artifacts and datasets at http://www.gdprbench.org

preprint2019arXiv

A novel Algorithm for Optimal Placement of Multiple Inertial Sensors to Improve the Sensing Accuracy

This paper proposes a novel algorithm to determine the optimal placement of redundant inertial sensors such as accelerometers and gyroscopes (gyros) for increasing the sensing accuracy. In this paper, we have proposed a novel iterative algorithm to find the optimal sensor configuration. The proposed algorithm utilizes the majorization-minimization (MM) algorithm and the duality principle to find the optimal configuration. Unlike the state-of-the-art which are mainly geometrical in nature and restricted to certain noise statistics, the proposed algorithm gives the exact positions of the sensors, and moreover, the proposed algorithm is independent of the nature of the noise at different sensors. The proposed alogrithm has been implemented and tested via numerical simulation in the MATLAB. The simulation results show that the algorithm converges to the optimal configurations and show the effectiveness of the proposed algorithm.

preprint2015arXiv

Kleene Algebras and Logic: Boolean and Rough Set Representations, 3-valued, Rough and Perp Semantics

A structural theorem for Kleene algebras is proved, showing that an element of a Kleene algebra can be looked upon as an ordered pair of sets. Further, we show that negation with the Kleene property (called the `Kleene negation&#39;) always arises from the set theoretic complement. The corresponding propositional logic is then studied through a 3-valued and rough set semantics. It is also established that Kleene negation can be considered as a modal operator, and enables giving a perp semantics to the logic. One concludes with the observation that all the semantics for this logic are equivalent.