Source author record

Mohit Kumar

Mohit Kumar appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Machine Learning eess.SP Cryptography and Security math.FA Information Retrieval Networking and Internet Architecture physics.optics Social and Information Networks

Catalog footprint

What is connected

15works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Kernel Affine Hull Machines for Compute-Efficient Query-Side Semantic Encoding

Transformer-based semantic retrieval is highly effective, yet in many deployments the dominant cost lies in online query encoding rather than corpus indexing. We study the fixed-teacher query-adaptation problem and ask whether repeated neural inference can be replaced by a lightweight, analytically explicit estimator without degrading decision-relevant retrieval quality. We propose Kernel Affine Hull Machines (KAHMs), which map inexpensive lexical features into a frozen semantic embedding space by estimating prototype-mixture weights in a rigorously specified RKHS and refining prototypes via normalized least-mean-squares, yielding a transparent decomposition of encoding error into posterior-approximation, generalization, and teacher-noise components. On a controlled Austrian-law benchmark (5,000 queries; 84 laws; 10,762 units), KAHM attains the strongest teacher-space reconstruction among matched learned adapters (MSE 0.000091, R^2 0.9071, cosine 0.9536) and consistently leads rank-sensitive metrics, including mean reciprocal rank at 20 (MRR@20, the average inverse rank of the first relevant result within the top 20), Hit rate at 20 (Hit@20, the fraction of queries with at least one relevant result in the top 20), and Top-1 accuracy (the fraction of queries whose correct item is ranked first), with scores of 0.504, 0.694, and 0.411, respectively. It also reduces per-query latency by a factor of 8.5 relative to direct transformer encoding. These results demonstrate that, in fixed-teacher regimes, lightweight geometric estimators can substitute for online neural encoding, preserving retrieval performance while substantially improving efficiency and interpretability.

preprint2022arXiv

An Energy Aware Clustering Scheme for 5G-enabled Edge Computing based IoMT Framework

In recent years, 5G network systems start to offer communication infrastructure for Internet of Things (IoT) applications, especially for health care service pro-viders. In smart health care systems, edge computing enabled Internet of Medical Things (IoMT) is an innovative technology to provide online health care monitor-ing facility to patients. Here, energy consumption, along with extending the lifespan of biosensor network, is a key concern. In this contribution, a Chicken Swarm Optimization algorithm, based on Energy Efficient Multi-objective clus-tering scheme is applied in the context of IoMT system. An effective fitness func-tion is designed for cluster head selection., using multiple objectives, such as re-sidual energy, queuing delay, communication cost, link quality and node centrali-ty. Simulated outcomes of the proposed scheme are compared with the existing schemes in terms of parameters such as cluster formation time, energy consump-tion, network lifetime, throughput and propagation delay.

preprint2022arXiv

Differentially Private Transferrable Deep Learning with Membership-Mappings

This paper considers the problem of differentially private semi-supervised transfer and multi-task learning. The notion of \emph{membership-mapping} has been developed using measure theory basis to learn data representation via a fuzzy membership function. An alternative conception of deep autoencoder, referred to as \emph{Conditionally Deep Membership-Mapping Autoencoder (CDMMA)}, is considered for transferrable deep learning. Under practice-oriented settings, an analytical solution for the learning of CDMMA can be derived by means of variational optimization. The paper proposes a transfer and multi-task learning approach that combines CDMMA with a tailored noise adding mechanism to achieve a given level of privacy-loss bound with the minimum perturbation of the data. Numerous experiments were carried out using MNIST, USPS, Office, and Caltech256 datasets to verify the competitive robust performance of the proposed methodology.

preprint2022arXiv

Information Theoretic Evaluation of Privacy-Leakage, Interpretability, and Transferability for Trustworthy AI

In order to develop machine learning and deep learning models that take into account the guidelines and principles of trustworthy AI, a novel information theoretic trustworthy AI framework is introduced. A unified approach to "privacy-preserving interpretable and transferable learning" is considered for studying and optimizing the tradeoffs between privacy, interpretability, and transferability aspects. A variational membership-mapping Bayesian model is used for the analytical approximations of the defined information theoretic measures for privacy-leakage, interpretability, and transferability. The approach consists of approximating the information theoretic measures via maximizing a lower-bound using variational optimization. The study presents a unified information theoretic approach to study different aspects of trustworthy AI in a rigorous analytical manner. The approach is demonstrated through numerous experiments on benchmark datasets and a real-world biomedical application concerned with the detection of mental stress on individuals using heart rate variability analysis.

preprint2022arXiv

Learning MAX-SAT from Contextual Examples for Combinatorial Optimisation

Combinatorial optimisation problems are ubiquitous in artificial intelligence. Designing the underlying models, however, requires substantial expertise, which is a limiting factor in practice. The models typically consist of hard and soft constraints, or combine hard constraints with an objective function. We introduce a novel setting for learning combinatorial optimisation problems from contextual examples. These positive and negative examples show - in a particular context - whether the solutions are good enough or not. We develop our framework using the MAX-SAT formalism as it is simple yet powerful setting having these features. We study the learnability of MAX-SAT models. Our theoretical results show that high-quality MAX-SAT models can be learned from contextual examples in the realisable and agnostic settings, as long as the data satisfies an intuitive "representativeness" condition. We also contribute two implementations based on our theoretical results: one leverages ideas from syntax-guided synthesis while the other makes use of stochastic local search techniques. The two implementations are evaluated by recovering synthetic and benchmark models from contextual examples. The experimental results support our theoretical analysis, showing that MAX-SAT models can be learned from contextual examples. Among the two implementations, the stochastic local search learner scales much better than the syntax-guided implementation while providing comparable or better models.

preprint2022arXiv

Membership-Mappings for Data Representation Learning: Measure Theoretic Conceptualization

A fuzzy theoretic analytical approach was recently introduced that leads to efficient and robust models while addressing automatically the typical issues associated to parametric deep models. However, a formal conceptualization of the fuzzy theoretic analytical deep models is still not available. This paper introduces using measure theoretic basis the notion of \emph{membership-mapping} for representing data points through attribute values (motivated by fuzzy theory). A property of the membership-mapping, that can be exploited for data representation learning, is of providing an interpolation on the given data points in the data space. An analytical approach to the variational learning of a membership-mappings based data representation model is considered.

preprint2022arXiv

Membership-Mappings for Practical Secure Distributed Deep Learning

This study leverages the data representation capability of fuzzy based membership-mappings for practical secure distributed deep learning using fully homomorphic encryption. The impracticality issue of secure machine (deep) learning with fully homomorphic encrypted data, arising from large computational overhead, is addressed via applying fuzzy attributes. Fuzzy attributes are induced by globally convergent and robust variational membership-mappings based local deep models. Fuzzy attributes combine the local deep models in a robust and flexible manner such that the global model can be evaluated homomorphically in an efficient manner using a boolean circuit composed of bootstrapped binary gates. The proposed method, while preserving privacy in a distributed learning scenario, remains accurate, practical, and scalable. The method is evaluated through numerous experiments including demonstrations through MNIST dataset and Freiburg Groceries Dataset. Further, a biomedical application related to mental stress detection on individuals is considered.

preprint2021arXiv

Mid-Infrared Photon-Pair Generation in AgGaS$_2$

We demonstrate non-degenerate photon-pair generation by spontaneous parametric down conversion in a silver gallium sulfide AgGaS$_2$ crystal. By tuning the pump wavelength, we achieve phase matching over a large spectral range. This allows to generate idler photons in the mid-infrared spectral range above 6 $μm$ wavelength with corresponding signal photons in the visible. Also, we show photon pair generation with broad spectral bandwidth. These results are a valuable step towards the development of quantum imaging and sensing techniques in the mid-infrared.

preprint2020arXiv

Coding schemes and Applications for Weather Radars

In this paper, we describe the evolution of a pair of polyphase coded waveform for use in second trip suppression in weather radar. The polyphase codes were designed and tested on NASA weather radar. The NASA dual-frequency, dual-polarization Doppler radar (D3R) was developed primarily as a ground validation tool for the GPM satellite dual-frequency radar. Recently, the D3R radar was upgraded with new versions of digital receiver hardware and firmware, which supports larger filter lengths and multiple phase coded waveforms, and also newer IF sub-systems. This has enhanced the capabilities of radar manifolds.

preprint2020arXiv

Intermediate frequency Upgrade design features of NASA D3R Weather Radar System

The NASA dual-frequency, dual-polarization, Doppler radar (D3R) is an important ground validation tool for the global precipitation measurement (GPM) mission dual-frequency precipitation radar (DPR). The D3R has undergone extensive field trials starting in 2011 and continues to provide observations that enhance our scientific knowledge. To further enhance its capabilities, the Intermediate frequency (IF) electronics, digital receiver and waveform generation subsystems have been upgraded. Due to the new, more flexible architecture, this upgrade enables more research frontiers to be explored with better performance. One of the primary motivations for this upgrade is to enable enhanced radar sensitivity and increase range resolution to 30 meters. In this work, the D3R system upgrade will be discussed with a focus on the key upgrade design features to obtain better sensitivity and a flexible waveform capability.

preprint2020arXiv

Machine Guides, Human Supervises: Interactive Learning with Global Explanations

We introduce explanatory guided learning (XGL), a novel interactive learning strategy in which a machine guides a human supervisor toward selecting informative examples for a classifier. The guidance is provided by means of global explanations, which summarize the classifier's behavior on different regions of the instance space and expose its flaws. Compared to other explanatory interactive learning strategies, which are machine-initiated and rely on local explanations, XGL is designed to be robust against cases in which the explanations supplied by the machine oversell the classifier's quality. Moreover, XGL leverages global explanations to open up the black-box of human-initiated interaction, enabling supervisors to select informative examples that challenge the learned model. By drawing a link to interactive machine teaching, we show theoretically that global explanations are a viable approach for guiding supervisors. Our simulations show that explanatory guided learning avoids overselling the model's quality and performs comparably or better than machine- and human-initiated interactive learning strategies in terms of model quality.

preprint2020arXiv

Toward Machine-Guided, Human-Initiated Explanatory Interactive Learning

Recent work has demonstrated the promise of combining local explanations with active learning for understanding and supervising black-box models. Here we show that, under specific conditions, these algorithms may misrepresent the quality of the model being learned. The reason is that the machine illustrates its beliefs by predicting and explaining the labels of the query instances: if the machine is unaware of its own mistakes, it may end up choosing queries on which it performs artificially well. This biases the "narrative" presented by the machine to the user.We address this narrative bias by introducing explanatory guided learning, a novel interactive learning strategy in which: i) the supervisor is in charge of choosing the query instances, while ii) the machine uses global explanations to illustrate its overall behavior and to guide the supervisor toward choosing challenging, informative instances. This strategy retains the key advantages of explanatory interaction while avoiding narrative bias and compares favorably to active learning in terms of sample complexity. An initial empirical evaluation with a clustering-based prototype highlights the promise of our approach.

preprint2020arXiv

Use of adaptive filtering techniques and deconvolution to obtain low range sidelobe samples

In this paper the use of adaptive filtering techniques to obtain better peak sidelobe suppression and integrated sidelobe energy will be discussed with regard to weather radars and obtaining better sensitivity with this technique. The performance of these new coefficient sets obtained with adaptive filter (using RLS optimization) will be discussed and presented. They will also be compared with the existing techniques and their peak sidelobe levels.

preprint2019arXiv

Intra-Pulse Polyphase Coding System for Second Trip Suppression in a Weather Radar

This paper describes the design and implementation of intra-pulse polyphase codes for a weather radar system. Algorithms to generate codes with good correlation properties are discussed. Thereafter, a new design framework is described, which optimizes the polyphase code and corresponding mismatched filter, using a cost/error function, especially for weather radars. It establishes the performance of these intra-pulse techniques with specific application towards second trip removal. The developed code is implemented on NASA D3R, which is a dual-frequency, dual-polarization, Doppler weather radar system.

preprint2015arXiv

EdgeCentric: Anomaly Detection in Edge-Attributed Networks

Given a network with attributed edges, how can we identify anomalous behavior? Networks with edge attributes are commonplace in the real world. For example, edges in e-commerce networks often indicate how users rated products and services in terms of number of stars, and edges in online social and phonecall networks contain temporal information about when friendships were formed and when users communicated with each other -- in such cases, edge attributes capture information about how the adjacent nodes interact with other entities in the network. In this paper, we aim to utilize exactly this information to discern suspicious from typical node behavior. Our work has a number of notable contributions, including (a) formulation: while most other graph-based anomaly detection works use structural graph connectivity or node information, we focus on the new problem of leveraging edge information, (b) methodology: we introduce EdgeCentric, an intuitive and scalable compression-based approach for detecting edge-attributed graph anomalies, and (c) practicality: we show that EdgeCentric successfully spots numerous such anomalies in several large, edge-attributed real-world graphs, including the Flipkart e-commerce graph with over 3 million product reviews between 1.1 million users and 545 thousand products, where it achieved 0.87 precision over the top 100 results.

Mohit Kumar

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Kernel Affine Hull Machines for Compute-Efficient Query-Side Semantic Encoding

An Energy Aware Clustering Scheme for 5G-enabled Edge Computing based IoMT Framework

Differentially Private Transferrable Deep Learning with Membership-Mappings

Information Theoretic Evaluation of Privacy-Leakage, Interpretability, and Transferability for Trustworthy AI

Learning MAX-SAT from Contextual Examples for Combinatorial Optimisation

Membership-Mappings for Data Representation Learning: Measure Theoretic Conceptualization

Membership-Mappings for Practical Secure Distributed Deep Learning

Mid-Infrared Photon-Pair Generation in AgGaS$_2$

Coding schemes and Applications for Weather Radars

Intermediate frequency Upgrade design features of NASA D3R Weather Radar System

Machine Guides, Human Supervises: Interactive Learning with Global Explanations

Toward Machine-Guided, Human-Initiated Explanatory Interactive Learning

Use of adaptive filtering techniques and deconvolution to obtain low range sidelobe samples

Intra-Pulse Polyphase Coding System for Second Trip Suppression in a Weather Radar

EdgeCentric: Anomaly Detection in Edge-Attributed Networks