Source author record

Sourav Dutta

Sourav Dutta appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

34works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

ACO based Adaptive RBFN Control for Robot Manipulators

This paper describes a new approach for approximating the inverse kinematics of a manipulator using an Ant Colony Optimization (ACO) based RBFN (Radial Basis Function Network). In this paper, a training solution using the ACO and the LMS (Least Mean Square) algorithm is presented in a two-phase training procedure. To settle the problem that the cluster results of k-mean clustering Radial Basis Function (RBF) are easy to be influenced by the selection of initial characters and converge to a local minimum, Ant Colony Optimization (ACO) for the RBF neural networks which will optimize the center of RBF neural networks and reduce the number of the hidden layer neurons nodes is presented. The result demonstrates that the accuracy of Ant Colony Optimization for the Radial Basis Function (RBF) neural networks is higher, and the extent of fitting has been improved.

preprint2022arXiv

Aligned Weight Regularizers for Pruning Pretrained Neural Networks

While various avenues of research have been explored for iterative pruning, little is known what effect pruning has on zero-shot test performance and its potential implications on the choice of pruning criteria. This pruning setup is particularly important for cross-lingual models that implicitly learn alignment between language representations during pretraining, which if distorted via pruning, not only leads to poorer performance on language data used for retraining but also on zero-shot languages that are evaluated. In this work, we show that there is a clear performance discrepancy in magnitude-based pruning when comparing standard supervised learning to the zero-shot setting. From this finding, we propose two weight regularizers that aim to maximize the alignment between units of pruned and unpruned networks to mitigate alignment distortion in pruned cross-lingual models and perform well for both non zero-shot and zero-shot settings. We provide experimental results on cross-lingual tasks for the zero-shot setting using XLM-RoBERTa$_{\mathrm{Base}}$, where we also find that pruning has varying degrees of representational degradation depending on the language corresponding to the zero-shot test set. This is also the first study that focuses on cross-lingual language model compression.

preprint2021arXiv

An Ising Hamiltonian Solver using Stochastic Phase-Transition Nano- Oscillators

Computationally hard problems, including combinatorial optimization, can be mapped into the problem of finding the ground-state of an Ising Hamiltonian. Building physical systems with collective computational ability and distributed parallel processing capability can accelerate the ground-state search. Here, we present a continuous-time dynamical system (CTDS) approach where the ground-state solution appears as stable points or attractor states of the CTDS. We harness the emergent dynamics of a network of phase-transition nano-oscillators (PTNO) to build an Ising Hamiltonian solver. The hardware fabric comprises of electrically coupled injection-locked stochastic PTNOs with bi-stable phases emulating artificial Ising spins. We demonstrate the ability of the stochastic PTNO-CTDS to progressively find more optimal solution by increasing the strength of the injection-locking signal - akin to performing classical annealing. We demonstrate in silico that the PTNO-CTDS prototype solves a benchmark non-deterministic polynomial time (NP)-hard Max-Cut problem with high probability of success. Using experimentally calibrated numerical simulations and incorporating non-idealities, we investigate the performance of our Ising Hamiltonian solver on dense Max-Cut problems with increasing graph size. We report a high energy-efficiency of 1.3x10^7 solutions/sec/Watt for 100-node dense Max-cut problems which translates to a 5x improvement over the recently demonstrated memristor-based Hopfield network and several orders of magnitude improvement over other candidates such as CPU and GPU, quantum annealer and photonic Ising solver approaches. Such an energy efficient hardware exhibiting high solution-throughput/Watt can find applications in industrial planning and manufacturing, defense and cyber-security, bioinformatics and drug discovery.

preprint2021arXiv

Logic Compatible High-Performance Ferroelectric Transistor Memory

Silicon ferroelectric field-effect transistors (FeFETs) with low-k interfacial layer (IL) between ferroelectric gate stack and silicon channel suffers from high write voltage, limited write endurance and large read-after-write latency due to early IL breakdown and charge trapping and detrapping at the interface. We demonstrate low voltage, high speed memory operation with high write endurance using an IL-free back-end-of-line (BEOL) compatible FeFET. We fabricate IL-free FeFETs with 28nm channel length and 126nm width under a thermal budget <400C by integrating 5nm thick Hf0.5Zr0.5O2 gate stack with amorphous Indium Tungsten Oxide (IWO) semiconductor channel. We report 1.2V memory window and read current window of 10^5 for program and erase, write latency of 20ns with +/-2V write pulses, read-after-write latency <200ns, write endurance cycles exceeding 5x10^10 and 2-bit/cell programming capability. Array-level analysis establishes IL-free BEOL FeFET as a promising candidate for logic-compatible high-performance on-chip buffer memory and multi-bit weight cell for compute-in-memory accelerators.

preprint2021arXiv

Neural Sampling Machine with Stochastic Synapse allows Brain-like Learning and Inference

Many real-world mission-critical applications require continual online learning from noisy data and real-time decision making with a defined confidence level. Probabilistic models and stochastic neural networks can explicitly handle uncertainty in data and allow adaptive learning-on-the-fly, but their implementation in a low-power substrate remains a challenge. Here, we introduce a novel hardware fabric that implements a new class of stochastic NN called Neural-Sampling-Machine that exploits stochasticity in synaptic connections for approximate Bayesian inference. Harnessing the inherent non-linearities and stochasticity occurring at the atomic level in emerging materials and devices allows us to capture the synaptic stochasticity occurring at the molecular level in biological synapses. We experimentally demonstrate in-silico hybrid stochastic synapse by pairing a ferroelectric field-effect transistor -based analog weight cell with a two-terminal stochastic selector element. Such a stochastic synapse can be integrated within the well-established crossbar array architecture for compute-in-memory. We experimentally show that the inherent stochastic switching of the selector element between the insulator and metallic state introduces a multiplicative stochastic noise within the synapses of NSM that samples the conductance states of the FeFET, both during learning and inference. We perform network-level simulations to highlight the salient automatic weight normalization feature introduced by the stochastic synapses of the NSM that paves the way for continual online learning without any offline Batch Normalization. We also showcase the Bayesian inferencing capability introduced by the stochastic synapse during inference mode, thus accounting for uncertainty in data. We report 98.25%accuracy on standard image classification task as well as estimation of data uncertainty in rotated samples.

preprint2020arXiv

A micromagnetic study of the switching dynamics of the BiFeO$_3$/CoFe heterojunction

The switching dynamics of a single-domain BiFeO3/CoFe heterojunction is modeled and key parameters such as interface exchange coupling coefficient are extracted from experimental results. The lower limit of the magnetic order response time of CoFe in the BiFeO3/CoFe heterojunction is theoretically quantified to be on to the order of 100 ps. Our results indicate that the switching behavior of CoFe in the BiFeO3/CoFe heterojunction is dominated by the rotation of the Neel vector in BiFeO3 rather than the unidirectional exchange bias at the interface. We also quantify the magnitude of the interface exchange coupling coefficient J_int to be 0.32 pJ/m by comparing our simulation results with the giant magnetoresistance (GMR) curves and the magnetic hysteresis loop in the experiments. To the best of our knowledge, this is the first time that J_int is extracted quantitatively from experiments. Furthermore, we demonstrate that the switching success rate and the thermal stability of the BiFeO3/CoFe heterojunction can be improved by reducing the thickness of CoFe and increasing the length to width aspect ratio of the BiFeO3/CoFe heterojunction. Our theoretical model provides a comprehensive framework to study the magnetoelectric properties and the manipulation of the magnetic order of CoFe in the BiFeO3/CoFe heterojunction.

preprint2020arXiv

Learning fine-grained search space pruning and heuristics for combinatorial optimization

Combinatorial optimization problems arise in a wide range of applications from diverse domains. Many of these problems are NP-hard and designing efficient heuristics for them requires considerable time and experimentation. On the other hand, the number of optimization problems in the industry continues to grow. In recent years, machine learning techniques have been explored to address this gap. We propose a framework for leveraging machine learning techniques to scale-up exact combinatorial optimization algorithms. In contrast to the existing approaches based on deep-learning, reinforcement learning and restricted Boltzmann machines that attempt to directly learn the output of the optimization problem from its input (with limited success), our framework learns the relatively simpler task of pruning the elements in order to reduce the size of the problem instances. In addition, our framework uses only interpretable learning models based on intuitive features and thus the learning process provides deeper insights into the optimization problem and the instance class, that can be used for designing better heuristics. For the classical maximum clique enumeration problem, we show that our framework can prune a large fraction of the input graph (around 99 % of nodes in case of sparse graphs) and still detect almost all of the maximum cliques. This results in several fold speedups of state-of-the-art algorithms. Furthermore, the model used in our framework highlights that the chi-squared value of neighborhood degree has a statistically significant correlation with the presence of a node in a maximum clique, particularly in dense graphs which constitute a significant challenge for modern solvers. We leverage this insight to design a novel heuristic for this problem outperforming the state-of-the-art. Our heuristic is also of independent interest for maximum clique detection and enumeration.

preprint2020arXiv

Measurement of collisions between laser cooled cesium atoms and trapped cesium ions

We report the measurement of collision rate coefficient for collisions between ultracold Cs atoms and low energy Cs+ ions. The experiments are performed in a hybrid trap consisting of a magneto-optical trap (MOT) for Cs atoms and a Paul trap for Cs+ ions. The ion-atom collisions impart kinetic energy to the ultracold Cs atoms resulting in their escape from the shallow MOT and, therefore, in a reduction in the number of Cs atoms in the MOT. By monitoring, using fluorescence measurements, the Cs atom number and the MOT loading dynamics and then fitting the data to a rate equation model, the ion-atom collision rate is derived. The Cs-Cs+ collision rate coefficient $9.3(\pm0.4)(\pm1.2)(\pm3.5) \times 10^{-14}$ m$^{3}$s$^{-1}$, measured for an ion distribution with most probable collision energy of 95 meV ($\approx k_{B}.1100$ K), is in fair agreement with theoretical calculations. As an intermediate step, we also determine the photoionization cross section of Cs $6P_{3/2}$ atoms at 473 nm wavelength to be $2.28 (\pm 0.33) \times 10^{-21}$ m$^{2}$.

preprint2020arXiv

Predictive Probability Path Planning Model For Dynamic Environments

Path planning in dynamic environments is essential to high-risk applications such as unmanned aerial vehicles, self-driving cars, and autonomous underwater vehicles. In this paper, we generate collision-free trajectories for a robot within any given environment with temporal and spatial uncertainties caused due to randomly moving obstacles. We use two Poisson distributions to model the movements of obstacles across the generated trajectory of a robot in both space and time to determine the probability of collision with an obstacle. Measures are taken to avoid an obstacle by intelligently manipulating the speed of the robot at space-time intervals where a larger number of obstacles intersect the trajectory of the robot. Our method potentially reduces the use of computationally expensive collision detection libraries. Based on our experiments, there has been a significant improvement over existing methods in terms of safety, accuracy, execution time and computational cost. Our results show a high level of accuracy between the predicted and actual number of collisions with moving obstacles.

preprint2020arXiv

Towards Quantifying the Distance between Opinions

Increasingly, critical decisions in public policy, governance, and business strategy rely on a deeper understanding of the needs and opinions of constituent members (e.g. citizens, shareholders). While it has become easier to collect a large number of opinions on a topic, there is a necessity for automated tools to help navigate the space of opinions. In such contexts understanding and quantifying the similarity between opinions is key. We find that measures based solely on text similarity or on overall sentiment often fail to effectively capture the distance between opinions. Thus, we propose a new distance measure for capturing the similarity between opinions that leverages the nuanced observation -- similar opinions express similar sentiment polarity on specific relevant entities-of-interest. Specifically, in an unsupervised setting, our distance measure achieves significantly better Adjusted Rand Index scores (up to 56x) and Silhouette coefficients (up to 21x) compared to existing approaches. Similarly, in a supervised setting, our opinion distance measure achieves considerably better accuracy (up to 20% increase) compared to extant approaches that rely on text similarity, stance similarity, and sentiment similarity

preprint2019arXiv

Simulation of the Magnetization Dynamics of a Single Domain BiFeO$_3$ Thin Film

The switching dynamics of a single-domain BiFeO$_3$ thin films is investigated through combining the dynamics of polarization and Neel vector. The evolution of the ferroelectric polarization is described by the Landau-Khalatnikov (LK) equation, and the Landau-Lifshitz-Gilbert (LLG) equations for spins in two sublattices to model the time evolution of the antiferromagnetic order (Neel vector) in a G-type antiferromagnet. This work theoretically demonstrates that due to the rotation of the magnetic hard axis following the polarization reversal, the Neel vector can be switched by 180 degrees, while the weak magnetization can remain unchanged. The simulation results are consistent with the ab initio calculation, where the Neel vector rotates during polarization rotation, and also match our calculation of the dynamics of order parameter using Landau-Ginzburg theory. We also find that the switching time of the Neel vector is determined by the speed polarization switching and is predicted to be as short as 30 ps.

preprint2016arXiv

A study of phantom scalar field cosmology using Lie and Noether symmetries

The paper deals with phantom scalar field cosmology in Einstein gravity. At first using Lie symmetry, the coupling function to the kinetic term and the potential function of the scalar field and the equation of state parameter of the matter field are determined and a simple solution is obtained. Subsequently, Noether symmetry is imposed on the Lagrangian of the system. The symmetry vector is obtained and the potential takes a very general form from which potential using Lie Symmetry can be obtained as a particular case. Then we choose a point transformation $(a,ϕ)\rightarrow(u,v)$ such that one of the transformed variables (say u) is a cyclic for the Lagrangian. Using conserved charge (corresponding to the cyclic coordinate) and the constant of motion, solutions are obtained.

preprint2016arXiv

An attempt for an Emergent Scenario with Modified Chaplygin Gas

The present work is an attempt for emergent universe scenario with modified Chaplygin gas. The universe is chosen as spatially flat FRW space-time with modified Chaplygin gas as the only cosmic substratum. It is found that emergent scenario is possible for some specific (unrealistic) choice of the parameters in the equation of state for modified Chaplygin gas.

preprint2016arXiv

KOGNAC: Efficient Encoding of Large Knowledge Graphs

Many Web applications require efficient querying of large Knowledge Graphs (KGs). We propose KOGNAC, a dictionary-encoding algorithm designed to improve SPARQL querying with a judicious combination of statistical and semantic techniques. In KOGNAC, frequent terms are detected with a frequency approximation algorithm and encoded to maximise compression. Infrequent terms are semantically grouped into ontological classes and encoded to increase data locality. We evaluated KOGNAC in combination with state-of-the-art RDF engines, and observed that it significantly improves SPARQL querying on KGs with up to 1B edges.

preprint2016arXiv

Non-destructive detection of ions using atom-cavity collective strong coupling

We present a technique, based on atoms coupled to an optical cavity, for non-destructive detection of trapped ions. We demonstrate the vacuum-Rabi splitting (VRS), arising due to the collective strong coupling of ultracold Rb atoms to a cavity, to change in presence of trapped Rb+ ions. The Rb+ ions are optically dark and the Rb atoms are prepared in a dark magneto-optical trap (MOT). The VRS is measured on an optically open transition of the initially dark Rb atoms. The measurement itself is fast, non-destructive and has sufficient fidelity to permit the measurement of atomic-state selective ion-atom collision rate. This demonstration illustrates a method based on atom-cavity coupling to measure two particle interactions generically and non-destructively.

preprint2016arXiv

Photodissociation of trapped Rb$^+_2$ : Implications for simultaneous trapping of atoms and molecular ions

The direct photodissociation of trapped $^{85}$Rb$_2^+$ (rubidium) molecular ions by the cooling light for the $^{85}$Rb magneto-optical trap (MOT) is studied, both experimentally and theoretically. Vibrationally excited Rb$_{2}^{+}$ ions are created by photoionization of Rb$_{2}$ molecules formed photoassociatively in the Rb MOT and are trapped in a modified spherical Paul trap. The decay rate of the trapped Rb$_{2}^{+}$ ion signal in the presence of the MOT cooling light is measured and agreement with our calculated rates for molecular ion photodissociation is observed. The photodissociation mechanism due to the MOT light is expected to be active and therefore universal for all homonuclear diatomic alkali metal molecular ions.

preprint2016arXiv

Quintom cosmological model and some possible solutions using Lie and Noether symmetries

The present work deals with a quintom model of dark energy in the framework of a spatially flat isotropic and homogeneous Friedmann-Lemaitre-Robertson-Walker (FLRW) universe. At first, Lie point symmetry is imposed to the system and the unknown coupled potential of the model is determined. Then Noether symmetry, which is also a point like symmetry of the Lagrangian, is imposed on the physical system and the potential takes a general form. It is shown that the Lie algebra of Noether symmetry is a sub-algebra of the corresponding Lie algebra of the Lie symmetry. Finally, a point transformation in the three dimensional augmented space is performed suitably so that one of the variables become cyclic and as a result there is considerable simplification to the physical system. Hence conserved quantities (i.e, constants of motion) are expressed in a compact form and cosmological solutions are evaluated and analyzed in the present context.

preprint2016arXiv

SONIK: Efficient In-situ All Item Rank Generation using Bit Operations

Sorting, a classical combinatorial process, forms the bedrock of numerous algorithms with varied applications. A related problem involves efficiently finding the corresponding ranks of all the elements - catering to rank queries, data partitioning and allocation, etc. Although, the element ranks can be subsequently obtained by initially sorting the elements, such procedures involve O(n log n) computations and might not be suitable with large input sizes for hard real-time systems or for applications with data re-ordering constraints. This paper proposes SONIK, a non-comparison linear time and space algorithm using bit operations inspired by radix sort for computing the ranks of all input integer elements, thereby providing implicit sorting. The element ranks are generated in-situ, i.e., directly at the corresponding element position without re-ordering or recourse to any other sorting mechanism.

preprint2015arXiv

Formation of ultracold $^{7}$Li$^{85}$Rb molecules in the lowest triplet electronic state by photoassociation and their detection by ionization spectroscopy

We report the formation of ultracold $^{7}$Li$^{85}$Rb molecules in the $a^{3}Σ^{+}$ electronic state by photoassociation (PA) and their detection via resonantly enhanced multiphoton ionization (REMPI). With our dual-species Li and Rb magneto-optical trap (MOT) apparatus, we detect PA resonances with binding energies up to ~62 cm$^{-1}$ below the $^{7}$Li 2s $^{2}S_{1/2}$ $+$ $^{85}$Rb 5p $^{2}P_{1/2}$ asymptote. In addition, we use REMPI spectroscopy to probe the $a^{3}Σ^{+}$ state and excited electronic $3^{3} Π$ and $4^{3} Σ^{+}$ states, and identify $a^{3}Σ^{+} (v" = 7 - 13)$, $3^{3} Π(v'_Π = 0 - 10)$ and $4^{3} Σ^{+} (v'_Σ = 0 - 5)$ vibrational levels. Our line assignments agree well with ab initio calculations. These preliminary spectroscopic studies on previously unobserved electronic states are crucial to discovering transition pathways for transferring ultracold LiRb molecules created via PA to deeply bound rovibrational levels of the electronic ground state.

preprint2014arXiv

Extracting molecular potentials from insufficient spectroscopic information

We extend our recently developed inversion method to extract excited state potentials from fluorescence line positions and line strengths. We consider a previous limitation of the method arising due to insufficient input data in cases where the relatively weaker emission data are not experimentally available. We develop a solution to this problem by "regenerating" these weak transition lines via applying a model potential, e.g. a Morse potential. The result of this procedure, illustrated for the Q-branch emission from the lowest three vibrational levels of the B($^1 Π)$ state of LiRb, is shown to have an error of $0.29$ cm$^{-1}$ in the classically allowed region and a global error of $5.67$ cm$^{-1}$ for $V\le E(ν'=10)$. The robustness of this procedure is also demonstrated by considering the statistical error in the measured line intensities.

preprint2014arXiv

Formation of deeply bound ultracold LiRb molecules via photoassociation near the Li 2S$_{1/2}$ + Rb 5P$_{3/2}$ asymptote

We present spectra of ultracold $^7$Li$^{85}$Rb molecules in their electronic ground state formed by spontaneous decay of weakly bound photoassociated molecules. Beginning with atoms in a dual species magneto-optical trap (MOT), weakly bound molecules are formed in the 4(1) electronic state, which corresponds to the B$^1Π$ state at short range. These molecules spontaneously decay to the electronic ground state and we use resonantly enhanced multiphoton ionization (REMPI) to determine the vibrational population distribution in the electronic ground states after spontaneous emission. Many of the observed lines from the spectra are consistent with transitions from the X$^1Σ^+$ ground electronic state to either the B$^1Π$ or D$^1Π$ electronic states that have been previously observed, with levels possibly as low as X$^1Σ^+$ $(v'' = 2)$ being populated. We do not observe decay to weakly bound vibrational levels of the X$^1Σ^+$ or a$^3Σ^+$ electronic states in the spectra. We also deduce a lower bound of 3900 cm$^{-1}$ for the dissociation energy of the LiRb$^+$ molecular ion.

preprint2014arXiv

Formation of ultracold LiRb molecules by photoassociation near the Li (2s 2S1/2) + Rb (5p 2P1/2) asymptote

We report the production of ultracold 7Li85Rb molecules by photoassociation (PA) below the Li (2s 2S1/2) + Rb (5p 2P1/2) asymptote. We perform PA spectroscopy in a dual-species 7Li-85Rb magneto-optical trap (MOT) and detect the PA resonances using trap loss spectroscopy. We observe several strong PA resonances corresponding to the last few bound states, assign the lines and derive the long range C6 dispersion coefficients for the Li (2s 2S1/2) + Rb (5p 2P1/2) asymptote. We also report an excited-state molecule formation rate (P_LiRb) of ~10^7 s^-1 and a PA rate coefficient (K_PA) of ~4x10^-11 cm^3/s, which are both among the highest observed for heteronuclear bi-alkali molecules. These suggest that PA is a promising route for the creation of ultracold ground state LiRb molecules.

preprint2014arXiv

Interspecies collision-induced losses in a dual species 7Li-85Rb magneto-optical trap

In this article, we report the measurement of collision-induced loss rate coefficients β_{Li,Rb} and β_{Rb,Li}, and also discuss means to significantly suppress such collision induced losses. We first describe our dual-species magneto-optical trap (MOT) that allows us to simultaneously trap > 5x10^8 7Li atoms loaded from a Zeeman slower and > 2x10^8 85Rb atoms loaded from a dispenser. We observe strong interspecies collision-induced losses in the MOTs which dramatically reduce the maximum atom number achievable in the MOTs. We measure the trap loss rate coefficients β_{Li,Rb} and β_{Rb,Li}, and, from a study of their dependence on the MOT parameters, determine the cause for the losses observed. Our results provide valuable insights into ultracold collisions between 7Li and 85Rb, guide our efforts to suppress collision induced losses, and also pave the way for the production of ultracold 7Li85Rb molecules.

preprint2014arXiv

Photoassociation of ultracold LiRb* molecules: observation of high efficiency and unitarity-limited rate saturation

We report the production of ultracold heteronuclear 7Li85Rb molecules in excited electronic states by photoassociation (PA) of ultracold 7Li and 85Rb atoms. PA is performed in a dual-species 7Li-85Rb magneto-optical trap (MOT) and the PA resonances are detected using trap loss spectroscopy. We identify several strong PA resonances below the Li (2s 2S1/2) + Rb (5p 2P3/2) asymptote and experimentally determine the long range C6 dispersion coefficients. We find a molecule formation rate (P_LiRb) of 3.5x10^7 s^-1 and a PA rate coefficient (K_PA) of 1.3x10^-10 cm^3/s, the highest among heteronuclear bi-alkali molecules. At large PA laser intensity, we observe the saturation of the PA rate coefficient (K_PA) close to the theoretical value at the unitarity limit.

preprint2014arXiv

Quantum Defect Theory description of weakly bound levels and Feshbach resonances in LiRb

The multichannel quantum defect theory (MQDT) in combination with the frame transformation (FT) approach is applied to model the Fano-Feshbach resonances measured for $^{7}$Li$^{87}$Rb and $^{6}$Li$^{87}$Rb [Marzok {\it et al.} Phys. Rev. A {\bf 79} 012717 (2009)]. The MQDT results show a level of accuracy comparable to that of previous models based on direct, fully numerical solutions of the the coupled channel Schrödinger equations (CC). Here, energy levels deduced from 2-photon photoassociation spectra for $^{7}$Li$^{85}$Rb are assigned by applying the MQDT approach, obtaining the bound state energies for the coupled channel problem. Our results confirm that MQDT yields a compact description of photoassociation observables as well as the Fano-Feshbach resonance positions and widths.

preprint2012arXiv

Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams

Applications involving telecommunication call data records, web pages, online transactions, medical records, stock markets, climate warning systems, etc., necessitate efficient management and processing of such massively exponential amount of data from diverse sources. De-duplication or Intelligent Compression in streaming scenarios for approximate identification and elimination of duplicates from such unbounded data stream is a greater challenge given the real-time nature of data arrival. Stable Bloom Filters (SBF) addresses this problem to a certain extent. . In this work, we present several novel algorithms for the problem of approximate detection of duplicates in data streams. We propose the Reservoir Sampling based Bloom Filter (RSBF) combining the working principle of reservoir sampling and Bloom Filters. We also present variants of the novel Biased Sampling based Bloom Filter (BSBF) based on biased sampling concepts. We also propose a randomized load balanced variant of the sampling Bloom Filter approach to efficiently tackle the duplicate detection. In this work, we thus provide a generic framework for de-duplication using Bloom Filters. Using detailed theoretical analysis we prove analytical bounds on the false positive rate, false negative rate and convergence rate of the proposed structures. We exhibit that our models clearly outperform the existing methods. We also demonstrate empirical analysis of the structures using real-world datasets (3 million records) and also with synthetic datasets (1 billion records) capturing various input distributions.

preprint2012arXiv

INSTRUCT: Space-Efficient Structure for Indexing and Complete Query Management of String Databases

The tremendous expanse of search engines, dictionary and thesaurus storage, and other text mining applications, combined with the popularity of readily available scanning devices and optical character recognition tools, has necessitated efficient storage, retrieval and management of massive text databases for various modern applications. For such applications, we propose a novel data structure, INSTRUCT, for efficient storage and management of sequence databases. Our structure uses bit vectors for reusing the storage space for common triplets, and hence, has a very low memory requirement. INSTRUCT efficiently handles prefix and suffix search queries in addition to the exact string search operation by iteratively checking the presence of triplets. We also propose an extension of the structure to handle substring search efficiently, albeit with an increase in the space requirements. This extension is important in the context of trie-based solutions which are unable to handle such queries efficiently. We perform several experiments portraying that INSTRUCT outperforms the existing structures by nearly a factor of two in terms of space requirements, while the query times are better. The ability to handle insertion and deletion of strings in addition to supporting all kinds of queries including exact search, prefix/suffix search and substring search makes INSTRUCT a complete data structure.

preprint2011arXiv

Caching Stars in the Sky: A Semantic Caching Approach to Accelerate Skyline Queries

Multi-criteria decision making has been made possible with the advent of skyline queries. However, processing such queries for high dimensional datasets remains a time consuming task. Real-time applications are thus infeasible, especially for non-indexed skyline techniques where the datasets arrive online. In this paper, we propose a caching mechanism that uses the semantics of previous skyline queries to improve the processing time of a new query. In addition to exact queries, utilizing such special semantics allow accelerating related queries. We achieve this by generating partial result sets guaranteed to be in the skyline sets. We also propose an index structure for efficient organization of the cached queries. Experiments on synthetic and real datasets show the effectiveness and scalability of our proposed methods.

preprint2011arXiv

Laser spectroscopy of the X 1Σ+ and B 1Π states of the LiRb molecule

We have studied the X 1Σ+ and B 1Π states of 7Li85Rb using Laser Induced Fluorescence (LIF) spectroscopy and Fluorescence Excitation Spectroscopy (FES). We extract molecular constants for levels v" = 0-2 of the X 1Σ+ state and levels v' = 0-20 of the B 1Π state. For the B 1Π state, we have observed rotational perturbations in the e-parity component of the v' = 2 level, and determined the dissociation energy. We discuss implications of our measurements in finding efficient photoassociation pathways for production of ultra-cold ground state LiRb molecules, and their detection via state selective ionization.

preprint2011arXiv

Mode-hop-free tuning over 135 GHz of external cavity diode lasers without anti-reflection coating

We report an external cavity diode laser (ECDL), using a diode whose front facet is not antireflection (AR) coated, that has a mode-hop-free (MHF) tuning range greater than 135 GHz. We achieved this using a short external cavity and by simultaneously tuning the internal and external modes of the laser. We find that the precise location of the pivot point of the grating in our laser is less critical than commonly believed. The general applicability of the method, combined with the compact portable mechanical and electronic design, makes it well suited for both research and industrial applications.

preprint2011arXiv

Multidimensional Balanced Allocation for Multiple Choice & (1 + Beta) Processes

Allocation of balls into bins is a well studied abstraction for load balancing problems.The literature hosts numerous results for sequential(single dimensional) allocation case when m balls are thrown into n bins. In this paper we study the symmetric multiple choice process for both unweighted and weighted balls as well as for both multidimensional and scalar models.Additionally,we present the results on bounds on gap for (1+beta) choice process with multidimensional balls and bins. We show that for the symmetric d choice process and with m=O(n), the upper bound on the gap is O(lnln(n)) w.h.p.This upper bound on the gap is within D=f factor of the lower bound. This is the first such tight result.For the general case of m>>n the expected gap is bounded by O(lnln(n)).For variable f and non-uniform distribution of the populated dimensions,we obtain the upper bound on the expected gap as O(log(n)). Further,for the multiple round parallel balls and bins,we show that the gap is also bounded by O(loglog(n)) for m=O(n).The same bound holds for the expected gap when m>>n. Our analysis also has strong implications in the sequential scalar case.For the weighted balls and bins and general case m>>n,we show that the upper bound on the expected gap is O(log(n)) which improves upon the best prior bound of n^c.Moreover,we show that for the (1 + beta) choice process and m=O(n) the upper bound(assuming uniform distribution of f populated dimensions over D total dimensions) on the gap is O(log(n)/beta),which is within D=f factor of the lower bound.For fixed f with non-uniform distribution and for random f with Binomial distribution the expected gap remains O(log(n)/beta) independent of the total number of balls thrown. This is the first such tight result for (1 +beta) paradigm with multidimensional balls and bins.

preprint2011arXiv

Perfectly Balanced Allocation With Estimated Average Using Expected Constant Retries

Balanced allocation of online balls-into-bins has long been an active area of research for efficient load balancing and hashing applications.There exists a large number of results in this domain for different settings, such as parallel allocations~\cite{parallel}, multi-dimensional allocations~\cite{multi}, weighted balls~\cite{weight} etc. For sequential multi-choice allocation, where $m$ balls are thrown into $n$ bins with each ball choosing $d$ (constant) bins independently uniformly at random, the maximum load of a bin is $O(\log \log n) + m/n$ with high probability~\cite{heavily_load}. This offers the current best known allocation scheme. However, for $d = Θ(\log n)$, the gap reduces to $O(1)$~\cite{soda08}.A similar constant gap bound has been established for parallel allocations with $O(\log ^*n)$ communication rounds~\cite{lenzen}. In this paper we propose a novel multi-choice allocation algorithm, \emph{Improved D-choice with Estimated Average} ($IDEA$) achieving a constant gap with a high probability for the sequential single-dimensional online allocation problem with constant $d$. We achieve a maximum load of $\lceil m/n \rceil$ with high probability for constant $d$ choice scheme with \emph{expected} constant number of retries or rounds per ball. We also show that the bound holds even for an arbitrary large number of balls, $m>>n$. Further, we generalize this result to (i)~the weighted case, where balls have weights drawn from an arbitrary weight distribution with finite variance, (ii)~multi-dimensional setting, where balls have $D$ dimensions with $f$ randomly and uniformly chosen filled dimension for $m=n$, and (iii)~the parallel case, where $n$ balls arrive and are placed parallely in the bins. We show that the gap in these case is also a constant w.h.p. (independent of $m$) for constant value of $d$ with expected constant number of retries per ball.

preprint2011arXiv

Towards "Intelligent Compression" in Streams: A Biased Reservoir Sampling based Bloom Filter Approach

With the explosion of information stored world-wide,data intensive computing has become a central area of research.Efficient management and processing of this massively exponential amount of data from diverse sources,such as telecommunication call data records,online transaction records,etc.,has become a necessity.Removing redundancy from such huge(multi-billion records) datasets resulting in resource and compute efficiency for downstream processing constitutes an important area of study. "Intelligent compression" or deduplication in streaming scenarios,for precise identification and elimination of duplicates from the unbounded datastream is a greater challenge given the realtime nature of data arrival.Stable Bloom Filters(SBF) address this problem to a certain extent.However,SBF suffers from a high false negative rate(FNR) and slow convergence rate,thereby rendering it inefficient for applications with low FNR tolerance.In this paper, we present a novel Reservoir Sampling based Bloom Filter,(RSBF) data structure,based on the combined concepts of reservoir sampling and Bloom filters for approximate detection of duplicates in data streams.Using detailed theoretical analysis we prove analytical bounds on its false positive rate(FPR),false negative rate(FNR) and convergence rates with low memory requirements.We show that RSBF offers the currently lowest FN and convergence rates,and are better than those of SBF while using the same memory.Using empirical analysis on real-world datasets(3 million records) and synthetic datasets with around 1 billion records,we demonstrate upto 2x improvement in FNR with better convergence rates as compared to SBF,while exhibiting comparable FPR.To the best of our knowledge,this is the first attempt to integrate reservoir sampling method with Bloom filters for deduplication in streaming scenarios.

preprint2010arXiv

Mining Statistically Significant Substrings Based on the Chi-Square Measure

Given the vast reservoirs of data stored worldwide, efficient mining of data from a large information store has emerged as a great challenge. Many databases like that of intrusion detection systems, web-click records, player statistics, texts, proteins etc., store strings or sequences. Searching for an unusual pattern within such long strings of data has emerged as a requirement for diverse applications. Given a string, the problem then is to identify the substrings that differs the most from the expected or normal behavior, i.e., the substrings that are statistically significant. In other words, these substrings are less likely to occur due to chance alone and may point to some interesting information or phenomenon that warrants further exploration. To this end, we use the chi-square measure. We propose two heuristics for retrieving the top-k substrings with the largest chi-square measure. We show that the algorithms outperform other competing algorithms in the runtime, while maintaining a high approximation ratio of more than 0.96.

Sourav Dutta

What is connected

Connect this record

See the researcher in context

Building this map preview

34 published item(s)

ACO based Adaptive RBFN Control for Robot Manipulators

Aligned Weight Regularizers for Pruning Pretrained Neural Networks

An Ising Hamiltonian Solver using Stochastic Phase-Transition Nano- Oscillators

Logic Compatible High-Performance Ferroelectric Transistor Memory

Neural Sampling Machine with Stochastic Synapse allows Brain-like Learning and Inference

A micromagnetic study of the switching dynamics of the BiFeO$_3$/CoFe heterojunction

Learning fine-grained search space pruning and heuristics for combinatorial optimization

Measurement of collisions between laser cooled cesium atoms and trapped cesium ions

Predictive Probability Path Planning Model For Dynamic Environments

Towards Quantifying the Distance between Opinions

Simulation of the Magnetization Dynamics of a Single Domain BiFeO$_3$ Thin Film

A study of phantom scalar field cosmology using Lie and Noether symmetries

An attempt for an Emergent Scenario with Modified Chaplygin Gas

KOGNAC: Efficient Encoding of Large Knowledge Graphs

Non-destructive detection of ions using atom-cavity collective strong coupling

Photodissociation of trapped Rb$^+_2$ : Implications for simultaneous trapping of atoms and molecular ions

Quintom cosmological model and some possible solutions using Lie and Noether symmetries

SONIK: Efficient In-situ All Item Rank Generation using Bit Operations

Formation of ultracold $^{7}$Li$^{85}$Rb molecules in the lowest triplet electronic state by photoassociation and their detection by ionization spectroscopy

Extracting molecular potentials from insufficient spectroscopic information

Formation of deeply bound ultracold LiRb molecules via photoassociation near the Li 2S$_{1/2}$ + Rb 5P$_{3/2}$ asymptote

Formation of ultracold LiRb molecules by photoassociation near the Li (2s 2S1/2) + Rb (5p 2P1/2) asymptote

Interspecies collision-induced losses in a dual species 7Li-85Rb magneto-optical trap

Photoassociation of ultracold LiRb* molecules: observation of high efficiency and unitarity-limited rate saturation

Quantum Defect Theory description of weakly bound levels and Feshbach resonances in LiRb

Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams

INSTRUCT: Space-Efficient Structure for Indexing and Complete Query Management of String Databases

Caching Stars in the Sky: A Semantic Caching Approach to Accelerate Skyline Queries

Laser spectroscopy of the X 1Σ+ and B 1Π states of the LiRb molecule

Mode-hop-free tuning over 135 GHz of external cavity diode lasers without anti-reflection coating

Multidimensional Balanced Allocation for Multiple Choice & (1 + Beta) Processes

Perfectly Balanced Allocation With Estimated Average Using Expected Constant Retries

Towards "Intelligent Compression" in Streams: A Biased Reservoir Sampling based Bloom Filter Approach

Mining Statistically Significant Substrings Based on the Chi-Square Measure