Source author record

Sourav Ghosh

Sourav Ghosh appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Computer Vision cond-mat.mtrl-sci Cryptography and Security Databases Human-Computer Interaction Machine Learning physics.app-ph

Catalog footprint

What is connected

7works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

eDWaaS: A Scalable Educational Data Warehouse as a Service

The university management is perpetually in the process of innovating policies to improve the quality of service. Intellectual growth of the students, the popularity of university are some of the major areas that management strives to improve upon. Relevant historical data is needed in support of taking any decision. Furthermore, providing data to various university ranking frameworks is a frequent activity in recent years. The format of such requirement changes frequently which requires efficient manual effort. Maintaining a data warehouse can be a solution to this problem. However, both in-house and outsourced implementation of a dedicated data warehouse may not be a cost-effective and smart solution. This work proposes an educational data warehouse as a service (eDWaaS) model to store historical data for multiple universities. The proposed multi-tenant schema facilitates the universities to maintain their data warehouse in a cost-effective solution. It also addresses the scalability issues in implementing such data warehouse as a service model.

preprint2022arXiv

LIP: Lightweight Intelligent Preprocessor for meaningful text-to-speech

Existing Text-to-Speech (TTS) systems need to read messages from the email which may have Personal Identifiable Information (PII) to text messages that can have a streak of emojis and punctuation. 92% of the world's online population use emoji with more than 10 billion emojis sent everyday. Lack of preprocessor leads to messages being read as-is including punctuation and infographics like emoticons. This problem worsens if there is a continuous sequence of punctuation/emojis that are quite common in real-world communications like messaging, Social Networking Site (SNS) interactions, etc. In this work, we aim to introduce a lightweight intelligent preprocessor (LIP) that can enhance the readability of a message before being passed downstream to existing TTS systems. We propose multiple sub-modules including: expanding contraction, censoring swear words, and masking of PII, as part of our preprocessor to enhance the readability of text. With a memory footprint of only 3.55 MB and inference time of 4 ms for up to 50-character text, our solution is suitable for real-time deployment. This work being the first of its kind, we try to benchmark with an open independent survey, the result of which shows 76.5% preference towards LIP enabled TTS engine as compared to standard TTS.

preprint2022arXiv

PrivPAS: A real time Privacy-Preserving AI System and applied ethics

With 3.78 billion social media users worldwide in 2021 (48% of the human population), almost 3 billion images are shared daily. At the same time, a consistent evolution of smartphone cameras has led to a photography explosion with 85% of all new pictures being captured using smartphones. However, lately, there has been an increased discussion of privacy concerns when a person being photographed is unaware of the picture being taken or has reservations about the same being shared. These privacy violations are amplified for people with disabilities, who may find it challenging to raise dissent even if they are aware. Such unauthorized image captures may also be misused to gain sympathy by third-party organizations, leading to a privacy breach. Privacy for people with disabilities has so far received comparatively less attention from the AI community. This motivates us to work towards a solution to generate privacy-conscious cues for raising awareness in smartphone users of any sensitivity in their viewfinder content. To this end, we introduce PrivPAS (A real time Privacy-Preserving AI System) a novel framework to identify sensitive content. Additionally, we curate and annotate a dataset to identify and localize accessibility markers and classify whether an image is sensitive to a featured subject with a disability. We demonstrate that the proposed lightweight architecture, with a memory footprint of a mere 8.49MB, achieves a high mAP of 89.52% on resource-constrained devices. Furthermore, our pipeline, trained on face anonymized data, achieves an F1-score of 73.1%.

preprint2021arXiv

Comparison of the electrochemical performance of CeO2 and rare earth-based mixed metallic oxide (Ce0.9Zr0.1O2) for supercapacitor applications

CeO2 and Ce0.9Zr0.1O2 are prepared from the sol-gel method to investigate and compare their electrochemical properties for supercapacitor applications. Structural, morphological, and elemental studies have been done for CeO2 and Ce0.9Zr0.1O2 by XRD, SEM, and EDX. Cyclic voltammetry, galvanostatic charge-discharge, and electrochemical impedance spectroscopy techniques are used to study the electrochemical performance of these materials. Doping enhances the electrochemical performance of the electrode, by improving the specific capacitance (~150%, 243 F g-1 from 96 F g-1) for the doped system @2 mV s-1 Vs. Ag/AgCl reference electrode in 2 mol L-1 KOH electrolyte solution. Ce0.9Zr0.1O2 shows only ~30% of capacitance degradation for a ten folds increase in current densities. Ce0.9Zr0.1O2 also shows 16% capacitance degradation after 800 cycles with excellent Columbic efficiency (~100%) @2 A g-1 current density. Partial replacement of Ce4+ ion (0.97 Å) with Zr4+ ion (0.84 Å) results in a decrease in lattice parameter, as confirmed by Rietveld refinement. Ce0.9Zr0.1O2 has provided good energy, and power density of 1.128 Wh kg-1and 112.5 W kg-1 respectively. Furthermore, better diffusivity of the Ce0.9Zr0.1O2 in KOH electrolyte (indicated using Randles-Sevcik equation-based analysis) is correlated with better electrochemical performance. These insights presented here clearly indicate that Zr doping into CeO2 results in a promising candidate material for electrochemical and supercapacitive applications.

preprint2021arXiv

Language Detection Engine for Multilingual Texting on Mobile Devices

More than 2 billion mobile users worldwide type in multiple languages in the soft keyboard. On a monolingual keyboard, 38% of falsely auto-corrected words are valid in another language. This can be easily avoided by detecting the language of typed words and then validating it in its respective language. Language detection is a well-known problem in natural language processing. In this paper, we present a fast, light-weight and accurate Language Detection Engine (LDE) for multilingual typing that dynamically adapts to user intended language in real-time. We propose a novel approach where the fusion of character N-gram model and logistic regression based selector model is used to identify the language. Additionally, we present a unique method of reducing the inference time significantly by parameter reduction technique. We also discuss various optimizations fabricated across LDE to resolve ambiguity in input text among the languages with the same character pattern. Our method demonstrates an average accuracy of 94.5% for Indian languages in Latin script and that of 98% for European languages on the code-switched data. This model outperforms fastText by 60.39% and ML-Kit by 23.67% in F1 score for European languages. LDE is faster on mobile device with an average inference time of 25.91 microseconds.

preprint2021arXiv

LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained environment. To this end, we propose LIDSNet, a novel lightweight on-device intent detection model, which accurately predicts the message intent by utilizing a Deep Siamese Network for learning better sentence representations. We use character-level features to enrich the sentence-level representations and empirically demonstrate the advantage of transfer learning by utilizing pre-trained embeddings. Furthermore, to investigate the efficacy of the modules in our architecture, we conduct an ablation study and arrive at our optimal model. Experimental results prove that LIDSNet achieves state-of-the-art competitive accuracy of 98.00% and 95.97% on SNIPS and ATIS public datasets respectively, with under 0.59M parameters. We further benchmark LIDSNet against fine-tuned BERTs and show that our model is at least 41x lighter and 30x faster during inference than MobileBERT on Samsung Galaxy S20 device, justifying its efficiency on resource-constrained edge devices.

preprint2021arXiv

Real-Time Optimized N-gram For Mobile Devices

With the increasing number of mobile devices, there has been continuous research on generating optimized Language Models (LMs) for soft keyboard. In spite of advances in this domain, building a single LM for low-end feature phones as well as high-end smartphones is still a pressing need. Hence, we propose a novel technique, Optimized N-gram (Op-Ngram), an end-to-end N-gram pipeline that utilises mobile resources efficiently for faster Word Completion (WC) and Next Word Prediction (NWP). Op-Ngram applies Stupid Backoff and pruning strategies to generate a light-weight model. The LM loading time on mobile is linear with respect to model size. We observed that Op-Ngram gives 37% improvement in Language Model (LM)-ROM size, 76% in LM-RAM size, 88% in loading time and 89% in average suggestion time as compared to SORTED array variant of BerkeleyLM. Moreover, our method shows significant performance improvement over KenLM as well.

Sourav Ghosh

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

eDWaaS: A Scalable Educational Data Warehouse as a Service

LIP: Lightweight Intelligent Preprocessor for meaningful text-to-speech

PrivPAS: A real time Privacy-Preserving AI System and applied ethics

Comparison of the electrochemical performance of CeO2 and rare earth-based mixed metallic oxide (Ce0.9Zr0.1O2) for supercapacitor applications

Language Detection Engine for Multilingual Texting on Mobile Devices

LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

Real-Time Optimized N-gram For Mobile Devices