Researcher profile

Ravi Shankar

Ravi Shankar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Gradient estimates for the Lagrangian mean curvature equation with critical and supercritical phase

In this paper, we prove interior gradient estimates for the Lagrangian mean curvature equation, if the Lagrangian phase is critical and supercritical and $C^{2}$. Combined with the a priori interior Hessian estimates proved in [Bha21, Bha22], this solves the Dirichlet boundary value problem for the critical and supercritical Lagrangian mean curvature equation with $C^0$ boundary data. We also provide a uniform gradient estimate for lower regularity phases that satisfy certain additional hypotheses.

preprint2022arXiv

Knowledge Graph - Deep Learning: A Case Study in Question Answering in Aviation Safety Domain

In the commercial aviation domain, there are a large number of documents, like, accident reports (NTSB, ASRS) and regulatory directives (ADs). There is a need for a system to access these diverse repositories efficiently in order to service needs in the aviation industry, like maintenance, compliance, and safety. In this paper, we propose a Knowledge Graph (KG) guided Deep Learning (DL) based Question Answering (QA) system for aviation safety. We construct a Knowledge Graph from Aircraft Accident reports and contribute this resource to the community of researchers. The efficacy of this resource is tested and proved by the aforesaid QA system. Natural Language Queries constructed from the documents mentioned above are converted into SPARQL (the interface language of the RDF graph database) queries and answered. On the DL side, we have two different QA models: (i) BERT QA which is a pipeline of Passage Retrieval (Sentence-BERT based) and Question Answering (BERT based), and (ii) the recently released GPT-3. We evaluate our system on a set of queries created from the accident reports. Our combined QA system achieves 9.3% increase in accuracy over GPT-3 and 40.3% increase over BERT QA. Thus, we infer that KG-DL performs better than either singly.

preprint2020arXiv

Analysis of Selective-Decode and Forward Relaying Protocol Over kappa-mu Fading Channel Distribution

In this work, we examine the performance of selective-decode and forward (S-DF) relay systems over kappa-mu fading channel condition. We discuss about the probability density function (PDF), system model, and cumulative distribution function (CDF) of kappa-mu distributed envelope and signal to noise ratio (SNR) and the techniques to generate samples that follow kappa-mu distribution. Specifically, we consider the case where the source-to-relay (SR), relay-to-destination (RD) and source-to-destination (SD) link is subject to the independent and identically distributed (i.i.d.) kappa-mu fading. From the simulation results, the enhancement in the symbol error rate (SER) with a stronger line of sight (LOS) component is observed. This shows that S-DF relaying systems can perform well even in the non-fading or LOS conditions. Monte Carlo simulations are conducted for various values of fading parameters and the outcomes closely match with theoretical outcomes which validate the derivations.

preprint2020arXiv

Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor Network

We propose a novel method for emotion conversion in speech based on a chained encoder-decoder-predictor neural network architecture. The encoder constructs a latent embedding of the fundamental frequency (F0) contour and the spectrum, which we regularize using the Large Diffeomorphic Metric Mapping (LDDMM) registration framework. The decoder uses this embedding to predict the modified F0 contour in a target emotional class. Finally, the predictor uses the original spectrum and the modified F0 contour to generate a corresponding target spectrum. Our joint objective function simultaneously optimizes the parameters of three model blocks. We show that our method outperforms the existing state-of-the-art approaches on both, the saliency of emotion conversion and the quality of resynthesized speech. In addition, the LDDMM regularization allows our model to convert phrases that were not present in training, thus providing evidence for out-of-sample generalization.

preprint2020arXiv

Non-parallel Emotion Conversion using a Deep-Generative Hybrid Network and an Adversarial Pair Discriminator

We introduce a novel method for emotion conversion in speech that does not require parallel training data. Our approach loosely relies on a cycle-GAN schema to minimize the reconstruction error from converting back and forth between emotion pairs. However, unlike the conventional cycle-GAN, our discriminator classifies whether a pair of input real and generated samples corresponds to the desired emotion conversion (e.g., A to B) or to its inverse (B to A). We will show that this setup, which we refer to as a variational cycle-GAN (VC-GAN), is equivalent to minimizing the empirical KL divergence between the source features and their cyclic counterpart. In addition, our generator combines a trainable deep network with a fixed generative block to implement a smooth and invertible transformation on the input features, in our case, the fundamental frequency (F0) contour. This hybrid architecture regularizes our adversarial training procedure. We use crowd sourcing to evaluate both the emotional saliency and the quality of synthesized speech. Finally, we show that our model generalizes to new speakers by modifying speech produced by Wavenet.