Researcher profile

Ritesh Kumar

Ritesh Kumar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

From Knowledge to Action: Outcomes of the 2025 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Large language models (LLMs) are rapidly changing how researchers in materials science and chemistry discover, organize, and act on scientific knowledge. This paper analyzes a broad set of community-developed LLM applications in an effort to identify emerging patterns in how these systems can be used across the scientific research lifecycle. We organize the projects into two complementary categories: Knowledge Infrastructure, systems that structure, retrieve, synthesize, and validate scientific information; and Action Systems, systems that execute, coordinate, or automate scientific work across computational and experimental environments. The submissions reveal a shift from single-purpose LLM tools toward integrated, multi-agent workflows that combine retrieval, reasoning, tool use, and domain-specific validation. Prominent themes include retrieval-augmented generation as grounding infrastructure, persistent structured knowledge representations, multimodal and multilingual scientific inputs, and early progress toward laboratory-integrated closed-loop systems. Together, these results suggest that LLMs are evolving from general-purpose assistants into composable infrastructure for scientific reasoning and action. This work provides a community snapshot of that transition and a practical taxonomy for understanding emerging LLM-enabled workflows in materials science and chemistry.

preprint2025arXiv

Selective Amplification of the Topological Hall Signal in Cr$_2$Te$_3$: The Role of Molecular Exchange Coupling

Layered magnetic transition-metal chalcogenides (TMCs) are a focal point of research, revealing a variety of intriguing magnetic and topological ground states. Within this family of TMCs, chromium telluride has garnered significant attention because of its excellent tunability in magnetic response, owing to the presence of competing magnetic exchange interactions. We here demonstrate the manipulation of magnetic anisotropy in ultra-thin Cr$_2$Te$_3$ films through growth engineering leading to a controlled transition from in-plane to out-of-plane orientation with an intermediate non-coplanar magnetic ground phase characterized by a topological Hall effect. Moreover, interfacing these films with Vanadyl phthalocyanine (VOPc) molecules prominently enhances the non-coplanar magnetic phase, attributing its presence to the competing interfacial magnetic exchange interactions over the spin-orbit-driven interfacial effects. These findings pave the way for the realization of novel topological spintronic devices through interface-modulated exchange coupling.

preprint2022arXiv

Aggression in Hindi and English Speech: Acoustic Correlates and Automatic Identification

In the present paper, we will present the results of an acoustic analysis of political discourse in Hindi and discuss some of the conventionalised acoustic features of aggressive speech regularly employed by the speakers of Hindi and English. The study is based on a corpus of slightly over 10 hours of political discourse and includes debates on news channel and political speeches. Using this study, we develop two automatic classification systems for identifying aggression in English and Hindi speech, based solely on an acoustic model. The Hindi classifier, trained using 50 hours of annotated speech, and English classifier, trained using 40 hours of annotated speech, achieve a respectable accuracy of over 73% and 66% respectively. In this paper, we discuss the development of this annotated dataset, the experiments for developing the classifier and discuss the errors that it makes.

preprint2022arXiv

Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi

In this paper we discuss an in-progress work on the development of a speech corpus for four low-resource Indo-Aryan languages -- Awadhi, Bhojpuri, Braj and Magahi using the field methods of linguistic data collection. The total size of the corpus currently stands at approximately 18 hours (approx. 4-5 hours each language) and it is transcribed and annotated with grammatical information such as part-of-speech tags, morphological features and Universal dependency relationships. We discuss our methodology for data collection in these languages, most of which was done in the middle of the COVID-19 pandemic, with one of the aims being to generate some additional income for low-income groups speaking these languages. In the paper, we also discuss the results of the baseline experiments for automatic speech recognition system in these languages.

preprint2022arXiv

Developing Universal Dependency Treebanks for Magahi and Braj

In this paper, we discuss the development of treebanks for two low-resourced Indian languages - Magahi and Braj based on the Universal Dependencies framework. The Magahi treebank contains 945 sentences and Braj treebank around 500 sentences marked with their lemmas, part-of-speech, morphological features and universal dependencies. This paper gives a description of the different dependency relationship found in the two languages and give some statistics of the two treebanks. The dataset will be made publicly available on Universal Dependency (UD) repository (https://github.com/UniversalDependencies/UD_Magahi-MGTB/tree/master) in the next(v2.10) release.

preprint2022arXiv

Diagnosing Data from ICTs to Provide Focused Assistance in Agricultural Adoptions

In the last two decades, ICTs have played a pivotal role in empowering rural populations in India by making knowledge more accessible. Digital Green (DG) is one such ICT that employs a participatory approach with smallholder farmers to produce instructional videos that encompass content specific to them. With help of human mediators, they disseminate these videos using projectors to improve the adoption of agricultural practices. DG's web-based data tracker stores attendance and adoption logs of millions of farmers, videos screened and their demographic information. We leverage this data for a period of ten years between 2010-2020 across five states in India and use it to conduct a holistic evaluation of the ICT. First, we find disparities in adoption rates of farmers, following which we use statistical tests to identify different factors that lead to these disparities and gender-based inequalities. Second, to provide assistance to farmers facing challenges, we model the adoption of practices from a video as a prediction problem and experiment with different model architectures. Our classifier achieves accuracies ranging from 79% to 90% across the five states, demonstrating its potential for assisting future ethnographic investigations. Third, we use SHAP values in conjunction with our model for explaining the impact of various network, content and demographic features on adoption. Our research finds that farmers greatly benefit from past adopters of a video from their group and village. We also discover that videos with a low content-specificity benefit some farmers more than others. Next, we highlight the implications of our findings by translating them into recommendations for community building, revisiting participatory approach and mitigating inequalities. We conclude with a discussion on how our work can assist future investigations into the lived experiences of farmers.

preprint2022arXiv

Language Resources and Technologies for Non-Scheduled and Endangered Indian Languages

In the present paper, we will present a survey of the language resources and technologies available for the non-scheduled and endangered languages of India. While there have been different estimates from different sources about the number of languages in India, it could be assumed that there are more than 1,000 languages currently being spoken in India. However barring some of the 22 languages included in the 8th Schedule of the Indian Constitution (called the scheduled languages), there is hardly any substantial resource or technology available for the rest of the languages. Nonetheless there have been some individual attempts at developing resources and technologies for the different languages across the country. Of late, some financial support has also become available for the endangered languages. In this paper, we give a summary of the resources and technologies for those Indian languages which are not included in the 8th schedule of the Indian Constitution and/or which are endangered.

preprint2022arXiv

UniMorph 4.0: Universal Morphology

The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet.

preprint2020arXiv

Developing a Multilingual Annotated Corpus of Misogyny and Aggression

In this paper, we discuss the development of a multilingual annotated corpus of misogyny and aggression in Indian English, Hindi, and Indian Bangla as part of a project on studying and automatically identifying misogyny and communalism on social media (the ComMA Project). The dataset is collected from comments on YouTube videos and currently contains a total of over 20,000 comments. The comments are annotated at two levels - aggression (overtly aggressive, covertly aggressive, and non-aggressive) and misogyny (gendered and non-gendered). We describe the process of data collection, the tagset used for annotation, and issues and challenges faced during the process of annotation. Finally, we discuss the results of the baseline experiments conducted to develop a classifier for misogyny in the three languages.

preprint2020arXiv

The pressure-enhanced superconducting phase of Sr$_x$-Bi$_2$Se$_3$ probed by hard point contact spectroscopy

The superconducting systems emerging from topological insulators upon metal ion intercalation or application of high pressure are ideal for investigation of possible topological superconductivity. In this context, Sr-intercalated Bi$_2$Se$_3$ is specially interesting because it displays pressure induced re-entrant superconductivity where the high pressure phase shows almost two times higher $T_c$ than the ambient superconducting phase ( $T_C\sim$ 2.9 K). Interestingly, unlike the ambient phase, the pressure-induced superconducting phase shows strong indication of unconventional superconductivity. However, since the pressure-induced phase remains inaccessible to spectroscopic techniques, the detailed study of the phase remained an unattained goal. Here we show that the high-pressure phase can be realized under a mesoscopic point contact, where transport spectroscopy can be used to probe the spectroscopic properties of the pressure-induced phase. We find that the point contact junctions on the high-pressure phase show unusual response to magnetic field supporting the possibility of unconventional superconductivity.

preprint2016arXiv

Evidence of a pseudogap driven by competing orders of multi-band origin in the ferromagnetic superconductor Sr$_{0.5}$Ce$_{0.5}$FBiS$_2$

From temperature and magnetic field dependent point-contact spectroscopy on the ferromagnetic superconductor Sr$_{0.5}$Ce$_{0.5}$FBiS$_2$ (bulk superconducting $T_c$ = 2.5 K) we observe (a) a pseudogap in the normal state that sustains to a remarkably high temperature of 40 K and (b) two-fold enhancement of $T_c$ upto 5 K in the point-contact geometry. In addition, Andreev reflection spectroscopy reveals a superconducting gap of 6 meV for certain point-contacts suggesting that the mean field $T_c$ of this system could be approximately 40 K, the onset temperature of pseudo-gap. Our results suggest that quantum fluctuations originating from other competing orders in Sr$_{0.5}$Ce$_{0.5}$FBiS$_2$ forbid a global phase coherence at high temperatures thereby suppressing $T_c$. Apart from the known ordering to a ferromagnetic state, our first-principles calculations reveal nesting of a multi-band Fermi surface and a significant electron-phonon coupling that could result in charge density wave-like instabilities.

preprint2012arXiv

Nanostructured antimony tin oxide synthesized via chemical precipitation method: its characterization and application in humidity sensing

In present investigation we report the synthesis of antimony tin oxide nanoparticles via chemical precipitation method. The synthesized material was characterized using X-ray diffractometer, Scanning Electron Microscope, UV-visible absorption spectroscopy. XRD shows the crystalline nature of the synthesized material and the crystallite size was estimated by using Debye-Scherer equation and its minimum value was 3 nm. Pelletization of synthesized material was done using hydraulic press machine under uniform pressure of 616 MPa. Then the pellets were annealed at 200, 400 and 600°C. Further each pellet was put in humidity sensing chamber and corresponding variations in resistance with relative humidity (%RH) were measured. The average sensitivity was calculated by taking the average of all sensitivities ranging from 10 to 90% RH. The average sensitivity of the pellet annealed at 600°C was best among all the sensing pellets and was 2.18 KΩ/%RH. Results were reproducible {\pm}84% after 2 months.