Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
30works
0followers
27topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

30 published item(s)

preprint2026arXiv

Feasibility study of the positronium lifetime imaging with the Biograph Vision Quadra and J-PET tomographs

Background: After its first ex-vivo and in-vivo demonstration, Positronium Lifetime Imaging (PLI) has received considerable interest as a potential new diagnostic biomarker. High sensitivity Positron Emission Tomography (PET) systems are needed for PLI since it requires simultaneous registration of annihilation photons and prompt gamma. In this simulation-based study, a~feasibility of PLI with the long axial field-of-view Biograph Vision Quadra (Quadra) and the Total Body J-PET scanner was investigated. Methods: The study was performed using the GATE software. Background radiation, present within the Quadra tomograph, was added to the simulation. First, the optimal placement of the energy window for the registration of the prompt gamma was investigated. Next, the organ-wise sensitivity of Quadra was calculated for the $^{68}$Ga, $^{44}$Sc, $^{22}$Na and $^{124}$I radioisotopes. Finally, the sensitivity for the scandium isotope was compared to the sensitivities obtainable with the Total Body J-PET scanner, as well as with the modular J-PET prototype. Results: The PLI sensitivities for the Quadra with the background radiation are estimated to 9.22(3), 10.46(4), 5.91(3), and 15.39(4) cps/kBq for the $^{44}$Sc, $^{68}$Ga, $^{22}$Na and $^{124}$I radioisotopes, respectively. The highest sensitivity was obtained when the energy window for the deexcitation photon is adjacent to the energy window for the annihilation photons. The determined PLI sensitivities with Quadra and the Total Body J-PET are in the order of sensitivities of standard PET imaging with the short axial field-of-view ($\sim$20 cm) PET scanners. Conclusion: The organ-wise PLI sensitivity of Quadra has been computed for the $^{68}$Ga, $^{44}$Sc, $^{22}$Na and $^{124}$I radioisotopes. A sensitivity gain by a factor of 150 was estimated relative to the modular J-PET system previously used for the first in-vivo PLI.

preprint2026arXiv

Locale-Conditioned Few-Shot Prompting Mitigates Demonstration Regurgitation in On-Device PII Substitution with Small Language Models

Personally Identifiable Information (PII) redaction usually replaces detected entities with placeholder tokens such as [PERSON], destroying the downstream utility of the redacted text for retrieval and Named Entity Recognition (NER) training. We propose a fully on-device pipeline that substitutes PII with consistent, type-preserving fake values: a 1.5 B mixture-of-experts token classifier (openai/privacy-filter) detects spans, a 1-bit Bonsai-1.7B Small Language Model (SLM) proposes contextual surrogates for names, addresses, and dates, and a rule-based generator (faker) handles patterned fields. We report a prompting finding more important than the quantization choice: with naive fixed three-shot demonstrations, the 1-bit SLM regurgitates demonstration outputs verbatim regardless of input; 1.58-bit Ternary-Bonsai-1.7B reproduces byte-identical failures, ruling out quantization as the cause. We fix this with locale-conditioned rotating few-shot demonstrations: a character-range heuristic picks a locale-pure pool and a per-input MD5 hash samples three demonstrations. With the fix, 482/482 unique Bonsai-1.7B calls succeed (no echoes) and produce locale-correct surrogates, although the SLM still copies from a small same-locale demonstration pool - a residual narrowness we quantify. On a 2000-document multilingual corpus, hybrid perplexity (PPL) beats faker in all six locales under a multilingual evaluator (XGLM-564M); length preservation is best-of-three in 4 of 6 locales. On downstream NER (400 train / 100 test, English), redact yields F1=0.000, faker 0.656, original 0.960; on a matched 160/40 subset including hybrid, faker (0.506) outperforms hybrid (0.346) at p < 0.001. We report this as an honest negative finding: SLM surrogates produce more natural text but a less varied training distribution, and downstream NER benefits more from variety than from naturalness.

preprint2026arXiv

Mind the Pause: Disfluency-Aware Objective Tuning for Multilingual Speech Correction with LLMs

Automatic Speech Recognition (ASR) transcripts often contain disfluencies, such as fillers, repetitions, and false starts, which reduce readability and hinder downstream applications like chatbots and voice assistants. If left unaddressed, such disfluencies can significantly degrade the reliability of downstream systems. Most existing approaches rely on classical models that focus on identifying disfluent tokens for removal. While this strategy is effective to some extent, it often disrupts grammatical structure and semantic coherence, leading to incomplete or unnatural sentences. Recent literature explored the use of large language models (LLMs); however, these efforts have primarily focused on disfluency detection or data augmentation, rather than performing comprehensive correction. We propose a multilingual correction pipeline where a sequence tagger first marks disfluent tokens, and these signals guide instruction fine-tuning of an LLM to rewrite transcripts into fluent text. To further improve reliability, we add a contrastive learning objective that penalizes the reproduction of disfluent tokens, encouraging the model to preserve grammar and meaning while removing disfluent artifacts. Our experiments across three Indian languages, namely Hindi, Bengali, and Marathi show consistent improvements over strong baselines, including multilingual sequence-to-sequence models. These results highlight that detection-only strategies are insufficient. Combining token-level cues with instruction tuning and contrastive learning provides a practical and scalable solution for multilingual disfluency correction in speech-driven NLP systems. We make the codes publicly available at https://github.com/deepak-kumar-98/Mind-the-Pause.

preprint2023arXiv

Hate Raids on Twitch: Echoes of the Past, New Modalities, and Implications for Platform Governance

In the summer of 2021, users on the livestreaming platform Twitch were targeted by a wave of &#34;hate raids,&#34; a form of attack that overwhelms a streamer&#39;s chatroom with hateful messages, often through the use of bots and automation. Using a mixed-methods approach, we combine a quantitative measurement of attacks across the platform with interviews of streamers and third-party bot developers. We present evidence that confirms that some hate raids were highly-targeted, hate-driven attacks, but we also observe another mode of hate raid similar to networked harassment and specific forms of subcultural trolling. We show that the streamers who self-identify as LGBTQ+ and/or Black were disproportionately targeted and that hate raid messages were most commonly rooted in anti-Black racism and antisemitism. We also document how these attacks elicited rapid community responses in both bolstering reactive moderation and developing proactive mitigations for future attacks. We conclude by discussing how platforms can better prepare for attacks and protect at-risk communities while considering the division of labor between community moderators, tool-builders, and platforms.

preprint2022arXiv

Dict-NMT: Bilingual Dictionary based NMT for Extremely Low Resource Languages

Neural Machine Translation (NMT) models have been effective on large bilingual datasets. However, the existing methods and techniques show that the model&#39;s performance is highly dependent on the number of examples in training data. For many languages, having such an amount of corpora is a far-fetched dream. Taking inspiration from monolingual speakers exploring new languages using bilingual dictionaries, we investigate the applicability of bilingual dictionaries for languages with extremely low, or no bilingual corpus. In this paper, we explore methods using bilingual dictionaries with an NMT model to improve translations for extremely low resource languages. We extend this work to multilingual systems, exhibiting zero-shot properties. We present a detailed analysis of the effects of the quality of dictionaries, training dataset size, language family, etc., on the translation quality. Results on multiple low-resource test languages show a clear advantage of our bilingual dictionary-based method over the baselines.

preprint2022arXiv

Linear programming word problems formulation using EnsembleCRF NER labeler and T5 text generator with data augmentations

We propose an ensemble approach to predict the labels in linear programming word problems. The entity identification and the meaning representation are two types of tasks to be solved in the NL4Opt competition. We propose the ensembleCRF method to identify the named entities for the first task. We found that single models didn&#39;t improve for the given task in our analysis. A set of prediction models predict the entities. The generated results are combined to form a consensus result in the ensembleCRF method. We present an ensemble text generator to produce the representation sentences for the second task. We thought of dividing the problem into multiple small tasks due to the overflow in the output. A single model generates different representations based on the prompt. All the generated text is combined to form an ensemble and produce a mathematical meaning of a linear programming problem.

preprint2022arXiv

No Calm in The Storm: Investigating QAnon Website Relationships

QAnon is a far-right conspiracy theory whose followers largely organize online. In this work, we use web crawls seeded from two of the largest QAnon hotbeds on the Internet, Voat and 8kun, to build a QAnon-centered domain-based hyperlink graph. We use this graph to identify, understand, and learn about the set of websites that spread QAnon content online. Specifically, we curate the largest list of QAnon centered websites to date, from which we document the types of QAnon sites, their hosting providers, as well as their popularity. We further analyze QAnon websites&#39; connection to mainstream news and misinformation online, highlighting the outsized role misinformation websites play in spreading the conspiracy. Finally, we leverage the observed relationship between QAnon and misinformation sites to build a highly accurate random forest classifier that distinguishes between misinformation and authentic news sites. Our results demonstrate new and effective ways to study the growing presence of conspiracy theories and misinformation on the Internet.

preprint2022arXiv

OCR Synthetic Benchmark Dataset for Indic Languages

We present the largest publicly available synthetic OCR benchmark dataset for Indic languages. The collection contains a total of 90k images and their ground truth for 23 Indic languages. OCR model validation in Indic languages require a good amount of diverse data to be processed in order to create a robust and reliable model. Generating such a huge amount of data would be difficult otherwise but with synthetic data, it becomes far easier. It can be of great importance to fields like Computer Vision or Image Processing where once an initial synthetic data is developed, model creation becomes easier. Generating synthetic data comes with the flexibility to adjust its nature and environment as and when required in order to improve the performance of the model. Accuracy for labeled real-time data is sometimes quite expensive while accuracy for synthetic data can be easily achieved with a good score.

preprint2022arXiv

On centrally extended Jordan derivations and related maps in rings

Let $R$ be a ring and $Z(R)$ be the center of $R.$ The aim of this paper is to define the notions of centrally extended Jordan derivations and centrally extended Jordan $\ast$-derivations, and to prove some results involving these mappings. Precisely, we prove that if a $2$-torsion free noncommutative prime ring $R$ admits a centrally extended Jordan derivation (resp. centrally extended Jordan $\ast$-derivation) $δ:R\to R$ such that \[ [δ(x),x]\in Z(R)~~(resp.~~[δ(x),x^{\ast}]\in Z(R))\text{~for~all~}x\in R, \] where $&#39;\ast&#39;$ is an involution on $R,$ then $R$ is an order in a central simple algebra of dimension at most 4 over its center.

preprint2022arXiv

On periodic and compactly supported least energy solutions to semilinear elliptic equations with non-Lipschitz nonlinearity

We discuss the existence and non-existence of periodic in one variable and compactly supported in the other variables least energy solutions for equations with non-Lipschitz nonlinearity of the form: $-Δu=λu^p - u^q$ in $\mathbb{R}^{N+1}$, where $ 0< q < p \leq 1$, $λ\in \mathbb{R}$. The approach is based on the Nehari manifold method supplemented by a one-sided constraint given through the functional of the suitable Pohozaev identity. The limit value of the parameter $λ$, where the approach is applicable, corresponds to the existence of periodic in one variable and compactly supported in the other variables least energy solutions. This value is found through the extrem values of nonlinear generalized Rayleigh quotients and the so-called curve of the critical exponents of $p,q$. Important properties of the solutions are derived, such as that they are not trivial with respect to the periodic variable and do not coincide with compactly supported solutions on the entire space $\mathbb{R}^{N+1}$.

preprint2021arXiv

Dynamics of QCD Matter -- current status

In this article, there are 18 sections discussing various current topics in the field of relativistic heavy-ion collisions and related phenomena, which will serve as a snapshot of the current state of the art. Section 1 reviews experimental results of some recent light-flavored particle production data from ALICE collaboration. Other sections are mostly theoretical in nature. Very strong but transient magnetic field created in relativistic heavy-ion collisions could have important observational consequences. This has generated a lot of theoretical activity in the last decade. Sections 2, 7, 9, 10 and 11 deal with the effects of the magnetic field on the properties of the QCD matter. There are several unanswered questions about the QCD phase diagram. Sections 3, 11 and 18 discuss various aspects of the QCD phase diagram and phase transitions. Recent years have witnessed interesting developments in foundational aspects of hydrodynamics and their application to heavy-ion collisions. Sections 12, 15, 16 and 17 of this article probe some aspects of this exciting field. Transport coefficients together with their temperature- and density-dependence, are essential inputs in hydrodynamical calculations. Sections 5, 8 and 14 deal with calculation/estimation of various transport coefficients (shear and bulk viscosity, thermal conductivity, relaxation times, etc.) of quark matter and hadronic matter. Sections 4, 6 and 13 deals with interesting new developments in the field. Section 4 discusses color dipole gluon distribution function at small transverse momentum in the form of a series of Bells polynomials. Section 6 discusses the properties of Higgs boson in the quark gluon plasma using Higgs-quark interaction. Section 13 discusses modification of coalescence model to incorporate viscous corrections and application of this model.

preprint2020arXiv

A qualitative study of (p,q) Singular parabolic equations: local existence, Sobolev regularity and asymptotic behaviour

The purpose of the article is to study the existence, regularity, stabilization and blow up results of weak solution to the following parabolic $(p,q)$-singular equation: \begin{equation*} (P_t)\; \left\{\begin{array}{rllll} u_t-Δ_{p}u -Δ_{q}u & = \vth \; u^{-\de}+ f(x,u), \; u>0 \text{ in } \Om\times (0,T), \\ u&=0 \quad \text{ on } \pa\Om\times (0,T), u(x,0)&= u_0(x) \; \text{ in }\Om, \end{array} \right. \end{equation*} where $\Om$ is a bounded domain in $\mathbb{R}^N$ with $C^2$ boundary $\pa\Om$, $1<q<p< \infty$, $0<\de, T>0$, $N\ge 2$ and $\vth>0$ is a parameter. Moreover, we assume that $f:\Om\times [0,\infty) \to \mb R$ is a bounded below Carathéodory function, locally Lipschitz with respect to the second variable uniformly in $x\in\Om$ and $u_0\in L^\infty(\Om)\cap W^{1,p}_0(\Om)$. We distinguish the cases as $q$-subhomogeneous and $q$-superhomogeneous depending on the growth of $f$ (hereafter we will drop the term $q$). In the subhomogeneous case, we prove the existence and uniqueness of the weak solution to problem $(P_t)$ for $\de<2+1/(p-1)$. For this, we first study the stationary problems corresponding to $(P_t)$ by using the method of sub and super solutions and subsequently employing implicit Euler method, we obtain the existence of a solution to $(P_t)$. Furthermore, in this case, we prove the stabilization result, that is, the solution $u(t)$ of $(P_t)$ converges to $u_\infty$, the unique solution to the stationary problem, in $L^\infty(\Om)$ as $t\ra\infty$. For the superhomogeneous case, we prove the local existence theorem by taking help of nonlinear semigroup theory. Subsequently, we prove finite time blow up of solution to problem $(P_t)$ for small parameter $\vartheta>0$ in the case $\de\leq 1$ and for all $\vth>0$ in the case $\de>1$.

preprint2020arXiv

An exact analytical scheme using a new potential to solve one-dimensional quantum systems

We propose an exact method for solving a one-dimensional Schrödinger equation. An arbitrary potential is represented by the collection of short-width potentials. For building the collection scheme, a new solvable potential is introduced. It is based on the simple expansion of the wavefunction of the introduced potential. The illustration of the scheme is done by reproducing the results of the rectangular potential. The scheme has computational advantages and the transmission properties, eigenenergies can be calculated efficiently. The presented scheme is compared with the other similar schemes in terms of computational complexity, analytical solubility, etc.. A \textit{Mathematica} code is provided in the supplementary file that solves the Schrödinger equation with arbitrary potential function $V(x)$ and effective mass $m(x)$.

preprint2020arXiv

Bremsstrahlung emission and plasma characterization driven by moderately relativistic laser-plasma interactions

Relativistic electrons generated by the interaction of petawatt-class short laser pulses with solid targets can be used to generate bright X-rays via bremsstrahlung. The efficiency of laser energy transfer into these electrons depends on multiple parameters including the focused intensity and pre-plasma level. This paper reports experimental results from the interaction of a high intensity petawatt-class glass laser pulses with solid targets at a maximum intensity of $10^{19}$ W/cm$^2$. In-situ measurements of specularly reflected light are used to provide an upper bound of laser absorption and to characterize focused laser intensity, the pre-plasma level and the generation mechanism of second harmonic light. The measured spectrum of electrons and bremsstrahlung radiation provide information about the efficiency of laser energy transfer.

preprint2020arXiv

Singular elliptic problems with unbalanced growth and critical exponent

In this article, we study the existence and multiplicity of solutions of the following $(p,q)$-Laplace equation with singular nonlinearity: \begin{equation*} \left\{\begin{array}{rllll} -Δ_{p}u-\baΔ_{q}u & = \la u^{-\de}+ u^{r-1}, \ u>0, \ \text{ in } \Om \\ u&=0 \quad \text{ on } \pa\Om, \end{array} \right. \end{equation*} where $\Om$ is a bounded domain in $\mathbb{R}^n$ with smooth boundary, $1< q< p<r \leq p^{*}$, where $p^{*}=\ds \frac{np}{n-p}$, $0<\de< 1$, $n> p$ and $\la,\, \ba>0$ are parameters. We prove existence, multiplicity and regularity of weak solutions of $(P_\la)$ for suitable range of $\la$. We also prove the global existence result for problem $(P_\la)$.

preprint2020arXiv

Thermoelectric transport coefficients of quark matter

A thermal gradient and/or a chemical potential gradient in a conducting medium can lead to an electric field, an effect known as thermoelectric effect or Seebeck effect. In the context of heavy-ion collisions, we estimate the thermoelectric transport coefficients for quark matter within the ambit of the Nambu-Jona Lasinio (NJL) model. We estimate the thermal conductivity, electrical conductivity, and the Seebeck coefficient of hot and dense quark matter. These coefficients are calculated using the relativistic Boltzmann transport equation within relaxation time approximation. The relaxation times for the quarks are estimated from the quark-quark and quark-antiquark scattering through in-medium meson exchange within the NJL model.

preprint2020arXiv

Unbalanced $(p,2)$-fractional problems with critical growth

We study the existence, multiplicity and regularity results of non-negative solutions of following doubly nonlocal problem: $$ (P_\la) \left\{ \begin{array}{lr}\ds \quad (-Δ)^{s_1}u+\ba (-Δ)^{s_2}_{p}u = \la a(x)|u|^{q-2}u+ \left(\int_{\Om}\frac{|u(y)|^r}{|x-y|^μ}~dy\right)|u|^{r-2} u \quad \text{in}\; \Om, \quad \quad\quad \quad u =0\quad \text{in} \quad \mb R^n\setminus \Om, \end{array} \right. $$ where $\Om\subset\mb R^n$ is a bounded domain with $C^2$ boundary $\pa\Om$, $0<s_2 < s_1<1$, $n> 2 s_1$, $1< q<p< 2$, $1<r \leq 2^{*}_μ$ with $2^{*}_μ=\frac{2n-μ}{n-2s_1}$, $\la,\ba>0$ and $a\in L^{\frac{d}{d-q}}(\Om)$, for some $q<d<2^{*}_{s_1}:=\frac{2n}{n-2s_1}$, is a sign changing function. We prove that each nonnegative weak solution of $(P_\la)$ is bounded. Furthermore, we obtain some existence and multiplicity results using Nehari manifold method.

preprint2019arXiv

Investigation of the phase separation property in La$_{0.2}$Pr$_{0.4}$Ca$_{0.4}$MnO$_3$ manganite

We report a comprehensive investigation of La0.2Pr0.4Ca0.4MnO3 to clarify the micrometre scale phase separation phenomenon in the mixed valent manganite (La,Pr,Ca)MnO3. The compound shows multiple magnetic transitions, in which the charge-ordered state is converted into a ferromagnetic state in steps with the application of a magnetic field. The ac susceptibility measurements show that the glassy transition at low temperatures does not depend on the frequency, thus indicating the absence of any spin glass behaviour. Magnetization as well as heat capacity measurements indicate that this low temperature transition is magnetic field dependent. The field dependent resistivity at 2K shows a sharp drop indicating that the sample behaviour changes from a high resistive state to a low resistive state, corroborating the conversion of charge-ordered insulating (COI) phase to a ferromagnetic metallic (FMM) phase. Our results point towards the existence of phase separation, rigidity of the low temperature glassy-like phase as well as the conversion of COI phase to FMM phase by the application of magnetic fields.

preprint2015arXiv

A controllable single photon beam-splitter as a node of a quantum network

A model for a controlled single-photon beam-splitter is proposed and analysed. It consists of two crossed optical-cavities with overlapping waists, dynamically coupled to a single flying atom. The system is shown to route a single photon with near-unity efficiency in an effective &#34;weak-coupling&#34; regime. Furthermore, two such nodes, forming a segment of a quantum network, are shown to perform several controlled quantum operations. All one-qubit operations involve a transfer of a photon from one cavity to another in a single node, while two-qubit operations involve transfer from one node to a next one, coupled via an optical fiber. Novel timing protocols for classical optical fields are found to simplify possible experimental realizations along with achievable effective parameter regime. Though our analysis here is restricted to a cavity-QED scenario, basic features of the model can be extended to various other physical systems including gated quantum dots, circuit-QED or opto-mechanical elements.

preprint2014arXiv

Quadratically constrained quadratic programming for classification using particle swarms and applications

Particle swarm optimization is used in several combinatorial optimization problems. In this work, particle swarms are used to solve quadratic programming problems with quadratic constraints. The approach of particle swarms is an example for interior point methods in optimization as an iterative technique. This approach is novel and deals with classification problems without the use of a traditional classifier. Our method determines the optimal hyperplane or classification boundary for a data set. In a binary classification problem, we constrain each class as a cluster, which is enclosed by an ellipsoid. The estimation of the optimal hyperplane between the two clusters is posed as a quadratically constrained quadratic problem. The optimization problem is solved in distributed format using modified particle swarms. Our method has the advantage of using the direction towards optimal solution rather than searching the entire feasible region. Our results on the Iris, Pima, Wine, and Thyroid datasets show that the proposed method works better than a neural network and the performance is close to that of SVM.

preprint2014arXiv

Spreading of triboelectrically charged granular matter

We report on the spreading of triboelectrically charged glass particles on an oppositely charged surface of a plastic cylindrical container in the presence of a constant mechanical agitation. The particles spread via sticking, as a monolayer on the cylinder&#39;s surface. Continued agitation initiates a sequence of instabilities of this monolayer, which first forms periodic wavy-stripe-shaped transverse density modulation in the monolayer and then ejects narrow and long particle-jets from the tips of these stripes. These jets finally coalesce laterally to form a homogeneous spreading front that is layered along the spreading direction. These remarkable growth patterns are related to a time evolving frictional drag between the moving charged glass particles and the countercharges on the plastic container. The results provide insight into the multiscale time-dependent tribolelectric processes and motivates further investigation into the microscopic causes of these macroscopic dynamical instabilities and spatial structures.

preprint2013arXiv

Data Hiding in Binary Image using Block Parity

Secret data hiding in binary images is more difficult than other formats since binary images require only one bit representation to indicate black and white. This study proposes a new method for data hiding in binary images using optimized bit position to replace a secret bit. This method manipulates blocks, which are sub-divided. The parity bit for a specified block decides whether to change or not, to embed a secret bit. By finding the best position to insert a secret bit for each divided block, the image quality of the resulting stego-image can be improved, while maintaining low computational complexity.The experimental results show that the proposed method has an improvement with respect to a previous work.

preprint2013arXiv

Transition of a particle between adjacent optical traps: A study using catastrophe theory

In spite of the widespread use of optical tweezers as a quantitative tool to measure small forces, there exists no unambiguous and simple experimental method for either validating its theoretically predicted form or empirically parameterizing it over the entire range. This problem is addressed by studying the transition of a colloidal particle between two spatially separated optical traps. The transition as a function of the relative intensity of the traps and the separation between them reveals a formal resemblance to the `butterfly catastrophe&#39; which also maps onto to phase transitions observed, for example in ferroelectrics, on a phenomenological level. The method has been used to experimentally determine the force-displacement curve for an optical trap over its entire range.

preprint2012arXiv

Benchmarking recognition results on word image datasets

We have benchmarked the maximum obtainable recognition accuracy on various word image datasets using manual segmentation and a currently available commercial OCR. We have developed a Matlab program, with graphical user interface, for semi-automated pixel level segmentation of word images. We discuss the advantages of pixel level annotation. We have covered five databases adding up to over 3600 word images. These word images have been cropped from camera captured scene, born-digital and street view images. We recognize the segmented word image using the trial version of Nuance Omnipage OCR. We also discuss, how the degradations introduced during acquisition or inaccuracies introduced during creation of word images affect the recognition of the word present in the image. Word images for different kinds of degradations and correction for slant and curvy nature of words are also discussed. The word recognition rates obtained on ICDAR 2003, Sign evaluation, Street view, Born-digital and ICDAR 2011 datasets are 83.9%, 89.3%, 79.6%, 88.5% and 86.7% respectively.

preprint2012arXiv

FST Based Morphological Analyzer for Hindi Language

Hindi being a highly inflectional language, FST (Finite State Transducer) based approach is most efficient for developing a morphological analyzer for this language. The work presented in this paper uses the SFST (Stuttgart Finite State Transducer) tool for generating the FST. A lexicon of root words is created. Rules are then added for generating inflectional and derivational words from these root words. The Morph Analyzer developed was used in a Part Of Speech (POS) Tagger based on Stanford POS Tagger. The system was first trained using a manually tagged corpus and MAXENT (Maximum Entropy) approach of Stanford POS tagger was then used for tagging input sentences. The morphological analyzer gives approximately 97% correct results. POS tagger gives an accuracy of approximately 87% for the sentences that have the words known to the trained model file, and 80% accuracy for the sentences that have the words unknown to the trained model file.

preprint2012arXiv

Quantum processing by adiabatic transfer through a manifold of dark states

We consider a network whose nodes are electromagnetic cavities, each coupled to a single three-level atom. The nodes are connected by optical fibers. Each atom is addressed by a control laser, which along with the cavity field drives atomic transitions. The network can be in the form of chain or two and three dimensional arrays of $N$-cavities connected by $N_B$ fibers. Following the work on two-cavity system by Pellizzari, we find that under certain conditions, the system possesses two kinds of dark states. The first kind are $N$ states corresponding to atomic excitations at each node and these are our logical states for quantum processing. The second kind are $N_B$ degenerate dark states on pairs of sites connected by a fibre. By manipulating intensities and phases of control lasers on the cavities, one can pass adiabatically among these dark states due to their degeneracy. This network operates as a $N$-level quantum system in which one can generate computationally useful states by protocols of external controls. We obtain numerical results for small chains and lattices to demonstrate some quantum operations like the transport of states across the array, generation of W-states and Fourier-like states. We also discuss effects of dissipation and limitations of the model.

preprint2010arXiv

Analysis of adiabatic transfer in cavity QED

A three-level atom in a $Λ$ configuration trapped in an optical cavity forms a basic unit in a number of proposed protocols for quantum information processing. Through control with an appropriate laser, this system allows for efficient storage of cavity photons into long-lived atomic excitations, and their retrieval with high fidelity. This process presumes an adiabatic transfer through the `dark state&#39;, a coherent superposition of the two lower levels of the $Λ$ system, by a slow variation of the intensity of the control laser. We study the full quantum mechanics of this transfer process with a view to examine the nonadiabatic effects, as the control laser is varied in time. The nonadiabatic effects arise due to inevitable excitations of the system to states involving the highest level of the $Λ$ configuration, which is radiative. We formulate the problem in terms of the instantaneous eigenstates and solve it numerically, allowing for spontaneous decays from the excited level. We present detailed results for both storage and retrieval for a suitable range of variation rates and intensity of the control laser, and examine optimum conditions for the transfer process.

preprint2010arXiv

Anomalous transport and phonon renormalization in a chain with transverse and longitudinal vibrations

We study thermal transport in a chain of coupled atoms, which can vibrate in longitudinal as well as transverse directions. The particles interact through anharmonic potentials upto cubic order. The problem is treated quantum mechanically. We first calculate the phonon frequencies self-consistently taking into account the anharmonic interactions. We show that for all the modes, frequencies must have linear dispersion with wave-vector $q$ for small $q$ irrespective of their bare dispersions. We then calculate the phonon relaxation rates $Γ_i(q)$, where $i$ is the polarization index of the mode, in a self-consistent approximation based on second order perturbation diagrams. We find that the relaxation rate for the longitudinal phonon, $Γ_x(q) \propto q^{3/2}$, while that for the transverse phonon $Γ_y(q) \propto q^2$. The consequence of these results on the thermal conductivity $κ(N)$ of a chain of $N$ particles is that $κ(N) \propto N^{1/2}$.

preprint2009arXiv

The Murchison Widefield Array: Design Overview

The Murchison Widefield Array (MWA) is a dipole-based aperture array synthesis telescope designed to operate in the 80-300 MHz frequency range. It is capable of a wide range of science investigations, but is initially focused on three key science projects. These are detection and characterization of 3-dimensional brightness temperature fluctuations in the 21cm line of neutral hydrogen during the Epoch of Reionization (EoR) at redshifts from 6 to 10, solar imaging and remote sensing of the inner heliosphere via propagation effects on signals from distant background sources,and high-sensitivity exploration of the variable radio sky. The array design features 8192 dual-polarization broad-band active dipoles, arranged into 512 tiles comprising 16 dipoles each. The tiles are quasi-randomly distributed over an aperture 1.5km in diameter, with a small number of outliers extending to 3km. All tile-tile baselines are correlated in custom FPGA-based hardware, yielding a Nyquist-sampled instantaneous monochromatic uv coverage and unprecedented point spread function (PSF) quality. The correlated data are calibrated in real time using novel position-dependent self-calibration algorithms. The array is located in the Murchison region of outback Western Australia. This region is characterized by extremely low population density and a superbly radio-quiet environment,allowing full exploitation of the instrumental capabilities.