Researcher profile

Arjun Mani

Arjun Mani contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

Point and Ask: Incorporating Pointing into Visual Question Answering

Visual Question Answering (VQA) has become one of the key benchmarks of visual recognition progress. Multiple VQA extensions have been explored to better simulate real-world settings: different question formulations, changing training and test distributions, conversational consistency in dialogues, and explanation-based answering. In this work, we further expand this space by considering visual questions that include a spatial point of reference. Pointing is a nearly universal gesture among humans, and real-world VQA is likely to involve a gesture towards the target region. Concretely, we (1) introduce and motivate point-input questions as an extension of VQA, (2) define three novel classes of questions within this space, and (3) for each class, introduce both a benchmark dataset and a series of baseline models to handle its unique challenges. There are two key distinctions from prior work. First, we explicitly design the benchmarks to require the point input, i.e., we ensure that the visual question cannot be answered accurately without the spatial reference. Second, we explicitly explore the more realistic point spatial input rather than the standard but unnatural bounding box input. Through our exploration we uncover and address several visual recognition challenges, including the ability to infer human intent, reason both locally and globally about the image, and effectively combine visual, language and spatial inputs. Code is available at: https://github.com/princetonvisualai/pointingqa .

preprint2019arXiv

Disordered contacts can localize chiral edge electrons

Chiral integer quantum Hall (QH) edge modes are immune to backscattering and therefore are non-localized and show a vanishing longitudinal as well as non-local resistance along with quantized 2-terminal and Hall resistance even in the presence of sample disorder. However, this is not the case for contact disorder, which refers to the possibility that a contact can reflect edge modes either partially or fully. This paper shows that when all contacts are disordered in a N-terminal quantum Hall bar, then transport via chiral QH edge modes can have a significant localization correction. The Hall and 2-terminal resistance in an N-terminal quantum Hall sample deviate from their values derived while neglecting the phase acquired at disordered contacts, and this deviation is called the quantum localization correction. This correction term increases with the increase of disorderedness of contacts but decreases with the increase in the number of contacts in an N-terminal Hall bar. The presence of inelastic scattering, however, can completely destroy the quantum localization correction.