Researcher profile

Mohammad Ridwan Kabir

Mohammad Ridwan Kabir contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

Auxilio and Beyond: Comparative Evaluation, Usability, and Design Guidelines for Head Movement-based Assistive Mouse Controllers

Upper limb disability due to neurological disorders or other factors restricts computer interaction for affected individuals using a generic optical mouse. This work reports the findings of a comparative evaluation of Auxilio, a sensor-based wireless head-mounted Assistive Mouse Controller (AMC), that facilitates computer interaction for such individuals. Combining commercially available, low-cost motion and infrared sensors, Auxilio utilizes head movements and cheek muscle twitches for mouse control. Its performance in pointing tasks with subjects without motor impairments has been juxtaposed against a commercially available and patented vision-based head-tracking AMC developed for similar stakeholders. Furthermore, our study evaluates the usability of Auxilio using the System Usability Scale, supplemented by a qualitative analysis of participant interview transcripts to identify the strengths and weaknesses of both AMCs. Experimental results demonstrate the feasibility and effectiveness of Auxilio, and we summarize our key findings into design guidelines for the development of similar future AMCs.

preprint2022arXiv

A Case Study on the Independence of Speech Emotion Recognition in Bangla and English Languages using Language-Independent Prosodic Features

A language agnostic approach to recognizing emotions from speech remains an incomplete and challenging task. In this paper, we performed a step-by-step comparative analysis of Speech Emotion Recognition (SER) using Bangla and English languages to assess whether distinguishing emotions from speech is independent of language. Six emotions were categorized for this study, such as - happy, angry, neutral, sad, disgust, and fear. We employed three Emotional Speech Sets (ESS), of which the first two were developed by native Bengali speakers in Bangla and English languages separately. The third was a subset of the Toronto Emotional Speech Set (TESS), which was developed by native English speakers from Canada. We carefully selected language-independent prosodic features, adopted a Support Vector Machine (SVM) model, and conducted three experiments to carry out our proposition. In the first experiment, we measured the performance of the three speech sets individually, followed by the second experiment, where different ESS pairs were integrated to analyze the impact on SER. Finally, we measured the recognition rate by training and testing the model with different speech sets in the third experiment. Although this study reveals that SER in Bangla and English languages is mostly language-independent, some disparities were observed while recognizing emotional states like disgust and fear in these two languages. Moreover, our investigations revealed that non-native speakers convey emotions through speech, much like expressing themselves in their native tongue.

preprint2022arXiv

VIS-iTrack: Visual Intention through Gaze Tracking using Low-Cost Webcam

Human intention is an internal, mental characterization for acquiring desired information. From interactive interfaces containing either textual or graphical information, intention to perceive desired information is subjective and strongly connected with eye gaze. In this work, we determine such intention by analyzing real-time eye gaze data with a low-cost regular webcam. We extracted unique features (e.g., Fixation Count, Eye Movement Ratio) from the eye gaze data of 31 participants to generate a dataset containing 124 samples of visual intention for perceiving textual or graphical information, labeled as either TEXT or IMAGE, having 48.39% and 51.61% distribution, respectively. Using this dataset, we analyzed 5 classifiers, including Support Vector Machine (SVM) (Accuracy: 92.19%). Using the trained SVM, we investigated the variation of visual intention among 30 participants, distributed in 3 age groups, and found out that young users were more leaned towards graphical contents whereas older adults felt more interested in textual ones. This finding suggests that real-time eye gaze data can be a potential source of identifying visual intention, analyzing which intention aware interactive interfaces can be designed and developed to facilitate human cognition.

preprint2021arXiv

ANTASID: A Novel Temporal Adjustment to Shannon's Index of Difficulty for Quantifying the Perceived Difficulty of Uncontrolled Pointing Tasks

Shannon's Index of Difficulty ($ID$), reputable for quantifying the perceived difficulty of pointing tasks as a logarithmic relationship between movement-amplitude ($A$) and target-width ($W$), is used for modelling the corresponding observed movement-times ($MT_O$) in such tasks in controlled experimental setup. However, real-life pointing tasks are both spatially and temporally uncontrolled, being influenced by factors such as - human aspects, subjective behavior, the context of interaction, the inherent speed-accuracy trade-off where, emphasizing accuracy compromises speed of interaction and vice versa, and so on. Effective target-width ($W_e$) is considered as spatial adjustment for compensating accuracy. However, no significant adjustment exists in the literature for compensating speed in different contexts of interaction in these tasks. As a result, without any temporal adjustment, the true difficulty of an uncontrolled pointing task may be inaccurately quantified using Shannon's ID. To verify this, we propose the ANTASID (A Novel Temporal Adjustment to Shannon's ID) formulation with detailed performance analysis. We hypothesized a temporal adjustment factor ($t$) as a binary logarithm of $MT_O$, compensating for speed due to contextual differences and minimizing the non-linearity between movement-amplitude and target-width. Considering spatial and/or temporal adjustments to ID, we conducted regression analysis using our own and Benchmark datasets in both controlled and uncontrolled scenarios of pointing tasks with a generic mouse.ANTASID formulation showed significantly superior fitness values and throughput in all the scenarios while reducing the standard error. Furthermore, the quantification of ID with ANTASID varied significantly compared to the classical formulations of Shannon's ID, validating the purpose of this study.