Source author record

Mohammad Khalil

Mohammad Khalil appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cs.CY Computational Engineering, Finance, and Science cond-mat.mtrl-sci Cryptography and Security Machine Learning physics.app-ph

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Brief but Impactful: How Human Tutoring Interactions Shape Engagement in Online Learning

Learning analytics can guide human tutors to efficiently address motivational barriers to learning that AI systems struggle to support. Students become more engaged when they receive human attention. However, what occurs during short interventions, and when are they most effective? We align student-tutor dialogue transcripts with MATHia tutoring system log data to study brief human-tutor interactions on Zoom drawn from 2,075 hours of 191 middle school students' classroom math practice. Mixed-effect models reveal that engagement, measured as successful solution steps per minute, is higher during a human-tutor visit and remains elevated afterward. Visit length exhibits diminishing returns: engagement rises during and shortly after visits, irrespective of visit length. Timing also matters: later visits yield larger immediate lifts than earlier ones, though an early visit remains important to counteract engagement decline. We create analytics that identify which tutor-student dialogues raise engagement the most. Qualitative analysis reveals that interactions with concrete, stepwise scaffolding with explicit work organization elevate engagement most strongly. We discuss implications for resource-constrained tutoring, prioritizing several brief, well-timed check-ins by a human tutor while ensuring at least one early contact. Our analytics can guide the prioritization of students for support and surface effective tutor moves in real-time.

preprint2026arXiv

Measuring the Impact of Student Gaming Behaviors on Learner Modeling

The expansion of large-scale online education platforms has made vast amounts of student interaction data available for knowledge tracing (KT). KT models estimate students' concept mastery from interaction data, but their performance is sensitive to input data quality. Gaming behaviors, such as excessive hint use, may misrepresent students' knowledge and undermine model reliability. However, systematic investigations of how different types of gaming behaviors affect KT remain scarce, and existing studies rely on costly manual analysis that does not capture behavioral diversity. In this study, we conceptualize gaming behaviors as a form of data poisoning, defined as the deliberate submission of incorrect or misleading interaction data to corrupt a model's learning process. We design Data Poisoning Attacks (DPAs) to simulate diverse gaming patterns and systematically evaluate their impact on KT model performance. Moreover, drawing on advances in DPA detection, we explore unsupervised approaches to enhance the generalizability of gaming behavior detection. We find that KT models' performance tends to decrease especially in response to random guess behaviors. Our findings provide insights into the vulnerabilities of KT models and highlight the potential of adversarial methods for improving the robustness of learning analytics systems.

preprint2026arXiv

Quality Degradation Attack in Synthetic Data

Synthetic Data Generation (SDG) can be used to facilitate privacy-preserving data sharing. However, most existing research focuses on privacy attacks where the adversary is the recipient of the released synthetic data and attempts to infer sensitive information from it. This study investigates quality degradation attacks initiated by adversaries who possess access to the real dataset or control over the generation process, such as the data owner, the synthetic data provider, or potential intruders. We formalize a corresponding threat model and empirically evaluate the effectiveness of targeted manipulations of real data (e.g., label flipping and feature-importance-based interventions) on the quality of generated synthetic data. The results show that even small perturbations can substantially reduce downstream predictive performance and increase statistical divergence, exposing vulnerabilities within SDG pipelines. This study highlights the need to integrate integrity verification and robustness mechanisms, alongside privacy protection, to ensure the reliability and trustworthiness of synthetic data sharing frameworks.

preprint2022arXiv

A heteroencoder architecture for prediction of failure locations in porous metals using variational inference

In this work we employ an encoder-decoder convolutional neural network to predict the failure locations of porous metal tension specimens based only on their initial porosities. The process we model is complex, with a progression from initial void nucleation, to saturation, and ultimately failure. The objective of predicting failure locations presents an extreme case of class imbalance since most of the material in the specimens do not fail. In response to this challenge, we develop and demonstrate the effectiveness of data- and loss-based regularization methods. Since there is considerable sensitivity of the failure location to the particular configuration of voids, we also use variational inference to provide uncertainties for the neural network predictions. We connect the deterministic and Bayesian convolutional neural networks at a theoretical level to explain how variational inference regularizes the training and predictions. We demonstrate that the resulting predicted variances are effective in ranking the locations that are most likely to fail in any given specimen.

preprint2022arXiv

Domain Decomposition of Stochastic PDEs: Development of Probabilistic Wirebasket-based Two-level Preconditioners

Realistic physical phenomena exhibit random fluctuations across many scales in the input and output processes. Models of these phenomena require stochastic PDEs. For three-dimensional coupled (vector-valued) stochastic PDEs (SPDEs), for instance, arising in linear elasticity, the existing two-level domain decomposition solvers with the vertex-based coarse grid show poor numerical and parallel scalabilities. Therefore, new algorithms with a better resolved coarse grid are needed. The probabilistic wirebasket-based coarse grid for a two-level solver is devised in three dimensions. This enriched coarse grid provides an efficient mechanism for global error propagation and thus improves the convergence. This development enhances the scalability of the two-level solver in handling stochastic PDEs in three dimensions. Numerical and parallel scalabilities of this algorithm are studied using MPI and PETSc libraries on high-performance computing (HPC) systems. Implementational challenges of the intrusive spectral stochastic finite element methods (SSFEM) are addressed by coupling domain decomposition solvers with FEniCS general purpose finite element package. This work generalizes the applications of intrusive SSFEM to tackle a variety of stochastic PDEs and emphasize the usefulness of the domain decomposition-based solvers and HPC for uncertainty quantification.

preprint2016arXiv

Potential of EPUB3 for Digital Textbooks in Higher Education

The e-book market is currently in a strong upswing. This research study deals with the question which practical uses the e-book format EPUB3 offers for (higher) education. By means of a didactic content analysis, a range of interactive exercise types were developed as a result of conversations with teachers. For this purpose, a didactic and technical concept has been developed. Different kinds of exercises were prototypically implemented in an e-book. Finally, a brief overview reflects the present state of the current e-book readers. A subsequent discussion illuminates the strengths and weaknesses of the format. In summary, it can be remarked that EPUB3 is suitable for a variety of different exercises and that it is able to serve as a basic format for forthcoming digital textbooks. Furthermore the openness of EPUB3 will assist Open Learning and Teaching in a meaningful way.

preprint2016arXiv

What is Learning Analytics about? A Survey of Different Methods Used in 2013-2015

The area of Learning Analytics has developed enormously since the first International Conference on Learning Analytics and Knowledge (LAK) in 2011. It is a field that combines different disciplines such as computer science, statistics, psychology and pedagogy to achieve its intended objectives. The main goals illustrate in creating convenient interventions on learning as well as its environment and the final optimization about learning domain stakeholders. Because the field matures and is now adapted in diverse educational settings, we believe there is a pressing need to list its own research methods and specify its objectives and dilemmas. This paper surveys publications from Learning Analytics and Knowledge conference from 2013 to 2015 and lists the significant research areas in this sphere. We consider the method profile and classify them into seven different categories with a brief description on each. Furthermore, we show the most cited method categories using Google scholar. Finally, the authors raise the challenges and constraints that affect its ethical approach through the meta-analysis study. It is believed that this paper will help researchers to identify the common methods used in Learning Analytics, and it will assist by establishing a future forecast towards new research work taking into account the privacy and ethical issues of this strongly emerged field.

preprint2016arXiv

What Massive Open Online Course (MOOC) Stakeholders Can Learn From Learning Analytics?

Massive Open Online Courses (MOOCs) are the road that led to a revolution and a new era of learning environments. Educational institutions have come under pressure to adopt new models that assure openness in their education distribution. Nonetheless, there is still altercation about the pedagogical approach and the absolute information delivery to the students. On the other side with the use of Learning Analytics, powerful tools become available which mainly aim to enhance learning and improve learners performance. In this chapter, the development phases of a Learning Analytics prototype and the experiment of integrating it into a MOOC platform, called iMooX will be presented. This chapter explores how MOOC Stakeholders may benefit from Learning Analytics as well as it reports an exploratory analysis of some of the offered courses and demonstrate use cases as a typical evaluation of this prototype in order to discover hidden patterns, overture future proper decisions and to optimize learning with applicable and convenient interventions.

Mohammad Khalil

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Brief but Impactful: How Human Tutoring Interactions Shape Engagement in Online Learning

Measuring the Impact of Student Gaming Behaviors on Learner Modeling

Quality Degradation Attack in Synthetic Data

A heteroencoder architecture for prediction of failure locations in porous metals using variational inference

Domain Decomposition of Stochastic PDEs: Development of Probabilistic Wirebasket-based Two-level Preconditioners

Potential of EPUB3 for Digital Textbooks in Higher Education

What is Learning Analytics about? A Survey of Different Methods Used in 2013-2015

What Massive Open Online Course (MOOC) Stakeholders Can Learn From Learning Analytics?