Source author record

Khashayar Filom

Khashayar Filom appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AG math.DS Artificial Intelligence Computation and Language Machine Learning math.AT

Catalog footprint

What is connected

5works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Rate or Fate? RLV$^\varepsilon$R: Reinforcement Learning with Verifiable Noisy Rewards

Reinforcement learning with verifiable rewards (RLVR) is a simple but powerful paradigm for training LLMs: sample a completion, verify it, and update. In practice, however, the verifier is almost never clean--unit tests probe only limited corner cases; human and synthetic labels are imperfect; and LLM judges (e.g., RLAIF) are noisy and can be exploited--and this problem worsens on harder domains (especially coding) where tests are sparse and increasingly model-generated. We ask a pragmatic question: does the verification noise merely slow down the learning (rate), or can it flip the outcome (fate)? To address this, we develop an analytically tractable multi-armed bandit view of RLVR dynamics, instantiated with GRPO and validated in controlled experiments. Modeling false positives and false negatives and grouping completions into recurring reasoning modes yields a replicator-style (natural-selection) flow on the probability simplex. The dynamics decouples into within-correct-mode competition and a one-dimensional evolution for the mass on incorrect modes, whose drift is determined solely by Youden's index J=TPR-FPR. This yields a sharp phase transition: when J>0, the incorrect mass is driven toward extinction (learning); when J=0, the process is neutral; and when J<0, incorrect modes amplify until they dominate (anti-learning and collapse). In the learning regime J>0, noise primarily rescales convergence time ("rate, not fate"). Experiments on verifiable programming tasks under synthetic noise reproduce the predicted J=0 boundary. Beyond noise, the framework offers a general lens for analyzing RLVR stability, convergence, and algorithmic interventions.

preprint2021arXiv

Topological aspects of the dynamical moduli space of rational maps

We investigate the topology of the space of Möbius conjugacy classes of degree $d$ rational maps on the Riemann sphere. We show that it is rationally acyclic and we compute its fundamental group. As a byproduct, we also obtain the ranks of some higher homotopy groups of the parameter space of degree $d$ rational maps allowing us to extend the previously known range. Moreover, we show that this parameter space is not nilpotent.

preprint2020arXiv

On the non-monotonicity of entropy for a class of real quadratic rational maps

We prove that the entropy function on the moduli space of real quadratic rational maps is not monotonic by exhibiting a continuum of disconnected level sets. This entropy behavior is in stark contrast with the case of polynomial maps, and establishes a conjecture on the failure of monotonicity for bimodal real quadratic rational maps of shape $(+-+)$ which was posed in arXiv:1901.03458 based on experimental evidence.

preprint2016arXiv

Dessins on Modular Curves

Given a finite index subgroup $Γ$ of ${\rm{PSL}}_2(\Bbb{Z})$, we investigate Belyi functions on the corresponding modular curve $X(Γ)$ by introducing two methods for constructing such functions. Numerous examples have been worked out completely and as an application, we have derived modular equations for $Γ_0(2),Γ_0(3)$ and several special values of the $j$-function by a new method based on the theory of Belyi functions and dessin d'enfants.

preprint2015arXiv

On the $j$-invariant and the Legendre Representation of Elliptic Curves with Complex Multiplication

By introducing a class of meromorphic functions with certain ramification structures on $\Bbb{CP}^1$, a new method for the determination of the Legendre representation of elliptic curves with complex multiplication is introduced. These functions reduce the desired representation to the solution of an explicitly given system of polynomial equations and makes no use of the knowledge of Hilbert class polynomial on which standard computations depend. As a byproduct, an algorithm for computing $j(kτ)$ in terms of $j(τ)$ is obtained and implemented for $k=2,3$. Solving the system of equations provides a method for computing certain special values of the modular function $j:\Bbb{H}\rightarrow\Bbb{C}$.