Paper detail

State-of-the-art Small Language Coder Model: Mify-Coder

We present Mify-Coder, a 2.5B-parameter code model trained on 4.2T tokens using a compute-optimal strategy built on the Mify-2.5B foundation model. Mify-Coder achieves comparable accuracy and safety while significantly outperforming much larger baseline models on standard coding and function-calling benchmarks, demonstrating that compact models can match frontier-grade models in code generation and agent-driven workflows. Our training pipeline combines high-quality curated sources with synthetic data generated through agentically designed prompts, refined iteratively using enterprise-grade evaluation datasets. LLM-based quality filtering further enhances data density, enabling frugal yet effective training. Through disciplined exploration of CPT-SFT objectives, data mixtures, and sampling dynamics, we deliver frontier-grade code intelligence within a single continuous training trajectory. Empirical evidence shows that principled data and compute discipline allow smaller models to achieve competitive accuracy, efficiency, and safety compliance. Quantized variants of Mify-Coder enable deployment on standard desktop environments without requiring specialized hardware.

preprint2025arXivOpen access
Abhinav ParmarAbhisek PanigrahiAbhishek Kumar DwivediAbhishek BhattacharyaAdarsh RamachandraAditya ChoudharyAditya GargAditya RajAlankrit BhattAlpesh YadavAnant VishnuAnanthu PillaiAnkush KumarAryan PatnaikAswatha Narayanan SAvanish Raj SinghBhavya Shree GaddaBrijesh Pankajbhai KachhadiyaBuggala JahnaviChidurala Nithin KrishnaChintan ShahChunduru AkshayaDebarshi BanerjeeDebrup DeyDeepa R.Deepika B GFaiz ur RahmanGagan GayariGudhi Jagadeesh Kumar NaiduGursimar SinghHarshal TyagiHarshini KJames Mani VathalloorJayarama NettarJayashree GajjamJoe Walter Sugil GeorgeKamalakara Sri Krishna TadepalliKamalkumar RathinasamyKaran ChaurasiaKarthikeyan SKashish AroraKaushal DesaiKhushboo BuwadeKiran ManjrekarMalikireddy Venkata Sai LikhithaManjunath AMitali Mahavir BedmuthaMohammed Rafee TarafdarNikhil TiwariNikitha K GigiPavan RavikumarPendyala SwarnanjaliPiyush AnandPrakash ChandrasekarPrasanna Bhalchandra GawadePrasanth SivanPreeti KhuranaPriyanshi BabbarRajab Ali MondalRajesh Kumar VissapragadaRajeshwari GanesanRajeswari KoppisettiRamjee R.Ramkumar ThiruppathisamyRani G. S.S RekaSamarth GuptaSandeep Reddy KothakotaSarathy KSathyanarayana Sampath KumarSaurabh KumarShashank KhasareShenbaga Devi Venkatesh KumarShiva Rama Krishna ParvathamShoeb ShaikhShrishanmathi AShubham PathakSree Samhita KoppakaSreenivasa Raghavan K SSreeram VenkatasubramanianSuprabha Desai BojjaSwetha RSyed AhmedChinmai Harshitha ThotaTushar YadavVeeravelly KusumithaV V S S Prasanth PatnaikVidya Sri SesettiVijayakeerthi KVikram Raj BakshiVinay K KVinoth Kumar LoganathanVipin TiwariVivek Kumar ShrivastavV Venkata Sri Datta CharanWasim Akhtar Khan
0citations
0reviews
0saves
Nocode
Nodataset
0institutions

Next steps

Decide what to do with this paper

Use like or dislike for the fast social read. The more specific scholarly feedback stays available below when needed.

Log in to curate

Reading frame

Keep the important context close to the paper

Keep the important signals around this paper in one place: votes, save state, collection context, reviews and the metadata you need before deciding what to do next.

Authors

Abhinav ParmarAbhisek PanigrahiAbhishek Kumar DwivediAbhishek BhattacharyaAdarsh RamachandraAditya ChoudharyAditya GargAditya RajAlankrit BhattAlpesh YadavAnant VishnuAnanthu PillaiAnkush KumarAryan PatnaikAswatha Narayanan SAvanish Raj SinghBhavya Shree GaddaBrijesh Pankajbhai KachhadiyaBuggala JahnaviChidurala Nithin KrishnaChintan ShahChunduru AkshayaDebarshi BanerjeeDebrup DeyDeepa R.Deepika B GFaiz ur RahmanGagan GayariGudhi Jagadeesh Kumar NaiduGursimar SinghHarshal TyagiHarshini KJames Mani VathalloorJayarama NettarJayashree GajjamJoe Walter Sugil GeorgeKamalakara Sri Krishna TadepalliKamalkumar RathinasamyKaran ChaurasiaKarthikeyan SKashish AroraKaushal DesaiKhushboo BuwadeKiran ManjrekarMalikireddy Venkata Sai LikhithaManjunath AMitali Mahavir BedmuthaMohammed Rafee TarafdarNikhil TiwariNikitha K GigiPavan RavikumarPendyala SwarnanjaliPiyush AnandPrakash ChandrasekarPrasanna Bhalchandra GawadePrasanth SivanPreeti KhuranaPriyanshi BabbarRajab Ali MondalRajesh Kumar VissapragadaRajeshwari GanesanRajeswari KoppisettiRamjee R.Ramkumar ThiruppathisamyRani G. S.S RekaSamarth GuptaSandeep Reddy KothakotaSarathy KSathyanarayana Sampath KumarSaurabh KumarShashank KhasareShenbaga Devi Venkatesh KumarShiva Rama Krishna ParvathamShoeb ShaikhShrishanmathi AShubham PathakSree Samhita KoppakaSreenivasa Raghavan K SSreeram VenkatasubramanianSuprabha Desai BojjaSwetha RSyed AhmedChinmai Harshitha ThotaTushar YadavVeeravelly KusumithaV V S S Prasanth PatnaikVidya Sri SesettiVijayakeerthi KVikram Raj BakshiVinay K KVinoth Kumar LoganathanVipin TiwariVivek Kumar ShrivastavV Venkata Sri Datta CharanWasim Akhtar Khan

Institutions

Add specific reaction

Move through the context

Research map

Open full explorer

Move through nearby people, institutions, topics and adjacent work without leaving the paper page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Structured reviews

0 review(s)

ContributeLeave structured feedbackUse the review template when you have a concrete strength, concern or method question.Open review form

No structured reviews yet. High-signal critique starts here.

Work discussion

0 comment(s)

DiscussAdd a high-signal commentKeep quick notes, caveats and replication pointers separate from formal reviews.Open comment form

No discussion yet. The first strong comment sets the tone.