Multimodal Benchmark Curation for Biology LLMs
We curate paper, code and dataset benchmarks for biology-focused language models under provenance and reproducibility constraints.
Researcher profile
Works on review quality, moderation design and institutional trust systems for online science.
Trust snapshot
Actions
Identity and collaboration
Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.
Log in to claimDirect collaboration
Claim this author entity first to unlock direct invitations.
Research graph
Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.
Senior Lecturer
Published work
We curate paper, code and dataset benchmarks for biology-focused language models under provenance and reproducibility constraints.
This paper combines institution verification, topical expertise and trust snapshots to route methodological questions to the right specialists.
We propose ledger-based moderation records that improve accountability, appeals and policy learning in research products.
We evaluate rubric design for structured reviews, moderation queues and reviewer calibration in technical communities.