Researcher profile

Shi Qiu

Shi Qiu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Automating scientific discovery requires more than generating papers from ideas. Real research is iterative: hypotheses are challenged from multiple perspectives, experiments fail and inform the next attempt, and lessons accumulate across cycles. Existing autonomous research systems often model this process as a linear pipeline: they rely on single-agent reasoning, stop when execution fails, and do not carry experience across runs. We present AutoResearchClaw, a multi-agent autonomous research pipeline built on five mechanisms: structured multi-agent debate for hypothesis generation and result analysis, a self-healing executor with a \textsc{Pivot}/\textsc{Refine} decision loop that transforms failures into information, verifiable result reporting that prevents fabricated numbers and hallucinated citations, human-in-the-loop collaboration with seven intervention modes spanning full autonomy to step-by-step oversight, and cross-run evolution that converts past mistakes into future safeguards. On ARC-Bench, a 25-topic experiment-stage benchmark, AutoResearchClaw outperforms AI Scientist v2 by 54.7%. A human-in-the-loop ablation across seven intervention modes reveals that precise, targeted collaboration at high-leverage decision points consistently outperforms both full autonomy and exhaustive step-by-step oversight. We position AutoResearchClaw as a research amplifier that augments rather than replaces human scientific judgment. Code is available at https://github.com/aiming-lab/AutoResearchClaw.

preprint2025arXiv

How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

Large Language Models (LLMs) display strikingly different generalization behaviors: supervised fine-tuning (SFT) often narrows capability, whereas reinforcement-learning (RL) tuning tends to preserve it. The reasons behind this divergence remain unclear, as prior studies have largely relied on coarse accuracy metrics. We address this gap by introducing a novel benchmark that decomposes reasoning into atomic core skills such as calculation, fact retrieval, simulation, enumeration, and diagnostic, providing a concrete framework for addressing the fundamental question of what constitutes reasoning in LLMs. By isolating and measuring these core skills, the benchmark offers a more granular view of how specific cognitive abilities emerge, transfer, and sometimes collapse during post-training. Combined with analyses of low-level statistical patterns such as distributional divergence and parameter statistics, it enables a fine-grained study of how generalization evolves under SFT and RL across mathematical, scientific reasoning, and non-reasoning tasks. Our meta-probing framework tracks model behavior at different training stages and reveals that RL-tuned models maintain more stable behavioral profiles and resist collapse in reasoning skills, whereas SFT models exhibit sharper drift and overfit to surface patterns. This work provides new insights into the nature of reasoning in LLMs and points toward principles for designing training strategies that foster broad, robust generalization.

preprint2020arXiv

Graphene nanopipette enabled liquid delivery at zeptoliter precision

Accurate extraction of liquid is the first step towards low-volume liquid delivery and nanocharacterization, which plays a significant role in biomedical research. In this study, a tip-shaped graphene nanopipette (GNP) is proposed by encapsulating the biomolecule solution on the prefabricated metal tip with graphene. The volume of the encapsulated liquid is highly controllable at zeptoliter precision by tuning the encapsulating speed and the number of graphene encapsulation rounds. Using protein (ferritin) solution as an example, it has been confirmed by finite element analysis and the controlled experiments that the GNP allows the delivery of ferritin solution at the zeptoliter-scale. Furthermore, GNP is demonstrated as a new type of tip-shaped liquid cell, which is suitable for multiple nanocharacterization techniques. In particular, due to the ultra-sharp tip shape, isotope (13C)-labelled glucose solution encapsulated in GNP has been characterized by atom probe tomography (APT) in the laser-pulsed mode. Analysis of the mass spectrum and the reconstructed three-dimensional chemical maps reveals the quantitative distribution and the compositions of individual glucose molecules. The GNP is expected to be introduced to deliver liquid in the range of zeptoliters to attoliters, and brings a new capability for characterization of biological specimens in their near-native state.

preprint2020arXiv

Integer Valued Definable Functions in $\mathbb{R}_{an,\exp}$

We give two variations on a result of Wilkie's on unary functions defianble in $\mathbb{R}_{an,\exp}$ that take integer values at positive integers. Provided that the functions grows slower than the function $2^x$, Wilkie showed that is must be eventually equal to a polynomial. We show the same conclusion under a stronger growth condition but only assuming that the function takes values sufficiently close to a integers at positive integers. In a different variation we show that it suffices to assume that the function takes integer values on a sufficiently dense subset of the positive integers(for instance primes), again under a stronger growth bound than that in Wilkie's result.