Researcher profile

Kaiyuan Chen

Kaiyuan Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

BabyVision: Visual Reasoning Beyond Language

While humans develop core visual skills long before acquiring language, contemporary Multimodal LLMs (MLLMs) still rely heavily on linguistic priors to compensate for their fragile visual understanding. We uncovered a crucial fact: state-of-the-art MLLMs consistently fail on basic visual tasks that humans, even 3-year-olds, can solve effortlessly. To systematically investigate this gap, we introduce BabyVision, a benchmark designed to assess core visual abilities independent of linguistic knowledge for MLLMs. BabyVision spans a wide range of tasks, with 388 items divided into 22 subclasses across four key categories. Empirical results and human evaluation reveal that leading MLLMs perform significantly below human baselines. Gemini3-Pro-Preview scores 49.7, lagging behind 6-year-old humans and falling well behind the average adult score of 94.1. These results show despite excelling in knowledge-heavy evaluations, current MLLMs still lack fundamental visual primitives. Progress in BabyVision represents a step toward human-level visual perception and reasoning capabilities. We also explore solving visual reasoning with generation models by proposing BabyVision-Gen and automatic evaluation toolkit. Our code and benchmark data are released at https://github.com/UniPat-AI/BabyVision for reproduction.

preprint2019arXiv

Phase evolution and thermal stability of high Curie temperature BiScO$_3$-PbTiO$_3$-Pb(Cd${1/3}$Nb$_{2/3}$)O$_3$ ceramics near MPB

Piezoelectric and ferroelectric ceramics with a high Curie temperature (Tc) have attracted a growing attention owning to their applications under severe environments. In this work, phase structure, dielectric, ferroelectric and piezoelectric properties of (0.975-x)BiScO3-xPbTiO3-0.025Pb(Cd1/3Nb2/3)O3 ceramics (x = 0.58-0.64) were studied. A composition-induced structural transformation occurs from rhombohedral phase to tetragonal phase through an intermediate monoclinic phase with the increasing PT concentration. The relationship between structure and electrical properties of the system were discussed. The BS-xPT-PCN system near the morphotropic phase boundary (x = 0.62) exhibits excellent piezoelectric and ferroelectric performances with d33 = 508 pC/N, kp = 56%, and Pr = 40 uC/cm2. The high-temperature piezoelectricity of the sample with MPB (x = 0.62) was characterized by an in situ XRD. The excellent thermal stability of the crystal structure and the piezoelectric property indicate that the BS-xPT-PCN system is a promising candidate for high temperature piezoelectric applications.