Source author record

Kexin Yi

Kexin Yi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence Computation and Language Computer Vision cond-mat.mes-hall cond-mat.quant-gas

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Immersive Text Game and Personality Classification

We designed and built a game called \textit{Immersive Text Game}, which allows the player to choose a story and a character, and interact with other characters in the story in an immersive manner of dialogues. The game is based on several latest models, including text generation language model, information extraction model, commonsense reasoning model, and psychology evaluation model. In the past, similar text games usually let players choose from limited actions instead of answering on their own, and not every time what characters said are determined by the player. Through the combination of these models and elaborate game mechanics and modes, the player will find some novel experiences as driven through the storyline.

preprint2020arXiv

CLEVRER: CoLlision Events for Video REpresentation and Reasoning

The ability to reason about temporal and causal events from videos lies at the core of human intelligence. Most video reasoning benchmarks, however, focus on pattern recognition from complex visual and language input, instead of on causal structure. We study the complementary problem, exploring the temporal and causal structures behind videos of objects with simple visual appearance. To this end, we introduce the CoLlision Events for Video REpresentation and Reasoning (CLEVRER), a diagnostic video dataset for systematic evaluation of computational models on a wide range of reasoning tasks. Motivated by the theory of human casual judgment, CLEVRER includes four types of questions: descriptive (e.g., "what color"), explanatory ("what is responsible for"), predictive ("what will happen next"), and counterfactual ("what if"). We evaluate various state-of-the-art models for visual reasoning on our benchmark. While these models thrive on the perception-based task (descriptive), they perform poorly on the causal tasks (explanatory, predictive and counterfactual), suggesting that a principled approach for causal reasoning should incorporate the capability of both perceiving complex visual and language inputs, and understanding the underlying dynamics and causal relations. We also study an oracle model that explicitly combines these components via symbolic representations.

preprint2020arXiv

Visual Grounding of Learned Physical Models

Humans intuitively recognize objects' physical properties and predict their motion, even when the objects are engaged in complicated interactions. The abilities to perform physical reasoning and to adapt to new environments, while intrinsic to humans, remain challenging to state-of-the-art computational models. In this work, we present a neural model that simultaneously reasons about physics and makes future predictions based on visual and dynamics priors. The visual prior predicts a particle-based representation of the system from visual observations. An inference module operates on those particles, predicting and refining estimates of particle locations, object states, and physical parameters, subject to the constraints imposed by the dynamics prior, which we refer to as visual grounding. We demonstrate the effectiveness of our method in environments involving rigid objects, deformable materials, and fluids. Experiments show that our model can infer the physical properties within a few observations, which allows the model to quickly adapt to unseen scenarios and make accurate predictions into the future.

preprint2016arXiv

Topological polaritons from photonic Dirac cones coupled to excitons in a magnetic field

We introduce an alternative scheme for creating topological polaritons (topolaritons) by exploiting the presence of photonic Dirac cones in photonic crystals with triangular lattice symmetry. As recently proposed, topolariton states can emerge from a coupling between photons and excitons combined with a periodic exciton potential and a magnetic field to open up a topological gap. We show that in photonic crystals the opening of the gap can be substantially simplified close to photonic Dirac points. Coupling to Zeeman-split excitons breaks time reversal symmetry and allows to gap out the Dirac cones in a non-trival way, leading to a topological gap similar to the strength of the periodic exciton potential. Compared to the original topolariton proposal [Karzig {\em et al}, PRX {\bf 5}, 031001 (2015)], this scheme significantly increases the size of the topological gap over a wide range of parameters. Moreover, the gap opening mechanism highlights an interesting connection between topolaritons and the Haldane and Raghu scheme [Haldane and Raghu, PRL {\bf 100}, 013904 (2008)] to create topological photons in magneto-optically active materials.