Source author record

SunYoung Park

SunYoung Park appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Vision Information Retrieval Multiagent Systems Neurons and Cognition

Catalog footprint

What is connected

2works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

V-Agent: An Interactive Video Search System Using Vision-Language Models

We introduce V-Agent, a novel multi-agent platform designed for advanced video search and interactive user-system conversations. By fine-tuning a vision-language model (VLM) with a small video preference dataset and enhancing it with a retrieval vector from an image-text retrieval model, we overcome the limitations of traditional text-based retrieval systems in multimodal scenarios. The VLM-based retrieval model independently embeds video frames and audio transcriptions from an automatic speech recognition (ASR) module into a shared multimodal representation space, enabling V-Agent to interpret both visual and spoken content for context-aware video search. This system consists of three agents-a routing agent, a search agent, and a chat agent-that work collaboratively to address user intents by refining search outputs and communicating with users. The search agent utilizes the VLM-based retrieval model together with an additional re-ranking module to further enhance video retrieval quality. Our proposed framework demonstrates state-of-the-art zero-shot performance on the MultiVENT 2.0 benchmark, highlighting its potential for both academic research and real-world applications. The retrieval model and demo videos are available at https://huggingface.co/NCSOFT/multimodal-embedding.

preprint2014arXiv

Unresolvable human mental states based on a parallel universe theory

We show that human mental states are unresolvable by suggesting a mathematical function that describes human mental states in relation to parallel universe theory. The function is a solution to a multi-dimensional advection equation; representing a situation a person is faced with, and its time-derivative showing the mental state in that situation. This function has interesting characteristics that explain why each person has different thoughts in a particular situation. Because the multi-dimensional advection equation has an infinite number of solutions, we can use them to represent an infinite number of mental states. We focus on the basic concepts of the model and explain the function using extremely simple cases. We also use the functions to explain remembering and forgetting.