Researcher profile

Yi-Ling Chen

Yi-Ling Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

i-Code: An Integrative and Composable Multimodal Learning Framework

Human intelligence is multimodal; we integrate visual, linguistic, and acoustic signals to maintain a holistic worldview. Most current pretraining methods, however, are limited to one or two modalities. We present i-Code, a self-supervised pretraining framework where users may flexibly combine the modalities of vision, speech, and language into unified and general-purpose vector representations. In this framework, data from each modality are first given to pretrained single-modality encoders. The encoder outputs are then integrated with a multimodal fusion network, which uses novel attention mechanisms and other architectural innovations to effectively combine information from the different modalities. The entire system is pretrained end-to-end with new objectives including masked modality unit modeling and cross-modality contrastive learning. Unlike previous research using only video for pretraining, the i-Code framework can dynamically process single, dual, and triple-modality data during training and inference, flexibly projecting different combinations of modalities into a single representation space. Experimental results demonstrate how i-Code can outperform state-of-the-art techniques on five video understanding tasks and the GLUE NLP benchmark, improving by as much as 11% and demonstrating the power of integrative multimodal pretraining.

preprint2021arXiv

Ubiquitous proximity to a critical state for collective neural activity in the CA1 region of freely moving mice

Using miniscope recordings of calcium fluorescence signals in the CA1 region of the hippocampus of mice, we monitor the neural activity of hippocampal regions while the animals are freely moving in an open chamber. Using a data-driven statistical modeling approach, the statistical properties of the recorded data are mapped to spin-glass models with pairwise interactions. Considering the parameter space of the model, the observed system is generally near a critical state between two distinct phases. The close proximity to the criticality is found to be robust against different ways of sampling and segmentation of the measured data. By independently altering the coupling distribution and the network structure of the statistical model, the network structures are found to be vital to maintain the proximity to the critical state. We further find the observed assignment of the coupling strengths makes the net coupling at each site more balanced with slight variation, which likely helps the maintenance of the critical state. Network analysis on the connectivity obtained by thresholding the coupling strengths find the connectivity of the networks to be well described by a random network model. These results are consistent across different experiments, sampling and segmentation choices in our analysis.