Researcher profile

Kai Nakamura

Kai Nakamura contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data

A pressing challenge in current dialogue systems is to successfully converse with users on topics with information distributed across different modalities. Previous work in multiturn dialogue systems has primarily focused on either text or table information. In more realistic scenarios, having a joint understanding of both is critical as knowledge is typically distributed over both unstructured and structured forms. We present a new dialogue dataset, HybriDialogue, which consists of crowdsourced natural conversations grounded on both Wikipedia text and tables. The conversations are created through the decomposition of complex multihop questions into simple, realistic multiturn dialogue interactions. We propose retrieval, system state tracking, and dialogue response generation tasks for our dataset and conduct baseline experiments for each. Our results show that there is still ample opportunity for improvement, demonstrating the importance of building stronger dialogue systems that can reason over the complex setting of information-seeking dialogue grounded on tables and text.

preprint2022arXiv

Trace Embeddings from Zero Surgery Homeomorphisms

Manolescu and Piccirillo recently initiated a program to construct an exotic $S^4$ or $\# n \mathbb{CP}^2$ by using zero surgery homeomorphisms and Rasmussen's $s$-invariant. They find five knots that if any were slice, one could construct an exotic $S^4$ and disprove the Smooth $4$-dimensional Poincaré conjecture. We rule out this exciting possibility and show that these knots are not slice. To do this, we use a zero surgery homeomorphism to relate slice properties of two knots \textit{stably} after a connected sum with some $4$-manifold. Furthermore, we show that our techniques will extend to the entire infinite family of zero surgery homeomorphisms constructed by Manolescu and Piccirillo. However, our methods do not completely rule out the possibility of constructing an exotic $S^4$ or $\# n \mathbb{CP}^2$ as Manolescu and Piccirillo proposed. We explain the limits of these methods hoping this will inform and invite new attempts to construct an exotic $S^4$ or $\# n \mathbb{CP}^2$. We also show a family of homotopy spheres constructed by Manolescu and Piccirillo using annulus twists of a ribbon knot are all standard.

preprint2020arXiv

r/Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection

Fake news has altered society in negative ways in politics and culture. It has adversely affected both online social network systems as well as offline communities and conversations. Using automatic machine learning classification models is an efficient way to combat the widespread dissemination of fake news. However, a lack of effective, comprehensive datasets has been a problem for fake news research and detection model development. Prior fake news datasets do not provide multimodal text and image data, metadata, comment data, and fine-grained fake news categorization at the scale and breadth of our dataset. We present Fakeddit, a novel multimodal dataset consisting of over 1 million samples from multiple categories of fake news. After being processed through several stages of review, the samples are labeled according to 2-way, 3-way, and 6-way classification categories through distant supervision. We construct hybrid text+image models and perform extensive experiments for multiple variations of classification, demonstrating the importance of the novel aspect of multimodality and fine-grained classification unique to Fakeddit.