Source author record

Kai Shen

Kai Shen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computation and Language cs.CY eess.AS math.OC Multiagent Systems nlin.AO Populations and Evolution Social and Information Networks Sound

Catalog footprint

What is connected

4works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Recent end-to-end spoken dialogue systems leverage speech tokenizers and neural audio codecs to enable LLMs to operate directly on discrete speech representations. However, these models often exhibit limited speaker identity preservation, hindering personalized voice interaction. In this work, we present Chroma 1.0, the first open-source, real-time, end-to-end spoken dialogue model that achieves both low-latency interaction and high-fidelity personalized voice cloning. Chroma achieves sub-second end-to-end latency through an interleaved text-audio token schedule (1:2) that supports streaming generation, while maintaining high-quality personalized voice synthesis across multi-turn conversations. Our experimental results demonstrate that Chroma achieves a 10.96% relative improvement in speaker similarity over the human baseline, with a Real-Time Factor (RTF) of 0.43, while maintaining strong reasoning and dialogue capabilities. Our code and models are publicly available at https://github.com/FlashLabs-AI-Corp/FlashLabs-Chroma and https://huggingface.co/FlashLabs/Chroma-4B .

preprint2022arXiv

Scenario-based Multi-product Advertising Copywriting Generation for E-Commerce

In this paper, we proposed an automatic Scenario-based Multi-product Advertising Copywriting Generation system (SMPACG) for E-Commerce, which has been deployed on a leading Chinese e-commerce platform. The proposed SMPACG consists of two main components: 1) an automatic multi-product combination selection module, which itself is consisted of a topic prediction model, a pattern and attribute-based selection model and an arbitrator model; and 2) an automatic multi-product advertising copywriting generation module, which combines our proposed domain-specific pretrained language model and knowledge-based data enhancement model. The SMPACG is the first system that realizes automatic scenario-based multi-product advertising contents generation, which achieves significant improvements over other state-of-the-art methods. The SMPACG has been not only developed for directly serving for our e-commerce recommendation system, but also used as a real-time writing assistant tool for merchants.

preprint2020arXiv

CovidNet: To Bring Data Transparency in the Era of COVID-19

Timely, creditable, and fine-granular case information is vital for local communities and individual citizens to make rational and data-driven responses to the COVID-19 pandemic. This paper presents CovidNet, a COVID-19 tracking project associated with a large scale epidemic dataset, which was initiated by 1Point3Acres. To the best of our knowledge, the project is the only platform providing real-time global case information of more than 4,124 sub-divisions from over 27 countries worldwide with multi-language supports. The platform also offers interactive visualization tools to analyze the full historical case curves in each region. Initially launched as a voluntary project to bridge the data transparency gap in North America in January 2020, this project by far has become one of the major independent sources worldwide and has been consumed by many other tracking platforms. The accuracy and freshness of the dataset is a result of the painstaking efforts from our voluntary teamwork, crowd-sourcing channels, and automated data pipelines. As of May 18, 2020, the project website has been visited more than 200 million times and the CovidNet dataset has empowered over 522 institutions and organizations worldwide in policy-making and academic researches. All datasets are openly accessible for non-commercial purposes at https://coronavirus.1point3acres.com via a formal request through our APIs.

preprint2020arXiv

Quasi-synchronization of bounded confidence opinion dynamics with stochastic asynchronous rule

Recently the theory of noise-induced synchronization of Hegselmann-Krause (HK) dynamics has been well developed. As a typical opinion dynamics of bounded confidence, the HK model obeys a synchronous updating rule, i.e., \emph{all} agents check and update their opinions at each time point. However, whether asynchronous bounded confidence models, including the famous Deffuant-Weisbuch (DW) model, can be synchronized by noise have not been theoretically proved. In this paper, we propose a generalized bounded confidence model which possesses a stochastic asynchronous rule. The model takes the DW model and the HK model as special cases and can significantly generalize the bounded confidence models to practical application. We discover that the asynchronous model possesses a different noise-based synchronization behavior compared to the synchronous HK model. Generally, the HK dynamics can achieve quasi-synchronization \emph{almost surely} under the drive of noise. For the asynchronous dynamics, we prove that the model can achieve quasi-synchronization \emph{in mean}, which is a new type of quasi-synchronization weaker than the "almost surely" sense. The results unify the theory of noise-induced synchronization of bounded confidence opinion dynamics and hence proves the noise-induced synchronization of DW model theoretically for the first time. Moreover, the results provide a theoretical foundation for developing noise-based control strategy of more complex social opinion systems with stochastic asynchronous rules.

Kai Shen

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Scenario-based Multi-product Advertising Copywriting Generation for E-Commerce

CovidNet: To Bring Data Transparency in the Era of COVID-19

Quasi-synchronization of bounded confidence opinion dynamics with stochastic asynchronous rule