Researcher profile

Siyi Zhou

Siyi Zhou contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

IndexTTS 2.5 Technical Report

In prior work, we introduced IndexTTS 2, a zero-shot neural text-to-speech foundation model comprising two core components: a transformer-based Text-to-Semantic (T2S) module and a non-autoregressive Semantic-to-Mel (S2M) module, which together enable faithful emotion replication and establish the first autoregressive duration-controllable generative paradigm. Building upon this, we present IndexTTS 2.5, which significantly enhances multilingual coverage, inference speed, and overall synthesis quality through four key improvements: 1) Semantic Codec Compression: we reduce the semantic codec frame rate from 50 Hz to 25 Hz, halving sequence length and substantially lowering both training and inference costs; 2) Architectural Upgrade: we replace the U-DiT-based backbone of the S2M module with a more efficient Zipformer-based modeling architecture, achieving notable parameter reduction and faster mel-spectrogram generation; 3) Multilingual Extension: We propose three explicit cross-lingual modeling strategies, boundary-aware alignment, token-level concatenation, and instruction-guided generation, establishing practical design principles for zero-shot multilingual emotional TTS that supports Chinese, English, Japanese, and Spanish, and enables robust emotion transfer even without target-language emotional training data; 4) Reinforcement Learning Optimization: we apply GRPO in post-training of the T2S module, improving pronunciation accuracy and natrualness. Experiments show that IndexTTS 2.5 not only supports broader language coverage but also replicates emotional prosody in unseen languages under the same zero-shot setting. IndexTTS 2.5 achieves a 2.28 times improvement in RTF while maintaining comparable WER and speaker similarity to IndexTTS 2.

preprint2022arXiv

Gravitational Waves from an Inflation Triggered First-Order Phase Transition

Large excursion of the inflaton field can trigger interesting dynamics. One important example is a first-order phase transition in a spectator sector which couples to the inflaton. Gravitational waves (GWs) from such a first-order phase transition during inflation, an example of an instantaneous source, have an oscillatory feature. In this work, we show that this feature is generic for a source in an era of accelerated expansion. We also demonstrate that the shape of the GW signal contains information about the evolution of the early universe following the phase transition. In particular, the slope of the infrared part of the GW spectrum is sensitive to the evolution of the Hubble parameter when the GW modes reenter the horizon after inflation. The slope of the profile of the intermediate oscillatory part and the ultraviolet part of the GW spectrum depend on the evolution of the Hubble parameter when the modes exit horizon during the inflation and when they reenter the horizon during the reheating. The ultraviolet spectrum also depends on the details of the dynamics of the phase transition. We consider the GW signal in several models of evolution during and after inflation, and compare them with the minimal scenario of quasi-de Sitter inflation followed by radiation domination after a fast reheating, and demonstrate that the shape of the GW can be used to distinguish them. In this way, the GW signal considered in this paper offers a powerful probe to the dynamics of the early universe which is otherwise difficult to explore directly through CMB, large scale structure, big bang nucleosynthesis (BBN), and other well-studied cosmological observables.

preprint2022arXiv

Spiky strings in de Sitter space

We study semiclassical spiky strings in de Sitter space and the corresponding Regge trajectories, generalizing the analysis in anti-de Sitter space. In particular we demonstrate that each Regge trajectory has a maximum spin due to de Sitter acceleration, similarly to the folded string studied earlier. While this property is useful for the spectrum to satisfy the Higuchi bound, it makes a nontrivial question how to maintain mildness of high-energy string scattering which we are familiar with in flat space and anti-de Sitter space. Our analysis implies that in order to have infinitely many higher spin states, one needs to consider infinitely many Regge trajectories with an increasing folding number.

preprint2022arXiv

String Regge trajectory in de Sitter space and implications for inflation

We study the spectrum of semiclassical rotating strings in de Sitter space and its consistency. Even though a naive extrapolation of the linear Regge trajectory on flat space implies a violation of the Higuchi bound (a unitarity bound on the mass of higher-spin particles in de Sitter space), the curved space effects turn out to modify the trajectory to respect the bound. Interestingly, as a consequence of accelerated expansion, there exists a maximum spin for each Regge trajectory, which is helpful to make the spectrum consistent with the Higuchi bound, but at the same time, it could be an obstruction to stringy UV completion based on an infinite higher-spin tower. By pushing further this observation, we demonstrate that the vacuum energy $V$ inflating the universe has to be bounded by the string scale $M_s$ as $V\lesssim M_s^4$, if UV completion is achieved with the leading Regge trajectory of higher spin states up to the 4D Planck scale. Its application to inflation in the early universe implies an upper bound on the tensor-to-scalar ratio, $r\lesssim 0.01\times(M_s/10^{16} \text{GeV})^{4}$, which is within the scope of the near future CMB experiments. We also discuss another possibility that UV completion is achieved by multiple Regge trajectories.

preprint2022arXiv

Superheavy Dark Matter Production from Symmetry Restoration First-Order Phase Transition During Inflation

We propose a scenario where superheavy dark matter (DM) can be produced via symmetry restoration first-order phase transition during inflation triggered by the evolution of the inflaton field. The phase transition happens in a spectator sector coupled to the inflaton field. During the phase transition, the spectator field tunnels from a symmetry-broken vacuum to a symmetry-restored vacuum. The massive particles produced after bubble collisions are protected against decaying by the restored symmetry and may serve as a DM candidate in the later evolution of the Universe. We show that the latent heat released during the phase transition can be sufficient to produce the DM relic abundance observed today. In addition, accompanied with the super heavy DM, this first-order phase transition also produces gravitational waves detectable via future gravitational wave detectors.

preprint2020arXiv

Cosmological Signatures of Superheavy Dark Matter

We discuss two possible scenarios, namely the curvaton mechanism and the dark matter density modulation, where non-Gaussianity signals of superheavy dark matter produced by gravity can be enhanced and observed. In both scenarios, superheavy dark matter couples to an additional light field as a mediator. In the case of derivative coupling, the resulting non-Gaussianities induced by the light field can be large, which can provide inflationary evidences for these superheavy dark matter scenarios.

preprint2020arXiv

Heavy Spinning Particles from Signs of Primordial Non-Gaussianities: Beyond the Positivity Bounds

Within the so-called cosmological collider program, imprints of new particles on primordial non-Gaussianities have been studied intensively. In particular, their non-analytic features in the soft limit provide a smoking gun for new particles at the inflation scale. While this approach is very powerful to probe particles of the mass near the Hubble scale, the signal is exponentially suppressed for heavy particles. In this paper, to enlarge the scope of the cosmological collider, we explore a new approach to probing spins of heavy particles from signs of Wilson coefficients of the inflaton effective action and the corresponding primordial non-Gaussianities. As a first step, we focus on the regime where the de Sitter conformal symmetry is weakly broken. It is well known that the leading order effective operator $(\partial_μϕ\partial^μϕ)^2$ is universally positive as a consequence of unitarity. In contrast, we find that the sign of the six derivative operator $(\nabla_μ\partial_νϕ)^2(\partial_ρϕ)^2$ is positive for intermediate heavy scalars, whereas it is negative for intermediate heavy spinning states. Therefore, under the assumption of tree-level UV completion, the sign can be used to probe spins of heavy particles generating the effective interaction. We also study phenomenology of primordial non-Gaussianities thereof.