Source author record

Cong Zhang

Cong Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

39works

30topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CAGS: Color-Adaptive Volumetric Video Streaming with Dynamic 3D Gaussian Splatting

Volumetric video (VV) streaming enables real-time, immersive access to remote 3D environments, powering telepresence, ecological monitoring, and robotic teleoperation. These applications turn VV streaming into a real-time interface to remote physical environments, imposing new system-level demands for photorealistic scene representation, low-latency interaction, and robust performance under heterogeneous networks. 3D Gaussian Splatting (3DGS) has been widely used for real-time photorealistic rendering, offering superior visual quality and rendering performance, but it faces challenges due to bandwidth consumption. Furthermore, as the foundation of adaptive VV streaming, existing Levels of Detail (LoD) methods based on density are not well-suited to Gaussian representations, leading to visible gaps and severe quality degradation. Recent studies have also explored attribute compression techniques to reduce bandwidth consumption. Our preliminary studies reveal that aggressive attribute compression primarily causes color distortion, which can be effectively corrected in the rendered image using a reference image. Motivated by these findings, we propose a novel Color-Adaptive scheme for adaptive VV streaming that uses vector quantization (VQ) to establish LoDs and correct color distortions with low-resolution reference images. We further present CAGS, an adaptive VV streaming system compatible with diverse Gaussian representations, which integrates the Color-Adaptive scheme by rendering reference images on the streaming server and performing color restoration on the client. Extensive experiments on our prototype system demonstrate that CAGS outperforms the existing adaptive streaming systems in PSNR by 5$\sim$20 dB under fluctuating bandwidth, operates significantly faster than existing scalable Gaussian compression methods, and generalizes across different Gaussian representations.

preprint2026arXiv

MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching

3D Gaussian Splatting (3DGS) achieves high-quality novel view synthesis with real-time rendering, but its storage cost remains prohibitive for practical deployment. Existing post-training compression methods still rely on many coupled hyperparameters across pruning, transformation, quantization, and entropy coding, making it difficult to control the final compressed size and fully exploit the rate-distortion trade-off. We propose MesonGS++, a size-aware post-training codec for 3D Gaussian compression. On the codec side, MesonGS++ combines joint importance-based pruning, octree geometry coding, attribute transformation, selective vector quantization for higher-degree spherical harmonics, and group-wise mixed-precision quantization with entropy coding. On the configuration side, it treats the reserve ratio and bit-width allocation as the dominant rate-distortion knobs and jointly optimizes them under a target storage budget via discrete sampling and 0--1 integer linear programming. We further propose a linear size estimator and a CUDA parallel quantization operator to accelerate the hyperparameter searching process. Extensive experiments show that MesonGS++ achieves over 34$\times$ compression while preserving rendering fidelity, outperforming state-of-the-art post-training methods and accurately meeting target size budgets. Remarkably, without any training, MesonGS++ can even surpass the PSNR of vanilla 3DGS at a 20$\times$ compression rate on the Stump scene. Our code is available at https://github.com/mmlab-sigs/mesongs_plus

preprint2023arXiv

Weighted EF1 Allocations for Indivisible Chores

We study how to fairly allocate a set of indivisible chores to a group of agents, where each agent $i$ has a non-negative weight $w_i$ that represents its obligation for undertaking the chores. We consider the fairness notion of weighted envy-freeness up to one item (WEF1) and propose an efficient picking sequence algorithm for computing WEF1 allocations. Our analysis is based on a natural and powerful continuous interpretation for the picking sequence algorithms in the weighted setting, which might be of independent interest. Using this interpretation, we establish the necessary and sufficient conditions under which picking sequence algorithms can guarantee other fairness notions in the weighted setting. We also study the existence of fair and efficient allocations and propose efficient algorithms for the computation of WEF1 and PO allocations for the bi-valued instances. Our result generalizes that of Garg et al. (AAAI 2022) and Ebadian et al. (AAMAS 2022) to the weighted setting. Our work also studies the price of fairness for WEF1, and the implications of WEF1 to other fairness notions.

preprint2022arXiv

Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech

This study investigates whether the phonological features derived from the Featurally Underspecified Lexicon model can be applied in text-to-speech systems to generate native and non-native speech in English and Mandarin. We present a mapping of ARPABET/pinyin to SAMPA/SAMPA-SC and then to phonological features. This mapping was tested for whether it could lead to the successful generation of native, non-native, and code-switched speech in the two languages. We ran two experiments, one with a small dataset and one with a larger dataset. The results supported that phonological features could be used as a feasible input system for languages in or not in the train data, although further investigation is needed to improve model performance. The results lend support to FUL by presenting successfully synthesised output, and by having the output carrying a source-language accent when synthesising a language not in the training data. The TTS process stimulated human second language acquisition process and thus also confirm FUL's ability to account for acquisition.

preprint2022arXiv

ByT5 model for massively multilingual grapheme-to-phoneme conversion

In this study, we tackle massively multilingual grapheme-to-phoneme conversion through implementing G2P models based on ByT5. We have curated a G2P dataset from various sources that covers around 100 languages and trained large-scale multilingual G2P models based on ByT5. We found that ByT5 operating on byte-level inputs significantly outperformed the token-based mT5 model in terms of multilingual G2P. Pairwise comparison with monolingual models in these languages suggests that multilingual ByT5 models generally lower the phone error rate by jointly learning from a variety of languages. The pretrained model can further benefit low resource G2P through zero-shot prediction on unseen languages or provides pretrained weights for finetuning, which helps the model converge to a lower phone error rate than randomly initialized weights. To facilitate future research on multilingual G2P, we make available our code and pretrained multilingual G2P models at: https://github.com/lingjzhu/CharsiuG2P.

preprint2022arXiv

Fermion coupling to loop quantum gravity: canonical formulation

In the model of a fermion field coupled to loop quantum gravity, we consider the Gauss and the Hamiltonian constraints. According to the explicit solutions to the Gauss constraint, the fermion spins and the gravitational spin networks intertwine with each other so that the fermion spins contribute to the volume of the spin network vertices. For the Hamiltonian constraint, the regularization and quantization procedures are presented in detail. By introducing an adapted vertex Hilbert space to remove the regulator, we propose a diffeomorphism covariant graph-changing Hamiltonian constraint operator of the fermion field. This operator shows how fermions move in the loop quantum gravity spacetime and simultaneously influences the background quantum geometry.

preprint2022arXiv

Fermions on Quantum Geometry and Resolution of Doubling Problem

The fermion doubling problem has an important impact on quantum gravity, by revealing the tension between fermion and the fundamental discreteness of quantum spacetime. In this work, we discover that in Loop Quantum Gravity, the quantum geometry involving superposition of states associated with lattice refinements provides a resolution to the fermion doubling problem. We construct and analyze the fermion propagator on the quantum geometry, and we show that all fermion doubler modes are suppressed in the propagator. Our result suggests that the superposition nature of quantum geometry should resolve the tension between fermion and the fundamental discreteness, and relate to the continuum limit of quantum gravity.

preprint2022arXiv

First-Order Quantum Correction in Coherent State Expectation Value of Loop-Quantum-Gravity Hamiltonian

Given the non-graph-changing Hamiltonian $\widehat{H[N]}$ in Loop Quantum Gravity (LQG), $\langle\widehat{H[N]}\rangle$, the coherent state expectation value of $\widehat{H[N]}$, admits an semiclassical expansion in $\ell^2_{\rm p}$. In this paper, as presenting the detailed derivations of our previous work arXiv:2012.14242, we explicitly compute the expansion of $\langle\widehat{H[N]}\rangle$ to the linear order in $\ell^2_{\rm p}$ on the cubic graph with respect to the coherent state peaked at the homogeneous and isotropic data of cosmology. In our computation, a powerful algorithm is developed, supported by rigorous proofs and several theorems, to overcome the complexity in the computation of $\langle \widehat{H[N]} \rangle$. Particularly, some key innovations in our algorithm substantially reduce the complexity in computing the Lorentzian part of $\langle\widehat{H[N]}\rangle$. Additionally, some quantum correction effects resulting from $\langle\widehat{H[N]}\rangle$ in cosmology are discussed at the end of this paper.

preprint2022arXiv

First-Order Quantum Correction in Coherent State Expectation Value of Loop-Quantum-Gravity Hamiltonian: Overview and Results

Given the Loop-Quantum-Gravity (LQG) non-graph-changing Hamiltonian $\widehat{H[N]}$, the coherent state expectation value $\langle\widehat{H[N]}\rangle$ admits an semiclassical expansion in $\ell^2_{\rm p}$. In this paper, we compute explicitly the expansion of $\langle\widehat{H[N]}\rangle$ on the cubic graph to the linear order in $\ell^2_{\rm p}$, when the coherent state is peaked at the homogeneous and isotropic data of cosmology. In our computation, a powerful algorithm is developed to overcome the complexity in computing $\langle \widehat{H[N]} \rangle$. In particular, some key innovations in our algorithm substantially reduce the computational complexity in the Lorentzian part of $\langle\widehat{H[N]}\rangle$. Moreover, the algorithm developed in the present work makes it possible to compute the expectation value of arbitrary monomial of holonomies and fluxes on one edge up to arbitrary order of $\ell_{\rm p}^2$.

preprint2022arXiv

Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning

We propose a manager-worker framework based on deep reinforcement learning to tackle a hard yet nontrivial variant of Travelling Salesman Problem (TSP), \ie~multiple-vehicle TSP with time window and rejections (mTSPTWR), where customers who cannot be served before the deadline are subject to rejections. Particularly, in the proposed framework, a manager agent learns to divide mTSPTWR into sub-routing tasks by assigning customers to each vehicle via a Graph Isomorphism Network (GIN) based policy network. A worker agent learns to solve sub-routing tasks by minimizing the cost in terms of both tour length and rejection rate for each vehicle, the maximum of which is then fed back to the manager agent to learn better assignments. Experimental results demonstrate that the proposed framework outperforms strong baselines in terms of higher solution quality and shorter computation time. More importantly, the trained agents also achieve competitive performance for solving unseen larger instances.

preprint2022arXiv

Phone-to-audio alignment without text: A Semi-supervised Approach

The task of phone-to-audio alignment has many applications in speech research. Here we introduce two Wav2Vec2-based models for both text-dependent and text-independent phone-to-audio alignment. The proposed Wav2Vec2-FS, a semi-supervised model, directly learns phone-to-audio alignment through contrastive learning and a forward sum loss, and can be coupled with a pretrained phone recognizer to achieve text-independent alignment. The other model, Wav2Vec2-FC, is a frame classification model trained on forced aligned labels that can both perform forced alignment and text-independent segmentation. Evaluation results suggest that both proposed methods, even when transcriptions are not available, generate highly close results to existing forced alignment tools. Our work presents a neural pipeline of fully automated phone-to-audio alignment. Code and pretrained models are available at https://github.com/lingjzhu/charsiu.

preprint2022arXiv

Polarization measurement for the dileptonic channel of $W^+ W^-$ scattering using generative adversarial network

Measuring the polarization fractions of the $W^+W^-$ scattering reveals the interactions of the Higgs boson as well as new neutral states that are related to the standard model electroweak symmetry breaking. The dileptonic channel has a relatively lower background rate, but the kinematics of its final states can not be fully reconstructed due to the presence of two neutrinos. We propose neural networks to establish maps between the distributions of measurable quantities and the distributions of the lepton angles in $W$ boson rest frames. New physics contributions and collision energy can largely affect the kinematic properties of the $W^+W^-$ scattering beside the lepton angles. To make the network in ignorance of that information, the loss function is modified in two different ways. We show that the networks are promising in reproducing the lepton angle distributions, and the precision of the fitted polarization fractions obtained from network predictions is comparable to that obtained with the truth lepton angle. Although the best-fit values of polarization fractions do not change much after including the background uncertainty, the precisions is substantially reduced. Our trained models are available at GitHub.

preprint2022arXiv

Reduced Phase Space Quantization of Black Holes: Path Integrals, and Effective Dynamics

We consider the loop quantum theory of the spherically symmetric model of gravity coupled to Gaussian dust fields, where the Gaussian dust fields provide a material reference frame of the space and time to deparameterize gravity. This theory, used to study the quantum features of the spherically symmetric black hole, is constructed based on a 1-dimensional lattice $γ\subset\mathbb R$. Taking advantage of the path integral formulation, we investigate the quantum dynamics and obtain an effective action. With this action, we get an effective continuous description of this quantum lattice system which is not the same as the one described by the effective Hamiltonian used in arXiv:2012.05729, i.e. the classical Hamiltonian with the holonomy correction. It turns out that the Hamiltonian derived in this paper returns that used in arXiv:2012.05729 only for macro black holes since the lattice $γ$ is required to be sufficiently fine. Indeed, it is necessary to propose this fine-grained lattice structure in order to well describe the underlying lattice theory by the continuous description.

preprint2022arXiv

Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration

Massive practical works addressed by Deep Q-network (DQN) algorithm have indicated that stochastic policy, despite its simplicity, is the most frequently used exploration approach. However, most existing stochastic exploration approaches either explore new actions heuristically regardless of Q-values or inevitably introduce bias into the learning process to couple the sampling with Q-values. In this paper, we propose a novel preference-guided $ε$-greedy exploration algorithm that can efficiently learn the action distribution in line with the landscape of Q-values for DQN without introducing additional bias. Specifically, we design a dual architecture consisting of two branches, one of which is a copy of DQN, namely the Q-branch. The other branch, which we call the preference branch, learns the action preference that the DQN implicit follows. We theoretically prove that the policy improvement theorem holds for the preference-guided $ε$-greedy policy and experimentally show that the inferred action preference distribution aligns with the landscape of corresponding Q-values. Consequently, preference-guided $ε$-greedy exploration motivates the DQN agent to take diverse actions, i.e., actions with larger Q-values can be sampled more frequently whereas actions with smaller Q-values still have a chance to be explored, thus encouraging the exploration. We assess the proposed method with four well-known DQN variants in nine different environments. Extensive results confirm the superiority of our proposed method in terms of performance and convergence speed. Index Terms- Preference-guided exploration, stochastic policy, data efficiency, deep reinforcement learning, deep Q-learning.

preprint2022arXiv

Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

Field of view (FoV) prediction is critical in 360-degree video multicast, which is a key component of the emerging Virtual Reality (VR) and Augmented Reality (AR) applications. Most of the current prediction methods combining saliency detection and FoV information neither take into account that the distortion of projected 360-degree videos can invalidate the weight sharing of traditional convolutional networks, nor do they adequately consider the difficulty of obtaining complete multi-user FoV information, which degrades the prediction performance. This paper proposes a spherical convolution-empowered FoV prediction method, which is a multi-source prediction framework combining salient features extracted from 360-degree video with limited FoV feedback information. A spherical convolution neural network (CNN) is used instead of a traditional two-dimensional CNN to eliminate the problem of weight sharing failure caused by video projection distortion. Specifically, salient spatial-temporal features are extracted through a spherical convolution-based saliency detection model, after which the limited feedback FoV information is represented as a time-series model based on a spherical convolution-empowered gated recurrent unit network. Finally, the extracted salient video features are combined to predict future user FoVs. The experimental results show that the performance of the proposed method is better than other prediction methods.

preprint2022arXiv

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

Speech enhancement is an essential task of improving speech quality in noise scenario. Several state-of-the-art approaches have introduced visual information for speech enhancement,since the visual aspect of speech is essentially unaffected by acoustic environment. This paper proposes a novel frameworkthat involves visual information for speech enhancement, by in-corporating a Generative Adversarial Network (GAN). In par-ticular, the proposed visual speech enhancement GAN consistof two networks trained in adversarial manner, i) a generator that adopts multi-layer feature fusion convolution network to enhance input noisy speech, and ii) a discriminator that attemptsto minimize the discrepancy between the distributions of the clean speech signal and enhanced speech signal. Experiment re-sults demonstrated superior performance of the proposed modelagainst several state-of-the-art

preprint2021arXiv

DeepFake-o-meter: An Open Platform for DeepFake Detection

In recent years, the advent of deep learning-based techniques and the significant reduction in the cost of computation resulted in the feasibility of creating realistic videos of human faces, commonly known as DeepFakes. The availability of open-source tools to create DeepFakes poses as a threat to the trustworthiness of the online media. In this work, we develop an open-source online platform, known as DeepFake-o-meter, that integrates state-of-the-art DeepFake detection methods and provide a convenient interface for the users. We describe the design and function of DeepFake-o-meter in this work.

preprint2021arXiv

Loop quantum deparametrized Schwarzschild interior and discrete black hole mass

We present the detailed analyses of a model of loop quantum Schwarzschild interior coupled to a massless scalar field and extend the results in our previous rapid communication arXiv:2006.08313 to more general schemes. It is shown that the spectrum of the black hole mass is discrete and does not contain zero. This indicates the existence of a black hole remnant after Hawking evaporation due to loop quantum gravity effects. Besides to show the existence of a stable black hole remnant in the vacuum case, the quantum dynamics for the non-vacuum case is also solved and compared with the effective one.

preprint2021arXiv

Twisted geometry coherent states in all dimensional loop quantum gravity: I. Construction and Peakedness properties

A new family of coherent states for all dimensional loop quantum gravity are proposed, which is based on the generalized twisted geometry parametrization of the phase space of $SO(D+1)$ connection theory. We prove that this family of coherent states provide an over-complete basis of the Hilbert space in which edge simplicity constraint is solved. Moreover, according to our explicit calculation, the expectation values of holonomy and flux operators with respect to this family of coherent states coincide with the corresponding classical values given by the labels of the coherent states, up to some gauge degrees of freedom. Besides, we study the peakedness properties of this family of coherent states, including the peakedness of the wave functions of this family of coherent states in holonomy, momentum and phase space representations. It turns out that the peakedness in these various representations and the (relative) uncertainty of the expectation values of the operators are well controlled by the semi-classical parameter $t$. Therefore, this family of coherent states provide a candidate for the semi-classical analysis of all dimensional loop quantum gravity.

preprint2020arXiv

Alternative dynamics in loop quantum Brans-Dicke cosmology

To inherit more features of full loop quantum Brans-Dicke theory, the Euclidean and Lorentzian terms of the Hamiltonian constraint are quantized independently in loop quantum Brans-Dicke cosmology. An alternative Hamiltonian constraint operator and its effective expression are obtained in the cosmological model. A residual quantum correction term is found in the effective Hamiltonian constraint, which has no analog in the effective Hamiltonian of the loop quantum cosmology from general relativity. The dynamics driven by this effective Hamiltonian constraint is analyzed in detail. For the physically interesting case of $ω\gg 1$, this effective Hamiltonian drives a bouncing evolution which evolves from a de Sitter universe to a classical Brans-Dicke solution.

preprint2020arXiv

DSP: A Differential Spatial Prediction Scheme for Comprehensive real industrial datasets

Inverse Distance Weighted models (IDW) have been widely used for predicting and modeling multidimensional space in multimodal industrial processes. However, the more complex the structure of multidimensional space, the lower the performance of IDW models, and real industrial datasets tend to have more complex spatial structure. To solve this problem, a new framework for spatial prediction and modeling based on deep reinforcement learning network is proposed. In the proposed framework, the internal relationship between state and action is enhanced by reusing the state values in the Q network, and the convergence rate and stability of the deep reinforcement learning network are improved. The improved deep reinforcement learning network is then used to search for and learn the hyperparameters of each sample point in the inverse distance weighted model. These hyperparameters can reflect the spatial structure of the current industrial dataset to some extent. Then a spatial distribution of hyperparameters is constructed based on the learned hyperparameters. Each interpolation point obtains corresponding hyperparameters from the hyperparametric spatial distribution and brings them into the classical IDW models for prediction, thus achieving differential spatial prediction and modeling. The simulation results show that the proposed framework is suitable for real industrial datasets with complex spatial structure characteristics and is more accurate than current IDW models in spatial prediction.

preprint2020arXiv

Joint Communication and Computational Resource Allocation for QoE-driven Point Cloud Video Streaming

Point cloud video is the most popular representation of hologram, which is the medium to precedent natural content in VR/AR/MR and is expected to be the next generation video. Point cloud video system provides users immersive viewing experience with six degrees of freedom and has wide applications in many fields such as online education, entertainment. To further enhance these applications, point cloud video streaming is in critical demand. The inherent challenges lie in the large size by the necessity of recording the three-dimensional coordinates besides color information, and the associated high computation complexity of encoding. To this end, this paper proposes a communication and computation resource allocation scheme for QoE-driven point cloud video streaming. In particular, we maximize system resource utilization by selecting different quantities, transmission forms and quality level tiles to maximize the quality of experience. Extensive simulations are conducted and the simulation results show the superior performance over the existing schemes

preprint2020arXiv

Koopman Operator and Phase Space Partition of Chaotic Maps

Koopman operator describes evolution of observables in the phase space, which could be used to extract characteristic dynamical features of a nonlinear system. Here, we show that it is possible to carry out interesting symbolic partitions based on properly constructed eigenfunctions of the operator for chaotic maps. The partition boundaries are the extrema of these eigenfunctions, the accuracy of which is improved by including more basis functions in the numerical computation. The validity of this scheme is demonstrated in well-known 1-d and 2-d maps. It seems no obstacle to extend the computation to nonlinear systems of high dimensions, which provides a possible way of dissecting complex dynamics.

preprint2020arXiv

Quantum geometry and effective dynamics of Janis-Newman-Winicour singularities

Inspired by the recent proposal for the quantum effective dynamics of the Schwarzschild spacetime given in \cite{AOS1}, we investigate the effective dynamics of the loop quantized Janis-Newman-Winicour (JNW) spacetime which is an extension of the Schwarzschild spacetime with an extra minimally coupled massless scalar field. Two parameters are introduced in order to regularize the Hamiltonian constraint in the quantum effective dynamics. These two parameters are assumed to be Dirac observables when the effective dynamics is solved. By carefully choosing appropriate conditions for these two parameters, we completely determine them, and the resulted new effective description of the JNW spacetime leads to a well behaved quantum dynamics which on one hand resolves the classical singularities, and on the other hand, agrees with the classical dynamics in the low curvature region.

preprint2016arXiv

Factors in Finetuning Deep Model for object detection

Finetuning from a pretrained deep model is found to yield state-of-the-art performance for many vision tasks. This paper investigates many factors that influence the performance in finetuning for object detection. There is a long-tailed distribution of sample numbers for classes in object detection. Our analysis and empirical results show that classes with more samples have higher impact on the feature learning. And it is better to make the sample number more uniform across classes. Generic object detection can be considered as multiple equally important tasks. Detection of each class is a task. These classes/tasks have their individuality in discriminative visual appearance representation. Taking this individuality into account, we cluster objects into visually similar class groups and learn deep representations for these groups separately. A hierarchical feature learning scheme is proposed. In this scheme, the knowledge from the group with large number of classes is transferred for learning features in its sub-groups. Finetuned on the GoogLeNet model, experimental results show 4.7% absolute mAP improvement of our approach on the ImageNet object detection dataset without increasing much computational cost at the testing stage.

preprint2016arXiv

Towards Hybrid Cloud-assisted Crowdsourced Live Streaming: Measurement and Analysis

Crowdsourced Live Streaming (CLS), most notably Twitch.tv, has seen explosive growth in its popularity in the past few years. In such systems, any user can lively broadcast video content of interest to others, e.g., from a game player to many online viewers. To fulfill the demands from both massive and heterogeneous broadcasters and viewers, expensive server clusters have been deployed to provide video ingesting and transcoding services. Despite the existence of highly popular channels, a significant portion of the channels is indeed unpopular. Yet as our measurement shows, these broadcasters are consuming considerable system resources; in particular, 25% (resp. 30%) of bandwidth (resp. computation) resources are used by the broadcasters who do not have any viewers at all. In this paper, we closely examine the challenge of handling unpopular live-broadcasting channels in CLS systems and present a comprehensive solution for service partitioning on hybrid cloud. The trace-driven evaluation shows that our hybrid cloud-assisted design can smartly assign ingesting and transcoding tasks to the elastic cloud virtual machines, providing flexible system deployment cost-effectively.

preprint2015arXiv

Crowdsourced Live Streaming over the Cloud

Empowered by today's rich tools for media generation and distribution, and the convenient Internet access, crowdsourced streaming generalizes the single-source streaming paradigm by including massive contributors for a video channel. It calls a joint optimization along the path from crowdsourcers, through streaming servers, to the end-users to minimize the overall latency. The dynamics of the video sources, together with the globalized request demands and the high computation demand from each sourcer, make crowdsourced live streaming challenging even with powerful support from modern cloud computing. In this paper, we present a generic framework that facilitates a cost-effective cloud service for crowdsourced live streaming. Through adaptively leasing, the cloud servers can be provisioned in a fine granularity to accommodate geo-distributed video crowdsourcers. We present an optimal solution to deal with service migration among cloud instances of diverse lease prices. It also addresses the location impact to the streaming quality. To understand the performance of the proposed strategies in the realworld, we have built a prototype system running over the planetlab and the Amazon/Microsoft Cloud. Our extensive experiments demonstrate that the effectiveness of our solution in terms of deployment cost and streaming quality.

preprint2015arXiv

Distinct itinerant spin-density waves and local-moment antiferromagnetism in an intermetallic ErPd$_2$Si$_2$ single crystal

Identifying the nature of magnetism, itinerant or localized, remains a major challenge in condensed-matter science. Purely localized moments appear only in magnetic insulators, whereas itinerant moments more or less co-exist with localized moments in metallic compounds such as the doped-cuprate or the iron-based superconductors, hampering a thorough understanding of the role of magnetism in phenomena like superconductivity or magnetoresistance. Here we distinguish two antiferromagnetic modulations with respective propagation wave vectors of $Q_{\pm}$ = ($H \pm 0.557(1)$, 0, $L \pm 0.150(1)$) and $Q_\text{C}$ = ($H \pm 0.564(1)$, 0, $L$), where $\left(H, L\right)$ are allowed Miller indices, in an ErPd$_2$Si$_2$ single crystal by neutron scattering and establish their respective temperature- and field-dependent phase diagrams. The modulations can co-exist but also compete depending on temperature or applied field strength. They couple differently with the underlying lattice albeit with associated moments in a common direction. The $Q_{\pm}$ modulation may be attributed to localized 4\emph{f} moments while the $Q_\text{C}$ correlates well with itinerant conduction bands, supported by our transport studies. Hence, ErPd$_2$Si$_2$ represents a new model compound that displays clearly-separated itinerant and localized moments, substantiating early theoretical predictions and providing a unique platform allowing the study of itinerant electron behavior in a localized antiferromagnetic matrix.

preprint2015arXiv

On Crowdsourced Interactive Live Streaming: A Twitch.TV-Based Measurement Study

Empowered by today's rich tools for media generation and collaborative production, the multimedia service paradigm is shifting from the conventional single source, to multi-source, to many sources, and now toward {\em crowdsource}. Such crowdsourced live streaming platforms as Twitch.tv allow general users to broadcast their content to massive viewers, thereby greatly expanding the content and user bases. The resources available for these non-professional broadcasters however are limited and unstable, which potentially impair the streaming quality and viewers' experience. The diverse live interactions among the broadcasters and viewers can further aggravate the problem. In this paper, we present an initial investigation on the modern crowdsourced live streaming systems. Taking Twitch as a representative, we outline their inside architecture using both crawled data and captured traffic of local broadcasters/viewers. Closely examining the access data collected in a two-month period, we reveal that the view patterns are determined by both events and broadcasters' sources. Our measurements explore the unique source- and event-driven views, showing that the current delay strategy on the viewer's side substantially impacts the viewers' interactive experience, and there is significant disparity between the long broadcast latency and the short live messaging latency. On the broadcaster's side, the dynamic uploading capacity is a critical challenge, which noticeably affects the smoothness of live streaming for viewers.

preprint2014arXiv

Four New Observational $H(z)$ Data From Luminous Red Galaxies of Sloan Digital Sky Survey Data Release Seven

By adopting the differential age method, we utilize selected 17832 luminous red galaxies (LRGs) from Sloan Digital Sky Survey Data Release Seven (SDSS DR7) covering redshift $0<z<0.4$ to measure Hubble parameters. Using a full spectrum fitting package UlySS, these spectra are reduced with single stellar population (SSP) models and optimal age information of our selected sample are derived. With the decreasing age-redshift relation, four new observational $H(z)$ data (OHD) points are obtained, which are $H(z)=69.0\pm19.6$ km s$^{-1}$ Mpc$^{-1}$ at $z=0.07$, $H(z)=68.6\pm26.2$ km s$^{-1}$ Mpc$^{-1}$ at $z=0.12$, $H(z)$=$72.9\pm29.6$ km s$^{-1}$ Mpc$^{-1}$ at $z=0.2$ and $H(z)$=$88.8\pm36.6$ km s$^{-1}$ Mpc$^{-1}$ at $z=0.28$, respectively. Combined with other 21 available OHD data points, a performance of constraint on both flat and non-flat $Λ$CDM model is presented.

preprint2014arXiv

Incommensurate antiferromagnetic order in the manifoldly-frustrated SrTb$_2$O$_4$ with transition temperature up to 4.28 K

The N$\acute{\rm e}$el temperature of the new frustrated family of Sr\emph{RE}$_2$O$_4$ (\emph{RE} = rare earth) compounds is yet limited to $\sim$ 0.9 K, which more or less hampers a complete understanding of the relevant magnetic frustrations and spin interactions. Here we report on a new frustrated member to the family, SrTb$_2$O$_4$ with a record $T_{\rm N}$ = 4.28(2) K, and an experimental study of the magnetic interacting and frustrating mechanisms by polarized and unpolarized neutron scattering. The compound SrTb$_2$O$_4$ displays an incommensurate antiferromagnetic (AFM) order with a transverse wave vector \textbf{Q}$^{\rm 0.5 K}_{\rm AFM}$ = (0.5924(1), 0.0059(1), 0) albeit with partially-ordered moments, 1.92(6) $μ_{\rm B}$ at 0.5 K, stemming from only one of the two inequivalent Tb sites mainly by virtue of their different octahedral distortions. The localized moments are confined to the \emph{bc} plane, 11.9(66)$^\circ$ away from the \emph{b} axis probably by single-ion anisotropy. We reveal that this AFM order is dominated mainly by dipole-dipole interactions and disclose that the octahedral distortion, nearest-neighbour (NN) ferromagnetic (FM) arrangement, different next NN FM and AFM configurations, and in-plane anisotropic spin correlations are vital to the magnetic structure and associated multiple frustrations. The discovery of the thus far highest AFM transition temperature renders SrTb$_2$O$_4$ a new friendly frustrated platform in the family for exploring the nature of magnetic interactions and frustrations.

preprint2014arXiv

Magnetization, crystal structure and anisotropic thermal expansion of single-crystal SrEr2O4

The magnetization, crystal structure, and thermal expansion of a nearly stoichiometric Sr$_{1.04(3)}$Er$_{2.09(6)}$O$_{4.00(1)}$ single crystal have been studied by PPMS measurements and in-house and high-resolution synchrotron X-ray powder diffraction. No evidence was detected for any structural phase transitions even up to 500 K. The average thermal expansions of lattice constants and unit-cell volume are consistent with the first-order Grüneisen approximations taking into account only the phonon contributions for an insulator, displaying an anisotropic character along the crystallographic \emph{a}, \emph{b}, and \emph{c} axes. Our magnetization measurements indicate that obvious magnetic frustration appears below $\sim$15 K, and antiferromagnetic correlations may persist up to 300 K.

preprint2014arXiv

Nonmagnetic ordering state of single-crystal SrTm$_2$O$_4$: A polarized and unpolarized neutron-scattering study

Our single-crystal polarized neutron scattering at 65 mK and powder unpolarized neutron diffraction at 0.5 K show no evidence for a long-range magnetic order and even detect no sign of diffuse magnetic neutron scattering in single-crystal SrTm2O4. The data refinements reveal that the two TmO6 octahedral distortion modes are the same as those of the TbO6 octahedra in SrTb2O4, i.e., one distortion is stronger than the other one especially at low temperatures, which is attributed probably to different crystal electric fields for the two inequivalent octahedra. Consequently, we conclude that SrTm2O4 has no magnetic order, neither long-ranged nor short-ranged, even down to 65 mK. Therefore, SrTm2O4 is a different compound from its brethren in the new family of frustrated SrRE2O4 (RE = Gd, Tb, Dy, Ho, Er, and Yb) magnets. We propose that crystal field anisotropy may dominate over weak dipolar spin interactions in SrTm2O4, leading to a virtually nonmagnetic ordering state.

preprint2013arXiv

Exact temporal evolution of the two-species Bose-Einstein condensates

We construct exact stationary solutions to the one-dimensional coupled Gross-Pitaevskii equations for the two-species Bose-Einstein condensates with equal intraspecies and interspecies interaction constants. Three types of complex solutions as well as their soliton limits are derived. By making use of the SU(2) unitary symmetry, we further obtain analytical time-evolving solutions. These solutions exhibit spatiotemporal periodicity.

preprint2013arXiv

Mechanical design and analysis for a low beta squeezed half-wave resonator

A superconducting half-wave resonator (HWR) of frequency=162.5 MHz and β=0.09 has been developed at Institute of Modern Physics. Mechanical stability of the low beta HWR cavity is a big challenge in cavity design and optimization. The mechanical deformations of a radio frequency superconducting cavity could be a source of instability, both in continues wave(CW) operation or in pulsed mode. Generally, the lower beta cavities have stronger Lorentz force detuning than that of the higher beta cavities. In this paper, a basic design consideration in the stiffening structure for the detuning effect caused by helium pressure and Lorentz force has been presented. The mechanical modal analysis has been investigated with finite element method(FEM). Based on these considerations, a new stiffening structure has been promoted for the HWR cavity. The computation results concerning the frequency shift show that the low beta HWR cavity with new stiffening structure has low frequency sensitivity coefficient, Lorentz force detuning coefficient KL and stable mechanical property.

preprint2013arXiv

Skyrmion crystals in the pseudo-spin-1/2 Bose-Einstein condensates

Exact two-dimensional solutions are constructed for the pseudo-spin-1/2 Bose-Einstein condensates which are described by the coupled nonlinear Gross-Pitaevskii equations where the intraspecies and interspecies coupling constants are assumed to be equal. The equations are decoupled by means of re-combinations of the nonlinear terms of the hyperfine states according to the spatial dimensions. These stationary solutions form various spin textures which are identified as skyrmion crystals. In a special case, the crystal of skyrmion-antiskyrmion pairs is formed in the soliton limit.

preprint2013arXiv

Study on the frequency tuning of half-wave resonator at IMP

A 162.5 MHz superconducting half-wave resonator (HWR) with geometry beta of 0.09 is being developed for Injector II of China Accelerator Driven Sub-critical System (CADS) Project at the Institute of Modern Physics (IMP). The HWR section composed of 16 HWR cavities will accelerate the proton beam from 2.1 MeV to 10 MeV. The RF and mechanical coupled analysis are essential in geometry design in order to predict the deformation of the cavity walls and the frequency shift caused by the deformation. In this paper, the detuning caused by both bath helium pressure and Lorentz force is analysed and a tuning system has been investigated and designed to compensate the detuning by deforming the cavity along the beam axis. The simulations performed with ANSYS code show that the tuning system can adjust and compensate the frequency drift due to external vibrations and helium pressure fluctuation during operation.

preprint2012arXiv

Analytical solutions to the spin-1 Bose-Einstein condensates

We analytically solve the one-dimensional coupled Gross-Pitaevskii equations which govern the motion of F=1 spinor Bose-Einstein condensates. The nonlinear density-density interactions are decoupled by making use of the unique properties of the Jacobian elliptical functions. Several types of complex stationary solutions are deduced. Furthermore, exact non-stationary solutions to the time-dependent Gross-Pitaevskii equations are constructed by making use of the spin-rotational symmetry of the Hamiltonian. The spin-polarizations exhibit kinked configurations. Our method is applicable to other coupled nonlinear systems.

preprint2012arXiv

Improving imaging resolution of shaking targets by Fourier-transform ghost diffraction

For conventional imaging, shaking of the imaging system or the target leads to the degradation of imaging resolution. In this work, the influence of the target's shaking to fourier-transform ghost diffraction (FGD) is investigated. The analytical results, which are backed up by numerical simulation and experiments, demonstrate that the quiver of target has no effect on the resolution of FGD, thus the target's imaging with high spatial resolution can be always achieved by phase-retrieval method from the FGD patterns. This approach can be applied in high-precision imaging systems, to overcome the influence of the system's shaking to imaging resolution.

Cong Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

39 published item(s)

CAGS: Color-Adaptive Volumetric Video Streaming with Dynamic 3D Gaussian Splatting

MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching

Weighted EF1 Allocations for Indivisible Chores

Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech

ByT5 model for massively multilingual grapheme-to-phoneme conversion

Fermion coupling to loop quantum gravity: canonical formulation

Fermions on Quantum Geometry and Resolution of Doubling Problem

First-Order Quantum Correction in Coherent State Expectation Value of Loop-Quantum-Gravity Hamiltonian

First-Order Quantum Correction in Coherent State Expectation Value of Loop-Quantum-Gravity Hamiltonian: Overview and Results

Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning

Phone-to-audio alignment without text: A Semi-supervised Approach

Polarization measurement for the dileptonic channel of $W^+ W^-$ scattering using generative adversarial network

Reduced Phase Space Quantization of Black Holes: Path Integrals, and Effective Dynamics

Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration

Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

DeepFake-o-meter: An Open Platform for DeepFake Detection

Loop quantum deparametrized Schwarzschild interior and discrete black hole mass

Twisted geometry coherent states in all dimensional loop quantum gravity: I. Construction and Peakedness properties

Alternative dynamics in loop quantum Brans-Dicke cosmology

DSP: A Differential Spatial Prediction Scheme for Comprehensive real industrial datasets

Joint Communication and Computational Resource Allocation for QoE-driven Point Cloud Video Streaming

Koopman Operator and Phase Space Partition of Chaotic Maps

Quantum geometry and effective dynamics of Janis-Newman-Winicour singularities

Factors in Finetuning Deep Model for object detection

Towards Hybrid Cloud-assisted Crowdsourced Live Streaming: Measurement and Analysis

Crowdsourced Live Streaming over the Cloud

Distinct itinerant spin-density waves and local-moment antiferromagnetism in an intermetallic ErPd$_2$Si$_2$ single crystal

On Crowdsourced Interactive Live Streaming: A Twitch.TV-Based Measurement Study

Four New Observational $H(z)$ Data From Luminous Red Galaxies of Sloan Digital Sky Survey Data Release Seven

Incommensurate antiferromagnetic order in the manifoldly-frustrated SrTb$_2$O$_4$ with transition temperature up to 4.28 K

Magnetization, crystal structure and anisotropic thermal expansion of single-crystal SrEr2O4

Nonmagnetic ordering state of single-crystal SrTm$_2$O$_4$: A polarized and unpolarized neutron-scattering study

Exact temporal evolution of the two-species Bose-Einstein condensates

Mechanical design and analysis for a low beta squeezed half-wave resonator

Skyrmion crystals in the pseudo-spin-1/2 Bose-Einstein condensates

Study on the frequency tuning of half-wave resonator at IMP

Analytical solutions to the spin-1 Bose-Einstein condensates

Improving imaging resolution of shaking targets by Fourier-transform ghost diffraction