Source author record

Feng Yu

Feng Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.PR Artificial Intelligence hep-th Information Retrieval Social and Information Networks Sound Computation Data Structures and Algorithms Databases Distributed, Parallel, and Cluster Computing eess.AS math.AP Multimedia Networking and Internet Architecture physics.optics Populations and Evolution

Catalog footprint

What is connected

20works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton

Generating symphonic music requires simultaneously managing high-level structural form and dense, multi-track orchestration. Existing symbolic models often struggle with a "complexity-control imbalance", in which scaling bottlenecks limit long-term granular steerability. We present SymphonyGen, a 3D hierarchical framework for contemporary cinematic orchestration. SymphonyGen employs a cascading decoder architecture that decomposes the Bar, Track, and Event axes, improving computational efficiency and scalability over conventional 1D or 2D models. We introduce "short-score" conditioning via a beat-quantized multi-voice harmony skeleton, enabling outline control while preserving textural diversity. The model is further refined using Group Relative Policy Optimization (GRPO) with a cross-modal audio-perceptual reward, aligning symbolic output with modern acoustic expectations. Additionally, we implement a dissonance-averse sampling algorithm to suppress unintended tonal clashes during inference. Objective evaluations show that both reinforcement learning and dissonance-averse sampling effectively enhance harmonic cleanliness while maintaining melodic expression. Subjective evaluations demonstrate that SymphonyGen outperforms baselines in musicality and preference for orchestral music generation. Demo page: https://symphonygen.github.io/

preprint2024arXiv

Hyperparameter Estimation for Sparse Bayesian Learning Models

Sparse Bayesian Learning (SBL) models are extensively used in signal processing and machine learning for promoting sparsity through hierarchical priors. The hyperparameters in SBL models are crucial for the model's performance, but they are often difficult to estimate due to the non-convexity and the high-dimensionality of the associated objective function. This paper presents a comprehensive framework for hyperparameter estimation in SBL models, encompassing well-known algorithms such as the expectation-maximization (EM), MacKay, and convex bounding (CB) algorithms. These algorithms are cohesively interpreted within an alternating minimization and linearization (AML) paradigm, distinguished by their unique linearized surrogate functions. Additionally, a novel algorithm within the AML framework is introduced, showing enhanced efficiency, especially under low signal noise ratios. This is further improved by a new alternating minimization and quadratic approximation (AMQ) paradigm, which includes a proximal regularization term. The paper substantiates these advancements with thorough convergence analysis and numerical experiments, demonstrating the algorithm's effectiveness in various noise conditions and signal-to-noise ratios.

preprint2024arXiv

MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music

In addressing the challenge of interpretability and generalizability of artificial music intelligence, this paper introduces a novel symbolic representation that amalgamates both explicit and implicit musical information across diverse traditions and granularities. Utilizing a hierarchical and-or graph representation, the model employs nodes and edges to encapsulate a broad spectrum of musical elements, including structures, textures, rhythms, and harmonies. This hierarchical approach expands the representability across various scales of music. This representation serves as the foundation for an energy-based model, uniquely tailored to learn musical concepts through a flexible algorithm framework relying on the minimax entropy principle. Utilizing an adapted Metropolis-Hastings sampling technique, the model enables fine-grained control over music generation. A comprehensive empirical evaluation, contrasting this novel approach with existing methodologies, manifests considerable advancements in interpretability and controllability. This study marks a substantial contribution to the fields of music analysis, composition, and computational musicology.

preprint2022arXiv

Non-Abelian braiding on photonic chips

Non-Abelian braiding has attracted significant attention because of its pivotal role in describing the exchange behaviors of anyons--a candidate for realizing quantum logics. The input and outcome of non-Abelian braiding are connected by a unitary matrix which can also physically emerge as a geometric-phase matrix in classical systems. Hence it is predicted that non-Abelian braiding should have analogues in photonics, but a feasible platform and the experimental realization remain out of reach. Here, we propose and experimentally realize an on-chip photonic system that achieves the non-Abelian braiding of up to five photonic modes. The braiding is realized by controlling the multi-mode geometric-phase matrix in judiciously designed photonic waveguide arrays. The quintessential effect of braiding--sequence-dependent swapping of photon dwell sites is observed in both classical-light and single-photon experiments. Our photonic chips are a versatile and expandable platform for studying non-Abelian physics, and we expect the results to motivate next-gen non-Abelian photonic devices.

preprint2022arXiv

Robust Regularized Low-Rank Matrix Models for Regression and Classification

While matrix variate regression models have been studied in many existing works, classical statistical and computational methods for the analysis of the regression coefficient estimation are highly affected by high dimensional and noisy matrix-valued predictors. To address these issues, this paper proposes a framework of matrix variate regression models based on a rank constraint, vector regularization (e.g., sparsity), and a general loss function with three special cases considered: ordinary matrix regression, robust matrix regression, and matrix logistic regression. We also propose an alternating projected gradient descent algorithm. Based on analyzing our objective functions on manifolds with bounded curvature, we show that the algorithm is guaranteed to converge, all accumulation points of the iterates have estimation errors in the order of $O(1/\sqrt{n})$ asymptotically and substantially attaining the minimax rate. Our theoretical analysis can be applied to general optimization problems on manifolds with bounded curvature and can be considered an important technical contribution to this work. We validate the proposed method through simulation studies and real image data examples.

preprint2022arXiv

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data

Despite enormous research interest and rapid application of federated learning (FL) to various areas, existing studies mostly focus on supervised federated learning under the horizontally partitioned local dataset setting. This paper will study the unsupervised FL under the vertically partitioned dataset setting. Accordingly, we propose the federated principal component analysis for vertically partitioned dataset (VFedPCA) method, which reduces the dimensionality across the joint datasets over all the clients and extracts the principal component feature information for downstream data analysis. We further take advantage of the nonlinear dimensionality reduction and propose the vertical federated advanced kernel principal component analysis (VFedAKPCA) method, which can effectively and collaboratively model the nonlinear nature existing in many real datasets. In addition, we study two communication topologies. The first is a server-client topology where a semi-trusted server coordinates the federated training, while the second is the fully-decentralized topology which further eliminates the requirement of the server by allowing clients themselves to communicate with their neighbors. Extensive experiments conducted on five types of real-world datasets corroborate the efficacy of VFedPCA and VFedAKPCA under the vertically partitioned FL setting. Code is available at: https://github.com/juyongjiang/VFedPCA-VFedAKPCA

preprint2021arXiv

Graph Contrastive Learning with Adaptive Augmentation

Recently, contrastive learning (CL) has emerged as a successful method for unsupervised graph representation learning. Most graph CL methods first perform stochastic augmentation on the input graph to obtain two graph views and maximize the agreement of representations in the two views. Despite the prosperous development of graph CL methods, the design of graph augmentation schemes -- a crucial component in CL -- remains rarely explored. We argue that the data augmentation schemes should preserve intrinsic structures and attributes of graphs, which will force the model to learn representations that are insensitive to perturbation on unimportant nodes and edges. However, most existing methods adopt uniform data augmentation schemes, like uniformly dropping edges and uniformly shuffling features, leading to suboptimal performance. In this paper, we propose a novel graph contrastive representation learning method with adaptive augmentation that incorporates various priors for topological and semantic aspects of the graph. Specifically, on the topology level, we design augmentation schemes based on node centrality measures to highlight important connective structures. On the node attribute level, we corrupt node features by adding more noise to unimportant node features, to enforce the model to recognize underlying semantic information. We perform extensive experiments of node classification on a variety of real-world datasets. Experimental results demonstrate that our proposed method consistently outperforms existing state-of-the-art baselines and even surpasses some supervised counterparts, which validates the effectiveness of the proposed contrastive framework with adaptive augmentation.

preprint2020arXiv

CAGNN: Cluster-Aware Graph Neural Networks for Unsupervised Graph Representation Learning

Unsupervised graph representation learning aims to learn low-dimensional node embeddings without supervision while preserving graph topological structures and node attributive features. Previous graph neural networks (GNN) require a large number of labeled nodes, which may not be accessible in real-world graph data. In this paper, we present a novel cluster-aware graph neural network (CAGNN) model for unsupervised graph representation learning using self-supervised techniques. In CAGNN, we perform clustering on the node embeddings and update the model parameters by predicting the cluster assignments. Moreover, we observe that graphs often contain inter-class edges, which mislead the GNN model to aggregate noisy information from neighborhood nodes. We further refine the graph topology by strengthening intra-class edges and reducing node connections between different classes based on cluster labels, which better preserves cluster structures in the embedding space. We conduct comprehensive experiments on two benchmark tasks using real-world datasets. The results demonstrate the superior performance of the proposed model over existing baseline methods. Notably, our model gains over 7% improvements in terms of accuracy on node clustering over state-of-the-arts.

preprint2020arXiv

Deep Graph Contrastive Representation Learning

Graph representation learning nowadays becomes fundamental in analyzing graph-structured data. Inspired by recent success of contrastive methods, in this paper, we propose a novel framework for unsupervised graph representation learning by leveraging a contrastive objective at the node level. Specifically, we generate two graph views by corruption and learn node representations by maximizing the agreement of node representations in these two views. To provide diverse node contexts for the contrastive objective, we propose a hybrid scheme for generating graph views on both structure and attribute levels. Besides, we provide theoretical justification behind our motivation from two perspectives, mutual information and the classical triplet loss. We perform empirical experiments on both transductive and inductive learning tasks using a variety of real-world datasets. Experimental experiments demonstrate that despite its simplicity, our proposed method consistently outperforms existing state-of-the-art methods by large margins. Moreover, our unsupervised method even surpasses its supervised counterparts on transductive tasks, demonstrating its great potential in real-world applications.

preprint2020arXiv

TAGNN: Target Attentive Graph Neural Networks for Session-based Recommendation

Session-based recommendation nowadays plays a vital role in many websites, which aims to predict users' actions based on anonymous sessions. There have emerged many studies that model a session as a sequence or a graph via investigating temporal transitions of items in a session. However, these methods compress a session into one fixed representation vector without considering the target items to be predicted. The fixed vector will restrict the representation ability of the recommender model, considering the diversity of target items and users' interests. In this paper, we propose a novel target attentive graph neural network (TAGNN) model for session-based recommendation. In TAGNN, target-aware attention adaptively activates different user interests with respect to varied target items. The learned interest representation vector varies with different target items, greatly improving the expressiveness of the model. Moreover, TAGNN harnesses the power of graph neural networks to capture rich item transitions in sessions. Comprehensive experiments conducted on real-world datasets demonstrate its superiority over state-of-the-art methods.

preprint2020arXiv

TFNet: Multi-Semantic Feature Interaction for CTR Prediction

The CTR (Click-Through Rate) prediction plays a central role in the domain of computational advertising and recommender systems. There exists several kinds of methods proposed in this field, such as Logistic Regression (LR), Factorization Machines (FM) and deep learning based methods like Wide&Deep, Neural Factorization Machines (NFM) and DeepFM. However, such approaches generally use the vector-product of each pair of features, which have ignored the different semantic spaces of the feature interactions. In this paper, we propose a novel Tensor-based Feature interaction Network (TFNet) model, which introduces an operating tensor to elaborate feature interactions via multi-slice matrices in multiple semantic spaces. Extensive offline and online experiments show that TFNet: 1) outperforms the competitive compared methods on the typical Criteo and Avazu datasets; 2) achieves large improvement of revenue and click rate in online A/B tests in the largest Chinese App recommender system, Tencent MyApp.

preprint2016arXiv

A Prudent-Precedence Concurrency Control Protocol for High Data Contention Database Enviornments

In this paper, we propose a concurrency control protocol, called the Prudent-Precedence Concurrency Control (PPCC) protocol, for high data contention database environments. PPCC is prudently more aggressive in permitting more serializable schedules than two-phase locking. It maintains a restricted precedence among conflicting transactions and commits the transactions according to the serialization order established in the executions. A detailed simulation model has been constructed and extensive experiments have been conducted to evaluate the performance of the proposed approach. The results demonstrate that the proposed algorithm outperforms the two-phase locking and optimistic concurrency control in all ranges of system workload.

preprint2016arXiv

ICE: Information Credibility Evaluation on Social Media via Representation Learning

With the rapid growth of social media, rumors are also spreading widely on social media and bring harm to people's daily life. Nowadays, information credibility evaluation has drawn attention from academic and industrial communities. Current methods mainly focus on feature engineering and achieve some success. However, feature engineering based methods require a lot of labor and cannot fully reveal the underlying relations among data. In our viewpoint, the key elements of user behaviors for evaluating credibility are concluded as "who", "what", "when", and "how". These existing methods cannot model the correlation among different key elements during the spreading of microblogs. In this paper, we propose a novel representation learning method, Information Credibility Evaluation (ICE), to learn representations of information credibility on social media. In ICE, latent representations are learnt for modeling user credibility, behavior types, temporal properties, and comment attitudes. The aggregation of these factors in the microblog spreading process yields the representation of a user's behavior, and the aggregation of these dynamic representations generates the credibility representation of an event spreading on social media. Moreover, a pairwise learning method is applied to maximize the credibility difference between rumors and non-rumors. To evaluate the performance of ICE, we conduct experiments on a Sina Weibo data set, and the experimental results show that our ICE model outperforms the state-of-the-art methods.

preprint2016arXiv

Stationary and time periodic patterns of two-predator and one-prey systems with prey-taxis

This paper concerns pattern formation in a class of reaction-advection-diffusion systems modeling the population dynamics of two predators and one prey. We consider the biological situation that both predators forage along the population density gradient of the preys which can defend themselves as a group. We prove the global existence and uniform boundedness of positive classical solutions for the fully parabolic system over a bounded domain with space dimension $N=1,2$ and for the parabolic- -parabolic-elliptic system over higher space dimensions. Linearized stability analysis shows that prey-taxis stabilizes the positive constant equilibrium if there is no group defense while it destabilizes the equilibrium otherwise. Then we obtain stationary and time-periodic nontrivial solutions of the system that bifurcate from the positive constant equilibrium. Moreover, the stability of these solutions is also analyzed in detail which provides a wave mode selection mechanism of nontrivial patterns for this strongly coupled system. Finally, we perform numerical simulations to illustrate and support our theoretical results.

preprint2013arXiv

Computing Traversal Times on Dynamic Markovian Paths

In source routing, a complete path is chosen for a packet to travel from source to destination. While computing the time to traverse such a path may be straightforward in a fixed, static graph, doing so becomes much more challenging in dynamic graphs, in which the state of an edge in one time slot (i.e., its presence or absence) is random, and may depend on its state in the previous time step. The traversal time is due to both time spent waiting for edges to appear and time spent crossing them once they become available. We compute the expected traversal time (ETT) for a dynamic path in a number of special cases of stochastic edge dynamics models, and for three edge failure models, culminating in a surprisingly challenging yet realistic setting in which the initial configuration of edge states for the entire path is known. We show that the ETT for this "initial configuration" setting can be computed in quadratic time, by an algorithm based on probability generating functions. We also give several linear-time upper and lower bounds on the ETT.

preprint2013arXiv

Conditioning the logistic branching process on non-extinction

We consider a birth and death process in which death is due to both `natural death' and to competition between individuals, modelled as a quadratic function of population size. The resulting `logistic branching process' has been proposed as a model for numbers of individuals in populations competing for some resource, or for numbers of species. However, because of the quadratic death rate, even if the intrinsic growth rate is positive, the population will, with probability one, die out in finite time. There is considerable interest in understanding the process conditioned on non-extinction. In this paper, we exploit a connection with the ancestral selection graph of population genetics to find expressions for the transition rates in the logistic branching process conditioned on survival until some fixed time $T$, in terms of the distribution of a certain one-dimensional diffusion process at time $T$. We also find the probability generating function of the Yaglom distribution of the process and rather explicit expressions for the transition rates for the so-called Q-process, that is the logistic branching process conditioned to stay alive into the indefinite future. For this process, one can write down the joint generator of the (time-reversed) total population size and what in population genetics would be called the `genealogy' and in phylogenetics would be called the `reconstructed tree' of a sample from the population. We explore some ramifications of these calculations numerically.

preprint2010arXiv

Asymptotic behavior of the rate of adaptation

We consider the accumulation of beneficial and deleterious mutations in large asexual populations. The rate of adaptation is affected by the total mutation rate, proportion of beneficial mutations and population size $N$. We show that regardless of mutation rates, as long as the proportion of beneficial mutations is strictly positive, the adaptation rate is at least $\mathcal{O}(\log^{1-δ}N)$ where $δ$ can be any small positive number, if the population size is sufficiently large. This shows that if the genome is modeled as continuous, there is no limit to natural selection, that is, the rate of adaptation grows in $N$ without bound.

preprint2007arXiv

Stationary distributions of a model of sympatric speciation

This paper deals with a model of sympatric speciation, that is, speciation in the absence of geographical separation, originally proposed by U. Dieckmann and M. Doebeli in 1999. We modify their original model to obtain a Fleming--Viot type model and study its stationary distribution. We show that speciation may occur, that is, the stationary distribution puts most of the mass on a configuration that does not concentrate on the phenotype with maximum carrying capacity, if competition between phenotypes is intense enough. Conversely, if competition between phenotypes is not intense, then speciation will not occur and most of the population will have the phenotype with the highest carrying capacity. The length of time it takes speciation to occur also has a delicate dependence on the mutation parameter, and the exact shape of the carrying capacity function and the competition kernel.

preprint1993arXiv

Many Boson Realizations of Universal Nonlinear $W_{\infty}$-Algebras

An infinite number of free field realizations of the universal nonlinear $\hat{W}_{\infty}^{(N)}$ ($\hat{W}_{1+\infty}^{(N)}$) algebras, which are identical to the KP Hamiltonian structures, are obtained in terms of $p$ plus $q$ scalars of different signatures with $p-q=N$. They are generalizations of the Miura transformation, and naturally give rise to the modified KP hierarchies via corresponding realizations of the latter. Their characteristic Lie-algebraic origin is shown to be the graded $SL(p,q)$.

preprint1992arXiv

An Infinite Number of Commuting Quantum $\hat{W}_{\infty}$ Charges in the $SL(2,R)/U(1)$ Coset Model

The conformal non-compact $SL(2,R)/U(1)$ coset model in two dimensions has been recently shown to embody a nonlinear $\hat{W}_\infty$ current algebra, consisting of currents of spin $\geq 2$ including the energy-momentum tensor. In this letter we explicitly construct an infinite set of commuting quantum $\hat{W}_\infty$ charges in the model with $k=1$. These commuting quantum charges generate a set of infinitely many compatible flows (quantum KP flows), which maintain the nonlinear $\hat{W}_\infty$ current algebra invariant.

Feng Yu

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton

Hyperparameter Estimation for Sparse Bayesian Learning Models

MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music

Non-Abelian braiding on photonic chips

Robust Regularized Low-Rank Matrix Models for Regression and Classification

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data

Graph Contrastive Learning with Adaptive Augmentation

CAGNN: Cluster-Aware Graph Neural Networks for Unsupervised Graph Representation Learning

Deep Graph Contrastive Representation Learning

TAGNN: Target Attentive Graph Neural Networks for Session-based Recommendation

TFNet: Multi-Semantic Feature Interaction for CTR Prediction

A Prudent-Precedence Concurrency Control Protocol for High Data Contention Database Enviornments

ICE: Information Credibility Evaluation on Social Media via Representation Learning

Stationary and time periodic patterns of two-predator and one-prey systems with prey-taxis

Computing Traversal Times on Dynamic Markovian Paths

Conditioning the logistic branching process on non-extinction

Asymptotic behavior of the rate of adaptation

Stationary distributions of a model of sympatric speciation

Many Boson Realizations of Universal Nonlinear $W_{\infty}$-Algebras

An Infinite Number of Commuting Quantum $\hat{W}_{\infty}$ Charges in the $SL(2,R)/U(1)$ Coset Model