Source author record

Cheng Ye

Cheng Ye appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.soc-ph Machine Learning Social and Information Networks Artificial Intelligence Biomolecules cond-mat.stat-mech gr-qc hep-ph q-fin.ST Quantitative Methods

Catalog footprint

What is connected

6works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Knowledge Graph-Enhanced Tensor Factorisation Model for Discovering Drug Targets

The drug discovery and development process is a long and expensive one, costing over 1 billion USD on average per drug and taking 10-15 years. To reduce the high levels of attrition throughout the process, there has been a growing interest in applying machine learning methodologies to various stages of drug discovery and development in the recent decade, especially at the earliest stage identification of druggable disease genes. In this paper, we have developed a new tensor factorisation model to predict potential drug targets (genes or proteins) for treating diseases. We created a three dimensional data tensor consisting of 1,048 gene targets, 860 diseases and 230,011 evidence attributes and clinical outcomes connecting them, using data extracted from the Open Targets and PharmaProjects databases. We enriched the data with gene target representations learned from a drug discovery oriented knowledge graph and applied our proposed method to predict the clinical outcomes for unseen gene target and disease pairs. We designed three evaluation strategies to measure the prediction performance and benchmarked several commonly used machine learning classifiers together with Bayesian matrix and tensor factorisation methods. The result shows that incorporating knowledge graph embeddings significantly improves the prediction accuracy and that training tensor factorisation alongside a dense neural network outperforms all other baselines. In summary, our framework combines two actively studied machine learning approaches to disease target identification, namely tensor factorisation and knowledge graph representation learning, which could be a promising avenue for further exploration in data driven drug discovery.

preprint2022arXiv

The analogy of the Lorentz-violating fermion-gravity and fermion photon couplings

By adopting a methodology proposed by R.J. Adler \etl, we study the interesting analogy between the fermion-gravity and the fermion-electromagnetic interactions in the presence of the minimal Lorentz-violating (LV) fermion coefficients. The one-fermion matrix elements of gravitational interaction (OMEGI) are obtained with a prescribed Lense-Thirring (LT) metric assuming test particle assumption. Quite distinct from the extensively studied linear gravitational potential, the LT metric is an essentially curved metric, and thus reveals the anomalous LV matter-gravity couplings as a manifestation of the so-called gravito-magnetic effects, which go beyond the conventional equivalence principle predictions. By collecting all the spin-dependent operators from the OMEGI with some reasonable assumptions, we get a LV non-relativistic Hamiltonian, from which we derive the anomalous spin precession and gravitational acceleration due to LV. Combined these results with certain spin gravity experiments, we get some rough bounds on several LV parameters, such as $|3\vec{\tilde{H}}-2\vec{b}|\leq1.46\times10^{-5}\mathrm{eV}$, with some ad hoc assumptions.

preprint2022arXiv

Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery

Knowledge Graphs (KG) and associated Knowledge Graph Embedding (KGE) models have recently begun to be explored in the context of drug discovery and have the potential to assist in key challenges such as target identification. In the drug discovery domain, KGs can be employed as part of a process which can result in lab-based experiments being performed, or impact on other decisions, incurring significant time and financial costs and most importantly, ultimately influencing patient healthcare. For KGE models to have impact in this domain, a better understanding of not only of performance, but also the various factors which determine it, is required. In this study we investigate, over the course of many thousands of experiments, the predictive performance of five KGE models on two public drug discovery-oriented KGs. Our goal is not to focus on the best overall model or configuration, instead we take a deeper look at how performance can be affected by changes in the training setup, choice of hyperparameters, model parameter initialisation seed and different splits of the datasets. Our results highlight that these factors have significant impact on performance and can even affect the ranking of models. Indeed these factors should be reported along with model architectures to ensure complete reproducibility and fair comparisons of future work, and we argue this is critical for the acceptance of use, and impact of KGEs in a biomedical setting.

preprint2015arXiv

Modular Dynamics of Financial Market Networks

The financial market is a complex dynamical system composed of a large variety of intricate relationships between several entities, such as banks, corporations and institutions. At the heart of the system lies the stock exchange mechanism, which establishes a time-evolving network of trades among companies and individuals. Such network can be inferred through correlations between time series of companies stock prices, allowing the overall system to be characterized by techniques borrowed from network science. Here we study the presence of communities in the inferred stock market network, and show that the knowledge about the communities alone can provide a nearly complete representation of the system topology. This is done by defining a simple null model, a randomized version of the studied network sharing only the sizes and interconnectivity between communities observed. We show that many topological characteristics of the inferred networks are carried over the networks generated by the null model. In particular, we find that in periods of instability, such as during a financial crisis, the network strays away from a state of well-defined community structure to a much more uniform topological organization. We show that the framework presented here provides a good null model representation of topological variations taking place in the market during crises. Also, the general approach used in this work can be extended to other systems.

preprint2015arXiv

Thermodynamic characterization of networks using graph polynomials

In this paper, we present a method for characterizing the evolution of time-varying complex networks by adopting a thermodynamic representation of network structure computed from a polynomial (or algebraic) characterization of graph structure. Commencing from a representation of graph structure based on a characteristic polynomial computed from the normalized Laplacian matrix, we show how the polynomial is linked to the Boltzmann partition function of a network. This allows us to compute a number of thermodynamic quantities for the network, including the average energy and entropy. Assuming that the system does not change volume, we can also compute the temperature, defined as the rate of change of entropy with energy. All three thermodynamic variables can be approximated using low-order Taylor series that can be computed using the traces of powers of the Laplacian matrix, avoiding explicit computation of the normalized Laplacian spectrum. These polynomial approximations allow a smoothed representation of the evolution of networks to be constructed in the thermodynamic space spanned by entropy, energy, and temperature. We show how these thermodynamic variables can be computed in terms of simple network characteristics, e.g., the total number of nodes and node degree statistics for nodes connected by edges. We apply the resulting thermodynamic characterization to real-world time-varying networks representing complex systems in the financial and biological domains. The study demonstrates that the method provides an efficient tool for detecting abrupt changes and characterizing different stages in network evolution.

preprint2014arXiv

Concentric Network Symmetry

Quantification of symmetries in complex networks is typically done globally in terms of automorphisms. Extending previous methods to locally assess the symmetry of nodes is not straightforward. Here we present a new framework to quantify the symmetries around nodes, which we call connectivity patterns. We develop two topological transformations that allow a concise characterization of the different types of symmetry appearing on networks and apply these concepts to six network models, namely the Erdős-Rényi, Barabási-Albert, random geometric graph, Waxman, Voronoi and rewired Voronoi. Real-world networks, namely the scientific areas of Wikipedia, the world-wide airport network and the street networks of Oldenburg and San Joaquin, are also analyzed in terms of the proposed symmetry measurements. Several interesting results emerge from this analysis, including the high symmetry exhibited by the Erdős-Rényi model. Additionally, we found that the proposed measurements present low correlation with other traditional metrics, such as node degree and betweenness centrality. Principal component analysis is used to combine all the results, revealing that the concepts presented here have substantial potential to also characterize networks at a global scale.

Cheng Ye

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

A Knowledge Graph-Enhanced Tensor Factorisation Model for Discovering Drug Targets

The analogy of the Lorentz-violating fermion-gravity and fermion photon couplings

Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery

Modular Dynamics of Financial Market Networks

Thermodynamic characterization of networks using graph polynomials

Concentric Network Symmetry