Source author record

Ying Fan

Ying Fan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.soc-ph Social and Information Networks cond-mat.stat-mech hep-ph physics.data-an hep-ex Digital Libraries Machine Learning nlin.CD physics.comp-ph Applications Artificial Intelligence Computation and Language Computer Vision cond-mat.dis-nn Information Retrieval

Catalog footprint

What is connected

32works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Academic mentees succeed in big groups, but thrive in small groups

Mentoring is a key component of scientific achievements, contributing to overall measures of career success for mentees and mentors. A common success metric in the scientific enterprise is acquiring a large research group, which is believed to indicate excellent mentorship and high-quality research. However, large, competitive groups might also amplify dropout rates, which are high especially among early career researchers. Here, we collect longitudinal genealogical data on mentor-mentee relations and their publication, and study the effects of a mentor's group on future academic survival and performance of their mentees. We find that mentees trained in large groups generally have better academic performance than mentees from small groups, if they continue working in academia after graduation. However, we also find two surprising results: Academic survival rate is significantly lower for (1) mentees from larger groups, and for (2) mentees with more productive mentors. These findings reveal that success of mentors has a negative effect on the academic survival rate of mentees, raising important questions about the definition of successful mentorship and providing actionable suggestions concerning career development.

preprint2022arXiv

Impactful scientists have higher tendency to involve collaborators in new topics

In scientific research, collaboration is one of the most effective ways to take advantage of new ideas, skills, resources, and for performing interdisciplinary research. Although collaboration networks have been intensively studied, the question of how individual scientists choose collaborators to study a new research topic remains almost unexplored. Here, we investigate the statistics and mechanisms of collaborations of individual scientists along their careers, revealing that, in general, collaborators are involved in significantly fewer topics than expected from controlled surrogate. In particular, we find that highly productive scientists tend to have higher fraction of single-topic collaborators, while highly cited, i.e., impactful, scientists have higher fraction of multi-topic collaborators. We also suggest a plausible mechanism for this distinction. Moreover, we investigate the cases where scientists involve existing collaborators into a new topic. We find that compared to productive scientists, impactful scientists show strong preference of collaboration with high impact scientists on a new topic. Finally, we validate our findings by investigating active scientists in different years and across different disciplines.

preprint2022arXiv

POEM: Out-of-Distribution Detection with Posterior Sampling

Out-of-distribution (OOD) detection is indispensable for machine learning models deployed in the open world. Recently, the use of an auxiliary outlier dataset during training (also known as outlier exposure) has shown promising performance. As the sample space for potential OOD data can be prohibitively large, sampling informative outliers is essential. In this work, we propose a novel posterior sampling-based outlier mining framework, POEM, which facilitates efficient use of outlier data and promotes learning a compact decision boundary between ID and OOD data for improved detection. We show that POEM establishes state-of-the-art performance on common benchmarks. Compared to the current best method that uses a greedy sampling strategy, POEM improves the relative performance by 42.0% and 24.2% (FPR95) on CIFAR-10 and CIFAR-100, respectively. We further provide theoretical insights on the effectiveness of POEM for OOD detection.

preprint2020arXiv

A hyperbolic Embedding Model for Directed Networks

Network embedding is a fervid topic in current networks science and observes that most real complex systems can be embedded in hidden metrics space and emerge as the geometrical property, where the geometric distance between nodes determines the likelihood of links connected. Among those, hyperbolic space associated with the structural organization of many real complex systems, it has thus received extensive attention. However, the majority of methods and measurements, recently developed, less take these features into consideration for the asymmetry of links. Here, we discuss how to multiplex node information as an embedding foundation through identifying the bipartite structure of directed networks; and we proposed the generally mapping framework which hybrids the topological structure of complex networks, directed links and the hidden metrics space. By splitting the different properties of a node, possibilities between different types of nodes can be modeled. In addition to that, we apply this model to some real systems, including international trade networks and C.elegans neural networks. Results confirm that directed networks enable mapping into metrics space as well, and network embedding information can improve the scope of application of existing models.

preprint2020arXiv

Prediction Model Based on Integrated Political Economy System: The Case of US Presidential Election

This paper studies an integrated system of political and economic systems from a systematic perspective to explore the complex interaction between them, and specially analyzes the case of the US presidential election forecasting. Based on the signed association networks of industrial structure constructed by economic data, our framework simulates the diffusion and evolution of opinions during the election through a kinetic model called the Potts Model. Remarkably, we propose a simple and efficient prediction model for the US presidential election, and meanwhile inspire a new way to model the economic structure. Findings also highlight the close relationship between economic structure and political attitude. Furthermore, the case analysis in terms of network and economy demonstrates the specific features and the interaction between political tendency and industrial structure in a particular period, which is consistent with theories in politics and economics.

preprint2020arXiv

Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction

Rich user behavior data has been proven to be of great value for click-through rate prediction tasks, especially in industrial applications such as recommender systems and online advertising. Both industry and academy have paid much attention to this topic and propose different approaches to modeling with long sequential user behavior data. Among them, memory network based model MIMN proposed by Alibaba, achieves SOTA with the co-design of both learning algorithm and serving system. MIMN is the first industrial solution that can model sequential user behavior data with length scaling up to 1000. However, MIMN fails to precisely capture user interests given a specific candidate item when the length of user behavior sequence increases further, say, by 10 times or more. This challenge exists widely in previously proposed approaches. In this paper, we tackle this problem by designing a new modeling paradigm, which we name as Search-based Interest Model (SIM). SIM extracts user interests with two cascaded search units: (i) General Search Unit acts as a general search from the raw and arbitrary long sequential behavior data, with query information from candidate item, and gets a Sub user Behavior Sequence which is relevant to candidate item; (ii) Exact Search Unit models the precise relationship between candidate item and SBS. This cascaded search paradigm enables SIM with a better ability to model lifelong sequential behavior data in both scalability and accuracy. Apart from the learning algorithm, we also introduce our hands-on experience on how to implement SIM in large scale industrial systems. Since 2019, SIM has been deployed in the display advertising system in Alibaba, bringing 7.1\% CTR and 4.4\% RPM lift, which is significant to the business. Serving the main traffic in our real system now, SIM models user behavior data with maximum length reaching up to 54000, pushing SOTA to 54x.

preprint2020arXiv

The critical role of fresh teams in creating original and multi-disciplinary research

Teamwork is one of the most prominent features in modern science. It is now well-understood that the team size is an important factor that affects team creativity. However, the crucial question of how the character of research studies is influenced by the freshness of the team remains unclear. In this paper, we quantify the team freshness according to the absent of prior collaboration among team members. Our results suggest that fresher teams tend to produce works of higher originality and more multi-disciplinary impact. These effects are even magnified in larger teams. Furthermore, we find that freshness defined by new team members in a paper is a more effective indicator of research originality and multi-disciplinarity compared to freshness defined by new collaboration relations among team members. Finally, we show that career freshness of members also plays an important role in increasing the originality and multi-disciplinarity of produced papers.

preprint2014arXiv

Characterizing and Modeling the Dynamics of Activity and Popularity

Social media, regarded as two-layer networks consisting of users and items, turn out to be the most important channels for access to massive information in the era of Web 2.0. The dynamics of human activity and item popularity is a crucial issue in social media networks. In this paper, by analyzing the growth of user activity and item popularity in four empirical social media networks, i.e., Amazon, Flickr, Delicious and Wikipedia, it is found that cross links between users and items are more likely to be created by active users and to be acquired by popular items, where user activity and item popularity are measured by the number of cross links associated with users and items. This indicates that users generally trace popular items, overall. However, it is found that the inactive users more severely trace popular items than the active users. Inspired by empirical analysis, we propose an evolving model for such networks, in which the evolution is driven only by two-step random walk. Numerical experiments verified that the model can qualitatively reproduce the distributions of user activity and item popularity observed in empirical networks. These results might shed light on the understandings of micro dynamics of activity and popularity in social media networks.

preprint2014arXiv

Reconstructing propagation networks with natural diversity and identifying hidden sources

Our ability to uncover complex network structure and dynamics from data is fundamental to understanding and controlling collective dynamics in complex systems. Despite recent progress in this area, reconstructing networks with stochastic dynamical processes from limited time series remains to be an outstanding problem. Here we develop a framework based on compressed sensing to reconstruct complex networks on which stochastic spreading dynamics take place. We apply the methodology to a large number of model and real networks, finding that a full reconstruction of inhomogeneous interactions can be achieved from small amounts of polarized (binary) data, a virtue of compressed sensing. Further, we demonstrate that a hidden source that triggers the spreading process but is externally inaccessible can be ascertained and located with high confidence in the absence of direct routes of propagation from it. Our approach thus establishes a paradigm for tracing and controlling epidemic invasion and information diffusion in complex networked systems.

preprint2013arXiv

Do scientists trace hot topics?

Do scientists follow hot topics in their scientific investigations? In this paper, by performing analysis to papers published in the American Physical Society (APS) Physical Review journals, it is found that papers are more likely to be attracted by hot fields, where the hotness of a field is measured by the number of papers belonging to the field. This indicates that scientists generally do follow hot topics. However, there are qualitative differences among scientists from various countries, among research works regarding different number of authors, different number of affiliations and different number of references. These observations could be valuable for policy makers when deciding research funding and also for individual researchers when searching for scientific projects.

preprint2013arXiv

Efficient learning strategy of Chinese characters based on network approach

Based on network analysis of hierarchical structural relations among Chinese characters, we develop an efficient learning strategy of Chinese characters. We regard a more efficient learning method if one learns the same number of useful Chinese characters in less effort or time. We construct a node-weighted network of Chinese characters, where character usage frequencies are used as node weights. Using this hierarchical node-weighted network, we propose a new learning method, the distributed node weight (DNW) strategy, which is based on a new measure of nodes' importance that takes into account both the weight of the nodes and the hierarchical structure of the network. Chinese character learning strategies, particularly their learning order, are analyzed as dynamical processes over the network. We compare the efficiency of three theoretical learning methods and two commonly used methods from mainstream Chinese textbooks, one for Chinese elementary school students and the other for students learning Chinese as a second language. We find that the DNW method significantly outperforms the others, implying that the efficiency of current learning methods of major textbooks can be greatly improved.

preprint2013arXiv

Phase transitions in Ising model induced by weight redistribution on weighted regular networks

In order to investigate the role of the weight in weighted networks, the collective behavior of the Ising system on weighted regular networks is studied by numerical simulation. In our model, the coupling strength between spins is inversely proportional to the corresponding weighted shortest distance. Disordering link weights can effectively affect the process of phase transition even though the underlying binary topological structure remains unchanged. Specifically, based on regular networks with homogeneous weights initially, randomly disordering link weights will change the critical temperature of phase transition. The results suggest that the redistribution of link weights may provide an additional approach to optimize the dynamical behaviors of the system.

preprint2012arXiv

B-meson Semi-inclusive Decay to $2^{-+}$ Charmonium in NRQCD and X(3872)

The semi-inclusive B-meson decay into spin-singlet D-wave $2^{-+}$ charmonium, $B\to η_{c2}+X$, is studied in nonrelativistic QCD (NRQCD). Both color-singlet and color-octet contributions are calculated at next-to-leading order (NLO) in the strong coupling constant $α_s$. The non-perturbative long-distance matrix elements are evaluated using operator evolution equations. It is found that the color-singlet $^1D_2$ contribution is tiny, while the color-octet channels make dominant contributions. The estimated branching ratio $B(B\to η_{c2}+X)$ is about $0.41\,\times10^{-4}$ in the Naive Dimensional Regularization (NDR) scheme and $1.24\,\times10^{-4}$ in the t'Hooft-Veltman (HV) scheme, with renormalization scale $μ=m_b=4.8$\,GeV. The scheme-sensitivity of these numerical results is due to cancelation between ${}^1S_0^{[8]}$ and ${}^1P_1^{[8]}$ contributions. The $μ$-dependence curves of NLO branching ratios in both schemes are also shown, with $μ$ varying from $\frac{m_b}{2}$ to $2m_b$ and the NRQCD factorization or renormalization scale $μ_Λ$ taken to be $2m_c$. Comparison of the estimated branching ratio of $B\to η_{c2}+X$ with the observed branching ratio of $B \to X(3872)+K$ may lead to the conclusion that X(3872) is unlikely to be the $2^{-+}$ charmonium state $η_{c2}$.

preprint2012arXiv

Higher-order corrections to exclusive production of charmonia at B factories

As a test of the color-singlet mechanism of the nonrelativistic QCD (NRQCD) factorization approach, we consider the exclusive two-quarkonium productions in electron-positron annihilation e+ e- -> eta_c + gamma and e+ e- -> J/psi + J/psi at B factories. The cross sections are computed to the next-to-leading order in alpha_s and are resummed to all orders in half the relative velocity v of the charm quark in each meson rest frame. The available theoretical prediction of the cross section for e+ e- -> J/psi + eta_c at the same level of theoretical accuracies is consistent with the available experimental data. Those for e+ e- -> eta_c + gamma and e+ e- -> J/psi + J/psi that are computed new in this work can be tested against the data from future super B factories.

preprint2012arXiv

Resummation of relativistic corrections to exclusive productions of charmonia in e+ e- collisions

We investigate two exclusive processes, e+ e- -> eta_c + gamma and e+ e- -> J/psi + J/psi, at the center-of-momentum energy sqrt{s}=10.58 GeV within the framework of the nonrelativistic QCD factorization approach. A class of relativistic corrections is resummed to all orders in the heavy-quark velocity v and the corrections are large negative. We further improve the prediction by including available QCD next-to-leading-order corrections and the interference between the QCD and relativistic corrections. The prediction for sigma[e+ e- -> eta_c + gamma] is about 50 fb. In the case of e+ e- -> J/psi + J/psi the standard nonrelativistic QCD prediction for the cross section is negative. As an alternative, the vector-meson-dominance approach is employed to compute the photon-fragmentation contribution of the process, which gives the cross section ~1 fb. This is an indication that the uncalculated QCD higher-order corrections may be significant. Our results can be tested against the forthcoming data from Belle II and super B factories.

preprint2012arXiv

Spectral coarse graining for random walk in bipartite networks

Many real-world networks display a natural bipartite structure, while analyzing or visualizing large bipartite networks is one of the most challenges. As a result, it is necessary to reduce the complexity of large bipartite systems and preserve the functionality at the same time. We observe, however, the existing coarse graining methods for binary networks fail to work in the bipartite networks. In this paper, we use the spectral analysis to design a coarse graining scheme specifically for bipartite networks and keep their random walk properties unchanged. Numerical analysis on artificial and real-world bipartite networks indicates that our coarse graining scheme could obtain much smaller networks from large ones, keeping most of the relevant spectral properties. Finally, we further validate the coarse graining method by directly comparing the mean first passage time between the original network and the reduced one.

preprint2011arXiv

Detecting Important Nodes to Community Structure Using the Spectrum of the Graph

Many complex systems can be represented as networks, and how a network breaks up into subnetworks or communities is of wide interest. However, the development of a method to detect nodes important to communities that is both fast and accurate is a very challenging and open problem. In this manuscript, we introduce a new approach to characterize the node importance to communities. First, a centrality metric is proposed to measure the importance of network nodes to community structure using the spectrum of the adjacency matrix. We define the node importance to communities as the relative change in the eigenvalues of the network adjacency matrix upon their removal. Second, we also propose an index to distinguish two kinds of important nodes in communities, i.e., "community core" and "bridge". Our indices are only relied on the spectrum of the graph matrix. They are applied in many artificial networks as well as many real-world networks. This new methodology gives us a basic approach to solve this challenging problem and provides a realistic result.

preprint2011arXiv

Detecting the optimal number of communities in complex networks

To obtain the optimal number of communities is an important problem in detecting community structure. In this paper, we extend the measurement of community detecting algorithms to find the optimal community number. Based on the normalized mutual information index, which has been used as a measure for similarity of communities, a statistic $Ω(c)$ is proposed to detect the optimal number of communities. In general, when $Ω(c)$ reaches its local maximum, especially the first one, the corresponding number of communities \emph{c} is likely to be optimal in community detection. Moreover, the statistic $Ω(c)$ can also measure the significance of community structures in complex networks, which has been paid more attention recently. Numerical and empirical results show that the index $Ω(c)$ is effective in both artificial and real world networks.

preprint2011arXiv

Navigation in non-uniform density social networks

Recent empirical investigations suggest a universal scaling law for the spatial structure of social networks. It is found that the probability density distribution of an individual to have a friend at distance $d$ scales as $P(d)\propto d^{-1}$. Since population density is non-uniform in real social networks, a scale invariant friendship network(SIFN) based on the above empirical law is introduced to capture this phenomenon. We prove the time complexity of navigation in 2-dimensional SIFN is at most $O(\log^4 n)$. In the real searching experiment, individuals often resort to extra information besides geography location. Thus, real-world searching process may be seen as a projection of navigation in a $k$-dimensional SIFN($k>2$). Therefore, we also discuss the relationship between high and low dimensional SIFN. Particularly, we prove a 2-dimensional SIFN is the projection of a 3-dimensional SIFN. As a matter of fact, this result can also be generated to any $k$-dimensional SIFN.

preprint2011arXiv

Onset of Synchronization in Weighted Complex Networks: the Effect of Weight-Degree Correlation

By numerical simulations, we investigate the onset of synchronization of networked phase oscillators under two different weighting schemes. In scheme-I, the link weights are correlated to the product of the degrees of the connected nodes, so this kind of networks is named as the weight-degree correlated (WDC) network. In scheme-II, the link weights are randomly assigned to each link regardless of the node degrees, so this kind of networks is named as the weight-degree uncorrelated (WDU) network. Interestingly, it is found that by increasing a parameter that governs the weight distribution, the onset of synchronization in WDC network is monotonically enhanced, while in WDU network there is a reverse in the synchronization performance. We investigate this phenomenon from the viewpoint of gradient network, and explain the contrary roles of coupling gradient on network synchronization: gradient promotes synchronization in WDC network, while deteriorates synchronization in WDU network. The findings highlight the fact that, besides the link weight, the correlation between the weight and node degree is also important to the network dynamics.

preprint2010arXiv

Comment on "Dynamics and Directionality in Complex Networks"

Authors of Phys. Rev. Lett. 103, 228702 (2009) claim that "The residual degree gradient (RDG) method can enhance thesynchronizability of networks by simply changing the direction of the links". In this paper, we argue that in some case the RDG method will lead to the failure of synchronization ($R=λ^{r}_{2}/λ^{r}_{N}=0$). Additionally, we also propose a so-called residual betweenness gradient (RBG) method to solve this problem.

preprint2010arXiv

Dynamics on Spatial Networks and the Effect of Distance Coarse Graining

Very recently, a kind of spatial network constructed with power-law distance distribution and total energy constriction is proposed. Moreover, it has been pointed out that such spatial networks have the optimal exponents $δ$ in the power-law distance distribution for the average shortest path, traffic dynamics and navigation. Because the distance is estimated approximately in real world, we present an distance coarse graining procedure to generate the binary spatial networks in this paper. We find that the distance coarse graining procedure will result in the shifting of the optimal exponents $δ$. Interestingly, when the network is large enough, the effect of distance coarse graining can be ignored eventually. Additionally, we also study some main dynamic processes including traffic dynamics, navigation, synchronization and percolation on this spatial networks with coarse grained distance. The results lead us to the enhancement of spatial networks' specifical functions.

preprint2010arXiv

Emergence of Global Preferential Attachment From Local Interaction

Global degree/strength based preferential attachment is widely used as an evolution mechanism of networks. But it is hard to believe that any individual can get global information and shape the network architecture based on it. In this paper, it is found that the global preferential attachment emerges from the local interaction models, including distance-dependent preferential attachment (DDPA) evolving model of weighted networks(M. Li et al, New Journal of Physics 8 (2006) 72), acquaintance network model(J. Davidsen et al, Phys. Rev. Lett. 88 (2002) 128701) and connecting nearest-neighbor(CNN) model(A. Vazquez, Phys. Rev. E 67 (2003) 056104). For DDPA model and CNN model, the attachment rate depends linearly on the degree or strength, while for acquaintance network model, the dependence follows a sublinear power law. It implies that for the evolution of social networks, local contact could be more fundamental than the presumed global preferential attachment. This is onsistent with the result observed in the evolution of empirical email networks.

preprint2010arXiv

Enhancing synchronization by directionality in complex networks

We proposed a method called residual edge-betweenness gradient (REBG) to enhance synchronizability of networks by assignment of link direction while keeping network topology and link weight unchanged. Direction assignment has been shown to improve the synchronizability of undirected networks in general, but we find that in some cases incommunicable components emerge and networks fail to synchronize. We show that the REBG method can effectively avoid the synchronization failure ($R=λ_{2}^{r}/λ_{N}^{r}=0$) which occurs in the residual degree gradient (RDG) method proposed in Phys. Rev. Lett. 103, 228702 (2009). Further experiments show that REBG method enhance synchronizability in networks with community structure as compared with the RDG method.

preprint2010arXiv

How to Measure Significance of Community Structure in Complex Networks

Community structure analysis is a powerful tool for complex networks, which can simplify their functional analysis considerably. Recently, many approaches were proposed to community structure detection, but few works were focused on the significance of community structure. Since real networks obtained from complex systems always contain error links, and most of the community detection algorithms have random factors, evaluate the significance of community structure is important and urgent. In this paper, we use the eigenvectors' stability to characterize the significance of community structures. By employing the eigenvalues of Laplacian matrix of a given network, we can evaluate the significance of its community structure and obtain the optimal number of communities, which are always hard for community detection algorithms. We apply our method to many real networks. We find that significant community structures exist in many social networks and C.elegans neural network, and that less significant community structures appear in protein-interaction networks and metabolic networks. Our method can be applied to broad clustering problems in data mining due to its solid mathematical basis and efficiency.

preprint2010arXiv

NRQCD Predictions of D-Wave Quarkonia $^3D_{J}(J=1,2,3)$ Decay into Light Hadrons at Order $α_{s}^{3}$

In this paper, in the framework of NRQCD we study the light hadron (LH) decays of the spin-triplet (S=1) D-wave heavy quarkonia. The short distance coefficients of all Fock states in the $^3D_J(J=1,2,3)$ quarkonia including D-wave color-singlet, P-wave color-octet and S-wave color-singlet and color-octet are calculated perturbatively at $α_{s}^3$ order. The operator evolution equations of the four-fermion operators are also derived and are used to estimate the numerical values of the long distance matrix elements. We find that for the $c\bar{c}$ system, the LH decay widths of $ψ(1^3D_J)$ predicted by NRQCD is about $2\sim3$ times larger than the phenomenological potential model results, while for the $b\bar{b}$ system the two theoretical estimations of $Γ(Υ(1^3D_J)\to LH)$ are in coincidence with each other. Our predictions for $ψ(1^3D_J)$ LH decay widths are $Γ(ψ(1^3D_J)\to LH)=(0.43,0.05,0.17)$MeV for J=1,2,3; and for $Υ(1^3D_J)$, $Γ(Υ(1^3D_J)\to LH)=(6.91,0.75,2.75)$KeV for J=1,2,3.

preprint2010arXiv

Relativistic correction to $e^{+}e^{-}\to J/ψ+gg$ at $B$ factories and constraint on color-octet matrix elements

We calculate the relativistic correction to $J/ψ$ production in the color-singlet process $e^{+}e^{-}\to J/ψ+gg$ at B-factories. We employ the non-relativistic QCD (NRQCD) factorization approach, where the short-distance coefficients are calculated perturbatively and the long-distance matrix elements are extracted from the decays of $J/ψ$ into $e^{+}e^{-}$ and light hadrons. We find that the $O(v^2)$ relativistic correction can enhance the cross section by a factor of 20-30%, comparable to the enhancement due to the $O(α_s)$ radiative correction obtained earlier. Combining the relativistic correction with the QCD radiative correction, we find that the color-singlet contribution to $e^{+}e^{-}\to J/ψ+gg$ can saturate the latest observed cross section $σ(e^{+}e^{-}\to J/ψ+X_{\mathrm{non-c\bar{c}}})=0.43 \pm0.09\pm0.09$ pb by Belle, thus leaving little room to the color-octet contributions. This gives a very stringent constraint on the color-octet contribution, and may imply that the values of color-octet matrix elements are much smaller than expected earlier by using the naive velocity scaling rules or extracted from fitting experimental data with the leading-order calculations.

preprint2010arXiv

The attack tolerance of community structure in complex networks

Robustness is an important property of complex networks. Up to now, there are plentiful researches focusing on the network's robustness containing error and attack tolerance of network's connectivity and the shortest path. In this paper, the error and attack tolerance of network's community structure are studies through randomly and purposely disturbing interaction of networks. Two purposely perturbation methods are designed, that one methods is based on cluster coefficient and the other is attacking triangle. Dissimilarity function D is used to quantify the changes of community structure and modularity Q is used to quantify the significance of community structure. The numerical results show that after perturbation, network's community structure is damaged to be more unclear. It is also discovered that purposely attacking damages more to the community structure than randomly attacking.

preprint2009arXiv

Measuring Significance of Community Structure in Complex Networks

Many complex systems can be represented as networks and separating a network into communities could simplify the functional analysis considerably. Recently, many approaches have been proposed for finding communities, but none of them can evaluate the communities found are significant or trivial definitely. In this paper, we propose an index to evaluate the significance of communities in networks. The index is based on comparing the similarity between the original community structure in network and the community structure of the network after perturbed, and is defined by integrating all the similarities. Many artificial networks and real-world networks are tested. The results show that the index is independent from the size of network and the number of communities. Moreover, we find the clear communities always exist in social networks, but don't find significative communities in proteins interaction networks and metabolic networks.

preprint2009arXiv

Scaling properties in spatial networks and its effects on topology and traffic dynamics

Empirical studies on the spatial structures in several real transport networks reveal that the distance distribution in these networks obeys power law. To discuss the influence of the power-law exponent on the network's structure and function, a spatial network model is proposed. Based on a regular network and subject to a limited cost $C$, long range connections are added with power law distance distribution $P(r)=ar^{-δ}$. Some basic topological properties of the network with different $δ$ are studied. It is found that the network has the smallest average shortest path when $δ=2$. Then a traffic model on this network is investigated. It is found that the network with $δ=1.5$ is best for the traffic process. All of these results give us some deep understandings about the relationship between spatial structure and network function.

preprint2007arXiv

Community Detecting By Signaling on Complex Networks

Based on signaling process on complex networks, a method for identification community structure is proposed. For a network with $n$ nodes, every node is assumed to be a system which can send, receive, and record signals. Each node is taken as the initial signal source once to inspire the whole network by exciting its neighbors and then the source node is endowed a $n$d vector which recording the effects of signaling process. So by this process, the topological relationship of nodes on networks could be transferred into the geometrical structure of vectors in $n$d Euclidian space. Then the best partition of groups is determined by $F$-statistic and the final community structure is given by Fuzzy $C$-means clustering method (FCM). This method can detect community structure both in unweighted and weighted networks without any extra parameters. It has been applied to ad hoc networks and some real networks including Zachary Karate Club network and football team network. The results are compared with that of other approaches and the evidence indicates that the algorithm based on signaling process is effective.

preprint2006arXiv

The Role of Weight on Community Structure of Networks

The role of weight on the weighted networks is investigated by studying the effect of weight on community structures. We use weighted modularity $Q^w$ to evaluate the partitions and Weighted Extremal Optimization algorithm to detect communities. Starting from idealized and empirical weighted networks, the distribution or matching between weights and edges are disturbed. Using dissimilarity function $D$ to distinguish the difference between community structures, it is found that the redistribution of weights does strongly affect the community structure especially in dense networks. This indicates that the community structure in networks is a suitable property to reflect the role of weight.

Ying Fan

What is connected

Connect this record

See the researcher in context

Building this map preview

32 published item(s)

Academic mentees succeed in big groups, but thrive in small groups

Impactful scientists have higher tendency to involve collaborators in new topics

POEM: Out-of-Distribution Detection with Posterior Sampling

A hyperbolic Embedding Model for Directed Networks

Prediction Model Based on Integrated Political Economy System: The Case of US Presidential Election

Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction

The critical role of fresh teams in creating original and multi-disciplinary research

Characterizing and Modeling the Dynamics of Activity and Popularity

Reconstructing propagation networks with natural diversity and identifying hidden sources

Do scientists trace hot topics?

Efficient learning strategy of Chinese characters based on network approach

Phase transitions in Ising model induced by weight redistribution on weighted regular networks

B-meson Semi-inclusive Decay to $2^{-+}$ Charmonium in NRQCD and X(3872)

Higher-order corrections to exclusive production of charmonia at B factories

Resummation of relativistic corrections to exclusive productions of charmonia in e+ e- collisions

Spectral coarse graining for random walk in bipartite networks

Detecting Important Nodes to Community Structure Using the Spectrum of the Graph

Detecting the optimal number of communities in complex networks

Navigation in non-uniform density social networks

Onset of Synchronization in Weighted Complex Networks: the Effect of Weight-Degree Correlation

Comment on "Dynamics and Directionality in Complex Networks"

Dynamics on Spatial Networks and the Effect of Distance Coarse Graining

Emergence of Global Preferential Attachment From Local Interaction

Enhancing synchronization by directionality in complex networks

How to Measure Significance of Community Structure in Complex Networks

NRQCD Predictions of D-Wave Quarkonia $^3D_{J}(J=1,2,3)$ Decay into Light Hadrons at Order $α_{s}^{3}$

Relativistic correction to $e^{+}e^{-}\to J/ψ+gg$ at $B$ factories and constraint on color-octet matrix elements

The attack tolerance of community structure in complex networks

Measuring Significance of Community Structure in Complex Networks

Scaling properties in spatial networks and its effects on topology and traffic dynamics

Community Detecting By Signaling on Complex Networks

The Role of Weight on Community Structure of Networks