Source author record

André Fujita

André Fujita appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Quantitative Methods Applications Machine Learning Neurons and Cognition physics.soc-ph Social and Information Networks

Catalog footprint

What is connected

4works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Stage I non-small cell lung cancer stratification by using a model-based clustering algorithm with covariates

Lung cancer is currently the leading cause of cancer deaths. Among various subtypes, the number of patients diagnosed with stage I non-small cell lung cancer (NSCLC), particularly adenocarcinoma, has been increasing. It is estimated that 30 - 40\% of stage I patients will relapse, and 10 - 30\% will die due to recurrence, clearly suggesting the presence of a subgroup that could be benefited by additional therapy. We hypothesize that current attempts to identify stage I NSCLC subgroup failed due to covariate effects, such as the age at diagnosis and differentiation, which may be masking the results. In this context, to stratify stage I NSCLC, we propose CEM-Co, a model-based clustering algorithm that removes/minimizes the effects of undesirable covariates during the clustering process. We applied CEM-Co on a gene expression data set composed of 129 subjects diagnosed with stage I NSCLC and successfully identified a subgroup with a significantly different phenotype (poor prognosis), while standard clustering algorithms failed.

preprint2015arXiv

Correlation between graphs with an application to brain networks analysis

The global functional brain network (graph) is more suitable for characterizing brain states than local analysis of the connectivity of brain regions. Therefore, graph-theoretic approaches are the natural methods to study the brain. However, conventional graph theoretical analyses are limited due to the lack of formal statistical methods for estimation and inference for random graphs. For example, the concept of correlation between two vectors of graphs is yet not defined. The aim of this article to introduce a notion of correlation between graphs. In order to develop a framework to infer correlation between graphs, we assume that they are generated by mathematical models and that the parameters of the models are our random variables. Then, we define that two vectors of graphs are independent whether their parameters are independent. The problem is that, in real world, the model is rarely known, and consequently, the parameters cannot be estimated. By analyzing the graph spectrum, we showed that the spectral radius is highly associated with the parameters of the graph model. Based on it, we constructed a framework for correlation inference between graphs and illustrate our approach in a functional magnetic resonance imaging data composed of 814 subjects comprising 529 controls and 285 individuals diagnosed with autism spectrum disorder (ASD). Results show that correlations between default-mode and control, default-mode and somatomotor, and default-mode and visual sub-networks are higher ($p<0.05$) in ASD than in controls.

preprint2013arXiv

A statistical test to identify differences in clustering structures

Statistical inference on functional magnetic resonance imaging (fMRI) data is an important task in brain imaging. One major hypothesis is that the presence or not of a psychiatric disorder can be explained by the differential clustering of neurons in the brain. In view of this fact, it is clearly of interest to address the question of whether the properties of the clusters have changed between groups of patients and controls. The normal method of approaching group differences in brain imaging is to carry out a voxel-wise univariate analysis for a difference between the mean group responses using an appropriate test (e.g. a t-test) and to assemble the resulting "significantly different voxels" into clusters, testing again at cluster level. In this approach of course, the primary voxel-level test is blind to any cluster structure. Direct assessments of differences between groups (or reproducibility within groups) at the cluster level have been rare in brain imaging. For this reason, we introduce a novel statistical test called ANOCVA - ANalysis Of Cluster structure Variability, which statistically tests whether two or more populations are equally clustered using specific features. The proposed method allows us to compare the clustering structure of multiple groups simultaneously, and also to identify features that contribute to the differential clustering. We illustrate the performance of ANOCVA through simulations and an application to an fMRI data set composed of children with ADHD and controls. Results show that there are several differences in the brain's clustering structure between them, corroborating the hypothesis in the literature. Furthermore, we identified some brain regions previously not described, generating new hypothesis to be tested empirically.

preprint2012arXiv

Discriminating different classes of biological networks by analyzing the graphs spectra distribution

The brain's structural and functional systems, protein-protein interaction, and gene networks are examples of biological systems that share some features of complex networks, such as highly connected nodes, modularity, and small-world topology. Recent studies indicate that some pathologies present topological network alterations relative to norms seen in the general population. Therefore, methods to discriminate the processes that generate the different classes of networks (e.g., normal and disease) might be crucial for the diagnosis, prognosis, and treatment of the disease. It is known that several topological properties of a network (graph) can be described by the distribution of the spectrum of its adjacency matrix. Moreover, large networks generated by the same random process have the same spectrum distribution, allowing us to use it as a "fingerprint". Based on this relationship, we introduce and propose the entropy of a graph spectrum to measure the "uncertainty" of a random graph and the Kullback-Leibler and Jensen-Shannon divergences between graph spectra to compare networks. We also introduce general methods for model selection and network model parameter estimation, as well as a statistical procedure to test the nullity of divergence between two classes of complex networks. Finally, we demonstrate the usefulness of the proposed methods by applying them on (1) protein-protein interaction networks of different species and (2) on networks derived from children diagnosed with Attention Deficit Hyperactivity Disorder (ADHD) and typically developing children. We conclude that scale-free networks best describe all the protein-protein interactions. Also, we show that our proposed measures succeeded in the identification of topological changes in the network while other commonly used measures (number of edges, clustering coefficient, average path length) failed.