Researcher profile

André Fujita

André Fujita contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2020arXiv

Stage I non-small cell lung cancer stratification by using a model-based clustering algorithm with covariates

Lung cancer is currently the leading cause of cancer deaths. Among various subtypes, the number of patients diagnosed with stage I non-small cell lung cancer (NSCLC), particularly adenocarcinoma, has been increasing. It is estimated that 30 - 40\% of stage I patients will relapse, and 10 - 30\% will die due to recurrence, clearly suggesting the presence of a subgroup that could be benefited by additional therapy. We hypothesize that current attempts to identify stage I NSCLC subgroup failed due to covariate effects, such as the age at diagnosis and differentiation, which may be masking the results. In this context, to stratify stage I NSCLC, we propose CEM-Co, a model-based clustering algorithm that removes/minimizes the effects of undesirable covariates during the clustering process. We applied CEM-Co on a gene expression data set composed of 129 subjects diagnosed with stage I NSCLC and successfully identified a subgroup with a significantly different phenotype (poor prognosis), while standard clustering algorithms failed.

preprint2013arXiv

A statistical test to identify differences in clustering structures

Statistical inference on functional magnetic resonance imaging (fMRI) data is an important task in brain imaging. One major hypothesis is that the presence or not of a psychiatric disorder can be explained by the differential clustering of neurons in the brain. In view of this fact, it is clearly of interest to address the question of whether the properties of the clusters have changed between groups of patients and controls. The normal method of approaching group differences in brain imaging is to carry out a voxel-wise univariate analysis for a difference between the mean group responses using an appropriate test (e.g. a t-test) and to assemble the resulting "significantly different voxels" into clusters, testing again at cluster level. In this approach of course, the primary voxel-level test is blind to any cluster structure. Direct assessments of differences between groups (or reproducibility within groups) at the cluster level have been rare in brain imaging. For this reason, we introduce a novel statistical test called ANOCVA - ANalysis Of Cluster structure Variability, which statistically tests whether two or more populations are equally clustered using specific features. The proposed method allows us to compare the clustering structure of multiple groups simultaneously, and also to identify features that contribute to the differential clustering. We illustrate the performance of ANOCVA through simulations and an application to an fMRI data set composed of children with ADHD and controls. Results show that there are several differences in the brain's clustering structure between them, corroborating the hypothesis in the literature. Furthermore, we identified some brain regions previously not described, generating new hypothesis to be tested empirically.

preprint2012arXiv

Discriminating different classes of biological networks by analyzing the graphs spectra distribution

The brain's structural and functional systems, protein-protein interaction, and gene networks are examples of biological systems that share some features of complex networks, such as highly connected nodes, modularity, and small-world topology. Recent studies indicate that some pathologies present topological network alterations relative to norms seen in the general population. Therefore, methods to discriminate the processes that generate the different classes of networks (e.g., normal and disease) might be crucial for the diagnosis, prognosis, and treatment of the disease. It is known that several topological properties of a network (graph) can be described by the distribution of the spectrum of its adjacency matrix. Moreover, large networks generated by the same random process have the same spectrum distribution, allowing us to use it as a "fingerprint". Based on this relationship, we introduce and propose the entropy of a graph spectrum to measure the "uncertainty" of a random graph and the Kullback-Leibler and Jensen-Shannon divergences between graph spectra to compare networks. We also introduce general methods for model selection and network model parameter estimation, as well as a statistical procedure to test the nullity of divergence between two classes of complex networks. Finally, we demonstrate the usefulness of the proposed methods by applying them on (1) protein-protein interaction networks of different species and (2) on networks derived from children diagnosed with Attention Deficit Hyperactivity Disorder (ADHD) and typically developing children. We conclude that scale-free networks best describe all the protein-protein interactions. Also, we show that our proposed measures succeeded in the identification of topological changes in the network while other commonly used measures (number of edges, clustering coefficient, average path length) failed.