Source author record

Kevin Hu

Kevin Hu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

2works
4topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2016arXiv

Pantheon 1.0, a manually verified dataset of globally famous biographies

We present the Pantheon 1.0 dataset: a manually verified dataset of individuals that have transcended linguistic, temporal, and geographic boundaries. The Pantheon 1.0 dataset includes the 11,341 biographies present in more than 25 languages in Wikipedia and is enriched with: (i) manually verified demographic information (place and date of birth, gender) (ii) a taxonomy of occupations classifying each biography at three levels of aggregation and (iii) two measures of global popularity including the number of languages in which a biography is present in Wikipedia (L), and the Historical Popularity Index (HPI) a metric that combines information on L, time since birth, and page-views (2008-2013). We compare the Pantheon 1.0 dataset to data from the 2003 book, Human Accomplishments, and also to external measures of accomplishment in individual games and sports: Tennis, Swimming, Car Racing, and Chess. In all of these cases we find that measures of popularity (L and HPI) correlate highly with individual accomplishment, suggesting that measures of global popularity proxy the historical impact of individuals.

preprint2013arXiv

Characteristic Direction Approach to Identify Differentially Expressed Genes

Genome-wide gene expression profiles, as measured with microarrays or RNA-Seq experiments, have revolutionized biological and biomedical research by providing a quantitative measure of the entire mRNA transcriptome. Typically, researchers set up experiments where control samples are compared to a treatment condition, and using the t-test they identify differentially expressed genes upon which further analysis and ultimately biological discovery from such experiments is based. Here we describe an alternative geometrical approach to identify differentially expressed genes. We show that this alternative method, called the Characteristic Direction, is capable of identifying more relevant genes. We evaluate our approach in three case studies. In the first two, we match transcription factor targets determined by ChIP-seq profiling with differentially expressed genes after the same transcription factor knockdown or over-expression in mammalian cells. In the third case study, we evaluate the quality of enriched terms when comparing normal epithelial cells with cancer stem cells. In conclusion, we demonstrate that the Characteristic Direction approach is much better in calling the significantly differentially expressed genes and should replace the widely currently in used t-test method for this purpose. Implementations of the method in MATLAB, Python and Mathematica are available at: http://www.maayanlab.net/CD.