Source author record

César A. Hidalgo

César A. Hidalgo appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

5works
5topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Knowledge is non-fungible

What would you do if you were asked to "add" knowledge? Would you say that "one plus one knowledge" is two "knowledges"? Less than that? More? Or something in between? Adding knowledge sounds strange, but it brings to the forefront questions that are as fundamental as they are eclectic. These are questions about the nature of knowledge and about the use of mathematics to model reality. In this chapter, I explore the mathematics of adding knowledge starting from what I believe is an overlooked but key observation: the idea that knowledge is non-fungible.

preprint2016arXiv

Deep Learning the City : Quantifying Urban Perception At A Global Scale

Computer vision methods that quantify the perception of urban environment are increasingly being used to study the relationship between a city's physical appearance and the behavior and health of its residents. Yet, the throughput of current methods is too limited to quantify the perception of cities across the world. To tackle this challenge, we introduce a new crowdsourced dataset containing 110,988 images from 56 cities, and 1,170,000 pairwise comparisons provided by 81,630 online volunteers along six perceptual attributes: safe, lively, boring, wealthy, depressing, and beautiful. Using this data, we train a Siamese-like convolutional neural architecture, which learns from a joint classification and ranking loss, to predict human judgments of pairwise image comparisons. Our results show that crowdsourcing combined with neural networks can produce urban perception data at the global scale.

preprint2016arXiv

Pantheon 1.0, a manually verified dataset of globally famous biographies

We present the Pantheon 1.0 dataset: a manually verified dataset of individuals that have transcended linguistic, temporal, and geographic boundaries. The Pantheon 1.0 dataset includes the 11,341 biographies present in more than 25 languages in Wikipedia and is enriched with: (i) manually verified demographic information (place and date of birth, gender) (ii) a taxonomy of occupations classifying each biography at three levels of aggregation and (iii) two measures of global popularity including the number of languages in which a biography is present in Wikipedia (L), and the Historical Popularity Index (HPI) a metric that combines information on L, time since birth, and page-views (2008-2013). We compare the Pantheon 1.0 dataset to data from the 2003 book, Human Accomplishments, and also to external measures of accomplishment in individual games and sports: Tennis, Swimming, Car Racing, and Chess. In all of these cases we find that measures of popularity (L and HPI) correlate highly with individual accomplishment, suggesting that measures of global popularity proxy the historical impact of individuals.

preprint2016arXiv

The amenity space and the evolution of neighborhoods

Neighborhoods populated by amenities--such as restaurants, cafes, and libraries--are considered to be a key property of desirable cities. Yet, despite the global enthusiasm for amenity-rich neighborhoods, little is known about the empirical laws governing the colocation of amenities at the neighborhood scale. Here, we contribute to our understanding of the naturally occurring neighborhood-scale agglomerations of amenities observed in cities by using a dataset summarizing the precise location of millions of amenities. We use this dataset to build the network of co-location of amenities, or Amenity Space, by first introducing a clustering algorithm to identify neighborhoods, and then using the identified neighborhoods to map the probability that two amenities will be co-located in one of them. Finally, we use the Amenity Space to build a recommender system that identifies the amenities that are missing in a neighborhood given its current pattern of specialization. This opens the door for the construction of amenity recommendation algorithms that can be used to evaluate neighborhoods and inform their improvement and development.

preprint2016arXiv

The Research Space: using the career paths of scholars to predict the evolution of the research output of individuals, institutions, and nations

In recent years scholars have built maps of science by connecting the academic fields that cite each other, are cited together, or that cite a similar literature. But since scholars cannot always publish in the fields they cite, or that cite them, these science maps are only rough proxies for the potential of a scholar, organization, or country, to enter a new academic field. Here we use a large dataset of scholarly publications disambiguated at the individual level to create a map of science-or research space-where links connect pairs of fields based on the probability that an individual has published in both of them. We find that the research space is a significantly more accurate predictor of the fields that individuals and organizations will enter in the future than citation based science maps. At the country level, however, the research space and citations based science maps are equally accurate. These findings show that data on career trajectories-the set of fields that individuals have previously published in-provide more accurate predictors of future research output for more focalized units-such as individuals or organizations-than citation based science maps.