Researcher profile

Alex Rodriguez

Alex Rodriguez contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

High Dimensional Fluctuations in Liquid Water: Combining Chemical Intuition with Unsupervised Learning

The microscopic description of the local structure of water remains an open challenge. Here, we adopt an agnostic approach to understanding water's hydrogen bond network using data harvested from molecular dynamics simulations of an empirical water model. A battery of state-of-the-art unsupervised data-science techniques are used to characterize the free energy landscape of water starting from encoding the water environment using local-atomic descriptors, through dimensionality reduction and finally the use of advanced clustering techniques. Analysis of the free energy at ambient conditions was found to be consistent with a rough single basin and independent of the choice of the water model. We find that the fluctuations of the water network occur in a high-dimensional space which we characterize using a combination of both atomic descriptors and chemical-intuition based coordinates. We demonstrate that a combination of both types of variables are needed in order to adequately capture the complexity of the fluctuations in the hydrogen bond network at different length-scales both at room temperature and also close to the critical point of water. Our results provide a general framework for examining fluctuations in water under different conditions.

preprint2021arXiv

Automatic topography of high-dimensional data sets by non-parametric Density Peak clustering

Data analysis in high-dimensional spaces aims at obtaining a synthetic description of a data set, revealing its main structure and its salient features. We here introduce an approach providing this description in the form of a topography of the data, namely a human-readable chart of the probability density from which the data are harvested. The approach is based on an unsupervised extension of Density Peak clustering and a non-parametric density estimator that measures the probability density in the manifold containing the data. This allows finding automatically the number and the height of the peaks of the probability density, and the depth of the "valleys" separating them. Importantly, the density estimator provides a measure of the error, which allows distinguishing genuine density peaks from density fluctuations due to finite sampling. The approach thus provides robust and visual information about the density peaks' height, their statistical reliability, and their hierarchical organization, offering a conceptually powerful extension of the standard clustering partitions. We show that this framework is particularly useful in the analysis of complex data sets.

preprint2021arXiv

Unsupervised learning universal critical behavior via the intrinsic dimension

The identification of universal properties from minimally processed data sets is one goal of machine learning techniques applied to statistical physics. Here, we study how the minimum number of variables needed to accurately describe the important features of a data set - the intrinsic dimension ($I_d$) - behaves in the vicinity of phase transitions. We employ state-of-the-art nearest neighbors-based $I_d$-estimators to compute the $I_d$ of raw Monte Carlo thermal configurations across different phase transitions: first-, second-order and Berezinskii-Kosterlitz-Thouless. For all the considered cases, we find that the $I_d$ uniquely characterizes the transition regime. The finite-size analysis of the $I_d$ allows not just to identify critical points with an accuracy comparable with methods that rely on {\it a priori} identification of order parameters, but also to determine the corresponding (critical) exponent $ν$ in case of continuous transitions. For the case of topological transitions, this analysis overcomes the reported limitations affecting other unsupervised learning methods. Our work reveals how raw data sets display unique signatures of universal behavior in the absence of any dimensional reduction scheme, and suggest direct parallelism between conventional order parameters in real space, and the intrinsic dimension in the data space.