Source author record

Federico Garcia

Federico Garcia appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.HE Machine Learning astro-ph.GA astro-ph.SR

Catalog footprint

What is connected

5works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Generating Synthetic Clinical Data that Capture Class Imbalanced Distributions with Generative Adversarial Networks: Example using Antiretroviral Therapy for HIV

Clinical data usually cannot be freely distributed due to their highly confidential nature and this hampers the development of machine learning in the healthcare domain. One way to mitigate this problem is by generating realistic synthetic datasets using generative adversarial networks (GANs). However, GANs are known to suffer from mode collapse thus creating outputs of low diversity. This lowers the quality of the synthetic healthcare data, and may cause it to omit patients of minority demographics or neglect less common clinical practices. In this paper, we extend the classic GAN setup with an additional variational autoencoder (VAE) and include an external memory to replay latent features observed from the real samples to the GAN generator. Using antiretroviral therapy for human immunodeficiency virus (ART for HIV) as a case study, we show that our extended setup overcomes mode collapse and generates a synthetic dataset that accurately describes severely imbalanced class distributions commonly found in real-world clinical variables. In addition, we demonstrate that our synthetic dataset is associated with a very low patient disclosure risk, and that it retains a high level of utility from the ground truth dataset to support the development of downstream machine learning algorithms.

preprint2022arXiv

Constraints to neutron-star kicks in High-Mass X-ray Binaries with Gaia EDR3

All neutron star progenitors in neutron-star High-Mass X-ray Binaries (NS HMXBs) undergo a supernova event that may lead to a significant natal kick impacting the motion of the whole binary system. The space observatory Gaia performs a deep optical survey with exquisite astrometric accuracy, for both position and proper motions, that can be used to study natal kicks in NS HMXBs. We aim to survey the observed Galactic NS HMXB population and to quantify the magnitude of the kick imparted onto their NSs, and to highlight any possible differences arising in between the various HMXB types. We perform a census of Galactic NS HMXBs and cross-match existing detections in X-rays, optical and infrared with the Gaia Early Data Release 3 database. We retrieve their parallaxes, proper motions, and radial velocities (when available), and perform a selection based on the quality of the parallax measurement. We then compute their peculiar velocities with respect to the rotating reference frame of the Milky Way, and including their respective masses and periods, we estimate their kick velocities through Markov Chain Monte Carlo simulations of the orbit undergoing a supernova event. We infer the posterior kick distributions of 35 NS HMXBs. After an inconclusive attempt at characterising the kick distributions with Maxwellian statistics, we find that the observed NS kicks are best reproduced by a Gamma distribution of mean $116^{+18}_{-15}$km.s$^{-1}$. We note that supergiant systems tend to have higher kick velocities than Be High-Mass X-ray Binaries. The peculiar velocity versus non-degenerate companion mass plane hints at a similar trend, supergiant systems having a higher peculiar velocity independently of their companion mass.

preprint2022arXiv

Coupling between the accreting corona and the relativistic jet in the micro quasar GRS 1915+105

GRS 1915+105 was the first stellar-mass black-hole in our Galaxy to display a superluminal radio jet, similar to those observed in active galactic nuclei with a supermassive black hole at the centre. It has been proposed that the radio emission in GRS 1915+105 is fed by instabilities in the accretion disc by which the inner parts of the accretion flow is ejected in the jet. Here we show that there is a significant correlation between: (i) the radio flux, coming from the jet, and the flux of the iron emission line, coming from the disc and, (ii) the temperature of the corona that produces the high-energy part of the X-ray spectrum via inverse Compton scattering and the amplitude of a high-frequency variability component coming from the innermost part of the accretion flow. At the same time, the radio flux and the flux of the iron line are strongly anti-correlated with the temperature of the X-ray corona and the amplitude of the high-frequency variability component. These correlations persist over ~10 years, despite the highly variable X-ray and radio properties of the source in that period. Our findings provide, for the first time, incontrovertible evidence that the energy that powers this black-hole system can be directed either to the X-ray corona or the jet. When this energy is used to power the corona, raising its temperature, there is less energy left to fuel the jet and the radio flux drops, and vice versa. These facts, plus the modelling of the variability in this source show conclusively that in GRS 1915+105 the X-ray corona morphs into the jet.

preprint2022arXiv

Finding the birthplace of HMXBs in the Galaxy using Gaia EDR3: kinematical age determination through orbit integration

High-Mass X-ray Binaries (HMXBs) are produced after the first supernova event in a massive binary. These objects are intrinsically young, and can suffer from a significant natal kick. As such, the progenitors of HMXBs are likely to have formed away from the current location of the X-ray emitting systems. We aim to find the birthplace of the known HMXBs of our Milky Way. Specifically, we want to answer the question whether the formation of HMXBs can be associated to open stellar clusters and/or Galactic spiral structures, and infer from that the time elapsed since the first supernova event. We use astrometric data from the Gaia EDR3 to initialize the position and velocity of each known HMXBs from the Galaxy, and integrate their motion back in time. In parallel, we perform the same calculations on a sample of 1381 open clusters detected by Gaia as well as for four Galactic spiral arms which shape and motion have also been recently modelled using Gaia data. We report on all the encounter candidates between HMXBs and clusters or spiral arms in the past 100 Myr. In our sample of 26 HMXBs, we infer that 7 were born in clusters, 8 were born near a Galactic spiral arm, and conclude that 7 others could have formed isolated. The birthplaces of the remaining 4 HMXBs are still inconclusive due to a combination of great distance, poor astrometric data and lack of known open cluster in the vicinity. We provide the kinematical age since supernova of 15 HMXBs. The astrometry from Gaia and the orbit integration we employ are effective at finding the birthplaces of HMXBs in the Milky Way. By considering the biases in our data and method, we find it is likely that the progenitors of HMXBs preferentially formed alongside other massive stars in open clusters.

preprint2022arXiv

The Health Gym: Synthetic Health-Related Datasets for the Development of Reinforcement Learning Algorithms

In recent years, the machine learning research community has benefited tremendously from the availability of openly accessible benchmark datasets. Clinical data are usually not openly available due to their highly confidential nature. This has hampered the development of reproducible and generalisable machine learning applications in health care. Here we introduce the Health Gym - a growing collection of highly realistic synthetic medical datasets that can be freely accessed to prototype, evaluate, and compare machine learning algorithms, with a specific focus on reinforcement learning. The three synthetic datasets described in this paper present patient cohorts with acute hypotension and sepsis in the intensive care unit, and people with human immunodeficiency virus (HIV) receiving antiretroviral therapy in ambulatory care. The datasets were created using a novel generative adversarial network (GAN). The distributions of variables, and correlations between variables and trends over time in the synthetic datasets mirror those in the real datasets. Furthermore, the risk of sensitive information disclosure associated with the public distribution of the synthetic datasets is estimated to be very low.

Federico Garcia

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Generating Synthetic Clinical Data that Capture Class Imbalanced Distributions with Generative Adversarial Networks: Example using Antiretroviral Therapy for HIV

Constraints to neutron-star kicks in High-Mass X-ray Binaries with Gaia EDR3

Coupling between the accreting corona and the relativistic jet in the micro quasar GRS 1915+105

Finding the birthplace of HMXBs in the Galaxy using Gaia EDR3: kinematical age determination through orbit integration

The Health Gym: Synthetic Health-Related Datasets for the Development of Reinforcement Learning Algorithms