Source author record

Payel Das

Payel Das appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning astro-ph.GA astro-ph.CO Biomolecules Artificial Intelligence Computation and Language Computer Vision Quantitative Methods astro-ph.HE astro-ph.SR cs.CY Distributed, Parallel, and Cluster Computing math.AT Molecular Networks Neural and Evolutionary Computing physics.chem-ph

Catalog footprint

What is connected

27works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

AI Maintenance: A Robustness Perspective

With the advancements in machine learning (ML) methods and compute resources, artificial intelligence (AI) empowered systems are becoming a prevailing technology. However, current AI technology such as deep learning is not flawless. The significantly increased model complexity and data scale incur intensified challenges when lacking trustworthiness and transparency, which could create new risks and negative impacts. In this paper, we carve out AI maintenance from the robustness perspective. We start by introducing some highlighted robustness challenges in the AI lifecycle and motivating AI maintenance by making analogies to car maintenance. We then propose an AI model inspection framework to detect and mitigate robustness risks. We also draw inspiration from vehicle autonomy to define the levels of AI robustness automation. Our proposal for AI maintenance facilitates robustness assessment, status tracking, risk scanning, model hardening, and regulation throughout the AI lifecycle, which is an essential milestone toward building sustainable and trustworthy AI ecosystems.

preprint2023arXiv

Reprogramming Pretrained Language Models for Protein Sequence Representation Learning

Machine Learning-guided solutions for protein learning tasks have made significant headway in recent years. However, success in scientific discovery tasks is limited by the accessibility of well-defined and labeled in-domain data. To tackle the low-data constraint, recent adaptions of deep learning models pretrained on millions of protein sequences have shown promise; however, the construction of such domain-specific large-scale model is computationally expensive. Here, we propose Representation Learning via Dictionary Learning (R2DL), an end-to-end representation learning framework in which we reprogram deep models for alternate-domain tasks that can perform well on protein property prediction with significantly fewer training samples. R2DL reprograms a pretrained English language model to learn the embeddings of protein sequences, by learning a sparse linear mapping between English and protein sequence vocabulary embeddings. Our model can attain better accuracy and significantly improve the data efficiency by up to $10^5$ times over the baselines set by pretrained and standard supervised methods. To this end, we reprogram an off-the-shelf pre-trained English language transformer and benchmark it on a set of protein physicochemical prediction tasks (secondary structure, stability, homology, stability) as well as on a biomedically relevant set of protein function prediction tasks (antimicrobial, toxicity, antibody affinity).

preprint2022arXiv

Accurate Clinical Toxicity Prediction using Multi-task Deep Neural Nets and Contrastive Molecular Explanations

Explainable ML for molecular toxicity prediction is a promising approach for efficient drug development and chemical safety. A predictive ML model of toxicity can reduce experimental cost and time while mitigating ethical concerns by significantly reducing animal and clinical testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clinical toxicity data. Two different molecular input representations are used: Morgan fingerprints and pre-training SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clinical, as indicated by AUROC and balanced accuracy. In particular, SMILES embeddings as input to the multi-task model improved clinical toxicity predictions compared to existing models in MoleculeNet benchmark. Additionally, our multi-task approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clinical platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clinical toxicity predictions. To provide confidence and explain the model's predictions, we adapt a post-hoc contrastive explanation method that returns pertinent positive and pertinent negative features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, aromatic amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature analysis captures more of the in vitro (53%) and in vivo (56%), rather than of the clinical (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo experimental data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clinical and in vivo molecular toxicity.

preprint2022arXiv

Augmenting Molecular Deep Generative Models with Topological Data Analysis Representations

Deep generative models have emerged as a powerful tool for learning useful molecular representations and designing novel molecules with desired properties, with applications in drug discovery and material design. However, most existing deep generative models are restricted due to lack of spatial information. Here we propose augmentation of deep generative models with topological data analysis (TDA) representations, known as persistence images, for robust encoding of 3D molecular geometry. We show that the TDA augmentation of a character-based Variational Auto-Encoder (VAE) outperforms state-of-the-art generative neural nets in accurately modeling the structural composition of the QM9 benchmark. Generated molecules are valid, novel, and diverse, while exhibiting distinct electronic property distribution, namely higher sample population with small HOMO-LUMO gap. These results demonstrate that TDA features indeed provide crucial geometric signal for learning abstract structures, which is non-trivial for existing generative models operating on string, graph, or 3D point sets to capture.

preprint2022arXiv

Cloud-Based Real-Time Molecular Screening Platform with MolFormer

With the prospect of automating a number of chemical tasks with high fidelity, chemical language processing models are emerging at a rapid speed. Here, we present a cloud-based real-time platform that allows users to virtually screen molecules of interest. For this purpose, molecular embeddings inferred from a recently proposed large chemical language model, named MolFormer, are leveraged. The platform currently supports three tasks: nearest neighbor retrieval, chemical space visualization, and property prediction. Based on the functionalities of this platform and results obtained, we believe that such a platform can play a pivotal role in automating chemistry and chemical engineering research, as well as assist in drug discovery and material design tasks. A demo of our platform is provided at \url{www.ibm.biz/molecular_demo}.

preprint2022arXiv

Data-Efficient Graph Grammar Learning for Molecular Generation

The problem of molecular generation has received significant attention recently. Existing methods are typically based on deep neural networks and require training on large datasets with tens of thousands of samples. In practice, however, the size of class-specific chemical datasets is usually limited (e.g., dozens of samples) due to labor-intensive experimentation and data collection. This presents a considerable challenge for the deep learning generative models to comprehensively describe the molecular design space. Another major challenge is to generate only physically synthesizable molecules. This is a non-trivial task for neural network-based generative models since the relevant chemical knowledge can only be extracted and generalized from the limited training data. In this work, we propose a data-efficient generative model that can be learned from datasets with orders of magnitude smaller sizes than common benchmarks. At the heart of this method is a learnable graph grammar that generates molecules from a sequence of production rules. Without any human assistance, these production rules are automatically constructed from training data. Furthermore, additional chemical knowledge can be incorporated in the model by further grammar optimization. Our learned graph grammar yields state-of-the-art results on generating high-quality molecules for three monomer datasets that contain only ${\sim}20$ samples each. Our approach also achieves remarkable performance in a challenging polymer generation task with only $117$ training samples and is competitive against existing methods using $81$k data points. Code is available at https://github.com/gmh14/data_efficient_grammar.

preprint2022arXiv

EDGE: What shapes the relationship between HI and stellar observables in faint dwarf galaxies?

We show how the interplay between feedback and mass-growth histories introduces scatter in the relationship between stellar and neutral gas properties of field faint dwarf galaxies ($M_{\star} \lessapprox 10^{6} M_{\odot}$). Across a suite of cosmological, high-resolution zoomed simulations, we find that dwarf galaxies of stellar masses $10^5 \leq M_{\star} \leq 10^{6} M_{\odot}$ are bimodal in their cold gas content, being either HI-rich or HI-deficient. This bimodality is generated through the coupling between (i) the modulation of HI contents by the background of ultraviolet radiation (UVB) at late times and (ii) the significant scatter in the stellar-mass-halo-mass relationship induced by reionization. Furthermore, our HI-rich dwarfs exhibit disturbed and time-variable neutral gas distributions primarily due to stellar feedback. Over the last four billion years, we observe order-of-magnitude changes around the median $M_{HI}$, factor-of-a-few variations in HI spatial extents, and spatial offsets between HI and stellar components regularly exceeding the galaxies' optical sizes. Time variability introduces further scatter in the $M_{\star}-M_{HI}$ relation and affects a galaxy's detectability in HI at any given time. These effects will need to be accounted for when interpreting observations of the population of faint, HI-bearing dwarfs by the combination of optical and radio wide, deep surveys.

preprint2022arXiv

Fourier Representations for Black-Box Optimization over Categorical Variables

Optimization of real-world black-box functions defined over purely categorical variables is an active area of research. In particular, optimization and design of biological sequences with specific functional or structural properties have a profound impact in medicine, materials science, and biotechnology. Standalone search algorithms, such as simulated annealing (SA) and Monte Carlo tree search (MCTS), are typically used for such optimization problems. In order to improve the performance and sample efficiency of such algorithms, we propose to use existing methods in conjunction with a surrogate model for the black-box evaluations over purely categorical variables. To this end, we present two different representations, a group-theoretic Fourier expansion and an abridged one-hot encoded Boolean Fourier expansion. To learn such representations, we consider two different settings to update our surrogate model. First, we utilize an adversarial online regression setting where Fourier characters of each representation are considered as experts and their respective coefficients are updated via an exponential weight update rule each time the black box is evaluated. Second, we consider a Bayesian setting where queries are selected via Thompson sampling and the posterior is updated via a sparse Bayesian regression model (over our proposed representation) with a regularized horseshoe prior. Numerical experiments over synthetic benchmarks as well as real-world RNA sequence optimization and design problems demonstrate the representational power of the proposed methods, which achieve competitive or superior performance compared to state-of-the-art counterparts, while improving the computation cost and/or sample efficiency, substantially.

preprint2022arXiv

Learning Geometrically Disentangled Representations of Protein Folding Simulations

Massive molecular simulations of drug-target proteins have been used as a tool to understand disease mechanism and develop therapeutics. This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein, e.g. SARS-CoV-2 Spike protein, obtained from computationally expensive molecular simulations. Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules, as well as efficient generation of protein conformations that can serve as an complement of a molecular simulation engine. Specifically, we present a geometric autoencoder framework to learn separate latent space encodings of the intrinsic and extrinsic geometries of the protein structure. For this purpose, the proposed Protein Geometric AutoEncoder (ProGAE) model is trained on the protein contact map and the orientation of the backbone bonds of the protein. Using ProGAE latent embeddings, we reconstruct and generate the conformational ensemble of a protein at or near the experimental resolution, while gaining better interpretability and controllability in term of protein structure generation from the learned latent space. Additionally, ProGAE models are transferable to a different state of the same protein or to a new protein of different size, where only the dense layer decoding from the latent representation needs to be retrained. Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations, charting the path toward scalable and improved approaches for analyzing and enhancing high-cost simulations of drug-target proteins.

preprint2022arXiv

The detailed chemical abundance patterns of accreted halo stars from the optical to infrared

Understanding the assembly of our Galaxy requires us to also characterize the systems that helped build it. In this work, we accomplish this by exploring the chemistry of accreted halo stars from the Gaia-Enceladus/Gaia-Sausage (GES) selected in the infrared from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) Data Release 16. We use high resolution optical spectra for 62 GES stars to measure abundances in 20 elements spanning the $α$, Fe-peak, light, odd-Z, and notably, the neutron-capture groups of elements to understand their trends in the context of and in contrast to the Milky Way and other stellar populations. Using these derived abundances we find that the optical and the infrared abundances agree to within 0.15 dex except for O, Co, Na, Cu, and Ce. These stars have enhanced neutron-capture abundance trends compared to the Milky Way, and their [Eu/Mg] and neutron-capture abundance ratios (e.g., [Y/Eu], [Ba/Eu], [Zr/Ba], [La/Ba], and [Nd/Ba]) point to r-process enhancement and a delay in s-process enrichment. Their [$α$/Fe] trend is lower than the Milky Way trend for [Fe/H]$>$-1.5 dex, similar to previous studies of GES stars and consistent with the picture that these stars formed in a system with a lower rate of star formation. This is further supported by their depleted abundances in Ni, Na, and Cu abundances, again, similar to previous studies of low-$α$ stars with accreted origins.

preprint2022arXiv

Towards Creativity Characterization of Generative Models via Group-based Subset Scanning

Deep generative models, such as Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs), have been employed widely in computational creativity research. However, such models discourage out-of-distribution generation to avoid spurious sample generation, thereby limiting their creativity. Thus, incorporating research on human creativity into generative deep learning techniques presents an opportunity to make their outputs more compelling and human-like. As we see the emergence of generative models directed toward creativity research, a need for machine learning-based surrogate metrics to characterize creative output from these models is imperative. We propose group-based subset scanning to identify, quantify, and characterize creative processes by detecting a subset of anomalous node-activations in the hidden layers of the generative models. Our experiments on the standard image benchmarks, and their "creatively generated" variants, reveal that the proposed subset scores distribution is more useful for detecting creative processes in the activation space rather than the pixel space. Further, we found that creative samples generate larger subsets of anomalies than normal or non-creative samples across datasets. The node activations highlighted during the creative decoding process are different from those responsible for the normal sample generation. Lastly, we assess if the images from the subsets selected by our method were also found creative by human evaluators, presenting a link between creativity perception in humans and node activations within deep neural nets.

preprint2021arXiv

Optimizing Molecules using Efficient Queries from Property Evaluations

Machine learning based methods have shown potential for optimizing existing molecules with more desirable properties, a critical step towards accelerating new chemical discovery. Here we propose QMO, a generic query-based molecule optimization framework that exploits latent embeddings from a molecule autoencoder. QMO improves the desired properties of an input molecule based on efficient queries, guided by a set of molecular property predictions and evaluation metrics. We show that QMO outperforms existing methods in the benchmark tasks of optimizing small organic molecules for drug-likeness and solubility under similarity constraints. We also demonstrate significant property improvement using QMO on two new and challenging tasks that are also important in real-world discovery problems: (i) optimizing existing potential SARS-CoV-2 Main Protease inhibitors toward higher binding affinity; and (ii) improving known antimicrobial peptides towards lower toxicity. Results from QMO show high consistency with external validations, suggesting effective means to facilitate material optimization problems with design constraints.

preprint2021arXiv

Reprogramming Language Models for Molecular Representation Learning

Recent advancements in transfer learning have made it a promising approach for domain adaptation via transfer of learned representations. This is especially when relevant when alternate tasks have limited samples of well-defined and labeled data, which is common in the molecule data domain. This makes transfer learning an ideal approach to solve molecular learning tasks. While Adversarial reprogramming has proven to be a successful method to repurpose neural networks for alternate tasks, most works consider source and alternate tasks within the same domain. In this work, we propose a new algorithm, Representation Reprogramming via Dictionary Learning (R2DL), for adversarially reprogramming pretrained language models for molecular learning tasks, motivated by leveraging learned representations in massive state of the art language models. The adversarial program learns a linear transformation between a dense source model input space (language data) and a sparse target model input space (e.g., chemical and biological molecule data) using a k-SVD solver to approximate a sparse representation of the encoded data, via dictionary learning. R2DL achieves the baseline established by state of the art toxicity prediction models trained on domain-specific data and outperforms the baseline in a limited training-data setting, thereby establishing avenues for domain-agnostic transfer learning for tasks with molecule data.

preprint2020arXiv

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

Mode connectivity provides novel geometric insights on analyzing loss landscapes and enables building high-accuracy pathways between well-trained neural networks. In this work, we propose to employ mode connectivity in loss landscapes to study the adversarial robustness of deep neural networks, and provide novel methods for improving this robustness. Our experiments cover various types of adversarial attacks applied to different network architectures and datasets. When network models are tampered with backdoor or error-injection attacks, our results demonstrate that the path connection learned using limited amount of bonafide data can effectively mitigate adversarial effects while maintaining the original accuracy on clean data. Therefore, mode connectivity provides users with the power to repair backdoored or error-injected models. We also use mode connectivity to investigate the loss landscapes of regular and robust models against evasion attacks. Experiments show that there exists a barrier in adversarial robustness loss on the path connecting regular and adversarially-trained models. A high correlation is observed between the adversarial robustness loss and the largest eigenvalue of the input Hessian matrix, for which theoretical justifications are provided. Our results suggest that mode connectivity offers a holistic tool and practical means for evaluating and improving adversarial robustness.

preprint2020arXiv

CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models

The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Autoencoder (VAE) and an efficient multi-attribute controlled sampling scheme that uses guidance from attribute predictors trained on latent features. To generate novel and optimal drug-like molecules for unseen viral targets, CogMol leverages a protein-molecule binding affinity predictor that is trained using SMILES VAE embeddings and protein sequence embeddings learned unsupervised from a large corpus. CogMol framework is applied to three SARS-CoV-2 target proteins: main protease, receptor-binding domain of the spike protein, and non-structural protein 9 replicase. The generated candidates are novel at both molecular and chemical scaffold levels when compared to the training data. CogMol also includes insilico screening for assessing toxicity of parent molecules and their metabolites with a multi-task toxicity classifier, synthetic feasibility with a chemical retrosynthesis predictor, and target structure binding with docking simulations. Docking reveals favorable binding of generated molecules to the target protein structure, where 87-95 % of high affinity molecules showed docking free energy < -6 kcal/mol. When compared to approved drugs, the majority of designed compounds show low parent molecule and metabolite toxicity and high synthetic feasibility. In summary, CogMol handles multi-constraint design of synthesizable, low-toxic, drug-like molecules with high target specificity and selectivity, and does not need target-dependent fine-tuning of the framework or target structure information.

preprint2020arXiv

Improving Efficiency in Large-Scale Decentralized Distributed Training

Decentralized Parallel SGD (D-PSGD) and its asynchronous variant Asynchronous Parallel SGD (AD-PSGD) is a family of distributed learning algorithms that have been demonstrated to perform well for large-scale deep learning tasks. One drawback of (A)D-PSGD is that the spectral gap of the mixing matrix decreases when the number of learners in the system increases, which hampers convergence. In this paper, we investigate techniques to accelerate (A)D-PSGD based training by improving the spectral gap while minimizing the communication cost. We demonstrate the effectiveness of our proposed techniques by running experiments on the 2000-hour Switchboard speech recognition task and the ImageNet computer vision task. On an IBM P9 supercomputer, our system is able to train an LSTM acoustic model in 2.28 hours with 7.5% WER on the Hub5-2000 Switchboard (SWB) test set and 13.3% WER on the CallHome (CH) test set using 64 V100 GPUs and in 1.98 hours with 7.7% WER on SWB and 13.3% WER on CH using 128 V100 GPUs, the fastest training time reported to date.

preprint2020arXiv

Learning Implicit Text Generation via Feature Matching

Generative feature matching network (GFMN) is an approach for training implicit generative models for images by performing moment matching on features from pre-trained neural networks. In this paper, we present new GFMN formulations that are effective for sequential data. Our experimental results show the effectiveness of the proposed method, SeqGFMN, for three distinct generation tasks in English: unconditional text generation, class-conditional text generation, and unsupervised text style transfer. SeqGFMN is stable to train and outperforms various adversarial approaches for text generation and text style transfer.

preprint2020arXiv

seestar: Selection functions for spectroscopic surveys of the Milky Way

Selection functions are vital for understanding the observational biases of spectroscopic surveys. With the wide variety of multi-object spectrographs currently in operation and becoming available soon, we require easily generalisable methods for determining the selection functions of these surveys. Previous work, however, has largely been focused on generating individual, tailored selection functions for every data release of each survey. Moreover, no methods for combining these selection functions to be used for joint catalogues have been developed. We have developed a Poisson likelihood estimation method for calculating selection functions in a Bayesian framework, which can be generalised to any multi-object spectrograph. We include a robust treatment of overlapping fields within a survey as well as selection functions for combined samples with overlapping footprints. We also provide a method for transforming the selection function that depends on the sky positions, colour, and apparent magnitude of a star to one that depends on the galactic location, metallicity, mass, and age of a star. This `intrinsic' selection function is invaluable for chemodynamical models of the Milky Way. We demonstrate that our method is successful at recreating synthetic spectroscopic samples selected from a mock galaxy catalogue.

preprint2020arXiv

Toward A Neuro-inspired Creative Decoder

Creativity, a process that generates novel and meaningful ideas, involves increased association between task-positive (control) and task-negative (default) networks in the human brain. Inspired by this seminal finding, in this study we propose a creative decoder within a deep generative framework, which involves direct modulation of the neuronal activation pattern after sampling from the learned latent space. The proposed approach is fully unsupervised and can be used off-the-shelf. Several novelty metrics and human evaluation were used to evaluate the creative capacity of the deep decoder. Our experiments on different image datasets (MNIST, FMNIST, MNIST+FMNIST, WikiArt and CelebA) reveal that atypical co-activation of highly activated and weakly activated neurons in a deep decoder promotes generation of novel and meaningful artifacts.

preprint2020arXiv

Using heritability of stellar chemistry to reveal the history of the Milky Way

Since chemical abundances are inherited between generations of stars, we use them to trace the evolutionary history of our Galaxy. We present a robust methodology for creating a phylogenetic tree, a biological tool used for centuries to study heritability. Combining our phylogeny with information on stellar ages and dynamical properties, we reconstruct the shared history of 78 stars in the Solar Neighbourhood. The branching pattern in our tree supports a scenario in which the thick disk is an ancestral population of the thin disk. The transition from thick to thin disk shows an anomaly, which we attribute to a star formation burst. Our tree shows a further signature of the variability in stars similar to the Sun, perhaps linked to a minor star formation enhancement creating our Solar System. In this paper, we demonstrate the immense potential of a phylogenetic perspective and interdisciplinary collaboration, where with borrowed techniques from biology we can study key processes that have contributed to the evolution of the Milky Way.

preprint2019arXiv

Ages and kinematics of chemically selected, accreted Milky Way halo stars

We exploit the [Mg/Mn]-[Al/Fe] chemical abundance plane to help identify nearby halo stars in the 14th data release from the APOGEE survey that have been accreted on to the Milky Way. Applying a Gaussian Mixture Model, we find a `blob' of 856 likely accreted stars, with a low disc contamination rate of ~7%. Cross-matching the sample with the second data release from Gaia gives us access to parallaxes and apparent magnitudes, which place constraints on distances and intrinsic luminosities. Using a Bayesian isochrone pipeline, this enables us to estimate new ages for the accreted stars, with typical uncertainties of ~20%. Our new catalogue is further supplemented with estimates of orbital parameters. The blob stars span a metallicities between -0.5 to -2.5, and [Mg/Fe] between -0.1 to 0.5. They constitute ~30% of the metal-poor ([Fe/H] < -0.8) halo at metallicities of ~-1.4. Our new ages are mainly range between 8 to 13 Gyr, with the oldest stars the metal-poorest, and with the highest [Mg/Fe] abundance. If the blob stars are assumed to belong to a single progenitor, the ages imply that the system merged with our Milky Way around 8 Gyr ago and that star formation proceeded for ~5 Gyr. Dynamical arguments suggest that such a single progenitor would have a total mass of ~1011Msun, similar to that found by other authors using chemical evolution models and simulations. Comparing the scatter in the [Mg/Fe]-[Fe/H] plane of the blob stars to that measured for stars belonging to the Large Magellanic Cloud suggests that the blob does indeed contain stars from only one progenitor.

preprint2016arXiv

Characterising stellar halo populations I: An extended distribution function for halo K giants

We fit an Extended Distribution Function (EDF) to K giants in the Sloan Extension for Galactic Understanding and Exploration (SEGUE) survey. These stars are detected to radii ~80 kpc and span a wide range in [Fe/H]. Our EDF, which depends on [Fe/H] in addition to actions, encodes the entanglement of metallicity with dynamics within the Galaxy's stellar halo. Our maximum-likelihood fit of the EDF to the data allows us to model the survey's selection function. The density profile of the K giants steepens with radius from a slope ~-2 to ~-4 at large radii. The halo's axis ratio increases with radius from 0.7 to almost unity. The metal-rich stars are more tightly confined in action space than the metal-poor stars and form a more flattened structure. A weak metallicity gradient ~-0.001 dex/kpc, a small gradient in the dispersion in [Fe/H] of ~0.001 dex/kpc, and a higher degree of radial anistropy in metal-richer stars result. Lognormal components with peaks at ~-1.5 and ~-2.3 are required to capture the overall metallicity distribution, suggestive of the existence of two populations of K giants. The spherical anisotropy parameter varies between 0.3 in the inner halo to isotropic in the outer halo. If the Sagittarius stream is included, a very similar model is found but with a stronger degree of radial anisotropy throughout.

preprint2016arXiv

Characterizing stellar halo populations II: The age gradient in blue horizontal-branch stars

The distribution of Milky Way halo blue horizontal-branch (BHB) stars is examined using action-based extended distribution functions (EDFs) that describe the locations of stars in phase space, metallicity, and age. The parameters of the EDFs are fitted using stars observed in the Sloan Extension for Galactic Understanding and Exploration-II (SEGUE-II) survey that trace the phase-space kinematics and chemistry out to ~70 kpc. A maximum a posteriori probability (MAP) estimate method and a Markov Chain Monte Carlo method are applied, taking into account the selection function in positions, distance, and metallicity for the survey. The best-fit EDF declines with actions less steeply at actions characteristic of the inner halo than at the larger actions characteristic of the outer halo, and older ages are found at smaller actions than at larger actions. In real space, the radial density profile steepens smoothly from -2 at ~2 kpc to -4 in the outer halo, with an axis ratio ~0.7 throughout. There is no indication for rotation in the BHBs, although this is highly uncertain. A moderate level of radial anisotropy is detected, with $β_s$ varying from isotropic to between ~0.1 and ~0.3 in the outer halo depending on latitude. The BHB data are consistent with an age gradient of -0.03 Gyr kpc$^{-1}$, with some uncertainty in the distribution of the larger ages. These results are consistent with a scenario in which older, larger systems contribute to the inner halo, whilst the outer halo is primarily comprised of younger, smaller systems.

preprint2011arXiv

Using NMAGIC to probe the dark matter halo and orbital structure of the X-ray bright, massive elliptical galaxy, NGC 4649

We create dynamical models of the massive elliptical galaxy, NGC 4649, using the N-body made-to-measure code, NMAGIC, and kinematic constraints from long-slit and planetary nebula (PN) data. We explore a range of potentials based on previous determinations from X-ray observations and a dynamical model fitting globular cluster (GC) velocities and a stellar density profile. The X-ray mass distributions are similar in the central region but have varying outer slopes, while the GC mass profile is higher in the central region and on the upper end of the range further out. Our models cannot differentiate between the potentials in the central region, and therefore if non-thermal pressures or multi-phase components are present in the hot gas, they must be smaller than previously inferred. In the halo, we find that the PN velocities are sensitive tracers of the mass, preferring a less massive halo than that derived from the GC mass profile, but similar to one of the mass distributions derived from X-rays. Our results show that the GCs may form a dynamically distinct system, and that the properties of the hot gas derived from X-rays in the outer halo have considerable uncertainties that need to be better understood. Estimating the mass in stars using photometric information and a stellar population mass-to-light ratio, we infer a dark matter mass fraction in NGC 4649 of ~0.39 at 1Re (10.5 kpc) and ~0.78 at 4Re. We find that the stellar orbits are isotropic to mildly radial in the central ~6 kpc depending on the potential assumed. Further out, the orbital structure becomes slightly more radial along R and more isotropic along z, regardless of the potential assumed. In the equatorial plane, azimuthal velocity dispersions dominate over meridional velocity dispersions, implying that meridional velocity anisotropy is the mechanism for flattening the stellar system.

preprint2010arXiv

Counter-dispersed slitless-spectroscopy technique: planetary nebula velocities in the halo of NGC 1399

Using a counter-dispersed slitless spectroscopy technique, we detect and measure the line-of-sight velocities of 187 planetary nebulae (PNe) around one of the nearest cD galaxies, NGC 1399, with FORS1 on the VLT. We describe the method for identifying and classifying the emission-line sources and the procedure for computing their J2000 coordinates and velocities. The number of PN detections and the errors in the velocity measurements (37 km/s indicate that this technique is comparable to other methods, such as that described by Teodorescu et al. (2005). We present the spatial distribution of the PNe and a basic analysis of their velocities. The PN two-dimensional velocity field shows marginal rotation consistent with other studies. We also find a low-velocity substructure in the halo and a flatter velocity-dispersion profile compared to previous observations that extends to ~400 arcsec. The detection of a low-velocity subcomponent underscores the importance of discrete velocity tracers for the detection of un-mixed components. The new velocity-dispersion profile is in good agreement with revised velocity dispersions for the red globular clusters in NGC 1399, using the data of Schuberth et al. (2009). The outer parts of this profile are consistent with one of the dynamical models of Kronawitter et al. (2000), which corresponds to a circular velocity of ~340 km/s and a rescaled B-band mass-to-light ratio of ~20 at 7' radius. These measurements trace the kinematics of the outer halo and disentangle the heterogenous populations in the Fornax Cluster core. The new data set the stage for a revised dynamical model of the outer halo of NGC 1399.

preprint2010arXiv

Steepening mass profiles, dark matter and environment of X-ray bright elliptical galaxies

We use a new non-parametric Bayesian approach to obtain the most probable mass distributions and circular velocity curves along with their confidence ranges, given deprojected density and temperature profiles of the hot gas surrounding X-ray bright elliptical galaxies. For a sample of six X-ray bright ellipticals, we find that all circular velocity curves are rising in the outer parts due to a combination of a rising temperature profile and a logarithmic pressure gradient that increases in magnitude. Comparing the circular velocity curves we obtain from X-rays to those obtained from dynamical models, we find that the former are often lower in the central ~10 kpc. This is probably due to a combination of: i) Non-thermal contributions of up to ~35% in the pressure (with stronger effects in NGC 4486), ii) multiple-temperature components in the hot gas, iii) incomplete kinematic spatial coverage in the dynamical models, and iv) mass profiles that are insufficiently general in the dynamical modelling. Complementing the total mass information from the X-rays with photometry and stellar population models to infer the dark matter content, we find evidence for massive dark matter haloes with dark matter mass fractions of ~35-80% at 2Re, rising to a maximum of 80-90% at the outermost radii. We also find that the six galaxies follow a Tully-Fisher relation with slope ~4 and that their circular velocities at 1Re correlate strongly with the velocity dispersion of the local environment. As a result, the galaxy luminosity at 1Re also correlates with the velocity dispersion of the environment. These relations suggest a close link between the properties of central X-ray bright elliptical galaxies and their environments (abridged).

preprint2009arXiv

The edge of the M87 halo and the kinematics of the diffuse light in the Virgo cluster core

We present high resolution FLAMES/VLT spectroscopy of intracluster planetary nebula (ICPN) candidates, targeting three new fields in the Virgo cluster core with surface brightness down to mu_B = 28.5. Based on the projected phase space information we separate the old and 12 newly-confirmed PNs into galaxy and intracluster components. The M87 PNs are confined to the extended stellar envelope of M87, within a projected radius of ~ 160 kpc, while the ICPNs are scattered across the whole surveyed region between M87 and M86. The velocity dispersions determined from the M87 PNs at projected radii of 60 kpc and 144 kpc show that the galaxy's velocity dispersion profile decreases in the outer halo, down to 78 +/- 25 km/s. A Jeans model for the M87 halo stars in the gravitational potential traced by the X-ray emission fits the observed velocity dispersion profile only if the stellar orbits are strongly radially anisotropic (beta ~= 0.4 at r ~= 10 kpc increasing to 0.8 at the outer edge), and if additionally the stellar halo is truncated at ~= 150 kpc average elliptical radius. From the spatial and velocity distribution of the ICPNs we infer that M87 and M86 are falling towards each other and that we may be observing them just before the first close pass. The inferred luminosity-specific PN numbers for the M87 halo and the ICL are in the range of values observed for old (> 10 Gyr) stellar populations (abridged).

Payel Das

What is connected

Connect this record

See the researcher in context

Building this map preview

27 published item(s)

AI Maintenance: A Robustness Perspective

Reprogramming Pretrained Language Models for Protein Sequence Representation Learning

Accurate Clinical Toxicity Prediction using Multi-task Deep Neural Nets and Contrastive Molecular Explanations

Augmenting Molecular Deep Generative Models with Topological Data Analysis Representations

Cloud-Based Real-Time Molecular Screening Platform with MolFormer

Data-Efficient Graph Grammar Learning for Molecular Generation

EDGE: What shapes the relationship between HI and stellar observables in faint dwarf galaxies?

Fourier Representations for Black-Box Optimization over Categorical Variables

Learning Geometrically Disentangled Representations of Protein Folding Simulations

The detailed chemical abundance patterns of accreted halo stars from the optical to infrared

Towards Creativity Characterization of Generative Models via Group-based Subset Scanning

Optimizing Molecules using Efficient Queries from Property Evaluations

Reprogramming Language Models for Molecular Representation Learning

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models

Improving Efficiency in Large-Scale Decentralized Distributed Training

Learning Implicit Text Generation via Feature Matching

seestar: Selection functions for spectroscopic surveys of the Milky Way

Toward A Neuro-inspired Creative Decoder

Using heritability of stellar chemistry to reveal the history of the Milky Way

Ages and kinematics of chemically selected, accreted Milky Way halo stars

Characterising stellar halo populations I: An extended distribution function for halo K giants

Characterizing stellar halo populations II: The age gradient in blue horizontal-branch stars

Using NMAGIC to probe the dark matter halo and orbital structure of the X-ray bright, massive elliptical galaxy, NGC 4649

Counter-dispersed slitless-spectroscopy technique: planetary nebula velocities in the halo of NGC 1399

Steepening mass profiles, dark matter and environment of X-ray bright elliptical galaxies

The edge of the M87 halo and the kinematics of the diffuse light in the Virgo cluster core