Source author record

David A. Winkler

David A. Winkler appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Biomolecules cond-mat.mtrl-sci Logic in Computer Science physics.comp-ph

Catalog footprint

What is connected

3works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

EFI: A Toolbox for Feature Importance Fusion and Interpretation in Python

This paper presents an open-source Python toolbox called Ensemble Feature Importance (EFI) to provide machine learning (ML) researchers, domain experts, and decision makers with robust and accurate feature importance quantification and more reliable mechanistic interpretation of feature importance for prediction problems using fuzzy sets. The toolkit was developed to address uncertainties in feature importance quantification and lack of trustworthy feature importance interpretation due to the diverse availability of machine learning algorithms, feature importance calculation methods, and dataset dependencies. EFI merges results from multiple machine learning models with different feature importance calculation approaches using data bootstrapping and decision fusion techniques, such as mean, majority voting and fuzzy logic. The main attributes of the EFI toolbox are: (i) automatic optimisation of ML algorithms, (ii) automatic computation of a set of feature importance coefficients from optimised ML algorithms and feature importance calculation techniques, (iii) automatic aggregation of importance coefficients using multiple decision fusion techniques, and (iv) fuzzy membership functions that show the importance of each feature to the prediction task. The key modules and functions of the toolbox are described, and a simple example of their application is presented using the popular Iris dataset.

preprint2020arXiv

Computational screening of repurposed drugs and natural products against SARS-Cov-2 main protease (Mpro) as potential COVID-19 therapies

There remains an urgent need to identify existing drugs that might be suitable for treating patients suffering from COVID-19 infection. Drugs rarely act at a single molecular target, with off target effects often being responsible for undesirable side effects and sometimes, beneficial synergy between targets for a specific illness. Off target activities have also led to blockbuster drugs in some cases, e.g. Viagra for erectile dysfunction and Minoxidil for male pattern hair loss. Drugs already in use or in clinical trials plus approved natural products constitute a rich resource for discovery of therapeutic agents that can be repurposed for existing and new conditions, based on the rationale that they have already been assessed for safety in man. A key question then is how to rapidly and efficiently screen such compounds for activity against new pandemic pathogens such as COVID-19. Here we show how a fast and robust computational process can be used to screen large libraries of drugs and natural compounds to identify those that may inhibit the main protease of SARS-Cov-2 (3CL pro, Mpro). We show how the resulting shortlist of candidates with strongest binding affinities is highly enriched in compounds that have been independently identified as potential antivirals against COVID-19. The top candidates also include a substantial number of drugs and natural products not previously identified as having potential COVID-19 activity, thereby providing additional targets for experimental validation. This in silico screening pipeline may also be useful for repurposing of existing drugs and discovery of new drug candidates against other medically important pathogens and for use in future pandemics.

preprint2020arXiv

Impressive computational acceleration by using machine learning for 2-dimensional super-lubricant materials discovery

The screening of novel materials is an important topic in the field of materials science. Although traditional computational modeling, especially first-principles approaches, is a very useful and accurate tool to predict the properties of novel materials, it still demands extensive and expensive state-of-the-art computational resources. Additionally, they can be often extremely time consuming. We describe a time and resource-efficient machine learning approach to create a large dataset of structural properties of van der Waals layered structures. In particular, we focus on the interlayer energy and the elastic constant of layered materials composed of two different 2-dimensional (2D) structures, that are important for novel solid lubricant and super-lubricant materials. We show that machine learning models can recapitulate results of computationally expansive approaches (i.e. density functional theory) with high accuracy.

David A. Winkler

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

EFI: A Toolbox for Feature Importance Fusion and Interpretation in Python

Computational screening of repurposed drugs and natural products against SARS-Cov-2 main protease (Mpro) as potential COVID-19 therapies

Impressive computational acceleration by using machine learning for 2-dimensional super-lubricant materials discovery