Source author record

Alberto Michelini

Alberto Michelini appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th Distributed, Parallel, and Cluster Computing hep-lat hep-ph physics.geo-ph Machine Learning

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Enabling Dynamic and Intelligent Workflows for HPC, Data Analytics, and AI Convergence

The evolution of High-Performance Computing (HPC) platforms enables the design and execution of progressively larger and more complex workflow applications in these systems. The complexity comes not only from the number of elements that compose the workflows but also from the type of computations they perform. While traditional HPC workflows target simulations and modelling of physical phenomena, current needs require in addition data analytics (DA) and artificial intelligence (AI) tasks. However, the development of these workflows is hampered by the lack of proper programming models and environments that support the integration of HPC, DA, and AI, as well as the lack of tools to easily deploy and execute the workflows in HPC systems. To progress in this direction, this paper presents use cases where complex workflows are required and investigates the main issues to be addressed for the HPC/DA/AI convergence. Based on this study, the paper identifies the challenges of a new workflow platform to manage complex workflows. Finally, it proposes a development approach for such a workflow platform addressing these challenges in two directions: first, by defining a software stack that provides the functionalities to manage these complex workflows; and second, by proposing the HPC Workflow as a Service (HPCWaaS) paradigm, which leverages the software stack to facilitate the reusability of complex workflows in federated HPC infrastructures. Proposals presented in this work are subject to study and development as part of the EuroHPC eFlows4HPC project.

preprint2022arXiv

Intra-domain and cross-domain transfer learning for time series data -- How transferable are the features?

In practice, it is very demanding and sometimes impossible to collect datasets of tagged data large enough to successfully train a machine learning model, and one possible solution to this problem is transfer learning. This study aims to assess how transferable are the features between different domains of time series data and under which conditions. The effects of transfer learning are observed in terms of predictive performance of the models and their convergence rate during training. In our experiment, we use reduced data sets of 1,500 and 9,000 data instances to mimic real world conditions. Using the same scaled-down datasets, we trained two sets of machine learning models: those that were trained with transfer learning and those that were trained from scratch. Four machine learning models were used for the experiment. Transfer of knowledge was performed within the same domain of application (seismology), as well as between mutually different domains of application (seismology, speech, medicine, finance). We observe the predictive performance of the models and the convergence rate during the training. In order to confirm the validity of the obtained results, we repeated the experiments seven times and applied statistical tests to confirm the significance of the results. The general conclusion of our study is that transfer learning is very likely to either increase or not negatively affect the predictive performance of the model or its convergence rate. The collected data is analysed in more details to determine which source and target domains are compatible for transfer of knowledge. We also analyse the effect of target dataset size and the selection of model and its hyperparameters on the effects of transfer learning.

preprint2021arXiv

Which picker fits my data? A quantitative evaluation of deep learning based seismic pickers

Seismic event detection and phase picking are the base of many seismological workflows. In recent years, several publications demonstrated that deep learning approaches significantly outperform classical approaches and even achieve human-like performance under certain circumstances. However, as most studies differ in the datasets and exact evaluation tasks studied, it is yet unclear how the different approaches compare to each other. Furthermore, there are no systematic studies how the models perform in a cross-domain scenario, i.e., when applied to data with different characteristics. Here, we address these questions by conducting a large-scale benchmark study. We compare six previously published deep learning models on eight datasets covering local to teleseismic distances and on three tasks: event detection, phase identification and onset time picking. Furthermore, we compare the results to a classical Baer-Kradolfer picker. Overall, we observe the best performance for EQTransformer, GPD and PhaseNet, with EQTransformer having a small advantage for teleseismic data. Furthermore, we conduct a cross-domain study, in which we analyze model performance on datasets they were not trained on. We show that trained models can be transferred between regions with only mild performance degradation, but not from regional to teleseismic data or vice versa. As deep learning for detection and picking is a rapidly evolving field, we ensured extensibility of our benchmark by building our code on standardized frameworks and making it openly accessible. This allows model developers to easily compare new models or evaluate performance on new datasets, beyond those presented here. Furthermore, we make all trained models available through the SeisBench framework, giving end-users an easy way to apply these models in seismological analysis.

preprint2020arXiv

Local earthquakes detection: A benchmark dataset of 3-component seismograms built on a global scale

Machine learning is becoming increasingly important in scientific and technological progress, due to its ability to create models that describe complex data and generalize well. The wealth of publicly-available seismic data nowadays requires automated, fast, and reliable tools to carry out a multitude of tasks, such as the detection of small, local earthquakes in areas characterized by sparsity of receivers. A similar application of machine learning, however, should be built on a large amount of labeled seismograms, which is neither immediate to obtain nor to compile. In this study we present a large dataset of seismograms recorded along the vertical, north, and east components of 1487 broad-band or very broad-band receivers distributed worldwide; this includes 629,095 3-component seismograms generated by 304,878 local earthquakes and labeled as EQ, and 615,847 ones labeled as noise (AN). Application of machine learning to this dataset shows that a simple Convolutional Neural Network of 67,939 parameters allows discriminating between earthquakes and noise single-station recordings, even if applied in regions not represented in the training set. Achieving an accuracy of 96.7, 95.3, and 93.2% on training, validation, and test set, respectively, we prove that the large variety of geological and tectonic settings covered by our data supports the generalization capabilities of the algorithm, and makes it applicable to real-time detection of local events. We make the database publicly available, intending to provide the seismological and broader scientific community with a benchmark for time-series to be used as a testing ground in signal processing.

preprint2015arXiv

VERCE delivers a productive e-Science environment for seismology research

The VERCE project has pioneered an e-Infrastructure to support researchers using established simulation codes on high-performance computers in conjunction with multiple sources of observational data. This is accessed and organised via the VERCE science gateway that makes it convenient for seismologists to use these resources from any location via the Internet. Their data handling is made flexible and scalable by two Python libraries, ObsPy and dispel4py and by data services delivered by ORFEUS and EUDAT. Provenance driven tools enable rapid exploration of results and of the relationships between data, which accelerates understanding and method improvement. These powerful facilities are integrated and draw on many other e-Infrastructures. This paper presents the motivation for building such systems, it reviews how solid-Earth scientists can make significant research progress using them and explains the architecture and mechanisms that make their construction and operation achievable. We conclude with a summary of the achievements to date and identify the crucial steps needed to extend the capabilities for seismologists, for solid-Earth scientists and for similar disciplines.

preprint2011arXiv

Non-Abelian monopole-vortex complex

In the context of softly broken N=2 supersymmetric quantum chromodynamics (SQCD), with a hierarchical gauge symmetry breaking SU(N+1) -> U(N) -> 1, at scales v1 and v2, respectively, where v1 >> v2, we construct monopole-vortex complex soliton-like solutions and examine their properties. They represent the minimum of the static energy under the constraint that the monopole and antimonopole positions sitting at the extremes of the vortex are kept fixed. They interpolate the 't Hooft-Polyakov-like regular monopole solution near the monopole centers and a vortex solution far from them and in between. The main result, obtained in the theory with Nf=N equal-mass flavors, is concerned with the existence of exact orientational CP(N-1) zero modes, arising from the exact color-flavor diagonal SU(N)_{C+F} global symmetry. The "unbroken" subgroup SU(N) \subset SU(N+1) with which the naïve notion of non-Abelian monopoles and the related difficulties were associated, is explicitly broken at low energies. The monopole transforms nevertheless according to the fundamental representation of a new exact, unbroken SU(N) symmetry group, as does the vortex attached to it. We argue that this explains the origin of the dual non-Abelian gauge symmetry.

preprint2011arXiv

Nonabelian Faddeev-Niemi Decomposition of the SU(3) Yang-Mills Theory

Faddeev and Niemi (FN) have introduced an abelian gauge theory which simulates dynamical abelianization in Yang-Mills theory (YM). It contains both YM instantons and Wu-Yang monopoles and appears to be able to describe the confining phase. Motivated by the meson degeneracy problem in dynamical abelianization models, in this note we present a generalization of the FN theory. We first generalize the Cho connection to dynamical symmetry breaking pattern SU(N+1) -> U(N), and subsequently try to complete the Faddeev-Niemi decomposition by keeping the missing degrees of freedom. While it is not possible to write an on-shell complete FN decomposition, in the case of SU(3) theory of physical interest we find an off-shell complete decomposition for SU(3) -> U(2) which amounts to partial gauge fixing, generalizing naturally the result found by Faddeev and Niemi for the abelian scenario SU(N+1) -> U(1)^N. We discuss general topological aspects of these breakings, demonstrating for example that the FN knot solitons never exist when the unbroken gauge symmetry is nonabelian, and recovering the usual no-go theorems for colored dyons.

preprint2010arXiv

Monopole-vortex complex in a theta vacuum

We discuss aspects of the monopole-vortex complex soliton arising in a hierarchically broken gauge system, G to H to 1, in a theta vacuum of the underlying G theory. Here we focus our attention mainly on the simplest such system with G=SU(2) and H=U(1). A consistent picture of the effect of the theta parameter is found both in a macroscopic, dual picture and in a microscopic description of the monopole-vortex complex soliton.