Source author record

Mariam Kiran

Mariam Kiran appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Distributed, Parallel, and Cluster Computing Multiagent Systems Computational Engineering, Finance, and Science eess.SP Networking and Internet Architecture Neural and Evolutionary Computing physics.atom-ph quant-ph Quantitative Methods

Catalog footprint

What is connected

9works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Hyperparameter Tuning for Deep Reinforcement Learning Applications

Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to complex data centers. However, setting the right hyperparameters can have a huge impact on the deployed solution performance and reliability in the inference models, produced via RL, used for decision-making. Hyperparameter search itself is a laborious process that requires many iterations and computationally expensive to find the best settings that produce the best neural network architectures. In comparison to other neural network architectures, deep RL has not witnessed much hyperparameter tuning, due to its algorithm complexity and simulation platforms needed. In this paper, we propose a distributed variable-length genetic algorithm framework to systematically tune hyperparameters for various RL applications, improving training time and robustness of the architecture, via evolution. We demonstrate the scalability of our approach on many RL problems (from simple gyms to complex applications) and compared with Bayesian approach. Our results show that with more generations, optimal solutions that require fewer training episodes and are computationally cheap while being more robust for deployment. Our results are imperative to advance deep reinforcement learning controllers for real-world problems.

preprint2022arXiv

The LBNL Superfacility Project Report

The Superfacility model is designed to leverage HPC for experimental science. It is more than simply a model of connected experiment, network, and HPC facilities; it encompasses the full ecosystem of infrastructure, software, tools, and expertise needed to make connected facilities easy to use. The three-year Lawrence Berkeley National Laboratory (LBNL) Superfacility project was initiated in 2019 to coordinate work being performed at LBNL to support this model, and to provide a coherent and comprehensive set of science requirements to drive existing and new work. A key component of the project was the in-depth engagements with eight science teams that represent challenging use cases across the DOE Office of Science. By the close of the project, we met our project goal by enabling our science application engagements to demonstrate automated pipelines that analyze data from remote facilities at large scale, without routine human intervention. In several cases, we have gone beyond demonstrations and now provide production-level services. To achieve this goal, the Superfacility team developed tools, infrastructure, and policies for near-real-time computing support, dynamic high-performance networking, data management and movement tools, API-driven automation, HPC-scale notebooks via Jupyter, authentication using Federated Identity and container-based edge services supported. The lessons we learned during this project provide a valuable model for future large, complex, cross-disciplinary collaborations. There is a pressing need for a coherent computing infrastructure across national facilities, and LBNL's Superfacility project is a unique model for success in tackling the challenges that will be faced in hardware, software, policies, and services across multiple science domains.

preprint2021arXiv

HYPPO: A Surrogate-Based Multi-Level Parallelism Tool for Hyperparameter Optimization

We present a new software, HYPPO, that enables the automatic tuning of hyperparameters of various deep learning (DL) models. Unlike other hyperparameter optimization (HPO) methods, HYPPO uses adaptive surrogate models and directly accounts for uncertainty in model predictions to find accurate and reliable models that make robust predictions. Using asynchronous nested parallelism, we are able to significantly alleviate the computational burden of training complex architectures and quantifying the uncertainty. HYPPO is implemented in Python and can be used with both TensorFlow and PyTorch libraries. We demonstrate various software features on time-series prediction and image classification problems as well as a scientific application in computed tomography image reconstruction. Finally, we show that (1) we can reduce by an order of magnitude the number of evaluations necessary to find the most optimal region in the hyperparameter space and (2) we can reduce by two orders of magnitude the throughput for such HPO process to complete.

preprint2020arXiv

Do optimization methods in deep learning applications matter?

With advances in deep learning, exponential data growth and increasing model complexity, developing efficient optimization methods are attracting much research attention. Several implementations favor the use of Conjugate Gradient (CG) and Stochastic Gradient Descent (SGD) as being practical and elegant solutions to achieve quick convergence, however, these optimization processes also present many limitations in learning across deep learning applications. Recent research is exploring higher-order optimization functions as better approaches, but these present very complex computational challenges for practical use. Comparing first and higher-order optimization functions, in this paper, our experiments reveal that Levemberg-Marquardt (LM) significantly supersedes optimal convergence but suffers from very large processing time increasing the training complexity of both, classification and reinforcement learning problems. Our experiments compare off-the-shelf optimization functions(CG, SGD, LM and L-BFGS) in standard CIFAR, MNIST, CartPole and FlappyBird experiments.The paper presents arguments on which optimization functions to use and further, which functions would benefit from parallelization efforts to improve pretraining time and learning rate convergence.

preprint2020arXiv

Dynamic Graph Neural Network for Traffic Forecasting in Wide Area Networks

Wide area networking infrastructures (WANs), particularly science and research WANs, are the backbone for moving large volumes of scientific data between experimental facilities and data centers. With demands growing at exponential rates, these networks are struggling to cope with large data volumes, real-time responses, and overall network performance. Network operators are increasingly looking for innovative ways to manage the limited underlying network resources. Forecasting network traffic is a critical capability for proactive resource management, congestion mitigation, and dedicated transfer provisioning. To this end, we propose a nonautoregressive graph-based neural network for multistep network traffic forecasting. Specifically, we develop a dynamic variant of diffusion convolutional recurrent neural networks to forecast traffic in research WANs. We evaluate the efficacy of our approach on real traffic from ESnet, the U.S. Department of Energy's dedicated science network. Our results show that compared to classical forecasting methods, our approach explicitly learns the dynamic nature of spatiotemporal traffic patterns, showing significant improvements in forecasting accuracy. Our technique can surpass existing statistical and deep learning approaches by achieving approximately 20% mean absolute percentage error for multiple hours of forecasts despite dynamic network traffic settings.

preprint2020arXiv

Towards AI-enabled Control for Enhancing Quantum Transduction

With advent of quantum internet, it becomes crucial to find novel ways to connect distributed quantum testbeds and develop novel technologies and research that extend innovations in managing the qubit performance. Numerous emerging technologies are focused on quantum repeaters and specialized hardware to extend the quantum distance over special-purpose channels. However, there is little work that utilizes current network technology, invested in optic technologies, to merge with quantum technologies. In this paper we argue for an AI-enabled control that allows optimized and efficient conversion between qubit and photon energies, to enable optic and quantum devices to work together. Our approach integrates AI techniques, such as deep reinforcement learning algorithms, with physical quantum transducer to inform real-time conversion between the two wavelengths. Learning from simulated environment, the trained AI-enabled transducer will lead to optimal quantum transduction to maximize the qubit lifetime.

preprint2014arXiv

Converting a Systems Dynamic Model to an Agent-based model for studying the Bicoid morphogen gradient in Drosophila embryo

The concentration gradient of the Bicoid morphogen, which is established during the early stages of a Drosophila melanogaster embryonic development, determines the differential spatial patterns of gene expression and subsequent cell fate determination. This is mainly achieved by diffusion elicited by the different concentrations of the Bicoid protein in the embryo. Such chemical dynamic progress can be simulated by stochastic models, particularly the Gillespie alogrithm. However, as with various modelling approaches in biology, each technique involves drawing assumptions and reducing the model complexity sometimes limiting the model capability. This is mainly due to the complexity of the software modelling approaches to construct these models. Agent-based modelling is a technique which is becoming increasingly popular for modelling the behaviour of individual molecules or cells in computational biology. This paper attempts to compare these two popular modelling techniques of stochastic and agent-based modelling to show how the model can be studied in detail using the different approaches. This paper presents how to use these techniques with the advantages and disadvantages of using either of these. Through various comparisons, such as computation complexity and results obtained, we show that although the same model is implemented, both approaches can give varying results. The results of the paper show that the stochastic model is able to give smoother results compared to the agent-based model which may need further analysis at a later stage. We discuss the reasons for these results and how these could be rectified in systems biology research.

preprint2014arXiv

Experimental Report on Setting up a Cloud Computing Environment at the University of Bradford

Cloud computing is increasingly attracting large attention in computing both in academic research and in industrial initiatives. Emerging as a popular paradigm and an attractive model of providing computing, information technology (IT) infrastructure, network and storage to large and small enterprises both in private and public sectors. This project was initiated and aimed at designing and Setting up a basic Cloud lab Testbed running on Open stack under Virtual box for experiments and Hosting Cloud Platforms in the networking laboratory at the University of Bradford. This report presents the methodology of setting up a cloud lab testbed for experiment running on open stack. Current resources, in the Networking lab at the university were used and turned into virtual platforms for cloud computing testing. This report serves as a practical guideline, concentrating on the practical infrastructure related questions and issues, on setting up a cloud lab for testing and proof of concept. Finally the report proposes an experimental validation showing feasibility of migrating to cloud. The primary focus of this report is to provide a brief background on different theoretical concepts of cloud computing, particularly virtualisation, and then it elaborates on the practical aspects concerning the setup and implementation of a Cloud lab test bed using open source solutions. This reports serves as a reference for institutions looking at the possibilities of implementing cloud solutions, in order to benefit from getting the basics and a view on the different aspects of cloud migration concepts.

preprint2014arXiv

Using FLAME Toolkit for Agent-Based Simulation: Case Study Sugarscape Model

Social scientists have used agent-based models to understand how individuals interact and behave in various political, ecological and economic scenarios. Agent-based models are ideal for understanding such models involving interacting individuals producing emergent phenomenon. Sugarscape is one of the most famous examples of a social agent-based model which has been used to show how societies grow in the real world. This paper builds on the Sugarscape model, using the Flexible Large scale Agent-based modelling Environment (FLAME) to simulate three different scenarios of the experiment, which are based on the Sugar and Citizen locations. FLAME is an agent-based modelling framework which has previously been used to model biological and economic models. The paper includes details on how the model was written and the various parameters set for the simulation. The results of the model simulated are processed for three scenarios and analysed to see what affect the initial starting states of the agents had on the overall result obtained through the model and the variance in simulation time of processing the model on multicore architectures. The experiments highlight that there are limitations of the FLAME framework and writing simulation models in general which are highly dependent on initial starting states of a model, also raising further potential work which can be built into the Sugarscape model to study other interesting phenomenon in social and economic laws.

Mariam Kiran

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Hyperparameter Tuning for Deep Reinforcement Learning Applications

The LBNL Superfacility Project Report

HYPPO: A Surrogate-Based Multi-Level Parallelism Tool for Hyperparameter Optimization

Do optimization methods in deep learning applications matter?

Dynamic Graph Neural Network for Traffic Forecasting in Wide Area Networks

Towards AI-enabled Control for Enhancing Quantum Transduction

Converting a Systems Dynamic Model to an Agent-based model for studying the Bicoid morphogen gradient in Drosophila embryo

Experimental Report on Setting up a Cloud Computing Environment at the University of Bradford

Using FLAME Toolkit for Agent-Based Simulation: Case Study Sugarscape Model