Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

Separation Assurance between Heterogeneous Fleets of Small Unmanned Aerial Systems via Multi-Agent Reinforcement Learning

In the envisioned future dense urban airspace, multiple companies will operate heterogeneous fleets of small unmanned aerial systems (sUASs), where each fleet includes several homogeneous aircraft with identical policies and configurations, e.g., equipage, sensing, and communication ranges, making tactical deconfliction highly complex for the aircraft. This paper aims to address two core questions: (1) Can tactical deconfliction policies converge or reach an equilibrium to ensure a conflict-free airspace when companies operate heterogeneous fleets of homogeneous aircraft? (2) If so, will the converged policies discriminate against companies operating sUASs with weaker configurations? We investigate a multi-agent reinforcement learning paradigm in which homogeneous aircraft within heterogeneous fleets operate concurrently to perform package delivery missions over Dallas, Texas, USA. An attention-enhanced Proximal Policy Optimization-based Advantage Actor-Critic (PPOA2C) framework is employed to resolve intra- and inter-fleet conflicts, with each fleet independently training its own policy while preserving privacy. Experimental results show that two fleets with distinct, shared PPOA2C policies can reach an equilibrium to maintain safe separation. While two PPOA2C policies outperform two strong rule-based baselines in terms of conflict resolution, a PPOA2C policy exhibits safer interaction with a rule-based policy, indicating adaptive capabilities of PPOA2C policies. Furthermore, we conducted extensive policy-configuration evaluations, which reveal that equilibria between similar policy types tend to favor fleets with stronger configurations. Even under similar configurations but different policy types, the equilibrium favors one of the heterogeneous policies, underscoring the need for fairness-aware conflict management in heterogeneous sUAS operations.

preprint2022arXiv

A Verification Framework for Certifying Learning-Based Safety-Critical Aviation Systems

We present a safety verification framework for design-time and run-time assurance of learning-based components in aviation systems. Our proposed framework integrates two novel methodologies. From the design-time assurance perspective, we propose offline mixed-fidelity verification tools that incorporate knowledge from different levels of granularity in simulated environments. From the run-time assurance perspective, we propose reachability- and statistics-based online monitoring and safety guards for a learning-based decision-making model to complement the offline verification methods. This framework is designed to be loosely coupled among modules, allowing the individual modules to be developed using independent methodologies and techniques, under varying circumstances and with different tool access. The proposed framework offers feasible solutions for meeting system safety requirements at different stages throughout the system development and deployment cycle, enabling the continuous learning and assessment of the system product.

preprint2022arXiv

Comparison of Effect Size Measures for Mediation Analysis of Survival Outcomes with Application to the Framingham Heart Study

There is an increasing trend of research in mediation analysis for survival outcomes. Such analyses help researchers to better understand how exposure affects disease outcomes through mediators. However, due to censored observations in survival outcomes, it is not straightforward to extend mediation analysis from linear models to survival outcomes. In this article, we extend a mediation effect size measure based on $R^2$ in linear regression to survival outcomes. Due to multiple definitions of $R^2$ for survival models, we compare and evaluate five $R^2$ measures for mediation analysis. Based on extensive simulations, we recommend two $R^2$ measures with good operating characteristics. We illustrate the utility of the $R^2$-based mediation measures by analyzing the mediation effects of multiple lifestyle risk factors on the relationship between environmental exposures and time to coronary heart disease and all-cause mortality in the Framingham Heart Study.

preprint2022arXiv

Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence

Mobile edge computing (MEC) is considered a novel paradigm for computation-intensive and delay-sensitive tasks in fifth generation (5G) networks and beyond. However, its uncertainty, referred to as dynamic and randomness, from the mobile device, wireless channel, and edge network sides, results in high-dimensional, nonconvex, nonlinear, and NP-hard optimization problems. Thanks to the evolved reinforcement learning (RL), upon iteratively interacting with the dynamic and random environment, its trained agent can intelligently obtain the optimal policy in MEC. Furthermore, its evolved versions, such as deep RL (DRL), can achieve higher convergence speed efficiency and learning accuracy based on the parametric approximation for the large-scale state-action space. This paper provides a comprehensive research review on RL-enabled MEC and offers insight for development in this area. More importantly, associated with free mobility, dynamic channels, and distributed services, the MEC challenges that can be solved by different kinds of RL algorithms are identified, followed by how they can be solved by RL solutions in diverse mobile applications. Finally, the open challenges are discussed to provide helpful guidance for future research in RL training and learning MEC.

preprint2022arXiv

Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance

The separation assurance task will be extremely challenging for air traffic controllers in a complex and high density airspace environment. Deep reinforcement learning (DRL) was used to develop an autonomous separation assurance framework in our previous work where the learned model advised speed maneuvers. In order to improve the safety of this model in unseen environments with uncertainties, in this work we propose a safety module for DRL in autonomous separation assurance applications. The proposed module directly addresses both model uncertainty and state uncertainty to improve safety. Our safety module consists of two sub-modules: (1) the state safety sub-module is based on the execution-time data augmentation method to introduce state disturbances in the model input state; (2) the model safety sub-module is a Monte-Carlo dropout extension that learns the posterior distribution of the DRL model policy. We demonstrate the effectiveness of the two sub-modules in an open-source air traffic simulator with challenging environment settings. Through extensive numerical experiments, our results show that the proposed sub-safety modules help the DRL agent significantly improve its safety performance in an autonomous separation assurance task.

preprint2021arXiv

Scaling Up Hardware Accelerator Verification using A-QED with Functional Decomposition

Hardware accelerators (HAs) are essential building blocks for fast and energy-efficient computing systems. Accelerator Quick Error Detection (A-QED) is a recent formal technique which uses Bounded Model Checking for pre-silicon verification of HAs. A-QED checks an HA for self-consistency, i.e., whether identical inputs within a sequence of operations always produce the same output. Under modest assumptions, A-QED is both sound and complete. However, as is well-known, large design sizes significantly limit the scalability of formal verification, including A-QED. We overcome this scalability challenge through a new decomposition technique for A-QED, called A-QED with Decomposition (A-QED$^2$). A-QED$^2$ systematically decomposes an HA into smaller, functional sub-modules, called sub-accelerators, which are then verified independently using A-QED. We prove completeness of A-QED$^2$; in particular, if the full HA under verification contains a bug, then A-QED$^2$ ensures detection of that bug during A-QED verification of the corresponding sub-accelerators. Results on over 100 (buggy) versions of a wide variety of HAs with millions of logic gates demonstrate the effectiveness and practicality of A-QED$^2$.

preprint2020arXiv

A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation Assurance

A novel deep multi-agent reinforcement learning framework is proposed to identify and resolve conflicts among a variable number of aircraft in a high-density, stochastic, and dynamic sector. Currently the sector capacity is constrained by human air traffic controller's cognitive limitation. We investigate the feasibility of a new concept (autonomous separation assurance) and a new approach to push the sector capacity above human cognitive limitation. We propose the concept of using distributed vehicle autonomy to ensure separation, instead of a centralized sector air traffic controller. Our proposed framework utilizes Proximal Policy Optimization (PPO) that we modify to incorporate an attention network. This allows the agents to have access to variable aircraft information in the sector in a scalable, efficient approach to achieve high traffic throughput under uncertainty. Agents are trained using a centralized learning, decentralized execution scheme where one neural network is learned and shared by all agents. The proposed framework is validated on three challenging case studies in the BlueSky air traffic control environment. Numerical results show the proposed framework significantly reduces offline training time, increases performance, and results in a more efficient policy.

preprint2020arXiv

A Survival Mediation Model with Bayesian Model Averaging

Determining the extent to which a patient is benefiting from cancer therapy is challenging. Criteria for quantifying the extent of "tumor response" observed within a few cycles of treatment have been established for various types of solid as well as hematologic malignancies. These measures comprise the primary endpoints of phase II trials. Regulatory approvals of new cancer therapies, however, are usually contingent upon the demonstration of superior overall survival with randomized evidence acquired with a phase III trial comparing the novel therapy to an appropriate standard of care treatment. With nearly two thirds of phase III oncology trials failing to achieve statistically significant results, researchers continue to refine and propose new surrogate endpoints. This article presents a Bayesian framework for studying relationships among treatment, patient subgroups, tumor response and survival. Combining classical components of mediation analysis with Bayesian model averaging (BMA), the methodology is robust to model mis-specification among various possible relationships among the observable entities. Posterior inference is demonstrated via application to a randomized controlled phase III trial in metastatic colorectal cancer. Moreover, the article details posterior predictive distributions of survival and statistical metrics for quantifying the extent of direct and indirect, or tumor response mediated, treatment effects.

preprint2020arXiv

Continuous Control for Searching and Planning with a Learned Model

Decision-making agents with planning capabilities have achieved huge success in the challenging domain like Chess, Shogi, and Go. In an effort to generalize the planning ability to the more general tasks where the environment dynamics are not available to the agent, researchers proposed the MuZero algorithm that can learn the dynamical model through the interactions with the environment. In this paper, we provide a way and the necessary theoretical results to extend the MuZero algorithm to more generalized environments with continuous action space. Through numerical results on two relatively low-dimensional MuJoCo environments, we show the proposed algorithm outperforms the soft actor-critic (SAC) algorithm, a state-of-the-art model-free deep reinforcement learning algorithm.

preprint2020arXiv

Physical Properties of H II Regions in M51 from Spectroscopic Observations

M51 and NGC 5195 is an interacting system that can be explored in great details with ground-based telescopes. The H II regions in M51 were observed using the 2.16 m telescope of the National Astronomical Observatories of the Chinese Academy of Sciences and the 6.5 m Multiple Mirror Telescope with spatial resolution of less than $\sim100$ pc. We obtain a total of 113 spectra across the galaxy and combine the literature data of Croxall et al. to derive a series of physical properties, including the gas-phase extinction, stellar population age, star formation rate (SFR) surface density, and oxygen abundance. The spatial distributions and radial profiles of these properties are investigated in order to study the characteristics of M51 and the clues to the formation and evolution of this galaxy. M51 presents a mild radial extinction gradient. The lower gas-phase extinction in the north spiral arms compared to the south arms are possibly caused by the past encounters with the companion galaxy of NGC 5195. A number of H II regions have the stellar age between 50 and 500 Myr, consistent with the recent interaction history by simulations in the literatures. The SFR surface density presents a mild radial gradient, which is ubiquitous in spiral galaxies. There is a negative metallicity gradient of $-0.08$ dex $R_{e}^{-1}$ in the disk region, which is also commonly found in many spiral galaxies. It is supported by the "inside-out" scenario of galaxy formation. We find a positive abundance gradient of 0.26 dex $R_{e}^{-1}$ in the inner region. There are possible reasons causing the positive gradient, including the freezing of the chemical enrichment due to the star-forming quenching in the bulge and the gas infall and dilution due to the pseudobulge growth and/or galactic interaction.

preprint2020arXiv

Prioritized Sequence Experience Replay

Experience replay is widely used in deep reinforcement learning algorithms and allows agents to remember and learn from experiences from the past. In an effort to learn more efficiently, researchers proposed prioritized experience replay (PER) which samples important transitions more frequently. In this paper, we propose Prioritized Sequence Experience Replay (PSER) a framework for prioritizing sequences of experience in an attempt to both learn more efficiently and to obtain better performance. We compare the performance of PER and PSER sampling techniques in a tabular Q-learning environment and in DQN on the Atari 2600 benchmark. We prove theoretically that PSER is guaranteed to converge faster than PER and empirically show PSER substantially improves upon PER.

preprint2020arXiv

Scalable FastMDP for Pre-departure Airspace Reservation and Strategic De-conflict

Pre-departure flight plan scheduling for Urban Air Mobility (UAM) and cargo delivery drones will require on-demand scheduling of large numbers of aircraft. We examine the scalability of an algorithm known as FastMDP which was shown to perform well in deconflicting many dozens of aircraft in a dense airspace environment with terrain. We show that the algorithm can adapted to perform first-come-first-served pre-departure flight plan scheduling where conflict free flight plans are generated on demand. We demonstrate a parallelized implementation of the algorithm on a Graphics Processor Unit (GPU) which we term FastMDP-GPU and show the level of performance and scaling that can be achieved. Our results show that on commodity GPU hardware we can perform flight plan scheduling against 2000-3000 known flight plans and with server-class hardware the performance can be higher. We believe the results show promise for implementing a large scale UAM scheduler capable of performing on-demand flight scheduling that would be suitable for both a centralized or distributed flight planning system

preprint2020arXiv

Signature of a pair of Majorana zero modes in superconducting gold surface states

Under certain conditions, a fermion in a superconductor can separate in space into two parts known as Majorana zero modes, which are immune to decoherence from local noise sources and are attractive building blocks for quantum computers. Promising experimental progress has been made to demonstrate Majorana zero modes in materials with strong spin-orbit coupling proximity coupled to superconductors. Here we report signatures of Majorana zero modes in a new material platform utilizing the surface states of gold. Using scanning tunneling microscope to probe EuS islands grown on top of gold nanowires, we observe two well separated zero bias tunneling conductance peaks aligned along the direction of the applied magnetic field, as expected for a pair of Majorana zero modes. This platform has the advantage of having a robust energy scale and the possibility of realizing complex designs using lithographic methods.

preprint2020arXiv

Site testing campaign for the Large Optical/infrared Telescope of China: Overview

The Large Optical/infrared Telescope (LOT) is a ground-based 12m diameter optical/infrared telescope which is proposed to be built in the western part of China in the next decade. Based on satellite remote sensing data, along with geographical, logistical and political considerations, three candidate sites were chosen for ground-based astronomical performance monitoring. These sites include: Ali in Tibet, Daocheng in Sichuan, and Muztagh Ata in Xinjiang. Up until now, all three sites have continuously collected data for two years. In this paper, we will introduce this site testing campaign, and present its monitoring results obtained during the period between March 2017 and March 2019.

preprint2020arXiv

Site-testing at Muztagh-ata site I: Ground Meteorology and Sky Brightness

Site-testing is crucial for achieving the goal of scientific research and analysis of meteorological and optical observing conditions is one of the basic tasks of it. As one of three potential sites to host 12-meter Large Optical/infrared Telescope (LOT), Muztagh-ata site which is located on the Pamirs Plateau in west China's Xinjiang began its site-testing task in the spring of 2017. In this paper, we firstly start with an introduction to the site and then present a statistical analysis of the ground-level meteorological properties such as air temperature, barometric pressure, relative humidity, wind speed and direction, recorded by automatic weather station with standard meteorological sensors for two-year long. We also show the monitoring results of sky brightness during this period.

preprint2020arXiv

Site-testing at Muztagh-ata site II: Seeing statistics

In this article, we present a detailed analysis of the statistical properties of seeing for the Muztagh-ata site which is the candidate site for hosting future Chinese Large Optical/infrared Telescope (LOT) project. The measurement was obtained with Differential Image Motion Monitor (DIMM) from April 2017 to November 2018 at different heights during different periods. The median seeing at 11 meters and 6 meters are very close but different significantly from that on the ground. We mainly analyzed the seeing at 11 meters monthly and hourly, having found that the best season for observing was from late autumn to early winter and seeing tended to improve during the night only in autumn. The analysis of the dependence on temperature inversion, wind speed, direction also was made and the best meteorological conditions for seeing is given.

preprint2020arXiv

Spatially-resolved Stellar Population Properties of the M 51-NGC 5195 System from Multi-wavelength Photometric Data

Using multi-band photometric images of M 51 and its companion NGC 5195 from ultraviolet to optical and infrared, we investigate spatially resolved stellar population properties of this interacting system with stellar population synthesis models. The observed IRX is used to constrain dust extinction. Stellar mass is also inferred from the model fitting. By fitting observed spectral energy distributions (SEDs) with synthetical ones, we derive two-dimensional distributions of stellar age, metallicity, dust extinction, and stellar mass. In M 51, two grand-designed spiral arms extending from the bulge show young age, rich metallicity, and abundant dust. The inter-arm regions are filled with older, metal-poorer, and less dusty stellar populations. Except for the spiral arm extending from M 51 into NGC 5195, the stellar population properties of NGC 5195 are quite featureless. NGC 5195 is much older than M 51, and its core is very dusty with $A_V$ up to 1.67 mag and dense in stellar mass surface density. The close encounters might drive the dust in the spiral arm of M51 into the center of NGC 5195.

preprint2020arXiv

Strain controlled superconductivity in few-layer NbSe2

The controlled tunability of superconductivity in low-dimensional materials may enable new quantum devices. Particularly in triplet or topological superconductors, tunneling devices such as Josephson junctions etc. can demonstrate exotic functionalities. The tunnel barrier, an insulating or normal material layer separating two superconductors, is a key component for the junctions. Thin layers of NbSe2 have been shown as a superconductor with strong spin orbit coupling, which can give rise to topological superconductivity if driven by a large magnetic exchange field. Here we demonstrate the superconductor-insulator transitions in epitaxially grown few-layer NbSe2 with wafer-scale uniformity on insulating substrates. We provide the electrical transport, Raman spectroscopy, cross-sectional transmission electron microscopy, and X-ray diffraction characterizations of the insulating phase. We show that the superconductor-insulator transition is driven by strain, which also causes characteristic energy shifts of the Raman modes. Our observation paves the way for high quality hetero-junction tunnel barriers to be seamlessly built into epitaxial NbSe2 itself, thereby enabling highly scalable tunneling devices for superconductor-based quantum electronics.

preprint2020arXiv

The Third Data Release of the Beijing-Arizona Sky Survey

The Beijing-Arizona Sky Survey (BASS) is a wide and deep imaging survey to cover a 5400 deg$^2$ area in the Northern Galactic Cap with the 2.3m Bok telescope using two filters ($g$ and $r$ bands). The Mosaic $z$-band Legacy Survey (MzLS) covers the same area in $z$ band with the 4m Mayall telescope. These two surveys will be used for spectroscopic targeting of the Dark Energy Spectroscopic Instrument (DESI). The BASS survey observations were completed in 2019 March. This paper describes the third data release (DR3) of BASS, which contains the photometric data from all BASS and MzLS observations between 2015 January and 2019 March. The median astrometric precision relative to {\it Gaia} positions is about 17 mas and the median photometric offset relative to the PanSTARRS1 photometry is within 5 mmag. The median $5σ$ AB magnitude depths for point sources are 24.2, 23.6, and 23.0 mag for $g$, $r$, and $z$ bands, respectively. The photometric depth within the survey area is highly homogeneous, with the difference between the 20\% and 80\% depth less than 0.3 mag. The DR3 data, including raw data, calibrated single-epoch images, single-epoch photometric catalogs, stacked images, and co-added photometric catalogs, are publicly accessible at \url{http://batc.bao.ac.cn/BASS/doku.php?id=datarelease:home}.

preprint2019arXiv

Electroconvection of Thin Liquid Crystals: Model Reduction and Numerical Simulations

We propose a finite element method for the numerical simulation of electroconvection of thin liquid crystals. The liquid is located in between two concentric circular electrodes which are either assumed to be of infinite height or slim. Each configuration results in a different nonlocal electro-magnetic model defined on a two dimensional bounded domain. The numerical method consists in approximating the surface charge density, the liquid velocity and pressure, and the electric potential in the two dimensional liquid region. Finite elements for the space discretization coupled with standard time stepping methods are put forward. Unlike for the infinite electrodes configuration, our numerical simulations indicate that slim electrodes are favorable for electroconvection to occur and are able to sustain the phenomena over long period of time. Furthermore, we provide a numerical study on the influence of the three main parameters of the system: the Rayleigh number, the Prandtl number and the electrodes aspect ratio.