Source author record

Tao Fu

Tao Fu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

eess.SY Systems and Control Databases Machine Learning nucl-ex

Catalog footprint

What is connected

5works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Aggregate Queries on Knowledge Graphs: Fast Approximation with Semantic-aware Sampling

A knowledge graph (KG) manages large-scale and real-world facts as a big graph in a schema-flexible manner. Aggregate query is a fundamental query over KGs, e.g., "what is the average price of cars produced in Germany?". Despite its importance, answering aggregate queries on KGs has received little attention in the literature. Aggregate queries can be supported based on factoid queries, e.g., "find all cars produced in Germany", by applying an additional aggregate operation on factoid queries' answers. However, this straightforward method is challenging because both the accuracy and efficiency of factoid query processing will seriously impact the performance of aggregate queries. In this paper, we propose a "sampling-estimation" model to answer aggregate queries over KGs, which is the first work to provide an approximate aggregate result with an effective accuracy guarantee, and without relying on factoid queries. Specifically, we first present a semantic-aware sampling to collect a high-quality random sample through a random walk based on knowledge graph embedding. Then, we propose unbiased estimators for COUNT, SUM, and a consistent estimator for AVG to compute the approximate aggregate results based on the random sample, with an accuracy guarantee in the form of confidence interval. We extend our approach to support iterative improvement of accuracy, and more complex queries with filter, GROUP-BY, and different graph shapes, e.g., chain, cycle, star, flower. Extensive experiments over real-world KGs demonstrate the effectiveness and efficiency of our approach.

preprint2022arXiv

Efficient Topology Assessment for Integrated Transmission and Distribution Network with 10,000+ Inverter-based Resources

The renewable energy proliferation calls upon the grid operators and planners to systematically evaluate the potential impacts of distributed energy resources (DERs). Considering the significant differences between various inverter-based resources (IBRs), especially the different capabilities between grid-forming inverters and grid-following inverters, it is crucial to develop an efficient and effective assessment procedure besides available co-simulation framework with high computation burdens. This paper presents a streamlined graph-based topology assessment for the integrated power system transmission and distribution networks. Graph analyses were performed based on the integrated graph of modified miniWECC grid model and IEEE 8500-node test feeder model, high performance computing platform with 40 nodes and total 2400 CPUs has been utilized to process this integrated graph, which has 100,000+ nodes and 10,000+ IBRs. The node ranking results not only verified the applicability of the proposed method, but also revealed the potential of distributed grid forming (GFM) and grid following (GFL) inverters interacting with the centralized power plants.

preprint2022arXiv

Predicting Peak Day and Peak Hour of Electricity Demand with Ensemble Machine Learning

Battery energy storage systems can be used for peak demand reduction in power systems, leading to significant economic benefits. Two practical challenges are 1) accurately determining the peak load days and hours and 2) quantifying and reducing uncertainties associated with the forecast in probabilistic risk measures for dispatch decision-making. In this study, we develop a supervised machine learning approach to generate 1) the probability of the next operation day containing the peak hour of the month and 2) the probability of an hour to be the peak hour of the day. Guidance is provided on the preparation and augmentation of data as well as the selection of machine learning models and decision-making thresholds. The proposed approach is applied to the Duke Energy Progress system and successfully captures 69 peak days out of 72 testing months with a 3% exceedance probability threshold. On 90% of the peak days, the actual peak hour is among the 2 hours with the highest probabilities.

preprint2021arXiv

Component Importance and Interdependence Analysis for Transmission, Distribution and Communication Systems

For critical infrastructure restoration planning, the real-time scheduling and coordination of system restoration efforts, the key in decision-making is to prioritize those critical components that are out of service during the restoration. For this purpose, there is a need for component importance analysis. While it has been investigated extensively for individual systems, component importance considering interdependence among transmission, distribution and communication (T&D&C) systems has not been systematically analyzed and widely adopted. In this study, we propose a component importance assessment method in the context of interdependence between T&D&C networks. Analytic methods for multilayer networks and a set of metrics have been applied for assessing the component importance and interdependence between T&D&C networks based on their physical characteristics. The proposed methodology is further validated with integrated synthetic Illinois regional transmission, distribution, and communication (T&D&C) systems, the results reveal the unique characteristics of component/node importance, which may be strongly affected by the network topology and cross-domain node mapping.

preprint2013arXiv

Tritium and helium analyses in thin films by enhanced proton backscattering

In order to perform quantitative tritium and helium analysis in thin film sample by using enhanced proton backscattering (EPBS), EPBS spectra for several samples consisting of non-RBS light elements (i.e., T, 4He, 12C, 16O, natSi), medium and heavy elements have been measured and analyzed by using analytical SIMNRA and Monte Carlo-based CORTEO codes. The CORTEO code used in this paper is modified and some non-RBS cross sections of proton scattering from T, 4He, 12C, 14N, 16O and natSi elements taken from ENDF/B-VII.1 database and the calculations of SigmaCalc code are incorporated. All cross section data needed in CORTEO code over the entire proton incident energy-scattering angle plane are obtained by interpolation. It is quantitatively observed that the multiple and plural scattering effects have little impact on energy spectra for light elements like T, He, C, O and Si, and the RBS cross sections of light elements, instead of the non-RBS cross sections, can be used in SIMNRA code for dual scattering calculations for EPBS analysis. It is also observed that at the low energy part of energy spectrum the results given by CORTEO code are higher than the results of SIMNRA code and are in better agreement with the experimental data, especially when heavier elements exist in samples. For tritium analysis, the tritium depth distributions should not be simply adjusted to fit the experimental spectra when the multiple and plural scattering contributions are not completely accounted, or else inaccurate results may be obtained. For medium and heavy matrix elements, when full Monte Carlo RBS calculations are used in CORTEO code, the results from CORTEO code are in good agreement with the experimental results at the low energy part of energy spectra, at this moment quantitative tritium and helium analysis in thin film sample by using enhanced proton backscattering can be performed reliably.

Tao Fu

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Aggregate Queries on Knowledge Graphs: Fast Approximation with Semantic-aware Sampling

Efficient Topology Assessment for Integrated Transmission and Distribution Network with 10,000+ Inverter-based Resources

Predicting Peak Day and Peak Hour of Electricity Demand with Ensemble Machine Learning

Component Importance and Interdependence Analysis for Transmission, Distribution and Communication Systems

Tritium and helium analyses in thin films by enhanced proton backscattering