Source author record

Zhe Yu

Zhe Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Software Engineering eess.SY Systems and Control eess.SP astro-ph.CO Computer Vision gr-qc Machine Learning math.AP

Catalog footprint

What is connected

17works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Comparative Separation: Evaluating Separation on Comparative Judgment Test Data

This research seeks to benefit the software engineering society by proposing comparative separation, a novel group fairness notion to evaluate the fairness of machine learning software on comparative judgment test data. Fairness issues have attracted increasing attention since machine learning software is increasingly used for high-stakes and high-risk decisions. It is the responsibility of all software developers to make their software accountable by ensuring that the machine learning software do not perform differently on different sensitive groups -- satisfying the separation criterion. However, evaluation of separation requires ground truth labels for each test data point. This motivates our work on analyzing whether separation can be evaluated on comparative judgment test data. Instead of asking humans to provide the ratings or categorical labels on each test data point, comparative judgments are made between pairs of data points such as A is better than B. According to the law of comparative judgment, providing such comparative judgments yields a lower cognitive burden for humans than providing ratings or categorical labels. This work first defines the novel fairness notion comparative separation on comparative judgment test data, and the metrics to evaluate comparative separation. Then, both theoretically and empirically, we show that in binary classification problems, comparative separation is equivalent to separation. Lastly, we analyze the number of test data points and test data pairs required to achieve the same level of statistical power in the evaluation of separation and comparative separation, respectively. This work is the first to explore fairness evaluation on comparative judgment test data. It shows the feasibility and the practical benefits of using comparative judgment test data for model evaluations.

preprint2026arXiv

Impact of particle production during inflation on the CMB detection

This work focuses on particle production described by a nonminimally coupled model during inflation. In this model, three parameters determine the characteristic frequency and strength of the induced gravitational waves (GWs). Considering the impact of particle production on inflation, we identify the parameter values that generate the strongest GWs without violating the slow-roll mechanism at the CMB scale. However, even with such extreme parameters, the power spectrum of induced GWs is only about $0.3\%$ of that of vacuum GWs. This contribution remains insignificant when identifying the primary source of the detected CMB B-mode polarization. Furthermore, when our results are integrated with the constraints driven by P+ACT+LB+BK18, the contribution of induced GWs at CMB scales becomes negligible. In contrast, their impact on the scalar spectral index $n_s$ proves significant. For a range of parameter values, the Starobinsky inflation model yields predictions for $n_s$ that are consistent with the measurements obtained from P+ACT+LB+BK18.

preprint2024arXiv

Particle production during Inflation with a non-minimally coupled spectator scalar field

We study the inflationary model with a spectator scalar field $χ$ coupled to both the inflaton and Ricci scalar. The interaction between the $χ$ field and the gravity, denoted by $ξRχ^2$, can trigger the tachyonic instability of certain modes of the $χ$ field. As a result, the $χ$ field perturbations are amplified and serve as a gravitational wave (GW) source. When considering the backreaction of the $χ$ field, an upper bound on the coupling parameter $ξ$ must be imposed to ensure that inflation does not end prematurely. In this case, we find that the inflaton's evolution experiences a sudden slowdown due to the production of $χ$ particles, resulting in a unique oscillating structure in the power spectrum of curvature perturbations at specific scales. Moreover, the GW signal induced by the $χ$ field is more significant than primordial GWs at around its peak scale, leading to a noticeable bump in the overall energy spectrum of GWs. It's worth noting that this bump predicted in the slow-roll inflationary scenario is unlikely to be detected by LISA and Taiji, but there is a slim chance it might approach the detection limits of GW experiments like BBO and SKA if we devise distinctive inflatonary potentials.

preprint2023arXiv

Elliptic inequalities with nonlinear convolution and Hardy terms in cone-like domains

We study the inequality $ -Δu - \fracμ{|x|^2} u \geq (|x|^{-α} * u^p)u^q$ in an unbounded cone $\mathcal{C}_Ω^ρ\subset \mathbb{R}^N$ ($N\geq 2$) generated by a subdomain $Ω$ of the unit sphere $S^{N-1}\subset \mathbb{R}^N,$ $p, q, ρ>0$, $μ\in \mathbb{R}$ and $0\leq α< N$. In the above, $|x|^{-α} * u^p$ denotes the standard convolution operator in the cone $\mathcal{C}_Ω^ρ$. We discuss the existence and nonexistence of positive solutions in terms of $N, p, q, α, μ$ and $Ω$. Extensions to systems of inequalities are also investigated.

preprint2022arXiv

Assessing Expert System-Assisted Literature Reviews With a Case Study

Given the large number of publications in software engineering, frequent literature reviews are required to keep current on work in specific areas. One tedious work in literature reviews is to find relevant studies amongst thousands of non-relevant search results. In theory, expert systems can assist in finding relevant work but those systems have primarily been tested in simulations rather than in application to actual literature reviews. Hence, few researchers have faith in such expert systems. Accordingly, using a realistic case study, this paper assesses how well our state-of-the-art expert system can help with literature reviews. The assessed literature review aimed at identifying test case prioritization techniques for automated UI testing, specifically from 8,349 papers on IEEE Xplore. This corpus was studied with an expert system that incorporates an incrementally updated human-in-the-loop active learning tool. Using that expert system, in three hours, we found 242 relevant papers from which we identified 12 techniques representing the state-of-the-art in test case prioritization when source code information is not available. These results were then validated by six other graduate students manually exploring the same corpus. Without the expert system, this task would have required 53 hours and would have found 27 additional papers. That is, our expert system achieved 90% recall with 6% of the human effort cost when compared to a conventional manual method. Significantly, the same 12 state-of-the-art test case prioritization techniques were identified by both the expert system and the manual method. That is, the 27 papers missed by the expert system would not have changed the conclusion of the literature review. Hence, if this result generalizes, it endorses the use of our expert system to assist in literature reviews.

preprint2021arXiv

Learning to Recognize Actionable Static Code Warnings (is Intrinsically Easy)

Static code warning tools often generate warnings that programmers ignore. Such tools can be made more useful via data mining algorithms that select the "actionable" warnings; i.e. the warnings that are usually not ignored. In this paper, we look for actionable warnings within a sample of 5,675 actionable warnings seen in 31,058 static code warnings from FindBugs. We find that data mining algorithms can find actionable warnings with remarkable ease. Specifically, a range of data mining methods (deep learners, random forests, decision tree learners, and support vector machines) all achieved very good results (recalls and AUC (TRN, TPR) measures usually over 95% and false alarms usually under 5%). Given that all these learners succeeded so easily, it is appropriate to ask if there is something about this task that is inherently easy. We report that while our data sets have up to 58 raw features, those features can be approximated by less than two underlying dimensions. For such intrinsically simple data, many different kinds of learners can generate useful models with similar performance. Based on the above, we conclude that learning to recognize actionable static code warnings is easy, using a wide range of learning algorithms, since the underlying data is intrinsically simple. If we had to pick one particular learner for this task, we would suggest linear SVMs (since, at least in our sample, that learner ran relatively quickly and achieved the best median performance) and we would not recommend deep learning (since this data is intrinsically very simple).

preprint2020arXiv

A General-Purpose Dehazing Algorithm based on Local Contrast Enhancement Approaches

Dehazing is in the image processing and computer vision communities, the task of enhancing the image taken in foggy conditions. To better understand this type of algorithm, we present in this document a dehazing method which is suitable for several local contrast adjustment algorithms. We base it on two filters. The first filter is built with a step of normalization with some other statistical tricks while the last represents the local contrast improvement algorithm. Thus, it can work on both CPU and GPU for real-time applications. We hope that our approach will open the door to new ideas in the community. Other advantages of our method are first that it does not need to be trained, then it does not need additional optimization processing. Furthermore, it can be used as a pre-treatment or post-processing step in many vision tasks. In addition, it does not need to convert the problem into a physical interpretation, and finally that it is very fast. This family of defogging algorithms is fairly simple, but it shows promising results compared to state-of-the-art algorithms based not only on a visual assessment but also on objective criteria.

preprint2020arXiv

Better Data Labelling with EMBLEM (and how that Impacts Defect Prediction)

Standard automatic methods for recognizing problematic development commits can be greatly improved via the incremental application of human+artificial expertise. In this approach, called EMBLEM, an AI tool first explore the software development process to label commits that are most problematic. Humans then apply their expertise to check those labels (perhaps resulting in the AI updating the support vectors within their SVM learner). We recommend this human+AI partnership, for several reasons. When a new domain is encountered, EMBLEM can learn better ways to label which comments refer to real problems. Also, in studies with 9 open source software projects, labelling via EMBLEM's incremental application of human+AI is at least an order of magnitude cheaper than existing methods ($\approx$ eight times). Further, EMBLEM is very effective. For the data sets explored here, EMBLEM better labelling methods significantly improved $P_{opt}20$ and G-scores performance in nearly all the projects studied here.

preprint2020arXiv

Distributed Frequency Emergency Control with Coordinated Edge Intelligence

Developing effective strategies to rapidly support grid frequency while minimizing loss in case of severe contingencies is an important requirement in power systems. While distributed responsive load demands are commonly adopted for frequency regulation, it is difficult to achieve both rapid response and global accuracy in a practical and cost-effective manner. In this paper, the cyber-physical design of an Internet-of-Things (IoT) enabled system, called Grid Sense, is presented. Grid Sense utilizes a large number of distributed appliances for frequency emergency support. It features a local power loss $ΔP$ estimation approach for frequency emergency control based on coordinated edge intelligence. The specifically designed smart outlets of Grid Sense detect the frequency disturbance event locally using the parameters sent from the control center to estimate active power loss in the system and to make rapid and accurate switching decisions soon after a severe contingency. Based on a modified IEEE 24-bus system, numerical simulations and hardware experiments are conducted to demonstrate the frequency support performance of Grid Sense in the aspects of accuracy and speed. It is shown that Grid Sense equipped with its local $ΔP$-estimation frequency control approach can accurately and rapidly prevent the drop of frequency after a major power loss.

preprint2020arXiv

Extended Prony Analysis on Power System Oscillation Under a Near-Resonance Condition

Power system oscillations under a large disturbance often exhibit distorted waveforms as captured by increasingly deployed phasor measurement units. One cause is the occurrence of a near-resonance condition among several dominant modes that are influenced by nonlinear transient dynamics of generators. This paper proposes an Extended Prony Analysis method for measurement-based modal analysis. Based on the normal form theory, it compares analyses on transient and post-transient waveforms to distinguish a resonance mode caused by a near-resonance condition from natural modes so that the method can give more accurate modal properties than a traditional Prony Analysis method, especially for large disturbances. The new method is first demonstrated in detail on Kundur's two-area system and then tested on the IEEE 39-bus system to show its performance under a near-resonance condition.

preprint2020arXiv

NTIRE 2020 Challenge on NonHomogeneous Dehazing

This paper reviews the NTIRE 2020 Challenge on NonHomogeneous Dehazing of images (restoration of rich details in hazy image). We focus on the proposed solutions and their results evaluated on NH-Haze, a novel dataset consisting of 55 pairs of real haze free and nonhomogeneous hazy images recorded outdoor. NH-Haze is the first realistic nonhomogeneous haze dataset that provides ground truth images. The nonhomogeneous haze has been produced using a professional haze generator that imitates the real conditions of haze scenes. 168 participants registered in the challenge and 27 teams competed in the final testing phase. The proposed solutions gauge the state-of-the-art in image dehazing.

preprint2020arXiv

Online Low Frequency Oscillation Detection and Analysis System with an Ensemble Filter

The widespread deployment of phasor measurement unit (PMU) overpower systems makes it possible to monitor and analyze grid dynamics in real-time. Low-frequency oscillation is harmful to power system equipment and operation, and in the worst-case scenario may lead to cascading failures. Therefore, it is critical to detect and identify them as soon as they appear. This paper presents an online low-frequency oscillation detection and analysis (LFODA) system, which has the merit of significantly reducing the chance of false alarm via a voting schema and a time-serial filter. A novel algorithm based on density-based spatial clustering of applications with noise (DBSCAN) is proposed to classify oscillation modes as well as to group their corresponding buses/monitoring sites. Performance of the LFODA system is evaluated through experiments using both simulated and real-world PMU data.

preprint2020arXiv

Time Series Classification for Locating Forced Oscillation Sources

Forced oscillations are caused by sustained cyclic disturbances. This paper presents a machine learning (ML) based time-series classification method that uses the synchrophasor measurements to locate the sources of forced oscillations for fast disturbance removal. Sequential feature selection is used to identify the most informative measurements of each power plant so that multivariate time series (MTS) can be constructed. By training the Mahalanobis matrix, we measure and compare the distance between the MTSs. Templates for representing each class is constructed to reduce the size of training datasets and improve the online matching efficiency. Dynamic time warping (DTW) algorithm is used to align the out-of-sync MTSs to account for oscillation detection errors. The algorithm is validated on two test systems: the IEEE 39-bus system and the WECC 179-bus system. When a forced oscillation occurs, MTSs will be constructed by designated PMU measurements. Then, the MTSs will be classified by the trained classifiers, the class membership of which corresponds to the location of each oscillation source. Simulation results show that the proposed method can be used online to identify the forced oscillation sources with high accuracy. The robustness of the proposed algorithm in the presence of oscillation detection errors is also quantified.

preprint2020arXiv

Wide Area Measurement System-based Low Frequency Oscillation Damping Control through Reinforcement Learning

Ensuring the stability of power systems is gaining more attraction today than ever before, due to the rapid growth of uncertainties in load and renewable energy penetration. Lately, wide area measurement system-based centralized controlling techniques started providing a more flexible and robust control to keep the system stable. But, such a modernization of control philosophy faces pressing challenges due to the irregularities in delays of long-distance communication channels and response of equipment to control actions. Therefore, we propose an innovative approach that can revolutionize the control strategy for damping down low frequency oscillations in transmission systems. Proposed method is enriched with a potential of overcoming the challenges of communication delays and other non-linearities in wide area damping control by leveraging the capability of the reinforcement learning technique. Such a technique has a unique characteristic to learn on diverse scenarios and operating conditions by exploring the environment and devising an optimal control action policy by implementing policy gradient method. Our detailed analysis and systematically designed numerical validation prove the feasibility, scalability and interpretability of the carefully modelled low-frequency oscillation damping controller so that stability is ensured even with the uncertainties of load and generation are on the rise.

preprint2019arXiv

Better Software Analytics via "DUO": Data Mining Algorithms Using/Used-by Optimizers

This paper claims that a new field of empirical software engineering research and practice is emerging: data mining using/used-by optimizers for empirical studies or DUO. For example, data miners can generate models that are explored by optimizers. Also, optimizers can advise how to best adjust the control parameters of a data miner. This combined approach acts like an agent leaning over the shoulder of an analyst that advises "ask this question next" or "ignore that problem, it is not relevant to your goals". Further, those agents can help us build "better" predictive models, where "better" can be either greater predictive accuracy or faster modeling time (which, in turn, enables the exploration of a wider range of options). We also caution that the era of papers that just use data miners is coming to an end. Results obtained from an unoptimized data miner can be quickly refuted, just by applying an optimizer to produce a different (and better performing) model. Our conclusion, hence, is that for software analytics it is possible, useful and necessary to combine data mining and optimization using DUO.

preprint2018arXiv

Crowdtesting : When is The Party Over?

Trade-offs such as "how much testing is enough" are critical yet challenging project decisions in software engineering. Most existing approaches adopt risk-driven or value-based analysis to prioritize test cases and minimize test runs. However, none of these is applicable to the emerging crowd testing paradigm where task requesters typically have no control over online crowdworkers's dynamic behavior and uncertain performance. In current practice, deciding when to close a crowdtesting task is largely done by guesswork due to lack of decision support. This paper intends to fill this gap by introducing automated decision support for monitoring and determining appropriate time to close the crowdtesting tasks. First, this paper investigates the necessity and feasibility of close prediction of crowdtesting tasks based on industrial dataset. Then,it designs 8 methods for close prediction, based on various models including the bug trend, bug arrival model, capture-recapture model.Finally, the evaluation is conducted on 218 crowdtesting tasks from one of the largest crowdtesting platforms in China, and the results show that a median of 91% bugs can be detected with 49% saved cost.

preprint2018arXiv

Data-Driven Search-based Software Engineering

This paper introduces Data-Driven Search-based Software Engineering (DSE), which combines insights from Mining Software Repositories (MSR) and Search-based Software Engineering (SBSE). While MSR formulates software engineering problems as data mining problems, SBSE reformulates SE problems as optimization problems and use meta-heuristic algorithms to solve them. Both MSR and SBSE share the common goal of providing insights to improve software engineering. The algorithms used in these two areas also have intrinsic relationships. We, therefore, argue that combining these two fields is useful for situations (a) which require learning from a large data source or (b) when optimizers need to know the lay of the land to find better solutions, faster. This paper aims to answer the following three questions: (1) What are the various topics addressed by DSE? (2) What types of data are used by the researchers in this area? (3) What research approaches do researchers use? The paper briefly sets out to act as a practical guide to develop new DSE techniques and also to serve as a teaching resource. This paper also presents a resource (tiny.cc/data-se) for exploring DSE. The resource contains 89 artifacts which are related to DSE, divided into 13 groups such as requirements engineering, software product lines, software processes. All the materials in this repository have been used in recent software engineering papers; i.e., for all this material, there exist baseline results against which researchers can comparatively assess their new ideas.

Zhe Yu

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

Comparative Separation: Evaluating Separation on Comparative Judgment Test Data

Impact of particle production during inflation on the CMB detection

Particle production during Inflation with a non-minimally coupled spectator scalar field

Elliptic inequalities with nonlinear convolution and Hardy terms in cone-like domains

Assessing Expert System-Assisted Literature Reviews With a Case Study

Learning to Recognize Actionable Static Code Warnings (is Intrinsically Easy)

A General-Purpose Dehazing Algorithm based on Local Contrast Enhancement Approaches

Better Data Labelling with EMBLEM (and how that Impacts Defect Prediction)

Distributed Frequency Emergency Control with Coordinated Edge Intelligence

Extended Prony Analysis on Power System Oscillation Under a Near-Resonance Condition

NTIRE 2020 Challenge on NonHomogeneous Dehazing

Online Low Frequency Oscillation Detection and Analysis System with an Ensemble Filter

Time Series Classification for Locating Forced Oscillation Sources

Wide Area Measurement System-based Low Frequency Oscillation Damping Control through Reinforcement Learning

Better Software Analytics via "DUO": Data Mining Algorithms Using/Used-by Optimizers

Crowdtesting : When is The Party Over?

Data-Driven Search-based Software Engineering