Source author record

Yongdao Zhou

Yongdao Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Applications Machine Learning math.ST Statistics Theory

Catalog footprint

What is connected

4works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A sampling scheme for estimating the prevalence of a pandemic

The spread of COVID-19 makes it essential to investigate its prevalence. In such investigation research, as far as we know, the widely-used sampling methods didn't use the information sufficiently about the numbers of the previously diagnosed cases, which provides a priori information about the true numbers of infections. This motivates us to develop a new, two-stage sampling method in this paper, which utilises the information about the distributions of both population and diagnosed cases, to investigate the prevalence more efficiently. The global likelihood sampling, a robust and efficient sampler to draw samples from any probability density function, is used in our sampling strategy, and thus, our new method can automatically adapt to the complicated distributions of population and cases. Moreover, the corresponding estimating method is simple, which facilitates the practical implementation. Some recommendations for practical implementation are given. Finally, several simulations and a practical example verified its efficiency.

preprint2022arXiv

Doubly Coupled Designs for Computer Experiments with both Qualitative and Quantitative Factors

Computer experiments with both qualitative and quantitative input variables occur frequently in many scientific and engineering applications. How to choose input settings for such experiments is an important issue for accurate statistical analysis, uncertainty quantification and decision making. Sliced Latin hypercube designs are the first systematic approach to address this issue. However, it comes with the increasing cost associated with an increasing large number of level combinations of the qualitative factors. For the reason of run size economy, marginally coupled designs were proposed in which the design for the quantitative factors is a sliced Latin hypercube design with respect to each qualitative factor. The drawback of such designs is that the corresponding data may not be able to capture the effects between any two (and more) qualitative factors and quantitative factors. To balance the run size and design efficiency, we propose a new type of designs, doubly coupled designs, where the design points for the quantitative factors form a sliced Latin hypercube design with respect to the levels of any qualitative factor and with respect to the level combinations of any two qualitative factors, respectively. The proposed designs have the better stratification property between the qualitative and quantitative factors compared with marginally coupled designs. The existence of the proposed designs is established. Several construction methods are introduced, and the properties of the resulting designs are also studied.

preprint2022arXiv

Model-free Subsampling Method Based on Uniform Designs

Subsampling or subdata selection is a useful approach in large-scale statistical learning. Most existing studies focus on model-based subsampling methods which significantly depend on the model assumption. In this paper, we consider the model-free subsampling strategy for generating subdata from the original full data. In order to measure the goodness of representation of a subdata with respect to the original data, we propose a criterion, generalized empirical F-discrepancy (GEFD), and study its theoretical properties in connection with the classical generalized L2-discrepancy in the theory of uniform designs. These properties allow us to develop a kind of low-GEFD data-driven subsampling method based on the existing uniform designs. By simulation examples and a real case study, we show that the proposed subsampling method is superior to the random sampling method. Moreover, our method keeps robust under diverse model specifications while other popular subsampling methods are under-performing. In practice, such a model-free property is more appealing than the model-based subsampling methods, where the latter may have poor performance when the model is misspecified, as demonstrated in our simulation studies.

preprint2021arXiv

Uniformity criterion for designs with both qualitative and quantitative factors

Experiments with both qualitative and quantitative factors occur frequently in practical applications. Many construction methods for this kind of designs, such as marginally coupled designs, were proposed to pursue some good space-filling structures. However, few criteria can be adapted to quantify the space-filling property of designs involving both qualitative and quantitative factors. As the uniformity is an important space-filling property of a design, in this paper, a new uniformity criterion, qualitative-quantitative discrepancy (QQD), is proposed for assessing the uniformity of designs with both types of factors. The closed form and lower bounds of the QQD are presented to calculate the exact QQD values of designs and recognize the uniform designs directly. In addition, a connection between the QQD and the balance pattern is derived, which not only helps to obtain a new lower bound but also provides a statistical justification of the QQD. Several examples show that the proposed criterion is reasonable and useful since it can distinguish distinct designs very well.

Yongdao Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

A sampling scheme for estimating the prevalence of a pandemic

Doubly Coupled Designs for Computer Experiments with both Qualitative and Quantitative Factors

Model-free Subsampling Method Based on Uniform Designs

Uniformity criterion for designs with both qualitative and quantitative factors