Source author record

Yujia Zhai

Yujia Zhai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.CA Social and Information Networks Artificial Intelligence Computation and Language cs.CY Distributed, Parallel, and Cluster Computing math.CO math.FA math.NT

Catalog footprint

What is connected

8works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Unified Granular-ball Learning Model of Pawlak Rough Set and Neighborhood Rough Set

Pawlak rough set and neighborhood rough set are the two most common rough set theoretical models. Pawlak can use equivalence classes to represent knowledge, but it cannot process continuous data; neighborhood rough sets can process continuous data, but it loses the ability of using equivalence classes to represent knowledge. To this end, this paper presents a granular-ball rough set based on the granular-ball computing. The granular-ball rough set can simultaneously represent Pawlak rough sets, and the neighborhood rough set, so as to realize the unified representation of the two. This makes the granular-ball rough set not only can deal with continuous data, but also can use equivalence classes for knowledge representation. In addition, we propose an implementation algorithms of granular-ball rough sets. The experimental results on benchmark datasets demonstrate that, due to the combination of the robustness and adaptability of the granular-ball computing, the learning accuracy of the granular-ball rough set has been greatly improved compared with the Pawlak rough set and the traditional neighborhood rough set. The granular-ball rough set also outperforms nine popular or the state-of-the-art feature selection methods.

preprint2022arXiv

Sparse domination and $L^{p} \rightarrow L^{q}$ estimates for maximal functions associated with curvature

In this paper, we study maximal functions along some finite type curves and hypersurfaces. In particular, various impacts of non-isotropic dilations are considered. Firstly, we provide a generic scheme that allows us to deduce the sparse domination bounds for global maximal functions under the assumption that the corresponding localized maximal functions satisfy the $L^{p}$ improving properties. Secondly, for the localized maximal functions with non-isotropic dilations of curves and hypersurfaces whose curvatures vanish to finite order at some points, we establish the $L^{p}\rightarrow L^{q}$ bounds $(q >p)$. As a corollary, we obtain the weighted inequalities for the corresponding global maximal functions, which generalize the known unweighted estimates.

preprint2021arXiv

Five-Linear Singular Integral Estimates of Brascamp-Lieb Type

We prove the full range of estimates for a five-linear singular integral of Brascamp-Lieb type. The study is methodology-oriented with the goal to develop a sufficiently general technique to estimate singular integral variants of Brascamp-Lieb inequalities that do not obey Hölder scaling. The invented methodology constructs localized analysis on the entire space from local information on its subspaces of lower dimensions and combines such tensor-type arguments with the generic localized analysis. A direct consequence of the boundedness of the five-linear singular integral is a Leibniz rule which captures nonlinear interactions of waves from transversal directions.

preprint2021arXiv

Innovation adoption: Broadcasting vs. Virality

Diffusion channels are critical to determining the adoption scale which leads to the ultimate impact of an innovation. The aim of this study is to develop an integrative understanding of the impact of two diffusion channels (i.e., broadcasting vs virality) on innovation adoption. Using citations of a series of classic algorithms and the time series of co-authorship as the footprints of their diffusion trajectories, we propose a novel method to analyze the intertwining relationships between broadcasting and virality in the innovation diffusion process. Our findings show that broadcasting and virality have similar diffusion power, but play different roles across diffusion stages. Broadcasting is more powerful in the early stages but may be gradually caught up or even surpassed by virality in the later period. Meanwhile, diffusion speed in virality is significantly faster than broadcasting and members from virality channels tend to adopt the same innovation repetitively.

preprint2020arXiv

Analysis of misinformation during the COVID-19 outbreak in China: cultural, social and political entanglements

COVID-19 resulted in an infodemic, which could erode public trust, impede virus containment, and outlive the pandemic itself. The evolving and fragmented media landscape is a key driver of the spread of misinformation. Using misinformation identified by the fact-checking platform by Tencent and posts on Weibo, our results showed that the evolution of misinformation follows an issue-attention cycle, pertaining to topics such as city lockdown, cures, and preventions, and school reopening. Sources of authority weigh in on these topics, but their influence is complicated by peoples' pre-existing beliefs and cultural practices. Finally, social media has a complicated relationship with established or legacy media systems. Sometimes they reinforce each other, but in general, social media may have a topic cycle of its own making. Our findings shed light on the distinct characteristics of misinformation during the COVID-19 and offer insights into combating misinformation in China and across the world at large.

preprint2020arXiv

FT-CNN: Algorithm-Based Fault Tolerance for Convolutional Neural Networks

Convolutional neural networks (CNNs) are becoming more and more important for solving challenging and critical problems in many fields. CNN inference applications have been deployed in safety-critical systems, which may suffer from soft errors caused by high-energy particles, high temperature, or abnormal voltage. Of critical importance is ensuring the stability of the CNN inference process against soft errors. Traditional fault tolerance methods are not suitable for CNN inference because error-correcting code is unable to protect computational components, instruction duplication techniques incur high overhead, and existing algorithm-based fault tolerance (ABFT) techniques cannot protect all convolution implementations. In this paper, we focus on how to protect the CNN inference process against soft errors as efficiently as possible, with the following three contributions. (1) We propose several systematic ABFT schemes based on checksum techniques and analyze their fault protection ability and runtime thoroughly.Unlike traditional ABFT based on matrix-matrix multiplication, our schemes support any convolution implementations. (2) We design a novel workflow integrating all the proposed schemes to obtain a high detection/correction ability with limited total runtime overhead. (3) We perform our evaluation using ImageNet with well-known CNN models including AlexNet, VGG-19, ResNet-18, and YOLOv2. Experimental results demonstrate that our implementation can handle soft errors with very limited runtime overhead (4%~8% in both error-free and error-injected situations).

preprint2020arXiv

Normalization of Input-output Shared Embeddings in Text Generation Models

Neural Network based models have been state-of-the-art models for various Natural Language Processing tasks, however, the input and output dimension problem in the networks has still not been fully resolved, especially in text generation tasks (e.g. Machine Translation, Text Summarization), in which input and output both have huge sizes of vocabularies. Therefore, input-output embedding weight sharing has been introduced and adopted widely, which remains to be improved. Based on linear algebra and statistical theories, this paper locates the shortcoming of existed input-output embedding weight sharing method, then raises methods for improving input-output weight shared embedding, among which methods of normalization of embedding weight matrices show best performance. These methods are nearly computational cost-free, can get combined with other embedding techniques, and show good effectiveness when applied on state-of-the-art Neural Network models. For Transformer-big models, the normalization techniques can get at best 0.6 BLEU improvement compared to the original version of model on WMT'16 En-De dataset, and similar BLEU improvements on IWSLT 14' datasets. For DynamicConv models, 0.5 BLEU improvement can be attained on WMT'16 En-De dataset, and 0.41 BLEU improvement on IWSLT 14' De-En translation task is achieved.

preprint2012arXiv

Areas of triangles and Beck's theorem in planes over finite fields

It is shown that any subset $E$ of a plane over a finite field $\F_q$, of cardinality $|E|>q$ determines not less than $\frac{q-1}{2}$ distinct areas of triangles, moreover once can find such triangles sharing a common base. It is also shown that if $|E|\geq 64q\log_2 q$, then there are more than $\frac{q}{2}$ distinct areas of triangles sharing a common vertex. The result follows from a finite field version of the Beck theorem for large subsets of $\F_q^2$ that we prove. If $|E|\geq 64q\log_2 q$, there exists a point $z\in E$, such that there are at least $\frac{q}{4}$ straight lines incident to $z$, each supporting the number of points of $E$ other than $z$ in the interval between $\frac{|E|}{2q}$ and $\frac{2|E|}{q}.$ This is proved by combining combinatorial and Fourier analytic techniques. We also discuss higher-dimensional implications of these results in light of recent developments.

Yujia Zhai

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

A Unified Granular-ball Learning Model of Pawlak Rough Set and Neighborhood Rough Set

Sparse domination and $L^{p} \rightarrow L^{q}$ estimates for maximal functions associated with curvature

Five-Linear Singular Integral Estimates of Brascamp-Lieb Type

Innovation adoption: Broadcasting vs. Virality

Analysis of misinformation during the COVID-19 outbreak in China: cultural, social and political entanglements

FT-CNN: Algorithm-Based Fault Tolerance for Convolutional Neural Networks

Normalization of Input-output Shared Embeddings in Text Generation Models

Areas of triangles and Beck's theorem in planes over finite fields