Source author record

Tian Dong

Tian Dong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AC math.NA Cryptography and Security Machine Learning math.CO

Catalog footprint

What is connected

8works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

An Interpretable Federated Learning-based Network Intrusion Detection Framework

Learning-based Network Intrusion Detection Systems (NIDSs) are widely deployed for defending various cyberattacks. Existing learning-based NIDS mainly uses Neural Network (NN) as a classifier that relies on the quality and quantity of cyberattack data. Such NN-based approaches are also hard to interpret for improving efficiency and scalability. In this paper, we design a new local-global computation paradigm, FEDFOREST, a novel learning-based NIDS by combining the interpretable Gradient Boosting Decision Tree (GBDT) and Federated Learning (FL) framework. Specifically, FEDFOREST is composed of multiple clients that extract local cyberattack data features for the server to train models and detect intrusions. A privacy-enhanced technology is also proposed in FEDFOREST to further defeat the privacy of the FL systems. Extensive experiments on 4 cyberattack datasets of different tasks demonstrate that FEDFOREST is effective, efficient, interpretable, and extendable. FEDFOREST ranks first in the collaborative learning and cybersecurity competition 2021 for Chinese college students.

preprint2022arXiv

Privacy for Free: How does Dataset Condensation Help Privacy?

To prevent unintentional data leakage, research community has resorted to data generators that can produce differentially private data for model training. However, for the sake of the data privacy, existing solutions suffer from either expensive training cost or poor generalization performance. Therefore, we raise the question whether training efficiency and privacy can be achieved simultaneously. In this work, we for the first time identify that dataset condensation (DC) which is originally designed for improving training efficiency is also a better solution to replace the traditional data generators for private data generation, thus providing privacy for free. To demonstrate the privacy benefit of DC, we build a connection between DC and differential privacy, and theoretically prove on linear feature extractors (and then extended to non-linear feature extractors) that the existence of one sample has limited impact ($O(m/n)$) on the parameter distribution of networks trained on $m$ samples synthesized from $n (n \gg m)$ raw samples by DC. We also empirically validate the visual privacy and membership privacy of DC-synthesized data by launching both the loss-based and the state-of-the-art likelihood-based membership inference attacks. We envision this work as a milestone for data-efficient and privacy-preserving machine learning.

preprint2016arXiv

A New Result on Packing Unit Squares into a Large Square

In their 2009 note: \emph{Packing equal squares into a large square}, Chung and Graham proved that the uncovered area of a large square of side length $x$ is $O\left(x^{(3+\sqrt{2})/7}\log x\right)$ after maximum number of non-overlapping unit squares are packed into it, which improved the earlier results of Erdős-Graham, Roth-Vaughan, and Karabash-Soifer. Here we further improve the result to $O(x^{5/8})$ that also helps to improve the bound for the dual problem: finding the minimum number of unit squares needed for covering the large square, from $x^2+O\left(x^{(3+\sqrt{2})/7}\log x\right)$ to $x^2+O(x^{5/8})$.

preprint2014arXiv

A Two-Dimensional Improvement for Farr-Gao Algorithm

Farr-Gao algorithm is a state-of-the-art algorithm for reduced Gröbner bases of vanishing ideals of finite points, which has been implemented in Maple$^\circledR$ as a build-in command. In this paper, we present a two-dimensional improvement for it that employs a preprocessing strategy for computing reduced Gröbner bases associated with tower subsets of given point sets. Experimental results show that the preprocessed Farr-Gao algorithm is more efficient than the classical one.

preprint2011arXiv

Finite Sets of Affine Points with Unique Associated Monomial Order Quotient Bases

The quotient bases for zero-dimensional ideals are often of interest in the investigation of multivariate polynomial interpolation, algebraic coding theory, and computational molecular biology, etc. In this paper, we discuss the properties of zero-dimensional ideals with unique monomial quotient bases, and verify that the vanishing ideals of Cartesian sets have unique monomial quotient bases. Furthermore, we reveal the relation between Cartesian sets and the point sets with unique associated monomial quotient bases.

preprint2011arXiv

Ideal Projectors of Type Partial Derivative and Their Perturbations

In this paper, we verify Carl de Boor's conjecture on ideal projectors for real ideal projectors of type partial derivative by proving that there exists a positive $η\in \mathbb{R}$ such that a real ideal projector of type partial derivative $P$ is the pointwise limit of a sequence of Lagrange projectors which are perturbed from $P$ up to $η$ in magnitude. Furthermore, we present an algorithm for computing the value of such $η$ when the range of the Lagrange projectors is spanned by the Gröbner éscalier of their kernels w.r.t. lexicographic order.

preprint2010arXiv

A Bivariate Preprocessing Paradigm for Buchberger-Möller Algorithm

For the last almost three decades, since the famous Buchberger-Möller(BM) algorithm emerged, there has been wide interest in vanishing ideals of points and associated interpolation polynomials. Our paradigm is based on the theory of bivariate polynomial interpolation on cartesian point sets that gives us related degree reducing interpolation monomial and Newton bases directly. Since the bases are involved in the computation process as well as contained in the final output of BM algorithm, our paradigm obviously simplifies the computation and accelerates the BM process. The experiments show that the paradigm is best suited for the computation over finite prime fields that have many applications.

preprint2010arXiv

Bivariate Quasi-Tower Sets and Their Associated Lagrange Interpolation Bases

As we all known, there is still a long way for us to solve arbitrary multivariate Lagrange interpolation in theory. Nevertheless, it is well accepted that theories about Lagrange interpolation on special point sets should cast important lights on the general solution. In this paper, we propose a new type of bivariate point sets, quasi-tower sets, whose geometry is more natural than some known point sets such as cartesian sets and tower sets. For bivariate Lagrange interpolation on quasi-tower sets, we construct the associated degree reducing interpolation monomial and Newton bases w.r.t. common monomial orderings theoretically. Moreover, by inputting these bases into Buchberger-Möller algorithm, we obtain the reduced Gröbner bases for vanishing ideals of quasi-tower sets much more efficiently than before.

Tian Dong

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

An Interpretable Federated Learning-based Network Intrusion Detection Framework

Privacy for Free: How does Dataset Condensation Help Privacy?

A New Result on Packing Unit Squares into a Large Square

A Two-Dimensional Improvement for Farr-Gao Algorithm

Finite Sets of Affine Points with Unique Associated Monomial Order Quotient Bases

Ideal Projectors of Type Partial Derivative and Their Perturbations

A Bivariate Preprocessing Paradigm for Buchberger-Möller Algorithm

Bivariate Quasi-Tower Sets and Their Associated Lagrange Interpolation Bases