Source author record

Mark A. Kon

Mark A. Kon appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.FA math.NA

Catalog footprint

What is connected

6works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2012arXiv

A complexity analysis of statistical learning algorithms

We apply information-based complexity analysis to support vector machine (SVM) algorithms, with the goal of a comprehensive continuous algorithmic analysis of such algorithms. This involves complexity measures in which some higher order operations (e.g., certain optimizations) are considered primitive for the purposes of measuring complexity. We consider classes of information operators and algorithms made up of scaled families, and investigate the utility of scaling the complexities to minimize error. We look at the division of statistical learning into information and algorithmic components, at the complexities of each, and at applications to support vector machine (SVM) and more general machine learning algorithms. We give applications to SVM algorithms graded into linear and higher order components, and give an example in biomedical informatics.

preprint2012arXiv

Empirical Normalization for Quadratic Discriminant Analysis and Classifying Cancer Subtypes

We introduce a new discriminant analysis method (Empirical Discriminant Analysis or EDA) for binary classification in machine learning. Given a dataset of feature vectors, this method defines an empirical feature map transforming the training and test data into new data with components having Gaussian empirical distributions. This map is an empirical version of the Gaussian copula used in probability and mathematical finance. The purpose is to form a feature mapped dataset as close as possible to Gaussian, after which standard quadratic discriminants can be used for classification. We discuss this method in general, and apply it to some datasets in computational biology.

preprint2012arXiv

On Some Integrated Approaches to Inference

We present arguments for the formulation of unified approach to different standard continuous inference methods from partial information. It is claimed that an explicit partition of information into a priori (prior knowledge) and a posteriori information (data) is an important way of standardizing inference approaches so that they can be compared on a normative scale, and so that notions of optimal algorithms become farther-reaching. The inference methods considered include neural network approaches, information-based complexity, and Monte Carlo, spline, and regularization methods. The model is an extension of currently used continuous complexity models, with a class of algorithms in the form of optimization methods, in which an optimization functional (involving the data) is minimized. This extends the family of current approaches in continuous complexity theory, which include the use of interpolatory algorithms in worst and average case settings.

preprint2012arXiv

On the probabilistic continuous complexity conjecture

In this paper we prove the probabilistic continuous complexity conjecture. In continuous complexity theory, this states that the complexity of solving a continuous problem with probability approaching 1 converges (in this limit) to the complexity of solving the same problem in its worst case. We prove the conjecture holds if and only if space of problem elements is uniformly convex. The non-uniformly convex case has a striking counterexample in the problem of identifying a Brownian path in Wiener space, where it is shown that probabilistic complexity converges to only half of the worst case complexity in this limit.

preprint2012arXiv

Relationships among Interpolation Bases of Wavelet Spaces and Approximation Spaces

A multiresolution analysis is a nested chain of related approximation spaces.This nesting in turn implies relationships among interpolation bases in the approximation spaces and their derived wavelet spaces. Using these relationships, a necessary and sufficient condition is given for existence of interpolation wavelets, via analysis of the corresponding scaling functions. It is also shown that any interpolation function for an approximation space plays the role of a special type of scaling function (an interpolation scaling function) when the corresponding family of approximation spaces forms a multiresolution analysis. Based on these interpolation scaling functions, a new algorithm is proposed for constructing corresponding interpolation wavelets (when they exist in a multiresolution analysis). In simulations, our theorems are tested for several typical wavelet spaces, demonstrating our theorems for existence of interpolation wavelets and for constructing them in a general multiresolution analysis.

preprint1994arXiv

Pointwise convergence of wavelet expansions

In this note we announce that under general hypotheses, wavelet-type expansions (of functions in $L^p,\ 1\leq p \leq \infty$, in one or more dimensions) converge pointwise almost everywhere, and identify the Lebesgue set of a function as a set of full measure on which they converge. It is shown that unlike the Fourier summation kernel, wavelet summation kernels $P_j$ are bounded by radial decreasing $L^1$ convolution kernels. As a corollary it follows that best $L^2$ spline approximations on uniform meshes converge pointwise almost everywhere. Moreover, summation of wavelet expansions is partially insensitive to order of summation. \footnote We also give necessary and sufficient conditions for given rates of convergence of wavelet expansions in the sup norm. Such expansions have order of convergence $s$ if and only if the basic wavelet $ψ$ is in the homogeneous Sobolev space $H^{-s-d/2}_h$. We also present equivalent necessary and sufficient conditions on the scaling function. The above results hold in one and in multiple dimensions.

Mark A. Kon

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

A complexity analysis of statistical learning algorithms

Empirical Normalization for Quadratic Discriminant Analysis and Classifying Cancer Subtypes

On Some Integrated Approaches to Inference

On the probabilistic continuous complexity conjecture

Relationships among Interpolation Bases of Wavelet Spaces and Approximation Spaces

Pointwise convergence of wavelet expansions