Source author record

Eytan Domany

Eytan Domany appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.stat-mech Applications Artificial Intelligence cond-mat Information Theory Machine Learning math.IT Methodology physics.comp-ph

Catalog footprint

What is connected

7works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2013arXiv

Temperature Integration: an efficient procedure for calculation of free energy differences

We propose a method, Temperature Integration, which allows an efficient calculation of free energy differences between two systems of interest, with the same degrees of freedom, which may have rough energy landscapes. The method is based on calculating, for each single system, the difference between the values of lnZ at two temperatures, using a Parallel Tempering procedure. If our two systems of interest have the same phase space volume, they have the same values of lnZ at high-T, and we can obtain the free energy difference between them, using the two single-system calculations described above. If the phase space volume of a system is known, our method can be used to calculate its absolute (versus relative) free energy as well. We apply our method and demonstrate its efficiency on a toy model of hard rods on a 1-dimensional ring.

preprint2012arXiv

On the Number of Samples Needed to Learn the Correct Structure of a Bayesian Network

Bayesian Networks (BNs) are useful tools giving a natural and compact representation of joint probability distributions. In many applications one needs to learn a Bayesian Network (BN) from data. In this context, it is important to understand the number of samples needed in order to guarantee a successful learning. Previous work have studied BNs sample complexity, yet it mainly focused on the requirement that the learned distribution will be close to the original distribution which generated the data. In this work, we study a different aspect of the learning, namely the number of samples needed in order to learn the correct structure of the network. We give both asymptotic results, valid in the large sample limit, and experimental results, demonstrating the learning behavior for feasible sample sizes. We show that structure learning is a more difficult task, compared to approximating the correct distribution, in the sense that it requires a much larger number of samples, regardless of the computational power available for the learner.

preprint2012arXiv

Ranking Under Uncertainty

Ranking objects is a simple and natural procedure for organizing data. It is often performed by assigning a quality score to each object according to its relevance to the problem at hand. Ranking is widely used for object selection, when resources are limited and it is necessary to select a subset of most relevant objects for further processing. In real world situations, the object's scores are often calculated from noisy measurements, casting doubt on the ranking reliability. We introduce an analytical method for assessing the influence of noise levels on the ranking reliability. We use two similarity measures for reliability evaluation, Top-K-List overlap and Kendall's tau measure, and show that the former is much more sensitive to noise than the latter. We apply our method to gene selection in a series of microarray experiments of several cancer types. The results indicate that the reliability of the lists obtained from these experiments is very poor, and that experiment sizes which are necessary for attaining reasonably stable Top-K-Lists are much larger than those currently available. Simulations support our analytical results.

preprint2011arXiv

FDR control with adaptive procedures and FDR monotonicity

The steep rise in availability and usage of high-throughput technologies in biology brought with it a clear need for methods to control the False Discovery Rate (FDR) in multiple tests. Benjamini and Hochberg (BH) introduced in 1995 a simple procedure and proved that it provided a bound on the expected value, $\mathit{FDR}\leq q$. Since then, many authors tried to improve the BH bound, with one approach being designing adaptive procedures, which aim at estimating the number of true null hypothesis in order to get a better FDR bound. Our two main rigorous results are the following: (i) a theorem that provides a bound on the FDR for adaptive procedures that use any estimator for the number of true hypotheses ($m_0$), (ii) a theorem that proves a monotonicity property of general BH-like procedures, both for the case where the hypotheses are independent. We also propose two improved procedures for which we prove FDR control for the independent case, and demonstrate their advantages over several available bounds, on simulated data and on a large number of gene expression data sets. Both applications are simple and involve a similar amount of computation as the original BH procedure. We compare the performance of our proposed procedures with BH and other procedures and find that in most cases we get more power for the same level of statistical significance.

preprint2005arXiv

Taylor series expansions for the entropy rate of Hidden Markov Processes

Finding the entropy rate of Hidden Markov Processes is an active research topic, of both theoretical and practical importance. A recently used approach is studying the asymptotic behavior of the entropy rate in various regimes. In this paper we generalize and prove a previous conjecture relating the entropy rate to entropies of finite systems. Building on our new theorems, we establish series expansions for the entropy rate in two different regimes. We also study the radius of convergence of the two series expansions.

preprint1999arXiv

Flowing sand - a possible physical realization of Directed Percolation

A simple model for flowing sand on an inclined plane is introduced. The model is related to recent experiments by Douady and Daerr [Nature 399, 241 (1999)] and reproduces some of the experimentally observed features. Avalanches of intermediate size appear to be compact, placing the critical behavior of the model into the universality class of compact directed percolation. On very large scales, however, the avalanches break up into several branches leading to a crossover from compact to ordinary directed percolation. Thus, systems of flowing granular matter on an inclined plane could serve as a first physical realization of directed percolation.

preprint1995arXiv

Topological model of soap froth evolution with deterministic T2-processes

We introduce a topological model for the evolution of 2d soap froth. The topological rearrangements (T2 processes) are deterministic (unlike the standard stochastic model): the final topology depends on the areas of the neighboring cells. The new model gives agreement with experiments in the transient regime, where the previous models failed qualitatively, and also improves agreement in the scaling state.