Researcher profile

Zeina Shreif

Zeina Shreif contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
6works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2015arXiv

The jigsaw puzzle of sequence phenotype inference: Piecing together Shannon entropy, importance sampling, and Empirical Bayes

A nucleotide sequence 35 base pairs long can take 1,180,591,620,717,411,303,424 possible values. An example of systems biology datasets, protein binding microarrays, contain activity data from about 40000 such sequences. The discrepancy between the number of possible configurations and the available activities is enormous. Thus, albeit that systems biology datasets are large in absolute terms, they oftentimes require methods developed for rare events due to the combinatorial increase in the number of possible configurations of biological systems. A plethora of techniques for handling large datasets, such as Empirical Bayes, or rare events, such as importance sampling, have been developed in the literature, but these cannot always be simultaneously utilized. Here we introduce a principled approach to Empirical Bayes based on importance sampling, information theory, and theoretical physics in the general context of sequence phenotype model induction. We present the analytical calculations that underlie our approach. We demonstrate the computational efficiency of the approach on concrete examples, and demonstrate its efficacy by applying the theory to publicly available protein binding microarray transcription factor datasets and to data on synthetic cAMP-regulated enhancer sequences. As further demonstrations, we find transcription factor binding motifs, predict the activity of new sequences and extract the locations of transcription factor binding sites. In summary, we present a novel method that is efficient (requiring minimal computational time and reasonable amounts of memory), has high predictive power that is comparable with that of models with hundreds of parameters, and has a limited number of optimized parameters, proportional to the sequence length.

preprint2011arXiv

Scaling Behavior of Quantum Nanosystems: Emergence of Quasi-particles, Collective Modes, and Mixed Exchange Symmetry States

Quantum nanosystems such as graphene nanoribbons or superconducting nanoparticles are studied via a multiscale approach. Long space-time dynamics is derived using a perturbation expansion in the ratio of the nearest-neighbor distance to a nanometer-scale characteristic length, and a theorem on the equivalence of long-time averages and expectation values. This dynamics is shown to satisfy a coarse-grained wave equation (CGWE) which takes a Schrödinger-like form with modified masses and interactions. The scaling of space and time is determined by the orders of magnitude of various contributions to the N-body potential. If the spatial scale of the coarse-graining is too large, the CGWE would imply an unbounded growth of gradients; if it is too short, the system's size would display uncontrolled growth inappropriate for the bound states of interest, i.e., collective motion or migration within a stable nano-assembly. The balance of these two extremes removes arbitrariness in the choice of the scaling of space-time. Since the long-scale dynamics of each fermion involves its interaction with many others, we hypothesize that the solutions of the CGWE have mean-field character to good approximation, i.e., can be factorized into single-particle functions. This leads to a Coarse-grained Mean-field (CGMF) approximation that is distinct in character from traditional Hartree-Fock theory. A variational principle is used to derive equations for the single-particle functions. This theme is developed and used to derive an equation for low-lying disturbances from the ground state corresponding to long wavelength density disturbances or long-scale migration. An algorithm for the efficient simulation of quantum nanosystems is suggested.

preprint2010arXiv

Liquid-Crystal Transitions: A First Principles Multiscale Approach

A rigorous theory of liquid-crystal transitions is developed starting from the Liouville equation. The starting point is an all-atom description and a set of order parameter field variables that are shown to evolve slowly via Newton's equations. The separation of timescales between that of atomic collisions and the order parameter fields enables the derivation of rigorous equations for stochastic order parameter field dynamics. When the fields provide a measure of the spatial profile of the probability of molecular position, orientation, and internal structure, a theory of liquid-crystal transitions emerges. The theory uses the all-atom/continuum approach developed earlier to obtain a functional generalization of the Smoluchowski equation wherein key atomic details are embedded. The equivalent non-local Langevin equations are derived and computational aspects are discussed. The theory enables simulations that are much less computationally intensive than molecular dynamics and thus does not require oversimplification of the system's constituent components. The equations obtained do not include factors that require calibration and can thus be applicable to various phase transitions which overcomes the limitations of phenomenological field models. The relation of the theory to phenomenological descriptions of Nematic and Smectic phase transitions, and the possible existence of other types of transitions involving intermolecular structural parameters are discussed.

preprint2010arXiv

Multiscaling for Classical Nanosystems: Derivation of Smoluchowski and Fokker-Planck Equations

Using multiscale analysis and methods of statistical physics, we show that a solution to the N-atom Liouville Equation can be decomposed via an expansion in terms of a smallness parameter epsilon, wherein the long scale time behavior depends upon a reduced probability density that is a function of slow-evolving order parameters. This reduced probability density is shown to satisfy the Smoluchowski equation up to order epsilon squared for a given range of initial conditions. Furthermore, under the additional assumption that the nanoparticle momentum evolves on a slow time scale, we show that this reduced probability density satisfies a Fokker-Planck equation up to the same order in epsilon. This approach applies to a broad range of problems in the nanosciences.

preprint2010arXiv

Self-Assembly of Nanocomponents into Composite Structures: Derivation and Simulation of Langevin Equations

The kinetics of the self-assembly of nanocomponents into a virus, nanocapsule, or other composite structure is analyzed via a multiscale approach. The objective is to achieve predictability and to preserve key atomic-scale features that underlie the formation and stability of the composite structures. We start with an all-atom description, the Liouville equation, and the order parameters characterizing nanoscale features of the system. An equation of Smoluchowski type for the stochastic dynamics of the order parameters is derived from the Liouville equation via a multiscale perturbation technique. The self-assembly of composite structures from nanocomponents with internal atomic structure is analyzed and growth rates are derived. Applications include the assembly of a viral capsid from capsomers, a ribosome from its major subunits, and composite materials from fibers and nanoparticles. Our approach overcomes errors in other coarse-graining methods which neglect the influence of the nanoscale configuration on the atomistic fluctuations. We account for the effect of order parameters on the statistics of the atomistic fluctuations which contribute to the entropic and average forces driving order parameter evolution. This approach enables an efficient algorithm for computer simulation of self-assembly, whereas other methods severely limit the timestep due to the separation of diffusional and complexing characteristic times. Given that our approach does not require recalibration with each new application, it provides a way to estimate assembly rates and thereby facilitate the discovery of self-assembly pathways and kinetic dead-end structures.

preprint2010arXiv

Stochastic Dynamics of Bionanosystems: Multiscale Analysis and Specialized Ensembles

An approach for simulating bionanosystems, such as viruses and ribosomes, is presented. This calibration-free approach is based on an all-atom description for bionanosystems, a universal interatomic force field, and a multiscale perspective. The supramillion-atom nature of these bionanosystems prohibits the use of a direct molecular dynamics approach for phenomena like viral structural transitions or self-assembly that develop over milliseconds or longer. A key element of these multiscale systems is the cross-talk between, and consequent strong coupling of, processes over many scales in space and time. We elucidate the role of interscale cross-talk and overcome bionanosystem simulation difficulties with automated construction of order parameters (OPs) describing supra-nanometer scale structural features, construction of OP dependent ensembles describing the statistical properties of atomistic variables that ultimately contribute to the entropies driving the dynamics of the OPs, and the derivation of a rigorous equation for the stochastic dynamics of the OPs. Since the atomic scale features of the system are treated statistically, several ensembles are constructed that reflect various experimental conditions. The theory provides a basis for a practical, quantitative bionanosystem modeling approach that preserves the cross-talk between the atomic and nanoscale features. A method for integrating information from nanotechnical experimental data in the derivation of equations of stochastic OP dynamics is also introduced.