Source author record

Eugene I. Shakhnovich

Eugene I. Shakhnovich appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

12works
10topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2016arXiv

Structure-based prediction of protein-folding transition paths

We propose a general theory to describe the distribution of protein-folding transition paths. We show that transition paths follow a predictable sequence of high-free-energy transient states that are separated by free-energy barriers. Each transient state corresponds to the assembly of one or more discrete, cooperative units, which are determined directly from the native structure. We show that the transition state on a folding pathway is reached when a small number of critical contacts are formed between a specific set of substructures, after which folding proceeds downhill in free energy. This approach suggests a natural resolution for distinguishing parallel folding pathways and provides a simple means to predict the rate-limiting step in a folding reaction. Our theory identifies a common folding mechanism for proteins with diverse native structures and establishes general principles for the self-assembly of polymers with specific interactions.

preprint2012arXiv

Soluble oligomerization provides a beneficial fitness effect on destabilizing mutations

Mutations create the genetic diversity on which selective pressures can act, yet also create structural instability in proteins. How, then, is it possible for organisms to ameliorate mutation-induced perturbations of protein stability while maintaining biological fitness and gaining a selective advantage? Here we used a new technique of site-specific chromosomal mutagenesis to introduce a selected set of mostly destabilizing mutations into folA - an essential chromosomal gene of E. coli encoding dihydrofolate reductase (DHFR) - to determine how changes in protein stability, activity and abundance affect fitness. In total, 27 E.coli strains carrying mutant DHFR were created. We found no significant correlation between protein stability and its catalytic activity nor between catalytic activity and fitness in a limited range of variation of catalytic activity observed in mutants. The stability of these mutants is strongly correlated with their intracellular abundance; suggesting that protein homeostatic machinery plays an active role in maintaining intracellular concentrations of proteins. Fitness also shows a significant correlation with intracellular abundance of soluble DHFR in cells growing at 30oC. At 42oC, on the other hand, the picture was mixed, yet remarkable: a few strains carrying mutant DHFR proteins aggregated rendering them nonviable, but, intriguingly, the majority exhibited fitness higher than wild type. We found that mutational destabilization of DHFR proteins in E. coli is counterbalanced at 42oC by their soluble oligomerization, thereby restoring structural stability and protecting against aggregation.

preprint2011arXiv

Multi-scale sequence correlations increase proteome structural disorder and promiscuity

Numerous experiments demonstrate a high level of promiscuity and structural disorder in organismal proteomes. Here we ask the question what makes a protein promiscuous, i.e., prone to non-specific interactions, and structurally disordered. We predict that multi-scale correlations of amino acid positions within protein sequences statistically enhance the propensity for promiscuous intra- and inter-protein binding. We show that sequence correlations between amino acids of the same type are statistically enhanced in structurally disordered proteins and in hubs of organismal proteomes. We also show that structurally disordered proteins possess a significantly higher degree of sequence order than structurally ordered proteins. We develop an analytical theory for this effect and predict the robustness of our conclusions with respect to the amino acid composition and the form of the microscopic potential between the interacting sequences. Our findings have implications for understanding molecular mechanisms of protein aggregation diseases induced by the extension of sequence repeats.

preprint2011arXiv

Sequence correlations shape protein promiscuity

We predict analytically that diagonal correlations of amino acid positions within protein sequences statistically enhance protein propensity for nonspecific binding. We use the term 'promiscuity' to describe such nonspecific binding. Diagonal correlations represent statistically significant repeats of sequence patterns where amino acids of the same type are clustered together. The predicted effect is qualitatively robust with respect to the form of the microscopic interaction potentials and the average amino acid composition. Our analytical results provide an explanation for the enhanced diagonal correlations observed in hubs of eukaryotic organismal proteomes [J. Mol. Biol. 409, 439 (2011)]. We suggest experiments that will allow direct testing of the predicted effect.

preprint2010arXiv

Optimality of mutation and selection in germinal centers

The population dynamics theory of B cells in a typical germinal center could play an important role in revealing how affinity maturation is achieved. However, the existing models encountered some conflicts with experiments. To resolve these conflicts, we present a coarse-grained model to calculate the B cell population development in affinity maturation, which allows a comprehensive analysis of its parameter space to look for optimal values of mutation rate, selection strength, and initial antibody-antigen binding level that maximize the affinity improvement. With these optimized parameters, the model is compatible with the experimental observations such as the ~100-fold affinity improvements, the number of mutations, the hypermutation rate, and the "all or none" phenomenon. Moreover, we study the reasons behind the optimal parameters. The optimal mutation rate, in agreement with the hypermutation rate in vivo, results from a tradeoff between accumulating enough beneficial mutations and avoiding too many deleterious or lethal mutations. The optimal selection strength evolves as a balance between the need for affinity improvement and the requirement to pass the population bottleneck. These findings point to the conclusion that germinal centers have been optimized by evolution to generate strong affinity antibodies effectively and rapidly. In addition, we study the enhancement of affinity improvement due to B cell migration between germinal centers. These results could enhance our understandings to the functions of germinal centers.

preprint2010arXiv

Protein abundances and interactions coevolve to promote functional complexes while suppressing non-specific binding

How do living cells achieve sufficient abundances of functional protein complexes while minimizing promiscuous non-functional interactions? Here we study this problem using a first-principle model of the cell whose phenotypic traits are directly determined from its genome through biophysical properties of protein structures and binding interactions in crowded cellular environment. The model cell includes three independent prototypical pathways, whose topologies of Protein-Protein Interaction (PPI) sub-networks are different, but whose contributions to the cell fitness are equal. Model cells evolve through genotypic mutations and phenotypic protein copy number variations. We found a strong relationship between evolved physical-chemical properties of protein interactions and their abundances due to a "frustration" effect: strengthening of functional interactions brings about hydrophobic interfaces, which make proteins prone to promiscuous binding. The balancing act is achieved by lowering concentrations of hub proteins while raising solubilities and abundances of functional monomers. Based on these principles we generated and analyzed a possible realization of the proteome-wide PPI network in yeast. In this simulation we found that high-throughput affinity capture - mass spectroscopy experiments can detect functional interactions with high fidelity only for high abundance proteins while missing most interactions for low abundance proteins.

preprint2009arXiv

Slowly replicating lytic viruses: pseudolysogenic persistence and within-host competition

We study the population dynamics of lytic viruses which replicate slowly in dividing host cells within an organism or cell culture, and find a range of viral replication rates that allows viruses to persist, avoiding extinction of host cells or dilution of viruses at too rapid or too slow viral replication. For the within-host competition between multiple viral strains, a strain with a "stable" replication rate could outcompete another strain with a higher or lower replication rate, therefore natural selection of viruses stabilizes the viral persistence. However, when strains with higher and lower than the "stable" value replication rates are both present, competition between strains does not result in dominance of one strain, but in their coexistence.

preprint2009arXiv

Thymic selection of T-cell receptors as an extreme value problem

T lymphocytes (T cells) orchestrate adaptive immune responses upon activation. T cell activation requires sufficiently strong binding of T cell receptors (TCRs) on their surface to short peptides (p) derived from foreign proteins, which are bound to major histocompatibility (MHC) gene products (displayed on antigen presenting cells). A diverse and self-tolerant T cell repertoire is selected in the thymus. We map thymic selection processes to an extreme value problem and provide an analytic expression for the amino acid compositions of selected TCRs (which enable its recognition functions).

preprint2008arXiv

Sensitivity dependent model of protein-protein interaction networks

The scale free structure p(k)~k^{-gamma} of protein-protein interaction networks can be reproduced by a static physical model in simulation. We inspect the model theoretically, and find the key reason for the model to generate apparent scale free degree distributions. This explanation provides a generic mechanism of "scale free" networks. Moreover, we predict the dependence of gamma on experimental protein concentrations or other sensitivity factors in detecting interactions, and find experimental evidence to support the prediction.

preprint2007arXiv

Positive and negative design in stability and thermal adaptation of natural proteins

The aim of this work is to elucidate how physical principles of protein design are reflected in natural sequences that evolved in response to the thermal conditions of the environment. Using an exactly solvable lattice model, we design sequences with selected thermal properties. Compositional analysis of designed model sequences and natural proteomes reveals a specific trend in amino acid compositions in response to the requirement of stability at elevated environmental temperature, i.e. the increase of fractions of hydrophobic and charged amino acid residues at the expense of polar ones. We show that this from both ends of hydrophobicity scale trend is due to positive (to stabilize the native state) and negative (to destabilize misfolded states) components of protein design. Negative design strengthens specific repulsive nonnative interactions that appear in misfolded structures. A pressure to preserve specific repulsive interactions in non-native conformations may result in correlated mutations between amino acids which are far apart in the native state but may be in contact in misfolded conformations. Such correlated mutations are indeed found in TIM barrel and other proteins.

preprint2006arXiv

Protein and DNA sequence determinants of thermophilic adaptation

Prokaryotes living at extreme environmental temperatures exhibit pronounced signatures in the amino acid composition of their proteins and nucleotide compositions of their genomes reflective of adaptation to their thermal environments. However, despite significant efforts, the definitive answer of what are the genomic and proteomic compositional determinants of Optimal Growth Temperature of prokaryotic organisms remained elusive. Here the authors performed a comprehensive analysis of amino acid and nucleotide compositional signatures of thermophylic adaptation by exhaustively evaluating all combinations of amino acids and nucleotides as possible determinants of Optimal Growth Temperature for all prokaryotic organisms with fully sequences genomes.. The authors discovered that total concentration of seven amino acids in proteomes, IVYWREL, serves as a universal proteomic predictor of Optimal Growth Temperature in prokaryotes. Resolving the old-standing controversy the authors determined that the variation in nucleotide composition (increase of purine load, or A+G content with temperature) is largely a consequence of thermal adaptation of proteins. However, the frequency with which A and G nucleotides appear as nearest neighbors in genome sequences is strongly and independently correlated with Optimal Growth Temperature. as a result of codon bias in corresponding genomes. Together these results provide a complete picture of proteomic and genomic determinants of thermophilic adaptation.

preprint2005arXiv

Entropic stabilization of proteins and its proteomic consequences

We report here a new entropic mechanism of protein thermostability due to residual dynamics of rotamer isomerization in native state. All-atom simulations show that Lysines have much greater number of accessible rotamers than Arginines in folded states of proteins. This finding suggests that Lysines would preferentially entropically stabilize the native state. Indeed we show in computational experiments that Arginine-to-Lysine amino acid substitutions result in noticeable stabilization of proteins. We then hypothesize that if evolution uses this physical mechanisms in its strategies of thermophilic adaptation then hyperthermostable organisms would have much greater content of Lysines in their proteomes than of comparable in size and similarly charged Arginines.. Consistent with that, high-throughput comparative analysis of complete proteomes shows extremely strong bias towards Arginine-to-Lysine replacement in hyperthermophilic organisms and overall much greater content of Lysines than Arginines in hyperthermophiles. This finding cannot be explained by GC compositional biases. Our study provides an example of how analysis of a delicate physical mechanism of thermostability helps to resolve a puzzle in comparative genomics as to why aminoacid compositions of hyperthermophilic proteomes are significantly biased towards Lysines but not Arginines