Powered by OpenAIRE graph
image/svg+xml Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Closed Access logo, derived from PLoS Open Access logo. This version with transparent background. http://commons.wikimedia.org/wiki/File:Closed_Access_logo_transparent.svg Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Electrophoresisarrow_drop_down
image/svg+xml Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao Closed Access logo, derived from PLoS Open Access logo. This version with transparent background. http://commons.wikimedia.org/wiki/File:Closed_Access_logo_transparent.svg Jakob Voss, based on art designer at PLoS, modified by Wikipedia users Nina and Beao
Electrophoresis
Article . 1997 . Peer-reviewed
License: Wiley Online Library User Agreement
Data sources: Crossref
Electrophoresis
Article . 1997
versions View all 2 versions

Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia coli K‐12

Authors: A J, Link; K, Robison; G M, Church;

Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia coli K‐12

Abstract

AbstractMining the emerging abundance of microbial genome sequences for hypotheses is an exciting prospect of “functional genomics”. At the forefront of this effort, we compared the predictions of the complete Escherichia coli genomic sequence with the observed gene products by assessing 381 proteins for their mature N‐termini, in vivo abundances, isoelectric points, molecular masses, and cellular locations. Two‐dimensional gel electrophoresis (2‐DE) and Edman sequencing were combined to sequence Coomassie‐stained 2‐DE spots representing the abundant proteins of wild‐type E. coli K‐12 strains. Greater than 90% of the abundant proteins in the E. coli proteome lie in a small isoelectric point and molecular mass window of 4–7 and 10–100 kDa, respectively. We identified several highly abundant proteins, YjbJ, YjbP, YggX, HdeA, and AhpC, which would not have been predicted from the genomic sequence alone. Of the 223 uniquely identified loci, 60% of the encoded proteins are proteolytically processed. As previously reported, the initiator methionine was efficiently cleaved when the penultimate amino acid was serine or alanine. In contrast, when the penultimate amino acid was threonine, glycine, or proline, cleavage was variable, and valine did not signal cleavage. Although signal peptide cleavage sites tended to follow predicted rules, the length of the putative signal sequence was occassionally greater than the consensus. For proteins predicted to be in the cytoplasm or inner membrane, the N‐terminal amino acids were highly constrained compared to proteins localized to the periplasm or outer membrane. Although cytoplasmic proteins follow the N‐end rule for protein stability, proteins in the periplasm or outer membrane do not follow this rule; several have N‐terminal amino acids predicted to destabilize the proteins. Surprisingly, 18% of the identified 2‐DE spots represent isoforms in which protein products of the same gene have different observed pI and Mr, suggesting they are post‐translationally processed. Although most of the predicted and observed values for isoelectric point and molecular mass show reasonable concordance, for several proteins the observed values significantly deviate from the expected values. Such discrepancies may represent either highly processed proteins or misinterpretations of the genomic sequence. Our data suggest that AhpC, CspC, and HdeA exist as covalent homomultimers, and that IcdA exists as at least three isoforms even under conditions in which covalent modification is not predicted. We enriched for proteins based on subcellular location and found several proteins in unexpected subcellular locations.

Related Organizations
Keywords

Sequence Homology, Amino Acid, Molecular Sequence Data, Gene Expression, Peptide Mapping, Molecular Weight, Open Reading Frames, Bacterial Proteins, Escherichia coli, Electrophoresis, Gel, Two-Dimensional, Amino Acid Sequence, Isoelectric Point, Sequence Analysis, Genome, Bacterial

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    329
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Top 1%
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Top 1%
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Top 1%
Powered by OpenAIRE graph
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
329
Top 1%
Top 1%
Top 1%