Skip Navigation


Briefings in Bioinformatics Advance Access originally published online on March 15, 2008
Briefings in Bioinformatics 2008 9(3):210-219; doi:10.1093/bib/bbn010
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
9/3/210    most recent
bbn010v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (12)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sammut, S. J.
Right arrow Articles by Bateman, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sammut, S. J.
Right arrow Articles by Bateman, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press. For Permissions, please email: journals.permissions@oxfordjournals.org

Pfam 10 years on: 10 000 families and still growing

Stephen John Sammut, Robert D. Finn and Alex Bateman

Corresponding author. Alex Bateman, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SA, UK. Tel: 44-1223-494950; Fax: 44-1223-494919; E-mail: agb{at}sanger.ac.uk

Classifications of proteins into groups of related sequences are in some respects like a periodic table for biology, allowing us to understand the underlying molecular biology of any organism. Pfam is a large collection of protein domains and families. Its scientific goal is to provide a complete and accurate classification of protein families and domains. The next release of the database will contain over 10 000 entries, which leads us to reflect on how far we are from completing this work. Currently Pfam matches 72% of known protein sequences, but for proteins with known structure Pfam matches 95%, which we believe represents the likely upper bound. Based on our analysis a further 28 000 families would be required to achieve this level of coverage for the current sequence database. We also show that as more sequences are added to the sequence databases the fraction of sequences that Pfam matches is reduced, suggesting that continued addition of new families is essential to maintain its relevance.

Keywords: Pfam, protein families, classification, coverage, hidden Markov model

Submitted: October 15, 2007. Received (in revised form): February 6, 2008.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
DNA ResHome page
K. Mochida, T. Yoshida, T. Sakurai, K. Yamaguchi-Shinozaki, K. Shinozaki, and L.-S. P. Tran
In silico Analysis of Transcription Factor Repertoire and Prediction of Stress Responsive Transcription Factors in Soybean
DNA Res, December 1, 2009; 16(6): 353 - 369.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Schlicker and M. Albrecht
FunSimMat update: new features for exploring functional similarity
Nucleic Acids Res., November 18, 2009; (2009) gkp979v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Levitt
Nature of the protein universe
PNAS, July 7, 2009; 106(27): 11079 - 11084.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
J. Skolnick and M. Brylinski
FINDSITE: a combined evolution/structure-based approach to protein function prediction
Brief Bioinform, July 1, 2009; 10(4): 378 - 391.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Rose, S. Lorenzen, A. Goede, B. Gruening, and P. W. Hildebrand
RHYTHM--a server to predict the orientation of transmembrane helices in channels and membrane-coils
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W575 - W580.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. G. Pell, V. Kanelis, L. W. Donaldson, P. Lynne Howell, and A. R. Davidson
The phage {lambda} major tail protein structure reveals a common evolution for long-tailed phages and the type VI bacterial secretion system
PNAS, March 17, 2009; 106(11): 4160 - 4165.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. Diella, S. Chabanis, K. Luck, C. Chica, C. Ramu, C. Nerlov, and T. J. Gibson
KEPE--a motif frequently superimposed on sumoylation sites in metazoan chromatin proteins and transcription factors
Bioinformatics, January 1, 2009; 25(1): 1 - 5.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Lima, A. H. Auchincloss, E. Coudert, G. Keller, K. Michoud, C. Rivoire, V. Bulliard, E. de Castro, C. Lachaize, D. Baratin, et al.
HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D471 - D478.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Rajasekaran, S. Balla, P. Gradie, M. R. Gryk, K. Kadaveru, V. Kundeti, M. W. Maciejewski, T. Mi, N. Rubino, J. Vyas, et al.
Minimotif miner 2nd release: a database and web system for motif search
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D185 - D190.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. J. Rigden and M. Y. Galperin
Sequence analysis of GerM and SpoVS, uncharacterized bacterial 'sporulation' proteins with widespread phylogenetic distribution
Bioinformatics, August 15, 2008; 24(16): 1793 - 1797.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.