Format

Send to

Choose Destination
Acta Crystallogr D Biol Crystallogr. 2007 Sep;63(Pt 9):941-50. Epub 2007 Aug 17.

Quality of protein crystal structures.

Author information

1
University of Iowa, Department of Biochemistry, Iowa City, IA, USA.

Abstract

The genomics era has seen the propagation of numerous databases containing easily accessible data that are routinely used by investigators to interpret results and generate new ideas. Most investigators consider data extracted from scientific databases to be error-free. However, data generated by all experimental techniques contain errors and some, including the coordinates in the Protein Data Bank (PDB), also integrate the subjective interpretations of experimentalists. This paper explores the determinants of protein structure quality metrics used routinely by protein crystallographers. These metrics are available for most structures in the database, including the R factor, R(free), real-space correlation coefficient, Ramachandran violations etc. All structures in the PDB were analyzed for their overall quality based on nine different quality metrics. Multivariate statistical analysis revealed that while technological improvements have increased the number of structures determined, the overall quality of structures has remained constant. The quality of structures deposited by structural genomics initiatives are generally better than the quality of structures from individual investigator laboratories. The most striking result is the association between structure quality and the journal in which the structure was first published. The worst offenders are the apparently high-impact general science journals. The rush to publish high-impact work in the competitive atmosphere may have led to the proliferation of poor-quality structures.

PMID:
17704562
DOI:
10.1107/S0907444907033847
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for International Union of Crystallography
Loading ...
Support Center