Send to

Choose Destination
Proteins. 2015 Sep;83(9):1593-603. doi: 10.1002/prot.24840. Epub 2015 Aug 1.

Identification of universal selectivity-determining positions in cytochrome P450 monooxygenases by systematic sequence-based literature mining.

Author information

Institute of Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.


Cytochrome P450 monooxygenases (CYPs) are a large, highly diverse protein family with a common fold. The sequences, structures, and functions of CYPs have been extensively studied resulting in more than 53,000 scientific articles. A sequence-based literature mining algorithm was designed to systematically analyze this wealth of information on SNPs, designed mutations, structural interactions, or functional roles of individual residues. Structurally corresponding positions in different CYPs were compared and universal selectivity-determining positions were identified. Based on the Cytochrome P450 Engineering Database ( and a standard numbering scheme for all CYPs, 4000 residues in 168 CYPs mentioned in 2400 articles could be assigned to 440 structurally corresponding standard positions of the CYP fold, covering 96% of all standard positions. Seventeen individual standard positions were mentioned in the context of more than 32 different CYPs. The majority of these most frequently mentioned positions are located on the six substrate recognition sites and are involved in control of selectivity, such as the well-studied position 87 in CYP102A1 (P450(BM-3)) which was mentioned in the articles on 63 different CYPs. The recurrent citation of the 17 frequently mentioned positions for different CYPs suggests their universal functional relevance.


CYP; data mining; database; hotspot; mutant

[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center