Send to

Choose Destination
See comment in PubMed Commons below
FEBS Lett. 2006 Mar 6;580(6):1649-53. Epub 2006 Feb 17.

A complete small molecule dataset from the protein data bank.

Author information

The Blueprint Initiative, Suite 101, 200 Elm Street, Toronto, Ont., Canada M5T 1K4.


A complete set of 6300 small molecule ligands was extracted from the protein data bank, and deposited online in PubChem as data source 'SMID'. This set's major improvement over prior methods is the inclusion of cyclic polypeptides and branched polysaccharides, including an unambiguous nomenclature, in addition to normal monomeric ligands. Only the best available example of each ligand structure is retained, and an additional dataset is maintained containing co-ordinates for all examples of each structure. Attempts are made to correct ambiguous atomic elements and other common errors, and a perception algorithm was used to determine bond order and aromaticity when no other information was available.

[Indexed for MEDLINE]
Free full text
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Wiley
    Loading ...
    Support Center