Send to

Choose Destination
Int Arch Allergy Immunol. 2002 Aug;128(4):280-91.

Bioinformatic methods for allergenicity assessment using a comprehensive allergen database.

Author information

Monsanto Company, Product Safety Center, St. Louis, Mo., USA.



A principal aim of the safety assessment of genetically modified crops is to prevent the introduction of known or clinically cross-reactive allergens. Current bioinformatic tools and a database of allergens and gliadins were tested for the ability to identify potential allergens by analyzing 6 Bacillus thuringiensis insecticidal proteins, 3 common non-allergenic food proteins and 50 randomly selected corn (Zea mays) proteins.


Protein sequences were compared to allergens using the FASTA algorithm and by searching for matches of 6, 7 or 8 contiguous identical amino acids.


No significant sequence similarities or matches of 8 contiguous amino acids were found with the B. thuringiensis or food proteins. Surprisingly, 41 of 50 corn proteins matched at least one allergen with 6 contiguous identical amino acids. Only 7 of 50 corn proteins matched an allergen with 8 contiguous identical amino acids. When assessed for overall structural similarity to allergens, these 7 plus 2 additional corn proteins shared >or=35% identity in an overlap of >or=80 amino acids, but only 6 of the 7 were similar across the length of the protein, or shared >50% identity to an allergen.


An evaluation of a protein by the FASTA algorithm is the most predictive of a clinically relevant cross-reactive allergen. An additional search for matches of 8 amino acids may provide an added margin of safety when assessing the potential allergenicity of a protein, but a search with a 6-amino-acid window produces many random, irrelevant matches.

[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for S. Karger AG, Basel, Switzerland
Loading ...
Support Center