query is a nucleotide or protein sequence, not a text term character string comparison against all the sequences in the target database rigorous statistics used to identify statistically significant matches