IDF Based DNA Sequence Retrieval Data Base: BLAST NT

IDF searching of nucleotide and amino acid sequence data bases provides accurate and the fastest access to sequence data from major online data bases. IDF searching beats all forms of BLAST, including MEGABLAST. To use, replace the sample in the text box below with query sequence in FASTA format. Note: first line with ">" is required. Results will be presented on the next screen. Depending on system activity, searches take about 10 seconds or less. The NT database was last updated Jan 11, 2009. It contains all 7,968,837 sequences of 10,000 or fewer base pairs (sequences longer than 10,000 were not indexed). Click Here for weight distribution. Click here for Mumps Compiler & MDH Homepage

IDF Scoring
Smith Waterman Scoring
Smith Waterman Scoring - show aligns
IDF Weight Factors
Low Wgt:
High Wgt:
No Weighting:    
Retrieve sequences SW Settings     
Gap:
Mismatch:
Match:
Citation:
O'Kane, K.C., The Effect of Inverse Document Frequency Weights on Indexed Sequence Retrieval, Online Journal of Bioinformatics, Volume 6 (2) 162-173, 2005.