Jump to content

formatdb

From Wikipedia, the free encyclopedia


formatdb is a discontinued software tool that was used in molecular bioinformatics to format protein or nucleotide databases for BLAST. It has been replaced by makeblastdb and the NCBI "strongly encourage[s]"[1] users to stop using formatdb.

formatdb must be used in order to format protein or nucleotide source databases before these databases can be searched by BLAST.[2] The source database may be in either FASTA or ASN.1 format. Although the FASTA format is most often used as input to formatdb, the use of ASN.1 is advantageous for those who are using ASN.1 as the common source for other formats such as the GenBank report.

The opposite of operation of formatdb, extracting sequences from a blast formatted database, can be achieved by using the fastacmd program, which comes in the same package.[3]

In the BLAST+ version, formatdb has been succeeded by makeblastdb.[4]

References

[edit]
  1. ^ "Download BLAST Software and Databases Documentation".
  2. ^ Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (October 1990). "Basic local alignment search tool". J. Mol. Biol. 215 (3): 403–10. doi:10.1016/S0022-2836(05)80360-2. PMID 2231712. S2CID 14441902.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  3. ^ "formatdb and fastacmd". www.ncbi.nlm.nih.gov. Archived from the original on 2007-09-12.
  4. ^ BLAST Command Line Applications User Manual. NCBI Bookshelf. 2009-10-16.
[edit]
  • NCBI BLAST Database Format — a description of the BLAST database format with a sample program to dump the contents of the database.