blastdbcmd
Extract sequences from BLAST databases
TLDR
Get database information
SYNOPSIS
blastdbcmd [-db database] [-entry id] [options]
DESCRIPTION
blastdbcmd is a utility for extracting sequences and metadata from BLAST databases. It can retrieve individual sequences by accession, extract all sequences, display database statistics, and generate custom reports.
The tool is part of the NCBI BLAST+ suite and works with databases created by makeblastdb or downloaded from NCBI.
PARAMETERS
-db name
BLAST database name or path-entry id
Sequence identifier(s) to retrieve; use "all" for entire database-entry_batch file
File containing list of sequence identifiers-out file
Output file (default: stdout)-outfmt format
Custom output format string-info
Display database information-list path
List databases in specified path-recursive
Search directories recursively (with -list)-show_blastdb_search_path
Display BLAST database search paths-tax_info
Display taxonomy information (requires taxonomy database)-range start-stop
Extract subsequence range
OUTPUT FORMAT TOKENS
%a - Accession
%t - Title
%s - Sequence
%l - Sequence length
%T - Taxonomy ID
%S - Scientific name
%L - Common name
CAVEATS
Requires pre-formatted BLAST databases. Taxonomy information requires the BLAST taxonomy database to be installed. Large extractions may require significant time and disk space.
SEE ALSO
makeblastdb(1), blastn(1), blastp(1), blastx(1)
