Command line reference for PASR

Table of contents

 

pasr

usage: aastk pasr [-h] -q QUERY -s SEED [-o OUTPUT] [-m MATRIX] [-n THREADS] [-d DB_PATH] [-k KEY_COLUMN] [-b BLOCK] [-c CHUNK] [--sensitivity {fast,sensitive,mid-sensitive,very-sensitive,ultra-sensitive,faster}] [--update] [-y YAML] [--sql] [--keep] [--svg] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-q, --query QUERY
        Path to query FASTA
-s, --seed SEED
        Path to FASTA files containing seed sequences for database creation

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-m, --matrix MATRIX
        Choose BLOSUM substitution matrix (BLOSUM 45 or BLOSUM 62)
-n, --threads THREADS
        Number of threads to be used (default: 1)
-d, --db_path DB_PATH
        Path to SQLite database
-k, --key_column KEY_COLUMN
        Column index in the BLAST tab file to pull unique IDs from (default is 0)
-b, --block BLOCK
        Choose diamond blastp sequence block size in billions of letters (default: 6)
-c, --chunk CHUNK
        Choose number of chunks for diamond blastp index processing (default: 2)
--sensitivity {fast,sensitive,mid-sensitive,very-sensitive,ultra-sensitive,faster}
        Set the sensitivity level for the DIAMOND search: fast, sensitive, mid-sensitive, very-sensitive, ultra-sensitive, or faster (default: fast)
--update
        Update subset of data using metadata yaml file
-y, --yaml YAML
        Path to metadata yaml file
--sql
        Use AASTK SQLite database for sequence retrieval
--keep
        Keep intermediate files in specified paths
--svg
        Generate plot in SVG format
--force
        Set flag to overwrite existing files in specified path

 

build

usage: aastk build [-h] -s SEED -d DB [-n THREADS] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-s, --seed SEED
        Path to FASTA files containing seed sequences for database creation
-d, --db DB
        Specify path to DIAMOND database

Optional:

-n, --threads THREADS
        Number of threads to be used (default: 1)
--force
        Set flag to overwrite existing files in specified path

 

usage: aastk search [-h] -d DB -q QUERY [-o OUTPUT] [-n THREADS] [-b BLOCK] [-c CHUNK] [--sensitivity {fast,sensitive,mid-sensitive,very-sensitive,ultra-sensitive,faster}] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-d, --db DB
        Specify path to DIAMOND database
-q, --query QUERY
        Path to query FASTA

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-n, --threads THREADS
        Number of threads to be used (default: 1)
-b, --block BLOCK
        Choose diamond blastp sequence block size in billions of letters (default: 6)
-c, --chunk CHUNK
        Choose number of chunks for diamond blastp index processing (default: 2)
--sensitivity {fast,sensitive,mid-sensitive,very-sensitive,ultra-sensitive,faster}
        Set the sensitivity level for the DIAMOND search: fast, sensitive, mid-sensitive, very-sensitive, ultra-sensitive, or faster (default: fast)
--force
        Set flag to overwrite existing files in specified path

 

get_hit_seqs

usage: aastk get_hit_seqs [-h] (--sql | -q QUERY) -t TABULAR [-o OUTPUT] [-d DB_PATH] [-n THREADS] [-k KEY_COLUMN] [--force]

Options:

-h, --help
        Show this help message and exit

Mutually exclusive required arguments:

--sql
        Use AASTK SQLite database for sequence retrieval
-q, --query QUERY
        Path to query FASTA

Required arguments:

-t, --tabular TABULAR
        Path to tabular BLAST/DIAMOND output file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-d, --db_path DB_PATH
        Path to SQLite database
-n, --threads THREADS
        Number of threads to be used (default: 1)
-k, --key_column KEY_COLUMN
        Column index in the BLAST tab file to pull unique IDs from (default is 0)
--force
        Set flag to overwrite existing files in specified path

 

max_score

usage: aastk max_score [-h] -e EXTRACTED [-o OUTPUT] [-m MATRIX] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-e, --extracted EXTRACTED
        Path to FASTA file containing extracted matching sequences

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-m, --matrix MATRIX
        Choose BLOSUM substitution matrix (BLOSUM 45 or BLOSUM 62)
--force
        Set flag to overwrite existing files in specified path

 

bsr

usage: aastk bsr [-h] (-c COLUMN_INFO_PATH | -s SCORE_COLUMN) -t TABULAR -m MAX_SCORES [-o OUTPUT] [-k KEY_COLUMN] [--force]

Options:

-h, --help
        Show this help message and exit

Mutually exclusive required arguments:

-c, --column_info_path COLUMN_INFO_PATH
        Path to json file containing columns the blast output file generated by aastk search
-s, --score_column SCORE_COLUMN
        Index of the raw score column of BLAST tabular output file. Counting starts at 1

Required arguments:

-t, --tabular TABULAR
        Path to tabular BLAST/DIAMOND output file
-m, --max_scores MAX_SCORES
        Path to file containing max self scores

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-k, --key_column KEY_COLUMN
        Column index in the BLAST tab file to pull unique IDs from (default is 0)
--force
        Set flag to overwrite existing files in specified path

 

pasr_plot

usage: aastk pasr_plot [-h] -b BSR [-o OUTPUT] [--svg] [--force] [--update] [-y YAML]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-b, --bsr BSR
        Path to blast score ratio input file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
--svg
        Generate plot in SVG format
--force
        Set flag to overwrite existing files in specified path
--update
        Update subset of data using metadata yaml file
-y, --yaml YAML
        Path to metadata yaml file

 

pasr_select

usage: aastk pasr_select [-h] (-y YAML | --params) -m MATCHED --bsr BSR [-o OUTPUT] [--create_yaml] [-l SELFMIN] [-u SELFMAX] [-d DBMIN] [-b BSR_CUTOFF] [--force]

Options:

-h, --help
        Show this help message and exit

Mutually exclusive required arguments:

-y, --yaml YAML
        Path to metadata yaml file
--params
        Select subset of matched sequences based on command line parameters

Required arguments:

-m, --matched MATCHED
        FASTA file containing matched sequences from previous PASR run
--bsr BSR
        Path to blast score ratio input file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
--create_yaml
        Create metadata yaml file from command line parameters
-l, --selfmin SELFMIN
        Lower cutoff for self score range to be included in updated dataset
-u, --selfmax SELFMAX
        Upper cutoff for self score range to be included in updated dataset
-d, --dbmin DBMIN
        Lower database score cutoff for inclusion in updated dataset
-b, --bsr_cutoff BSR_CUTOFF
        Blast Score Ratio cutoff for inclusion in updated dataset
--force
        Set flag to overwrite existing files in specified path