Command line reference for CASM

Table of contents

 

casm

usage: aastk casm [-h] (--subset SUBSET | --subset_size SUBSET_SIZE) --fasta FASTA [-n THREADS] [-o OUTPUT] [-d DB_PATH] [-p PERPLEXITY] [-i ITERATIONS] [--exaggeration EXAGGERATION] [--metadata_protein METADATA_PROTEIN] [--force] [-e EARLY_CLUST] [-f FULL_CLUST] [-m MATRIX_PATH] [--metadata_matrix METADATA_MATRIX] [--keep] [-s] [--svg]

Options:

-h, --help
        Show this help message and exit

Mutually exclusive required arguments:

--subset SUBSET
        Path to subset fasta to use as DIAMOND alignment reference set
--subset_size SUBSET_SIZE
        Number of sequences to randomly subset from input FASTA file

Required arguments:

--fasta FASTA
        Path to FASTA file

Optional:

-n, --threads THREADS
        Number of threads to be used (default: 1)
-o, --output OUTPUT
        Desired output directory (default: current working directory)
-d, --db_path DB_PATH
        Path to SQLite database
-p, --perplexity PERPLEXITY
        Perplexity value for tSNE clustering
-i, --iterations ITERATIONS
        Number of clustering iterations
--exaggeration EXAGGERATION
        Exaggeration value for tSNE clustering
--metadata_protein METADATA_PROTEIN
        Select metadata for plotting; run "aastk metadata_categories" to view available categories
--force
        Set flag to overwrite existing files in specified path
-e, --early_clust EARLY_CLUST
        Path to early clustering TSV file
-f, --full_clust FULL_CLUST
        Path to full clustering TSV file
-m, --matrix_path MATRIX_PATH
        Path to .npy file containing alignment matrix
--metadata_matrix METADATA_MATRIX
        Path to matrix metadata file
--keep
        Keep intermediate files in specified paths
-s, --show
        Show cluster number at cluster centers in output plots
--svg
        Generate plot in SVG format

 

matrix

usage: aastk matrix [-h] (-s SUBSET | --subset_size SUBSET_SIZE) -f FASTA [-o OUTPUT] [-n THREADS] [--force]

Options:

-h, --help
        Show this help message and exit

Mutually exclusive required arguments:

-s, --subset SUBSET
        Path to subset fasta to use as DIAMOND alignment reference set
--subset_size SUBSET_SIZE
        Number of sequences to randomly subset from input FASTA file

Required arguments:

-f, --fasta FASTA
        Path to FASTA file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-n, --threads THREADS
        Number of threads to be used (default: 1)
--force
        Set flag to overwrite existing files in specified path

 

cluster

usage: aastk cluster [-h] -m MATRIX_PATH --metadata_matrix METADATA_MATRIX [-o OUTPUT] [-p PERPLEXITY] [-i ITERATIONS] [-e EXAGGERATION] [-n THREADS] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-m, --matrix_path MATRIX_PATH
        Path to .npy file containing alignment matrix
--metadata_matrix METADATA_MATRIX
        Path to matrix metadata file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-p, --perplexity PERPLEXITY
        Perplexity value for tSNE clustering
-i, --iterations ITERATIONS
        Number of clustering iterations
-e, --exaggeration EXAGGERATION
        Exaggeration value for tSNE clustering
-n, --threads THREADS
        Number of threads to be used (default: 1)
--force
        Set flag to overwrite existing files in specified path

 

casm_plot

usage: aastk casm_plot [-h] -e EARLY_CLUST -f FULL_CLUST [-o OUTPUT] [-d DB_PATH] [--metadata_protein METADATA_PROTEIN] [-s] [--svg] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

-e, --early_clust EARLY_CLUST
        Path to early clustering TSV file
-f, --full_clust FULL_CLUST
        Path to full clustering TSV file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
-d, --db_path DB_PATH
        Path to SQLite database
--metadata_protein METADATA_PROTEIN
        Select metadata for plotting; run "aastk metadata_categories" to view available categories
-s, --show
        Show cluster number at cluster centers in output plots
--svg
        Generate plot in SVG format
--force
        Set flag to overwrite existing files in specified path

 

casm_select

usage: aastk casm_select [-h] --full_clust FULL_CLUST -f FASTA -n NO_CLUSTER [-o OUTPUT] [--force]

Options:

-h, --help
        Show this help message and exit

Required arguments:

--full_clust FULL_CLUST
        Path to full clustering TSV file
-f, --fasta FASTA
        Path to FASTA file
-n, --no_cluster NO_CLUSTER
        Number of cluster of choice in TSV file

Optional:

-o, --output OUTPUT
        Desired output directory (default: current working directory)
--force
        Set flag to overwrite existing files in specified path