We are very happy to announce the first release of the the amino acid sequence toolkit (AASTK).
AASTK is a suite of tools designed to leverage the genomic diversity captured by the GlobDB to create and analyze datasets of homologous proteins. Current functionality of AASTK includes tree-of-life scale dataset building, curation, and maintenance, as well as clustering of protein datasets, genomic context analysis, and metadata retrieval.