The r232 release of the GlobDB is in preparation, and will be based on the upcoming GTDB release. The existing data sources in the GlobDB will be dereplicated against the new GTDB release. In addition, we will include genomes from the data sources listed below if they represent species that aren't present in the INSDC databases or previous GlobDB releases.
If you have suggestions for additional data sources to include, please contact Daan Speth.
New data sources for GlobDB release 232 (in no particular order):
gcMeta 2025: a global repository of metagenome-assembled genomes enabling cross-ecosystem microbial discovery and function research
https://doi.org/10.1093/nar/gkaf1115
Functional traits and adaptation of lake microbiomes on the Tibetan Plateau
https://doi.org/10.1186/s40168-024-01979-7
A deep metagenomic atlas of Qinghai-Xizang Plateau lakes reveals their microbial diversity and salinity adaptation mechanisms
https://doi.org/10.1016/j.celrep.2025.116483
proGenomes4: providing 2 million accurately and consistently annotated high-quality prokaryotic genomes
https://doi.org/10.1093/nar/gkaf1208
A genome and gene catalog of glacier microbiomes
https://doi.org/10.1038/s41587-022-01367-2
A holistic genome dataset of bacteria, archaea and viruses of the Pearl River estuary
https://doi.org/10.1038/s41597-022-01153-4
236 metagenome-assembled microbial genomes from rivers along a latitudinal gradient
https://doi.org/10.1038/s41597-025-05888-8
Metagenome-assembled genomes from microbial communities in lab-scale anaerobic bioreactors treating simulated dairy wastewater
https://doi.org/10.1128/mra.00487-25
Crop root bacterial and viral genomes reveal unexplored species and microbiome patterns
https://doi.org/10.1016/j.cell.2025.02.013
Expanded catalogue of metagenome-assembled genomes reveals resistome characteristics and athletic performance-associated microbes in horse
https://doi.org/10.1186/s40168-022-01448-z
Metagenome sequencing and 768 microbial genomes from cold seep in South China Sea
https://doi.org/10.1038/s41597-022-01586-x
The prokaryote MGNify catalogs released since the previous GlobDB release
https://www.ebi.ac.uk/metagenomics
All released "direct submission" entries in the categories "Bacteria", "Archaea" and "Metagenomes" in the genomic warehouse (GWH) of the CNCB-NGDC (China National Center for Bioinformation / National Genomics Data Center)
https://ngdc.cncb.ac.cn/gwh/
Comments1
TPMC-S
The TPMCS dataset will also be included
Data-mining of sediment microbiomes of the Tibetan Plateau revealed a genomic repository of ancient lineages and adaptive evolution of Asgardarchaeota
https://spj.science.org/doi/abs/10.34133/research.1213