NCTC 3000 project
A comprehensive resource of bacterial type and reference genomes
NCTC 3000 is a collaborative Whole Genome Sequencing (WGS) project that was established in 2013 between UK Health Security Agency (formally Public Health England), the Wellcome Sanger Institute (WSI) and Pacific Biosciences (PacBio). The project generated 3,305 PacBio long-read sequence datasets were successfully for 2,915 NCTC strains, all of which have been made publicly available under BioProject PRJEB6403.
The NCTC3000 dataset has further enabled the generation of high-quality genome assemblies and annotations. To date, 2,228 genome assemblies and annotations have been published within the ENA/GenBank/DDBJ databases, around a third of which have achieved ‘Complete Genome’ status. The data are already being widely used by the bacterial community for a broad array of projects and tasks.
The annotated bacterial genomes generated by the NCTC 3000 project can be accessed via:
-
directly from the European Nucleotide Archive (ENA)
References
Dicks J, Fazal MA, Oliver K, et al. NCTC3000: a century of bacterial strain collecting leads to a rich genomic data resource. Microb Genom. 2023;9(5):mgen000976. doi:10.1099/mgen.0.000976