Fusobacterium Genomics Using MinION and Illumina Sequencing Enables Genome Completion and Correction

SM Todd, RE Settlage, KK Lahmers, DJ Slade - Msphere, 2018 - Am Soc Microbiol
Msphere, 2018Am Soc Microbiol
Understanding the virulence mechanisms of human pathogens from the genus
Fusobacterium has been hindered by a lack of properly assembled and annotated
genomes. Here we report the first complete genomes for seven Fusobacterium strains, as
well as resequencing of the reference strain Fusobacterium nucleatum subsp. nucleatum
ATCC 25586 (total of seven species; total of eight genomes). A highly efficient and cost-
effective sequencing pipeline was achieved using sample multiplexing for short-read …
Abstract
Understanding the virulence mechanisms of human pathogens from the genus Fusobacterium has been hindered by a lack of properly assembled and annotated genomes. Here we report the first complete genomes for seven Fusobacterium strains, as well as resequencing of the reference strain Fusobacterium nucleatum subsp. nucleatum ATCC 25586 (total of seven species; total of eight genomes). A highly efficient and cost-effective sequencing pipeline was achieved using sample multiplexing for short-read Illumina (150 bp) and long-read Oxford Nanopore MinION (>80 kbp) platforms, coupled with genome assembly using the open-source software Unicycler. Compared to currently available draft assemblies (previously 24 to 67 contigs), these genomes are highly accurate and consist of only one complete chromosome. We present the complete genome sequence of F. nucleatum subsp. nucleatum ATCC 23726, a genetically tractable and biomedically important strain and, in addition, reveal that the previous F. nucleatum subsp. nucleatum ATCC 25586 genome assembly contains a 452-kb genomic inversion that has been corrected using our sequencing and assembly pipeline. To enable genomic analyses by the scientific community, we concurrently used these genomes to launch FusoPortal, a repository of interactive and downloadable genomic data, genome maps, gene annotations, and protein functional analyses and classifications. In summary, this report provides detailed methods for accurately sequencing, assembling, and annotating Fusobacterium genomes, while focusing on using open-source software to foster the availability of reproducible and open data. This resource will enhance efforts to properly identify virulence proteins that may contribute to a repertoire of diseases that includes periodontitis, preterm birth, and colorectal cancer.
IMPORTANCE Fusobacterium spp. are Gram-negative, oral bacteria that are increasingly associated with human pathologies as diverse as periodontitis, preterm birth, and colorectal cancer. While a recent surge in F. nucleatum research has increased our understanding of this human pathogen, a lack of complete genomes has hindered the identification and characterization of associated host-pathogen virulence factors. Here we report the first eight complete Fusobacterium genomes sequenced using an Oxford Nanopore MinION and Illumina sequencing pipeline and assembled using the open-source program Unicycler. These genomes are highly accurate, and seven of the genomes represent the first complete sequences for each strain. In summary, the FusoPortal resource provides a publicly available resource that will guide future genetic, bioinformatic, and biochemical experiments to characterize this genus of emerging human pathogens.
American Society for Microbiology