Generating High-Quality Genome Assemblies from Metagenomic Sequencing

last updated: October 29, 2024

The decreasing costs in genomic sequencing over the past decade have inspired researchers to apply shotgun next-generation sequencing to entire microbial communities. While the reads generated typically cannot be assembled cleanly into individual genomes, there is often enough information produced to identify most microbes present in the population. However, this approach lacks sufficient resolution to link taxonomy with function.

Applications ranging from clinical microbiome analysis to environmental metagenomics require the production of high-quality genome assemblies for each microbial member of a community. With the goal of yielding such information from metagenomics experiments, Ami Bhatt and collaborators at Stanford University developed a method that uses a new assembler alongside a 10x Genomics long-range workflow that includes automated DNA size selection. In this bioRxiv preprint, the authors report that their novel technique enables successful sequencing of human and marine microbiome samples.

The assembler, named Athena, can produce high-quality, de novo individual draft genomes from microbial communities. It works by analyzing the “read clouds” produced by 10x Genomics technology, which links short sequencing reads to provide long-range genomic data.

Bhatt and her colleagues first tested the ability of the approach to assemble a mock microbial community, and then evaluated samples collected from the human gut and the sea floor. In the latter sample types, the Athena-powered method was able to generate contiguous assemblies of individual microbes present as compared to short read and SLR assembly. Indeed, Bhatt et al. report that their “approach combines the advantages of both short read and [synthetic long read] approaches, and is capable of producing many highly contiguous drafts (>200kb N50, <10 contigs) with as little as 20x raw short-read coverage.” This is particularly significant in the case of the marine sediment community, as this type of sample is significantly more microbially complex than a human stool sample.

The workflow includes DNA size-selection via the BluePippin instrument as a key part of the sample prep process for the 10x Genomics Chromium platform used for library sequencing. This process ensures optimal results from the long-range technology, allowing it to focus on the large DNA fragments that will generate the most useful information from the linked short reads.

Bhatt et al. note that their pipeline is cost-effective and will allow scientists to perform experiments “at a price point that gives it relevance to the broader microbiome community.” Importantly, they conclude that their novel “approach will be a significant step forward in enabling comparative genomics for bacteria, enabling fine-grained inspection of microbial evolution within complex communities.”

Sage Science

Genomics and Epigenetics | Thermo Fisher

Reducing GC Bias in WGS: Moving Beyond PCR

ByThermo Fisher

WGS technologies have seen significant progress since the completion of the Human Genome Project in 2003. First-generation Sanger Sequencers were limited by lengthy run times, high expenses, and throughputs that read only tens of kilobases per run. The arrival of second-generation sequencers in the mid-2000s brought about the plummeting of sequencing costs and run times,…

Genomics and Epigenetics

A Crash Course in Epigenetics Part 3: Regulated regulation

ByJudith R. Brouwer

Epigenetics is the most rapidly expanding field in biology. In the second article in this series, I discussed which experimental techniques have been crucial in gaining insight into epigenetic processes. I will now shed light on what those and other methods have taught us. As described in the first article, it has been long understood…

Genomics and Epigenetics | Sage Science

Why DNA Size Selection Matters in NGS Pipelines

BySage Science

Of all the sample prep steps necessary for next generation sequencing, DNA size selection may have the greatest impact on quality of results. After all, ineffective sizing can waste sequencing capacity on low molecular weight material such as adapter-dimers or primer-dimers, while imprecise sizing can prevent bioinformaticians from producing accurate assemblies. High-quality size selection can…

Genomics and Epigenetics

How Bisulfite Pyrosequencing Works

ByKirsten Hogg

Bisulfite pyrosequencing is becoming a routine technique in molecular biology labs as a method to precisely measure DNA methylation levels right down to the single base. The technique allows for detailed and high resolution analysis of DNA methylation at specific genomic regions. How to detect the 5th base? Methylation of any of the four nucleotides…

Genomics and Epigenetics

How To Identify Conserved Elements In Genes

ByLaura-Nadine Schuhmacher

Conserved elements are stretches of DNA sequence that are under purifying selection. That means mutations leading to a change of function in this part of the DNA are detrimental to the organism and will not become fixed in the genome, but rather discarded by natural selection. The level of conservation between species gives an idea…

Genomics and Epigenetics

Some Sanger Sequencing Tips and Tricks

ByJames Hadfield

Sanger sequencing is still a workhorse of most molecular biology labs. Even with the advent of next-generation sequencing we still need to sequence our clones and PCR products. In this article I have listed some of the tips and tricks we used in our Sanger services. (1)Dilution of BigDye: I’d expect this to be a…

About Us

Marketing

Generating High-Quality Genome Assemblies from Metagenomic Sequencing

Reducing GC Bias in WGS: Moving Beyond PCR

A Crash Course in Epigenetics Part 3: Regulated regulation

Why DNA Size Selection Matters in NGS Pipelines

How Bisulfite Pyrosequencing Works

How To Identify Conserved Elements In Genes

Some Sanger Sequencing Tips and Tricks

10 Things Every Molecular Biologist Should Know

About Us

Marketing

Generating High-Quality Genome Assemblies from Metagenomic Sequencing

More 'Genomics and Epigenetics' articles

10 Things Every Molecular Biologist Should Know