For Long-Read Sequencers, Size Selection Is Key

Bitesize Bio Search

Search below to delve into the Bitesize Bio archive. Here, you’ll find over two decades of the best articles, live events, podcasts, and resources, created by real experts and passionate mentors, to help you improve as a bioscientist. Whether you’re looking to learn something new or dig deep into a topic, you’ll find trustworthy, human-crafted content that’s ready to inspire and guide you.

It took scientists a little while to warm up to long-read sequencing, but now you couldn’t pry most of them away from their sequencers with a crowbar. Long reads — we’re talking 10,000 bases and more — provide a level of contiguity and completeness in genome assemblies that simply isn’t possible with short-read sequencers. They can reveal full structural variants and accurately represent long, repetitive regions that flummox their short-read counterparts.

For example, scientists sequencing microbial genomes have discovered that they can often generate fully closed assemblies with long reads, representing the whole genome in a single contig. With more complex organisms, it’s not uncommon to hear about assemblies that have one contig to represent each chromosome. With short reads, assemblies are far more fragmented, split into hundreds or even thousands of small pieces that are difficult to place in the correct order and orientation.

There are two vendors in long-read sequencing today: PacBio and Oxford Nanopore Technologies. Others are waiting in the wings. For scientists using either of these platforms, they don’t want just long reads, they want the longest reads. And that’s where automated DNA size selection comes in.

Long-read sequencers are limited most by the length of the fragments fed into them. You can have a machine capable of producing 100,000-base reads, but if you load only 500-base DNA fragments, you can’t get the benefit of long-read data. In some cases, these sequencers preferentially sequence smaller fragments, so even if you had a mix of long and short fragments in your library, you’d wind up with much shorter average read lengths than the instrument is capable of producing.

Users of sequencers from both PacBio and ONT have shown that size selection can be used to remove the smaller fragments from a library prior to sequencing. This step may seem trivial, but studies show that it can double the average read length generated simply by focusing the sequencer on the longest DNA fragments available.

Here’s a great example from blogger Lex Nederbragt with nice data and charts. In a more recent study of the human genome, scientists from the Icahn School of Medicine at Mount Sinai and several other institutions reported the first diploid human genome sequence and noted that size selection was essential for maximizing read length. “Without selection, smaller 2000 – 7000 bp molecules dominate the zero-mode waveguide loading distribution, decreasing the sub-readlength,” the researchers noted in the supplementary materials.

At a recent ONT user group meeting, scientist and blogger Keith Robison reported that the company had begun using the BluePippin™ automated size selection platform to increase average read lengths; some users demonstrated the ability to enrich for reads at least 20 Kb long. At a PacBio user group event last fall, CSO Jonas Korlach introduced a protocol for generating libraries of at least 30 Kb by using the BluePippin with Diagenode shearing.

To learn more, check out the long-read sequencing resources listed here.

Resources

Sage Protocols for PacBio

Scientist Profile: Long Reads at Mount Sinai

App Note: 7 Kb+ Libraries

Sage Science

More 'Genomics and Epigenetics' articles

Astound Research | Getting Funded

Funding Opportunities and the Flow of Money in Science
ByJoel Berry

Gain insights into the flow of funding for scientific research. This article breaks down the funding landscape and explains the differences between government, industry, and foundation funding. Learn how navigating these sources can shape your career and accelerate your research endeavors.

Read More Funding Opportunities and the Flow of Money in Science
Protein Expression and Analysis | Proteintech

The Recombinant Revolution: Experimental Optimization with Recombinant Antibodies and Epitope Mapping
ByProteintech

Effective experimental design depends on choosing antibodies that match your target and assay conditions. This article explains how experimental optimization with recombinant antibodies and epitope mapping can improve consistency, reveal precise binding interactions, and support better reagent selection, enabling you to design assays that deliver more reliable, reproducible results.

Read More The Recombinant Revolution: Experimental Optimization with Recombinant Antibodies and Epitope Mapping
Genomics and Epigenetics

Bioinformatics: It’s Not All About Genomics
ByNatalie C Kegulian

Bioinformatics isn’t just for genomics geeks – there’s something for everyone!

Read More Bioinformatics: It’s Not All About Genomics
DNA / RNA Manipulation and Analysis | Biotium

Ethidium Bromide: The Alternatives
ByDr Nick Oswald

How can you avoid the perils of exposing DNA to UV light during the cloning procedure? Use an alternative DNA stain! Ethidium bromide is not your only option. In this article, we will compare the available DNA stains that can be used in electrophoresis to clarify the options available to you. Ethidium Bromide The classic…

Read More Ethidium Bromide: The Alternatives
Genomics and Epigenetics | Sage Science

How Does Automated Electrophoresis Perform DNA Size Selection?
BySage Science

Anytime lab processes get automated by a sophisticated scientific instrument, there can be a “black box” effect, leading users to wonder what’s going on in there. For DNA electrophoresis, it’s no different. It’s easy to see what’s happening in a manual gel, but the automated gel-based DNA size selection platforms can be more mysterious. Automated…

Read More How Does Automated Electrophoresis Perform DNA Size Selection?
Genomics and Epigenetics | Thermo Fisher

Decoding the Genome: Applications of DNA Sequencing
ByThermo Fisher

The age of sequencing is undoubtedly upon us. From improving cancer diagnostics to pinning down elephant poaching hotspots, DNA sequencing is revolutionizing the world around us from the ground up. The latest video from Thermo Fisher Scientific’s “Behind the Bench” blog, 10 moments in DNA sequencing gives fascinating insights into the amazing advances being made…

Read More Decoding the Genome: Applications of DNA Sequencing

About Us

Marketing

Bitesize Bio Search

For Long-Read Sequencers, Size Selection Is Key

Resources

Funding Opportunities and the Flow of Money in Science

The Recombinant Revolution: Experimental Optimization with Recombinant Antibodies and Epitope Mapping

Bioinformatics: It’s Not All About Genomics

Ethidium Bromide: The Alternatives

How Does Automated Electrophoresis Perform DNA Size Selection?

Decoding the Genome: Applications of DNA Sequencing

**Get help with everything* lab-related.**

10 Things Every Molecular Biologist Should Know

Get practical lab wisdom like this in your inbox