Beginner's Guide to Next Generation Sequencing

The completion of the Human Genome Project in 2003 ushered in a new era of rapid, affordable, and accurate genome analysis—called Next Generation Sequencing (NGS). NGS builds upon “first generation sequencing” technologies to yield accurate and cost-effective sequencing results.

Fred Sanger sequenced the first whole DNA genome, the virus phage X174, in 1977. In that same year, Sanger developed the future backbone of the genome era: DNA sequencing technology. His technique used the chain termination method. This is now seen as the “first-generation technology” of genome sequencing.

Since Sanger sequencing (or the chain termination method) is the first generation of sequencing technology, understanding it is greatly important. The Human Genome Project began with Sanger sequencing technology.

The Sanger Sequencing Method

The Sanger sequencing method relies on dideoxynucleotides (ddNTPs),a type of deoxynucleoside triphosphates (dNTPs), that lack a 3′ hydroxyl group and have a hydrogen atom instead . When these bases bind to the growing DNA sequence, they terminate replication as they cannot bind other bases.

Choose a free resource to help you move forward

EBOOK

Gene Editing 101 is your guide to understanding, designing, and performing CRISPR experiments, exploring how this revolutionary technology is driving advances across health, diagnostics, agriculture, and energy, and covering how to design gRNA, choose a Cas9 format, screen with CRISPR, use advanced CRISPR approaches, and more.

GET YOUR COPY

DOWNLOAD

Bitesize Bio’s blood collection tube chart explains each tube type, cap color, and essential properties in a clear format, further divided into serum and plasma tubes so you can pick with confidence. Grab your free chart, pin it up, and streamline your blood collection process today.

GET YOUR COPY

To perform Sanger Sequencing, you add your primers to a solution containing the genetic information to be sequenced, then divide up the solution into four PCR reactions. Each reaction contains a with dNTP mix with one of the four nucleotides substituted with a ddNTP (A, T, G, and C ddNTP groups).

At the end of the PCR, each of your four reactions will yield PCR products of various lengths because replication is randomly terminated. By running the samples on a gel with 4 lanes, you can piece together the sequence as each sequence has been replicated from the same original material. Here is an example where the ddNTPs are in bold and the dNTPs are not:

Your sequence is ATGCTCAG. Your four reactions give you:

Reaction with ddATP: A, ATGCTCA, ATGCTCAG
Reaction with ddGTP: ATG, ATGCTCAG
Reaction with ddCTP: ATGC, ATGCTC, ATGCTCAG
Reaction with ddTTP: AT, ATGCT, ATGCTCAG

All the reactions once run a gel would look something like this (Image by Olwen Reina):

Each band denotes the different lengths code. For example, the band is the right under the “A” symbolizes the sequence: “ATGCTCA”

Still confused?

Let’s imagine a party game. The game is a guessing game. Here is how it is played:

You are thinking of a number and the group has to guess it. The tricky part is that the number is 200-digits in length. You are reading the digits of the number in your head without making a sound. Every so often a person interrupts you, and you tell them the single digit you were just thinking and where it is in the sequence of 200.

Each time you are interrupted, you have to start again. You leave after a few hours and the group has to figure out the 200-digit number. They have to piece together the information you gave them, for example the 25^th number was 5, the 40^th number was 0, and so on. Using the information from their interruptions, they can repeat the number they gave you.

While this sounds like the lamest game in the world, it works very well for sequencing!

Unfortunately, it is slow, expensive, and (previously) relies on radioactive materials. This pushed scientists to develop new and better forms of genome sequencing.

Next Generation Sequencing Technologies

The biggest advances in genome sequencing have been increasing speed and accuracy, resulting in a reduction in manpower and cost. These advances unlock striking applications, like NGS, which unearthed the fungus behind the Irish potato famine. Newer approaches push further still — see this CRISPR-inspired method that targets large, repetitive DNA elements.

The speed is thanks to parallel analysis and high-throughput technology.

The Sanger Sequencing Method

Sequencing machines have improved wildly since Sanger developed his method.

In 1987, Applied Biosystems became the first to introduce an automatic sequencing machine, called the AB370. The machine relied on a method called capillary electrophoresis that used Sanger’s chain-terminating method without needing a gel.

First, fluorescently labeled chain-terminating ddNTPs are added to the PCR reaction mix. By the Sanger sequencing method, PCR products of various lengths are created, and then separated according to their size. Size is measured by the PCR product’s overall negative charge. The more negative the charge, the longer the fragment.

Take the above example of ddNTP incorporation and imagine that the bold letters are ddNTPs with a fluorochrome attached. As the fragments are pulled toward the positive electrode of a capillary (see image below), they pass a laser beam that triggers a flash of light from the fluorochrome attached to the ddNTP that is characteristic of the base type (for example, green for A, yellow for T, blue for G, red for C). In this way, the genome is carefully read.

The biggest difference here was speed and cost. AB370 could detect 96 bases at one time, 500K bases per day, and had a read length of 600 bases by using a parallel analysis and high-throughput setup. This form of sequencing became the primary tool for completing the Human Genome Project.

How Does NGS Compare to First-Generation Sequencing?

NGS is characterized by improved accuracy and speed, but also reduced manpower and cost.

There has never been a time when it has been as cheap, convenient, or straightforward to sequence a genome. Arguably, the biggest improvement has been the development of parallel analysis, which has increased the sequencing speed.

The only thing slowing us down now is the interpretation of results! That interpretation increasingly touches new biology, like alternative polyadenylation.

Sometimes data interpretation gets complicated — here’s what to do when whole genome sequencing yields unexpected results. One free tool that helps is Galaxy, an NGS workflow management system. It also pays to know some Linux for high-throughput sequencing analysis and to be aware of batch effects, which can distort results across runs.

If you’re running many samples in parallel, it’s worth understanding probability theory and molecular barcodes. It’s also essential to quantify your NGS libraries accurately before pooling them and ensuring clean input before sequencing. For single-cell workflows specifically, see our tips to optimize nuclei extraction and counting.

You made it to the end—nice work! If you’re the kind of scientist who likes figuring things out without wasting half a day on trial and error, you’ll love our newsletter. Get 3 quick reads a week, packed with hard-won lab wisdom. Join FREE here.

Olwen Reina

I am a Clinical Research Coordinator at the U.S. Department of Veterans Affairs with a background in basic research, writing, mentoring and teaching. I studied Natural Science at Trinity College Dublin, Ireland, specializing in biochemistry with immunology and I am currently undergoing ACRP (Association of Clinical Research Professionals) certification. In my spare time, I enjoy studying HTML/CSS and SEO, doing acroyoga, making kombucha, salsa dancing, voluntary community projects and eating sushi. Feel free to send me a note with any writing opportunities or to say hello.

About Us

Marketing

Bitesize Bio Search

A Beginner’s Guide to Next Generation Sequencing (NGS) Technology

The Sanger Sequencing Method

Still confused?

Next Generation Sequencing Technologies

The Sanger Sequencing Method

How Does NGS Compare to First-Generation Sequencing?

A Crash Course in Epigenetics Part 1: An intro to epigenetics

How DNA Microarrays are Built

A Crash Course in Epigenetics Part 4: Disease mechanisms and therapeutic targets

How to Confirm Your CRISPR-cas9 Genome Editing Was Successful

How to Get Started with Algae

DNA Shuffling Like a Pro

Beyond Bulk RNA-seq: Revealing Cell-specific Gene Expression

**Get help with everything* lab-related.**

10 Things Every Molecular Biologist Should Know

Get practical lab wisdom like this in your inbox

About Us

Marketing

Bitesize Bio Search

A Beginner’s Guide to Next Generation Sequencing (NGS) Technology

The Sanger Sequencing Method

Still confused?

Next Generation Sequencing Technologies

The Sanger Sequencing Method

How Does NGS Compare to First-Generation Sequencing?

More 'Genomics and Epigenetics' articles

Beyond Bulk RNA-seq: Revealing Cell-specific Gene Expression

Get help with everything* lab-related.

10 Things Every Molecular Biologist Should Know

Get practical lab wisdom like this in your inbox

**Get help with everything* lab-related.**