Demystifying NGS: Depth Coverage and Deep Sequencing

Bitesize Bio Search

Search below to delve into the Bitesize Bio archive. Here, you’ll find over two decades of the best articles, live events, podcasts, and resources, created by real experts and passionate mentors, to help you improve as a bioscientist. Whether you’re looking to learn something new or dig deep into a topic, you’ll find trustworthy, human-crafted content that’s ready to inspire and guide you.

NGS is not a three-headed monster. However, it can be a difficult concept to grasp—especially when you are getting started. There is a lot of new terminology, and a whole new world to discover: both in the lab bench and in interpreting your results.

It helps to start somewhere. So, let’s start!

Depth of Coverage

Depth of coverage is the number of reads of a given nucleotide in an experiment. Most NGS protocols start with a random fragmentation of the genome into short random fragments. These fragments are then sequenced and aligned. This alignment creates a longer contiguous sequence, by tiling of the short sequences. For tiling to be successful, you need different reads with significant overlaps, to align them with confidence. Please note the key-word: random. Because the fragmentation process is random, there is a technical need for a large number of fragments. You need to find sequences that overlap on flanking regions, so that we can tile them together. It’s almost like putting together a sequence puzzle.

Therefore, the more depth of coverage we get, the more significant overlaps we have to correctly align our sequence. This gives us robust results, with a better mapping quality.

Choose a free resource to help you move forward

EBOOK

Gene Editing 101 is your guide to understanding, designing, and performing CRISPR experiments, exploring how this revolutionary technology is driving advances across health, diagnostics, agriculture, and energy, and covering how to design gRNA, choose a Cas9 format, screen with CRISPR, use advanced CRISPR approaches, and more.

GET YOUR COPY

DOWNLOAD

Bitesize Bio’s blood collection tube chart explains each tube type, cap color, and essential properties in a clear format, further divided into serum and plasma tubes so you can pick with confidence. Grab your free chart, pin it up, and streamline your blood collection process today.

GET YOUR COPY

High average read depth is also important for accuracy and confidence. Small sequencing errors occur, but are easily discarded with good coverage: correct reads outnumber these individual errors, and make them statistically irrelevant.

Which brings us to our next topic…

Deep Sequencing

Deep sequencing is taking the concept of depth of coverage one step further. In some experiments, you need very high read depth to be absolutely certain of the sequence. This is especially important for heterogeneous samples, such as tumor samples, or mosaics. By upping the coverage, we will be sure to call a variant, even if it is only present in a small percentage of cells in our sample. We can also differentiate them from sequencing errors, as we have more reads to accurately make the distinction.

Let’s imagine we are analyzing a tumor sample: normal cell contamination is common in cancer samples. So, we assume that we have a population of cells with no mutations (normal cells) and a population of cells with mutations (tumor cells). We do not know for sure the ratio of each population in our sample. Therefore, maximum accuracy is very important. However, with deep sequencing we can call a variant on a population of cells comprising as little as 1% of the original sample.

With the high depth of coverage associated with deep sequencing, bioinformatic tools can also detect insertions and deletions (even larger ones that are not detected by Sanger sequencing, for example) by observing the reads, and understanding the differences in coverage. If there are many fewer reads, it may mean there is a deletion. On the other hand, many more may signify a duplication or an insertion.

Deep sequencing is a powerful tool, both in research and diagnostics, and it is essential to understand how important it can become—especially when analyzing very heterogeneous samples.

You made it to the end—nice work! If you’re the kind of scientist who likes figuring things out without wasting half a day on trial and error, you’ll love our newsletter. Get 3 quick reads a week, packed with hard-won lab wisdom. Join FREE here.