The Advanced User’s Guide to Sequencing Alignment Software

Bitesize Bio Search

Search below to delve into the Bitesize Bio archive. Here, you’ll find over two decades of the best articles, live events, podcasts, and resources, created by real experts and passionate mentors, to help you improve as a bioscientist. Whether you’re looking to learn something new or dig deep into a topic, you’ll find trustworthy, human-crafted content that’s ready to inspire and guide you.

Whether you’re employing sequencing gels, Sanger-based methods, or the latest in pyrosequencing or ion torrent technologies, obtaining, manipulating and analyzing your sequences has never been easier. Depending on what your goals are, you need to understand the pros and cons of the software. There is a lot of software out there, so do you your due diligence. Many are offered for free, but some aren’t worth the time. Caveat emptor! Here, we’ll talk about the differences between different sequencing alignment packages, and how to choose the one that’s right for you.

Alignment algorithms

Alignment algorithms, what are they? In the vast majority of cases, 3 or more sequences are being aligned (as opposed to “pairwise”), so we’ll be examining algorithms in this context. Generally, algorithms are simply computational commands that allow the software to recognize areas of similarity which may be associated with specific features that have been more highly conserved than other regions. They take into a account gaps, similarities and differences between 3 or more sequences. The most commonly used algorithms are:

a) The Clustal family: ClustalW, ClustalW2, Clustal V, ClustalX and ClustalOmega

b) MUSCLE (MUltiple Sequence Comparison by Log-Expectation)

Image Larger Volumes with the UltraMicroscope Choros™

From: Miltenyi Biotech

Trust Your Quantification with the DeNovix DS-8X Rapid Eight Channel, 1µL UV-Vis Spectrophotometer

From: DeNovix

MEGA, and most others, offer both algorithms. ClustalW has been the workhorse for many applications and will most likely be the best fit for your work. ClustalOmega is the “latest and greatest”; however, if comparing large sets of sequences, MUSCLE will do the job nicely. Many have reported that MUSCLE tends to produce more reliable alignments, regardless of number, in less time so you may want to try them both with your data sets. Click the links to learn more about Clustal and MUSCLE.

Sequence data output formats

Don’t worry about differing formats- most software suites will convert to the required ASCII text format. Click here to learn about different sequence outputs.

User interface/web-based platforms

I’m a big fan of the free “MEGA” software (Molecular Evolution Genetic Analysis) and have used it for years. It offers more tools, support, a great GUI (graphic user interface) and numerous “how to” tutorials to get rookies and advanced users up and running quickly. Not all software suites offer integrated web-based sequence capture, e.g. from NCBI’s GenBank. For many, having this capability is a huge advantage which will save you time from constantly having to download and integrate web-based FASTA files from online sequence databases.

When using web-based servers like GenBank, remember you’re competing for resources with many others, so be prepared to wait. These are ok, but have serious limitations when it comes to manipulating your data in any way. If all you want to do is align 10-20 sequences, sure, give it a lash. But, if you’re going to try to align 100 Drosophila genomes, you will most likely crash the server…and make a lot of new friends. You’ve been warned.

Different software options

Here are a few of the programs that I’ve used over the years, and where to find them:

Designer	$$$	Website
MEGA	Free Download	https://www.megasoftware.net/
Lasergene	Pay Licensed	https://www.dnastar.com/t-products-lasergene.aspx
BioNumerics	Pay Licensed	https://www.applied-maths.com/
Bioedit	Free Download	https://www.mbio.ncsu.edu/bioedit/bioedit.html
NCBI/GenBank	Free web-based	https://blast.ncbi.nlm.nih.gov/
EMBL-EBI	Free web-based	https://www.ebi.ac.uk/Tools/msa/clustalw2/
TCoffee	Free web-based	https://www.ebi.ac.uk/Tools/msa/tcoffee/help/
PRALINE	Free web-based	https://www.ibi.vu.nl/programs/pralinewww/

What’s your favorite sequencing alignment software?

Jason Garner

Jason has an MS in Cell and Molecular Biology from University of Texas at San Antonio. He has a passion for virology, immunology, infectious disease and forensics, and has been involved in business development for the past 10 years working in senior roles with Bio-Rad, Qiagen and currently, Thermo Fisher Scientific.

About Us

Marketing

Bitesize Bio Search

The Advanced User’s Guide to Sequencing Alignment Software

Alignment algorithms

Image Larger Volumes with the UltraMicroscope Choros™

Trust Your Quantification with the DeNovix DS-8X Rapid Eight Channel, 1µL UV-Vis Spectrophotometer

Sequence data output formats

User interface/web-based platforms

Different software options

Controls and Tips for TA Cloning

Can’t Find pFavorite? Organize Your Plasmid Library

Using Synthetic DNA For Long Term Data Storage

The EMSA – Teaching an Old Dog New Tricks

Defeat RNAse Contamination Using Bleach in Your RNA Agarose Gel

3 More DNA ligation Tips

Are You Leaning On a Crutch? Spotting Habits and Routines That Hold You Back

10 Things Every Molecular Biologist Should Know

About Us

Marketing

Bitesize Bio Search

The Advanced User’s Guide to Sequencing Alignment Software

Alignment algorithms

Sequence data output formats

User interface/web-based platforms

Different software options

More 'DNA / RNA Manipulation and Analysis' articles

Are You Leaning On a Crutch? Spotting Habits and Routines That Hold You Back

10 Things Every Molecular Biologist Should Know