New Channels on Bitesize Bio

To help you find information on exactly what you need we're implementing channels, a new way to browse content

Each channel is focused on a specific technique or area and authored/presented by hand-picked authors who are experts in their field. Make sure you don't miss a thing by checking the box below for each channel that interests you.

In return we'll send you one email per month that brings you the latest from your chosen channel(s), along with free members-only content.

Check out our upcoming new channels; Flow Cytometry and Cell Culture, we'll be launching them very soon!

I would like to receive the newsletters for the following channels

Cell Culture
Flow Cytomery
Microscopy & Imaging
Next Generation Sequencing
Writing, Publishing and Presenting
Cloning & Expression


My email address is:

lab-databases

Make Your Excel Worksheets Database Ready

by in Organization & Productivity
From the Bitesize Bio channel

In a previous article (Tips for Constructing Lab Databases in Excel by Emily Crow), BitesizeBio readers began a brief, but spirited commentary on the application of using true databases (MySQL, Access, etc.) versus Excel “databases”.

While Excel can be quite useful to organize information (for example, an inventory of reagents, plasmids, laboratory items – even your own experimental projects), true databases are superior when it comes to storing, retrieving and querying data, integrity and comparisons between two or more sets.

But, how to transfer your information from your Excel worksheets into a database? Some may not care for the responsibility that database administration entails. Others may find that the task begins with a steep learning curve. You don’t have to be an IT wizard; work with your informatics group to make your Excel worksheets database-ready.

Know your Bioinformatician

A bioinformatician may already be the person administrating everything behind the scenes. He or she collects various project files and combines their contents into a single file. A good working relationship with your informatics group is beneficial at all stages, from the start of new projects, to projects in motion and all the way through to final analysis. If you have not already done so…introduce yourself and say hello to them!

Know the table structure

Your bioinformatician designed the database from the ground up (and maybe even asked for input along the way). They know the structure inside and out and can provide you with a list of category names for each data set. Using this, you will get a very clear idea of what should and should not be included into the database.

Organize your worksheet

Now it’s as simple as copying/pasting columns of your Excel data over to a fresh new worksheet. Use category information as a guide to arrange everything into an exact order. To make things cleaner, exclude extraneous entries, personal comments unrelated to projects and any special formatting.

Save and submit your database-ready worksheet

Save your reorganized data in Excel file format. Or select tab- or comma- delimited. Do note that for character delimited formats, each worksheet must be saved individually. This method is simple enough as it can enable your bioinformatician to import the data directly, with very few additional modifications.

Why Databases Are Useful

* Centralized data – the database exists in a single location where it is backed up regularly. No more searching computer after computer for worksheets that may or may not exist!

* Primary keys (unique identifiers) – this ensures each entry in your database is unique, even if you have a multi-part project where every part begins sample labeling at number 1.

* Relational Databases – compare multiple data sets to standard libraries to see which entries are identical.

* Data integrity – query the data, then query again and again. The original data is still there. In Excel, it is all too easy to sort only a portion, format entries based on the wrong type, or even accidentally edit the cells. Undo/redo your actions? Unless you saved an untouched, duplicate copy it can be difficult to start at the beginning again.

Whether you use Excel to manage information, have your own customized database on the side or even administrate all of your laboratory’s data, we would love to hear about your experiences. What works best for your lab? And what were some of the challenges along the way? Please share your story with us, in the comments below.

Articles in your inbox

Enter your email to be informed when we publish more articles like this on BsB, and also get access to all of these goodies:

  • Free ebooks and audiobooks on the topics that matter to you
  • Access to Member’s-only articles and Videos
  • Advance notice of new webinars and eBooks
  • Access to make comments and ask questions on BsB



What to read next

Scientists: Can any of us REALLY multi-task?

Unlike Nick Oswald I think I can multitask in the lab. If I organize my day efficiently and perform lots of experiments and other tasks in parallel, I get more done. But there is a school of thought – the one described in Nick’s article – that says no-one can really multitask, that our brains [...]

Make Every Day As Effective As Deadline Day

I’m sure you’ve heard of Parkinson’s Law, or at least the modern-day generalisation of it. It states that “Work expands so as to fill the time available for its completion.”  When I first heard this, back in the mists of time, I thought Mr Parkinson was damn right, had a chuckle at a very accurate [...]

How To Get Organized With Reference Managers for Science – ReadCube

In my last post on reference managers I discussed Mendeley, a well-established reference manager and the one I’m most familiar with. Today I am going to tell you about ReadCube, a more recent addition to the referencing software market brought to you from Labtiva. ReadCube was created by two Harvard students, Siniša Hrvatin and Robert [...]

About the author

Jason Erk

Jason holds a BS in Biochemistry and is a faculty research assistant at an Oregon university. While current research efforts span the fields of behavior, neuroscience, and molecular biology, he manages copious amounts of data and freezers full of...

What do you think?

One comment

  1. from on

    nothing will break the experimentalists central dogma of Data -> Excel -> Powerpoint. It’s grafted into students from day 1 of University.

    The informatics groups at my last institution were focused on writing tools for Taverna that could suck out the data from Excel and re-format it itself. They wanted the process to be as invisible as possible as re-education repeatedly failed.

Subscribe to Channels

To receive information about any of our new channels click on the button below.
subscribe to the channel newsletter »

Write for us

Have a short tip, a written
article or a video you'd like
to see published?
write for us »