How to use Mauve to align genomes and identify orthologs?Bordetella species encode on their genomes multiple copies of insertion sequences (IS) elements. In species such as Bordetella pertussis, identical repeats of IS481 can be present in over 200 times and comprise over 6% of the total genome. These sequences provide sites for homologous recombination which are the origin of extensive genome remodeling. As a result, it is often difficult to identify orthologous genes. This is where Mauve comes in. This tool allows you to compare multiple genomes at the same time and see to where exactly the genes have moved. Mauve was developed at the University of Wisconsin-Madison with a team including Aaron Darling, Bob Mau, and Nicole Perna and several others.
Now it’s time to get to work. Here you will find a step-by-step set of instructions to help guide you through the genome alignment process.
|Mauve can be downloaded for free here.
Start Mauve by going to your computer’s program list and clicking the Mauve program icon.
|Once the home page for Mauve has opened, click ‘File’ and then ‘Align with progressiveMauve’.|
|You will now be prompted to add a sequence. Click the ‘Add Sequence’ button to browse for the sequence files.|
||Be sure to select at least 2 sequences in .gbk format. Once you’ve chosen all the files you wish to align.
Important: you can only align .gbk files!
|Now you must choose a space for the output file. An empty folder titled “Alignments” should be appropriate.|
|Time to choose a file name for these alignments once they’re finished. A simple name like
“[Bacterial species name here] Alignments” Should work fine.
|Click the ‘Align’ button, and you’re ready to go!|
|And now we wait. Be sure to wait til all the alignments have finished. Keep in mind, the more alignments running at a time, the longer this step will take.|
|Once the alignments are complete, hit the ‘close’ button.|
|The alignments should now be visible in you Mauve program window. Take a glance at and compare the sequences aligned. Chances are they are very different! Hovering your mouse over the length of the alignments should show you where that gene is located in each of your strains.|
|Now it’s time to export the orthologs. To do this, simply hit the ‘Tools’ tab, ‘Export’, and then ‘Export Orthologs’.|
|Be sure to select the proper Identity and Coverage ranges. We suggest using a Min Identity of 60 and a Max of 100 as well as a Min Coverage of 70 and a Max of 100. Once the values are set, click the ‘Browse’ button.|
|Within your newly created “Alignments” folder, save the file as “ [Bacterial species name here] Orthologs “.|
|Hit ‘Save’ and ‘Export’.|
|The Orthologs file should now be accessible. Open the file.|
|You may be asked to select a program in which to view these Orthologs. We suggest using Microsoft Excel.|
|Once Excel has opened, you may be notified that the version you are about to view is a “Read-Only” version. This is fine. Open as a “Read-Only” file anyway.|
|You should now have your list of Orthologs in Excel. There is a chance the Orthologs have been jumbled. Each strain should occupy only one column. If multiple strains have mixed in a column, you will need to sort them. We suggest using the ‘Sort and Filter’ function in the upper right hand corner of the ‘Home’ tab.
CAUTION: Be very careful not to move an Ortholog vertically within a column. Only horizontal movement should occur.
More detailed information on the different parameters for alignments in Mauve can be found here.Mauve