Skip to main content Skip to navigation

ThermoPhyl FAQ

FAQ

1. How is ThermoPhyl different than other primer design programs like PrimerBlast?

A: ThermoPhyl fills a slightly different niche space than other programs as summarized in the following table:

Software Database size Remote/Local FISH Assays
(single oligo)
Traditional PCR Assays
(two primers)
Quantitative PCR Assays
(two primers and probe)
Through put Summary output of
sensitivity and specificity
Primer
Blast
Large (Genbank) R N Y N Low N
Probe
Check
Linked to public databases R Y Y N High N
Primique Small to Large
(user-defined)
R N Y N High w/ speed limitations N
Thermo
Phyl
Small to Large
(user-defined)
L Y Y Y High Y
2. What’s the best way to design candidate assays?

A: For traditional PCR, the most efficient way is to compile all (or a representative set) of your target sequences into a single fasta file, and simply feed this into BatchPrimer3. For qPCR, we have had good results with ABI’s PrimerExpress program, although each sequence must be input singly and the output compiled into a single file. We typically generate 50 candidate assays per target sequence, which can be defined in the primer design program.

3. My output from BatchPrimer3 is formatted strangely. How do I change it?

A: No need to worry. ThermoPhyl has an option to use the output from Batch Primer3 as is. Just select the appropriate option in the introductory screen.

4. If I design so many assays for the similar sequences in my target group, won’t I end up with a bunch of redundant assays?

A: Good question. Yes, you probably will, but not to worry, ThermoPhyl will detect these and filter them down to a unique list.

5. Can I test primers to target all the sequences in a fasta file?

A: Yes, just don't specify any sequences in your target list (i.e. save it as an empty file).

6. ThermoPhyl tells me I have some discrepancy between my target list and my sequence database and so can’t calculate properly the number of matches to targets vs non-targets.

A: Common causes of this problem include strange characters (such as &*,!%) in sequence names (in the “outside_world.fas”) file and/or the target list (“target_list.txt” file), non-unique identifiers (‘AY100’ with file of sequence names which include AY100908, AY100909, etc.), and duplicate entries in the sequence database (“outside_world.fas” file). Correcting these errors should resolve the problem. ThermoPhyl will still run despite these errors, but the sensitivity and specificity calculations will be adversely affected.

Common problems with sequence databases include duplicate entries, gibberish or inconsistent naming schemes, etc. Remember, “Garbage In, Garbage Out”.

7. ThermoPhyl doesn’t find any matches in my dataset.

A: This can reflect the truth (although matches should always be found among your target sequences!), but more often is an artifact of either having the columns in the assay file (“candidate_assays.txt”) in the wrong order (correct order is Forward, Reverse or Forward, Probe, Reverse). Another common problem is having the Reverse Primer sequences reverse-complemented. They should be listed as true Reverse Primers as you would order the oligos (i.e. written 5’ – 3’ for the opposite strand; this is format used by Batch Primer3 and Primer Express to output their assays). The script automatically reverse-complements them to search on the same strand as the Forward Primer. Finally, the primer file (“candidate_assays.txt”) should have no special characters other than tab-delimitations and newline characters as in the example.

Finally, make sure you aren’t mixing ‘T’s and ‘U’s in your primers and your outside world file.

8. How long does ThermoPhyl take to run?

A: In tests on moderately fast WinXP machines (e.g. 2 GHz Pentium CPU w/ 3 Gb of RAM), we have tested 5,000 candidate qPCR assays against a database with 5,000 taxa, for 25 million comparisons in about 2.5 hrs. Most users will have many fewer comparisons than this and for most applications, ThermoPhyl produces output in seconds to minutes.

9. Why doesn’t ThermoPhyl search for degenerate bases/mismatches in my probe(s) and/or primers?

A: Two reasons:

First, because the premise of ThermoPhyl is to start with thermodynamically optimized assays, any introduction of degenerate bases is considered to be a compromise. For qPCR, ABI recommends against the use of degenerate primers or probes. Ideally, one should be able to find a sensitive and specific assay for a given target group without introducing degenerate bases.

Second, ThermoPhyl is designed to summarize large datasets and the output of searches with varying numbers of mismatches would overwhelm most users and defeat the purpose of the program. We always recommend comparing several of the best assays to your alignment file, and if absolutely necessary, introducing degeneracies at that point.

10. Can ThermoPhyl evaluate FISH probes as well?

A: Yes. Just select the option for a single oligo in the opening screen. The input file ‘candidate_assays.txt’ should have just a single column.

11. What are the output files and how do I view them?

A: ThermoPhyl produces two output files. The first file (‘sorted_search_results.txt’) is a summary table with a sorted tally of the number of target and non-target sequences matched by each candidate assay. The second file (‘raw_search_results.txt’) contains details of each match, including the amplicon length, the position on the DNA strand where each primer/probe matched, etc. These files can be viewed conveniently in a spreadsheet like MS Excel.

12. Is the ‘R_motif’ sequence printed out by ThermoPhyl ready to order as a Reverse Primer?

A: No, it needs to be reverse-complemented. ThermoPhyl prints it out this way to facilitate searching on the same strand by the user when comparing output.

13. Everything worked fine, now can I just order the first assay on the list?

A: For some applications, that may be OK, but in general we highly recommend looking at several of the highly ranked assays in a multiple alignment file (the ARB editor is really good for this) to confirm sensitivity and specificity, check for possible degeneracies, etc.