Multiple sequence alignment for phylogenetic purposes

David A. Morrison

doi:10.1071/SB06020

L. A. S. JOHNSON REVIEW

Next Contents Vol 19(6)

Multiple sequence alignment for phylogenetic purposes

David A. Morrison

+ Author Affiliations

- Author Affiliations

Department of Parasitology (SWEPAR), National Veterinary Institute and Swedish University of Agricultural Sciences, 751 89 Uppsala, Sweden. Email: David.Morrison@bvf.slu.se

Australian Systematic Botany 19(6) 479-539 https://doi.org/10.1071/SB06020
Submitted: 3 July 2006 Accepted: 30 October 2006 Published: 14 December 2006

Abstract

I have addressed the biological rather than bioinformatics aspects of molecular sequence alignment by covering a series of topics that have been under-valued, particularly within the context of phylogenetic analysis. First, phylogenetic analysis is only one of the many objectives of sequence alignment, and the most appropriate multiple alignment may not be the same for all of these purposes. Phylogenetic alignment thus occupies a specific place within a broader context. Second, homology assessment plays an intricate role in phylogenetic analysis, with sequence alignment consisting of primary homology assessment and tree building being secondary homology assessment. The objective of phylogenetic alignment thus distinguishes it from other sorts of alignment. Third, I summarise what is known about the serious limitations of using phenetic similarity as a criterion for automated multiple alignment, and provide an overview of what is currently being done to improve these computerised procedures. This synthesises information that is apparently not widely known among phylogeneticists. Fourth, I then consider the recent development of automated procedures for combining alignment and tree building, thus integrating primary and secondary homology assessment. Finally, I outline various strategies for increasing the biological content of sequence alignment procedures, which consists of taking into account known evolutionary processes when making alignment decisions. These procedures can be objective and repeatable, and can involve computerised algorithms to automate much of the work. Perhaps the most important suggestion is that alignment should be seen as a process where new sequences are added to a pre-existing alignment that has been manually curated by the biologist.

References

Aagesen L, Petersen G, Seberg O (2005) Sequence length variation, indel costs, and congruence in sensitivity analysis. Cladistics 21, 15–30.

Aboitiz F (1987) Letter to the editor. Cell 51, 515–516.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Achaz G, Boyer F, Rocha EPC, Viari AC (2006) Repseek, a tool to retrieve approximate repeats from large DNA sequences. Bioinformatics in press ,
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Al-Lazikani B, Sheinerman FB, Honig B (2001) Combining multiple structure and sequence alignments to improve sequence detection and alignment: application to the SH2 domains of janus kinases. Proceedings of the National Academy of Sciences USA 98, 14 796–14 801.
| Crossref | GoogleScholarGoogle Scholar |

Allison L, Wallace CS (1994) The posterior probability distribution of alignments and its application to parameter estimation of evolutionary trees and to optimization of multiple alignments. Journal of Molecular Evolution 39, 418–430.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Allison L , Wallace CS , Yee CN (1992) Minimum message length encoding, evolutionary trees and multiple alignment. In ‘Proceedings of the Hawaii international conference on system sciences (HICSS-25).’ pp. 663–674. (IEEE Press: Piscataway)

Althaus E, Caprara A, Lenhof H-P, Reinert K (2002) Multiple sequence alignment with arbitrary gap costs: computing an optimal solution using polyhedral combinatorics. Bioinformatics 18, S4–S16.
| PubMed |

Anbarasu LA, Narayanasamy P, Sundararajan V (2000) Multiple molecular sequence alignment by island parallel genetic algorithm. Current Science 78, 858–863.

Andersen ES, Rosenblad MA, Larsen N, Westergaard JC, Burks J, Wower IK, Wower J, Gorodkin J, Samuelsson T, Zwieb C (2006) The tmRDB and SRPDB resources. Nucleic Acids Research 34, D163–D168.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Anwar T, Khan AU (2006) SSRscanner: a program for reporting distribution and location of simple sequence repeats. Bioinformation 1, 89–91.

Apostolico A, Giancarlo R (1998) Sequence alignment in molecular biology. Journal of Computational Biology 5, 173–196.
| PubMed |

Armougom F, Moretti S, Poirot O, Audic S, Dumas P, Schaeli B, Keduas V, Notredame C (2006) Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee. Nucleic Acids Research 34, W604–W608.
| PubMed |

Arvestad L (1997) Aligning coding DNA in the presence of frame-shift errors. Lecture Notes in Computer Science 1264, 180–190.

Badger JH, Eisen JA, Ward NL (2005) Genomic analysis of Hyphomonas neptunium contradicts 16S rRNA gene-based phylogenetic analysis: implications for the taxonomy of the orders ‘Rhodobacterales’ and Caulobacterales. International Journal of Systematic and Evolutionary Microbiology 55, 1021–1026.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bafna V, Tang H, Zhang S (2006) Consensus folding of unaligned RNA sequences revisited. Journal of Computational Biology 13, 283–295.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bahr A, Thompson JD, Thierry J-C, Poch O (2001) BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations. Nucleic Acids Research 29, 323–326.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Barta JR (1997) Investigating phylogenetic relationships within the Apicomplexa using sequence data: the search for homology. Methods 13, 81–88.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Barton GJ, Sternberg MJE (1987) A strategy for the rapid multiple alignment of protein sequences: confidence levels from tertiary structure comparisons. Journal of Molecular Biology 198, 327–337.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Batzoglou S (2005) The many faces of sequence alignment. Briefings in Bioinformatics 6, 6–22.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bauer M, Klau GW, Reinert K (2005a) Fast and accurate structural RNA alignment by progressive lagrangian optimization. Lecture Notes in Computer Science 3695, 217–228.

Bauer M, Klau GW, Reinert K (2005b) Multiple structural RNA alignment with lagrangian relaxation. Lecture Notes in Computer Science 3692, 303–314.

Baumel A, Ainouche ML, Bayer RJ, Ainouche AK, Misset MT (2002) Molecular phylogeny of hybridizing species from the genus Spartina Schreb. (Poaceae). Molecular Phylogenetics and Evolution 22, 303–314.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Beebe NW, Cooper RD, Morrison DA, Ellis JT (2000) Subset partitioning of the ribosomal DNA small subunit and its effects on the phylogeny of the Anopheles punctulatus group. Insect Molecular Biology 9, 515–520.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bell LH, Coggins JR, Milner-White EJ (1993) Mix’n’Match: an improved multiple sequence alignment procedure for distantly related proteins using secondary structure predictions, designed to be independent of the choice of gap penalty and scoring matrix. Protein Engineering 6, 683–690.
| PubMed |

Belshaw R, Quicke DLJ (2002) Robustness of ancestral state estimates: evolution of life history strategy in ichneumonoid parasitoids. Systematic Biology 51, 450–477.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Benner SA, Cohen MA, Gonnet GH (1993) Empirical and structural models for insertions and deletions in the divergent evolution of proteins. Journal of Molecular Biology 229, 1065–1082.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Benson G (1997) Sequence alignment with tandem duplication. Journal of Computational Biology 4, 351–367.
| PubMed |

Benson G (1999) Tandem Repeats Finder: a program to analyze DNA sequences. Nucleic Acids Research 27, 573–580.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bininda-Emonds ORP (2005) TransAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences. BMC Bioinformatics 6, 156.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bishop MJ, Thompson EA (1986) Maximum likelihood alignment of DNA sequences. Journal of Molecular Biology 190, 159–165.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Blackshields G, Wallace IM, Larkin M, Higgins DG (2006) Analysis and comparison of benchmarks for multiple sequence alignment. In Silico Biology 6, 0030.
| PubMed |

Blaisdell BE (1986) A measure of the similarity of sets of sequences not requiring sequence alignment. Proceedings of the National Academy of Sciences USA 83, 5155–5159.
| Crossref | GoogleScholarGoogle Scholar |

Bledsoe AH, Sheldon FH (1990) Molecular homology and DNA hybridization. Journal of Molecular Evolution 30, 425–433.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Boeva V, Regnier M, Papatsenko D, Makeev V (2006) Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression. Bioinformatics 22, 676–684.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bonizzoni P, Della Vedova G (2001) The complexity of multiple sequence alignment with SP-score that is a metric. Theoretical Computer Science 259, 63–79.
| Crossref | GoogleScholarGoogle Scholar |

Brawley SH (1999) Submission and retrieval of an aligned set of nucleic acid sequences. Journal of Phycology 35, 433–437.
| Crossref | GoogleScholarGoogle Scholar |

Brenner SE, Chothia C, Hubbard TJ (1998) Assessing sequence comparison methods with reliable structurally-identified distant evolutionary relationships. Proceedings of the National Academy of Sciences USA 95, 6073–6078.
| Crossref | GoogleScholarGoogle Scholar |

Briffeuil P, Baudoux G, Lambert C, De Bolle X, Vinals C, Feytmans E, Depiereux E (1998) Comparative analysis of seven multiple protein sequence alignment servers: clues to enhance reliability of predictions. Bioinformatics 14, 357–366.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Britten RJ, Rowen L, Williams J, Cameron RA (2003) Majority of divergence between closely related DNA samples is due to indels. Proceedings of the National Academy of Sciences USA 100, 4661–4665.
| Crossref | GoogleScholarGoogle Scholar |

Brower AVZ, Schawaroch V (1996) Three steps of homology assessment. Cladistics 12, 265–272.

Brown JW (1999) The ribonuclease P database. Nucleic Acids Research 27, 314.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Bucka-Lassen K, Caprani O, Hein J (1999) Combining many multiple alignments in one improved alignment. Bioinformatics 15, 122–130.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Butler AB, Saidel WM (2000) Defining sameness: historical, biological, and generative homology. BioEssays 22, 846–853.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Campagna D, Romualdi C, Vitulo N, Del Favero M, Lexa M, Cannata N, Valle G (2005) RAP: a new computer program for de novo identification of repeated sequences in whole genomes. Bioinformatics 21, 582–588.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Cannone JJ, Subramanian S, Schnare MN, Collett JR, D’Souza LM , et al. (2002) The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics 3, 2.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Carfi A, Pares S, Duée E, Galleni M, Duez C, Frère JM, Dideberg O (1995) The 3-D structure of a zinc metallo-β-lactamase from Bacillus cereus reveals a new type of protein fold. EMBO Journal 14, 4914–4921.
| PubMed |

Cartmill M (1994) A critique of homology as a morphological concept. American Journal of Physical Anthropology 94, 115–123.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Cartwright RA (2005) DNA assembly with gaps (DAWG): simulating sequence evolution. Bioinformatics 21, iii31–iii38.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Castelo AT, Martins W, Gao GR (2002) TROLL—tandem repeat occurrence locator. Bioinformatics 18, 634–636.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Catherinot V, Labesse G (2004) ViTO: tool for refinement of protein sequence–structure alignments. Bioinformatics 20, 3694–3696.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Cerchio S, Tucker P (1998) Influence of alignment on the mtDNA phylogeny of Cetacea: questionable support for a Mysticeti / Physeteroidea clade. Systematic Biology 47, 336–344.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Chain P, Kurtz S, Ohlebusch E, Slezak T (2003) An applications-focused review of comparative genomics tools: capabilities, limitations, and future challenges. Briefings in Bioinformatics 4, 105–123.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Chakrabarti S, Bhardwaj N, Anand PA, Sowdhamini R (2004) Improvement of alignment accuracy utilizing sequentially conserved motifs. BMC Bioinformatics 5, 167.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Chakrabarti S, Lanczycki CJ, Panchenko AR, Przytycka TM, Thiessen PA, Bryant SH (2006) Refining multiple sequence alignments with conserved core regions. Nucleic Acids Research 34, 2598–2606.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Chan SC, Wong AKC, Chiu DKY (1992) A survey of multiple sequence comparison methods. Bulletin of Mathematical Biology 54, 563–598.
| PubMed |

Chang MSS, Benner SA (2004) Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments. Journal of Molecular Biology 341, 617–631.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD (2003) Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Research 31, 3497–3500.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Chiaromonte F , Yap VB , Miller W (2002) Scoring pairwise genomic sequence alignments. In ‘Proceedings of the 7th Pacific Symposium on Biocomputing 2002, Lihue, Hawaii’. pp. 115–126.

Chindelevitch L, Li Z, Blais E, Blanchette M (2006) On the inference of parsimonious indel evolutionary scenarios. Journal of Bioinformatics and Computational Biology 4, 721–744.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Clamp M, Cuff J, Searle SM, Barton GJ (2004) The Jalview java alignment editor. Bioinformatics 20, 426–427.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Cognato AI, Vogler AP (2001) Exploring data interaction and nucleotide alignment in a multiple gene analysis of Ips (Coleoptera: Scolytinae). Systematic Biology 50, 758–780.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Cole JR, Chai B, Farris RJ, Wang Q, Kulam SA, McGarrell DM, Garrity GM, Tiedje JM (2005) The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis. Nucleic Acids Research 33, D294–D296.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Cooper A, Lalueza-Fox C, Anderson S, Rambaut A, Austin J, Ward R (2001) Complete mitochondrial genome sequences of two extinct moas clarify ratite evolution. Nature 409, 704–707.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Corpet F (1988) Multiple sequence alignment with hierarchical clustering. Nucleic Acids Research 16, 10 881–10 890.

Corpet F, Michot B (1994) RNAlign program: alignment of RNA sequences using both primary and secondary structures. Computer Applications in the Biosciences 10, 389–399.
| PubMed |

Cozzetto D, Tramontano A (2005) Relationship between multiple sequence alignments and quality of protein comparative models. Proteins: Structure, Function, and Bioinformatics 58, 151–157.
| Crossref | GoogleScholarGoogle Scholar |

Croan DG, Morrison DA, Ellis JT (1997) Evolution of the genus Leishmania revealed by comparison of DNA and RNA polymerase gene sequences. Molecular and Biochemical Parasitology 89, 149–159.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Dalli D, Wilm A, Mainz I, Steger G (2006) STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time. Bioinformatics 22, 1593–1599.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Darling ACE, Mau B, Blattner FR, Perna NT (2004) Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Research 14, 1394–1403.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

De Laet JE (2005) Parsimony and the problem of inapplicables in sequence data. In ‘Parsimony, phylogeny, and genomics.’ (Ed. VA Albert) pp. 81–116. (Oxford University Press: Oxford)

Deléage G, Clerc FF, Roux B, Gautheron DC (1988) ANTHEPROT: a package for protein sequence analysis using a microcomputer. Computer Applications in the Biosciences 4, 351–356.
| PubMed |

De Rijk P, De Wachter R (1993) DCSE, an interactive tool for sequence alignment and secondary structure research. Bioinformatics 9, 735–740.

DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL (2006a) Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Applied and Environmental Microbiology 72, 5069–5072.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

DeSantis TZ, Hugenholtz P, Keller K, Brodie EL, Larsen N, Piceno YM, Phan R, Andersen GL (2006b) NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Research 34, W394–W399.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Dewey CN, Pachter L (2006) Evolution at the nucleotide level: the problem of multiple whole-genome alignment. Human Molecular Genetics 15, R51–R56.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Do CB, Mahabhashyam MSP, Brudno M, Batzoglou S (2005) ProbCons: probabilistic consistency-based multiple sequence alignment. Genome Research 15, 330–340.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Domingues FS, Lackner P, Andreeva A, Sippl MJ (2000) Structure-based evaluation of sequence comparison and fold recognition alignment accuracy. Journal of Molecular Biology 297, 1003–1013.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Donoghue MJ , Sanderson MJ (1994) Complexity and homology in plants. In ‘Homology: the hierarchical basis of comparative biology’. (Ed. BK Hall) pp. 393–421. (Academic Press: San Diego)

Doolittle RF (1981) Similar amino acid sequences: chance or common ancestry? Science 214, 149–159.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Duret L , Abdeddaïm S (2000) Multiple alignments for structural, functional, or phylogenetic analyses of homologous sequences. In ‘Bioinformatics: sequence, structure, and databanks.’ (Ed. D Higgins, W Taylor) pp. 51–76. (Oxford University Press: Oxford)

Ebedes J, Datta A (2004) Multiple sequence alignment in parallel on a workstation cluster. Bioinformatics 20, 1193–1195.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Eddy SR (1998) Profile hidden markov models. Bioinformatics 14, 755–763.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Eddy SR (2002a) A memory efficient dynamic programming algorithm for optimal structural alignment of a sequence to an RNA secondary structure. BMC Bioinformatics 3, 18.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Eddy SR (2002b) Computational genomics of noncoding RNA genes. Cell 109, 137–140.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Edgar RC (2004a) Local homology recognition and distance measures in linear time using compressed amino acid alphabets. Nucleic Acids Research 32, 380–385.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Edgar RC (2004b) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Edgar RC (2004c) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Edgar RC, Sjölander K (2004) A comparison of scoring functions for protein sequence profile alignment. Bioinformatics 20, 1301–1308.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Edgar RC, Batzoglou S (2006) Multiple sequence alignment. Current Opinion in Structural Biology 16, 368–373.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Elias I (2003) Settling the intractability of multiple alignment. Lecture Notes in Computer Science 2906, 352–363.

Ellis J, Morrison D (1995) Effects of sequence alignment on the phylogeny of Sarcocystis deduced from 18S rDNA sequences. Parasitology Research 81, 696–699.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Errami M, Geourjon C, Deléage G (2003) Conservation of amino acids into multiple alignments involved in pairwise interactions in three-dimensional protein structures. Journal of Bioinformatics and Computational Biology 1, 505–520.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Feng D-F, Doolittle RF (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. Journal of Molecular Evolution 25, 351–360.
| PubMed |

Finn RD, Mistry J, Schuster-Böckler B, Griffiths-Jones S, Hollich V , et al. (2006) Pfam: clans, web tools and services. Nucleic Acids Research 34, D247–D251.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Fitch WM (2000) Homology: a personal view on some of the problems. Trends in Genetics 16, 227–231.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Fitch WM, Smith TF (1983) Optimal sequence alignments. Proceedings of the National Academy of Sciences USA 80, 1382–1386.
| Crossref | GoogleScholarGoogle Scholar |

Fleißner R (2004) ‘Sequence alignment and phylogenetic inference.’ (Logos Verlag: Berlin)

Fleissner R, Metzler D, von Haeseler A (2005) Simultaneous statistical multiple alignment and phylogeny reconstruction. Systematic Biology 54, 548–561.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Frith MC, Hansen U, Spouge JL, Weng Z (2004) Finding functional sequence elements by multiple local alignment. Nucleic Acids Research 32, 189–200.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gagnon S, Bourbeau D, Levesque RC (1996) Secondary structures and features of the 18S, 5.8S and 26S ribosomal RNAs from the Apicomplexan parasite Toxoplasma gondii. Gene 173, 129–135.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gardner PP, Giegerich R (2004) A comprehensive comparison of comparative RNA structure prediction approaches. BMC Bioinformatics 5, 140.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gardner PP, Wilm A, Washietl S (2005) A benchmark of multiple sequence alignment programs upon structural RNAs. Nucleic Acids Research 33, 2433–2439.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Geiger DL (2002) Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences. Journal of Molecular Evolution 54, 191–199.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gille C, Frömmel C (2001) STRAP: editor for structural alignments of proteins. Bioinformatics 17, 377–378.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gillespie JJ (2004) Characterizing regions of ambiguous alignment caused by the expansion and contraction of hairpin-stem loops in ribosomal RNA molecules. Molecular Phylogenetics and Evolution 33, 936–943.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gillespie JJ, Yoder MJ, Wharton RA (2005a) Predicted secondary structure for 28S and 18S rRNA from Ichneumonoidea (Insecta: Hymenoptera: Apocrita): impact on sequence alignment and phylogeny estimation. Journal of Molecular Evolution 61, 114–137.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gillespie JJ, McKenna CH, Yoder MJ, Gutell RR, Johnston JS, Kathirithamby J, Cognato AI (2005b) Assessing the odd secondary structural properties of nuclear small subunit ribosomal RNA sequences (18S) of the twisted-wing parasites (Insecta: Strepsiptera). Insect Molecular Biology 14, 625–643.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Giribet G (2001) Exploring the behavior of POY, a program for direct optimization of molecular data. Cladistics 17, S60–S70.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Giribet G (2002) Relationship among metazoan phyla as inferred from 18S rRNA sequence data: a methodological approach. In ‘Molecular systematics and evolution: theory and practice’. (Eds R DeSalle, G Giribet, W Wheeler) pp. 85–101. (Birkhäuser Verlag: Basel)

Giribet G (2005) Generating implied alignments under direct optimization using POY. Cladistics 21, 396–402.
| Crossref | GoogleScholarGoogle Scholar |

Giribet G, Wheeler WC (1999) On gaps. Molecular Phylogenetics and Evolution 13, 132–143.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Giribet G , Wheeler WC , Muona J (2002) DNA multiple sequence alignments. In ‘Molecular systematics and evolution: theory and practice’. (Eds R DeSalle, G Giribet, W Wheeler) pp. 107–114. (Birkhäuser Verlag: Basel)

Gonnet GH, Korostensky C, Benner S (2000) Evaluation measures of multiple sequence alignments. Journal of Computational Biology 7, 261–276.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gotoh O (1982) An improved algorithm for matching biological sequences. Journal of Molecular Biology 162, 705–708.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gotoh O (1990) Consistency of optimal sequence alignments. Bulletin of Mathematical Biology 52, 509–525.
| PubMed |

Gotoh O (1995) A weighting scheme and algorithm for aligning many phylogenetically related sequences. Computer Applications in the Biosciences 11, 543–551.
| PubMed |

Gotoh O (1996) Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments. Journal of Molecular Biology 264, 823–838.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gotoh O (1999) Multiple sequence alignment: algorithms and applications. Advances in Biophysics 36, 159–206.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gough J (2005) Convergent evolution of domain architectures is rare. Bioinformatics 21, 1464–1471.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Graham SW, Reeves PA, Burns ACE, Olmstead RG (2000) Microstructural changes in noncoding chloroplast DNA: interpretation, evolution, and utility of indels and inversions in basal angiosperm phylogenetic inference. International Journal of Plant Sciences 161, S83–S96.
| Crossref | GoogleScholarGoogle Scholar |

Grasso C, Lee C (2004) Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems. Bioinformatics 20, 1546–1556.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Greenberg HJ, Hart WE, Lancia G (2004) Opportunities for combinatorial optimization in computational biology. INFORMS Journal on Computing 16, 211–231.
| Crossref | GoogleScholarGoogle Scholar |

Griffiths-Jones S (2005) RALEE—RNA alignment editor in emacs. Bioinformatics 21, 257–259.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy SR, Bateman A (2005) Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Research 33, D121–D124.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gu X, Li W-H (1995) The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment. Journal of Molecular Evolution 40, 464–473.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gueneau de Novoa P, Williams KP (2004) The tmRNA website: reductive evolution of tmRNA in plastids and other endosymbionts. Nucleic Acids Research 32, D104–D108.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Gupta SK, Kececioglu JD, Schäffer AA (1995) Improving the practical space and time efficiency of the shortest-paths approach to sum-of-pairs multiple sequence alignment. Journal of Computational Biology 2, 459–472.
| PubMed |

Gutell RR, Lee JC, Cannone JJ (2002) The accuracy of ribosomal RNA comparative structure models. Current Opinion in Structural Biology 12, 301–310.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series 41, 95–98.

Hancock JM, Vogler AP (2000) How slippage-derived sequences are incorporated into rRNA variable-region secondary structure: implications for phylogeny reconstruction. Molecular Phylogenetics and Evolution 14, 366–374.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Haszprunar G (1998) Parsimony analysis as a specific kind of homology estimation and the implications for character weighting. Molecular Phylogenetics and Evolution 9, 333–339.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Heger A, Holm L (2000) Rapid automatic detection and alignment of repeats in protein sequences. Proteins: Structure, Function, and Genetics 41, 224–237.
| Crossref | GoogleScholarGoogle Scholar |

Hein J (1990) Unified approach to alignment and phylogenies. Methods in Enzymology 183, 626–645.
| PubMed |

Hein J (1994) An algorithm combining DNA and protein alignment. Journal of Theoretical Biology 167, 169–174.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Hein J, Støvlbæk J (1996) Combined DNA and protein alignment. Methods in Enzymology 266, 402–418.
| PubMed |

Helm M, Brulé H, Friede D, Giegé R, Pütz J, Florentz C (2000) Search for characteristic structural features of mammalian mitochondrial tRNAs. RNA 6, 1356–1379.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Henneke CM (1989) A multiple sequence alignment algorithm for homologous proteins using secondary structure information and optionally keying alignments to functionally important sites. Computer Applications in the Biosciences 5, 141–150.
| PubMed |

Hennig W (1966) ‘Phylogenetic systematics.’ [Transl. DD Davis, R Zangerl from W Hennig (1950) ‘Grundzüge einer theorie der phylogenetischen systematik.’ (Deutscher Zentralverlag: Berlin)] (University of Illinois Press: Urbana)

Henikoff S (1991) Playing with blocks: some pitfalls of forcing multiple alignments. The New Biologist 3, 1148–1154.
| PubMed |

Heringa J (1999) Two strategies for sequence comparison: profile-preprocessed and secondary structure-induced multiple alignment. Computers and Chemistry 23, 341–364.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Hickson RE, Simon C, Cooper A, Spicer GS, Sullivan J, Penny D (1996) Conserved sequence motifs, alignment, and secondary structure for the third domain of animal 12S rRNA. Molecular Biology and Evolution 13, 150–169.
| PubMed |

Hickson RE, Simon C, Perrey SW (2000) The performance of several multiple-sequence alignment programs in relation to secondary-structure features for an rRNA sequence. Molecular Biology and Evolution 17, 530–539.
| PubMed |

Higgins DG, Thompson JD, Gibson TJ (1996) Using CLUSTAL for multiple sequence alignments. Methods in Enzymology 266, 383–402.
| PubMed |

Higgins DG, Blackshields G, Wallace IM (2005) Mind the gaps: progress in progressive alignment. Proceedings of the National Academy of Sciences USA 102, 10 411–10 412.
| Crossref | GoogleScholarGoogle Scholar |

Higgs PG (2000) RNA secondary structure: physical and computational aspects. Quarterly Reviews of Biophysics 33, 199–253.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Hillis DM (1994) Homology in molecular biology. In ‘Homology: the hierarchical basis of comparative biology’. (Ed. BK Hall) pp. 339–368. (Academic Press: San Diego)

Hirosawa M, Totoki Y, Hoshida M, Ishikawa M (1995) Comprehensive study of iterative algorithms of multiple sequence alignment. Computer Applications in the Biosciences 11, 13–18.
| PubMed |

Hofacker IL, Bernhart SHF, Stadler PF (2004) Alignment of RNA base pairing probability matrices. Bioinformatics 20, 2222–2227.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Hogeweg P, Hesper B (1984) The alignment of sets of sequences and the construction of phyletic trees: an integrated method. Journal of Molecular Evolution 20, 175–186.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Holm L, Sander C (1996) Mapping the protein universe. Science 273, 595–603.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Holmes I (2005) Accelerated probabilistic inference of RNA structure evolution. BMC Bioinformatics 6, 73.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Holmes I, Durbin R (1998) Dynamic programming alignment accuracy. Journal of Computational Biology 5, 493–504.
| PubMed |

Hoot SB, Douglas AW (1998) Phylogeny of the Proteaceae based on atpB and atpB–rbcL intergenic spacer region sequences. Australian Systematic Botany 11, 301–320.
| Crossref | GoogleScholarGoogle Scholar |

Hua Y, Jiang T, Wu B (1999) Aligning DNA sequences to minimize the change in protein. Journal of Combinatorial Optimization 3, 227–245.
| Crossref | GoogleScholarGoogle Scholar |

Huang X, Miller W (1991) A time-efficient, linear-space local similarity algorithm. Advances in Applied Mathematics 12, 337–357.
| Crossref | GoogleScholarGoogle Scholar |

Hudak J , McClure MA (1999) A comparative analysis of computational motif-detection methods. In ‘Proceedings of the 4th Pacific Symposium on Biocomputing 1999, Hawaii’. pp. 138–149.

Janies DA, Wheeler WC (2001) Efficiency of parallel direct optimization. Cladistics 17, S71–S82.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Jennings AJ, Edge CM, Sternberg MJE (2001) An approach to improving multiple alignments of protein sequences using predicted secondary structure. Protein Engineering 14, 227–231.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Jeon Y-S, Chung H, Park S, Hur I, Lee J-H, Chun J (2005) jPHYDIT: a JAVA-based integrated environment for molecular phylogeny of ribosomal RNA sequences. Bioinformatics 21, 3171–3173.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Jiang T , Lawler EL , Wang L (1994) Aligning sequences via an evolutionary tree: complexity and approximation. In ‘Proceedings of the 26th annual ACM symposium on theory of computing’. pp. 760–769. (ACM Press: New York)

Johnson MS, Sali A, Blundell TL (1990) Phylogenetic relationships from three-dimensional protein structures. Methods in Enzymology 183, 670–690.
| PubMed |

Johnson R (1982) Parsimony principles in phylogenetic systematics: a critical re-appraisal. Evolutionary Theory 6, 79–90.

Just W (2001) Computational complexity of multiple sequence alignment with SP-score. Journal of Computational Biology 8, 615–623.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Just W, Della Vedova G (2004) Multiple sequence alignment as a facility location problem. INFORMS Journal on Computing 16, 430–440.
| Crossref | GoogleScholarGoogle Scholar |

Karaca M, Bilgen M, Onus AN, Ince AG, Elmasulu SY (2005) Exact Tandem Repeats Analyzer (E-TRA): a new program for DNA sequence mining. Journal of Genetics 84, 49–54.
| PubMed |

Karp RM (2002) Mathematical challenges from genomics and molecular biology. Notices of the AMS 49, 544–553.

Karplus K, Hu B (2001) Evaluation of protein multiple alignments by SAM-T99 using the BAliBASE multiple alignment test set. Bioinformatics 17, 713–720.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Katoh K, Misawa K, Kuma K, Miyata T (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Research 30, 3059–3066.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Katoh K, Kuma K, Toh H, Miyata T (2005a) MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Research 33, 511–518.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Katoh K, Kuma K, Miyata T, Toh H (2005b) Improvement in the accuracy of multiple sequence alignment program MAFFT. Genome Informatics 16, 22–33.
| PubMed |

Kawakita A, Sota T, Ascher JS, Ito M, Tanaka H, Kato M (2003) Evolution and phylogenetic utility of alignment gaps within intron sequences of three nuclear genes in bumble genes (Bombus). Molecular Biology and Evolution 20, 87–92.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kececioglu J , Starrett D (2004) Aligning alignments exactly. In ‘Proceedings of the 8th ACM conference on research in computational molecular biology (RECOMB’04)’. pp. 85–96. (ACM Press: New York)

Kececioglu J, Kim E (2006) Simple and fast inverse alignment. Lecture Notes in Computer Science 3909, 441–455.

Keightley PD, Johnson T (2004) MCALIGN: stochastic alignment of noncoding DNA sequences based on an evolutionary model of sequence evolution. Genome Research 14, 442–450.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kelchner SA (2000) The evolution of non-coding chloroplast DNA and its application in plant systematics. Annals of the Missouri Botanical Garden 87, 482–498.
| Crossref | GoogleScholarGoogle Scholar |

Kelchner SA (2002) Group II introns as phylogenetic tools: structure, function, and evolutionary constraints. American Journal of Botany 89, 1651–1669.

Kelchner SA, Wendel JF (1996) Hairpins create minute inversions in non-coding regions of chloroplast DNA. Current Genetics 30, 259–262.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kelchner SA, Clark LG (1997) Molecular evolution and phylogenetic utility of the chloroplast rpl16 intron in Chusquea and the Bambusoideae (Poaceae). Molecular Phylogenetics and Evolution 8, 385–397.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kjer KM (1995) Use of rRNA secondary structure in phylogenetic studies to identify homologous positions: an example of alignment and data presentation from the frogs. Molecular Phylogenetics and Evolution 4, 314–330.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kjer KM (1997) An alignment template for amphibian 12S rRNA, domain III: conserved primary and secondary structural motifs. Journal of Herpetology 31, 599–604.
| Crossref | GoogleScholarGoogle Scholar |

Kjer KM (2004) Aligned 18S and insect phylogeny. Systematic Biology 53, 506–514.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kjer KM, Baldridge GD, Fallon AM (1994) Mosquito large subunit ribosomal RNA: simultaneous alignment of primary and secondary structure. Biochimica et Biophysica Acta 1217, 147–155.
| PubMed |

Kjer KM, Gillespie JJ, Ober KA (2006) Opinions on multiple sequence alignment, and an empirical comparison of repeatability and accuracy between POY and structural alignment. Systematic Biology in press ,

Kleinjung J, Douglas N, Heringa J (2002) Parallelized multiple alignment. Bioinformatics 18, 1270–1271.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Knudsen B, Miyamoto M (2003) Sequence alignments and pair hidden markov models using evolutionary history. Journal of Molecular Biology 333, 453–460.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kolodny R, Koehl P, Levitt M (2005) Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures. Journal of Molecular Biology 346, 1173–1188.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kreitman M (1983) Nucleotide polymorphism at the alcohol dehydrogenase locus of Drosophila melanogaster. Nature 304, 412–417.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kroken S, Taylor JW (2001) Outcrossing and recombination in the lichenized fungus Letharia. Fungal Genetics and Biology 34, 83–92.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Kurtz S, Schleiermacher C (1999) REPuter: fast computation of maximal repeats in complete genomes. Bioinformatics 15, 426–427.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lambert C, Van Campenhout J-M, DeBolle X, Depiereux E (2003) Review of common sequence alignment methods: clues to enhance reliability. Current Genomics 4, 131–146.
| Crossref | GoogleScholarGoogle Scholar |

Lancia G, Ravi R (1999) GESTALT: genomic steiner aligments. Lecture Notes in Computer Science 1645, 101–114.

Lassmann T, Sonnhammer ELL (2002) Quality assessment of multiple alignment programs. FEBS Letters 529, 126–130.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lassmann T, Sonnhammer ELL (2005) Automatic assessment of alignment quality. Nucleic Acids Research 33, 7120–7128.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Laurenne NM, Broad GR, Quicke DLJ (2006) Direct optimization and multiple alignment of 28S D2–D3 rDNA sequences: problems with indels on the way to a molecular phylogeny of the cryptine ichneumon wasps (Insecta: Hymenoptera). Cladistics 22, 442–473.
| Crossref | GoogleScholarGoogle Scholar |

Lawrence CJ, Malmberg RL, Muszynski MG, Dawe RK (2002) Maximum likelihood methods reveal conservation of function among closely related kinesin families. Journal of Molecular Evolution 54, 42–53.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lawrence CJ, Zmasek CM, Dawe RK, Malmberg RL (2004) LumberJack: a heuristic tool for sequence alignment exploration and phylogenetic inference. Bioinformatics 20, 1977–1979.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lebrun E, Santini JM, Brugna M, Ducluzeau A-L, Ouchane S, Schoepp-Cothenet B, Baymann F, Nitschke W (2006) The rieske protein: a case study on the pitfalls of multiple sequence alignments and phylogenetic reconstruction. Molecular Biology and Evolution 23, 1180–1191.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lecompte O, Thompson JD, Plewniak F, Thierry J-C, Poch O (2001) Multiple alignment of complete sequences (MACS) in the post-genomic era. Gene 270, 17–30.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lee MSY (2001) Unalignable sequences and molecular evolution. Trends in Ecology and Evolution 16, 681–685.
| Crossref | GoogleScholarGoogle Scholar |

Lenhof H-P, Reinert K, Vingron M (1998) A polyhedral approach to RNA sequence structure alignment. Journal of Computational Biology 5, 517–530.
| PubMed |

Li K-B (2003) ClustalW-MPI: ClustalW analysis using distributed and parallel computing. Bioinformatics 19, 1585–1586.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lombard V, Camon EB, Parkinson HE, Hingamp P, Stoesser G, Redaschi N (2002) EMBL-Align: a new public nucleotide and amino acid multiple sequence alignment database. Bioinformatics 18, 763–764.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Löytynoja A, Milinkovitch MC (2001) SOAP, cleaning multiple alignments from unstable blocks. Bioinformatics 17, 573–574.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Löytynoja A, Milinkovitch MC (2003) A hidden markov model for progressive multiple alignment. Bioinformatics 19, 1505–1513.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Löytynoja A, Goldman N (2005) An algorithm for progressive multiple alignment of sequences with insertions. Proceedings of the National Academy of Sciences USA 102, 10 557–10 562.
| Crossref | GoogleScholarGoogle Scholar |

Lu CL, Huang YP (2005) A memory-efficient algorithm for multiple sequence alignment with constraints. Bioinformatics 21, 23–30.

Ludwig W, Strunk O, Westram R, Richter L, Meier H , et al. (2004) ARB: a software environment for sequence data. Nucleic Acids Research 32, 1363–1371.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Lunter G , Drummond AJ , Miklós I , Hein J (2005) Statistical alignment: recent progress, new applications, and challenges. In ‘Statistical methods in molecular evolution’. (Ed. R Nielsen) pp. 375–405. (Springer: New York)

Manohar A , Batzoglou S (2005) TreeRefiner: a tool for refining a multiple alignment on a phylogenetic tree. In ‘Proceedings of the 2005 IEEE computational systems bioinformatics conference (CSB’05)’. pp. 111–119. (IEEE Press: Piscataway)

Marchler-Bauer A, Panchenko AR, Ariel N, Bryant SH (2002) Comparison of sequence and structure alignments for protein domains. Proteins: Structure, Function, and Genetics 48, 439–446.
| Crossref | GoogleScholarGoogle Scholar |

Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY , et al. (2005) CDD: a conserved domain database for protein classification. Nucleic Acids Research 33, D192–D196.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Margulies EH, Chen CW, Green ED (2006) Differences between pair-wise and multi-sequence alignment methods affect vertebrate genome comparisons. Trends in Genetics 22, 187–193.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Marsden B, Abagyan R (2004) SAD—a normalized structural alignment database: improving sequence–structure alignments. Bioinformatics 20, 2333–2344.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Marti-Renom MA, Madhusudhan MS, Sali A (2004) Alignment of protein sequences by their profiles. Protein Science 13, 1071–1087.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

May ACW (2004) Percent sequence identity: the need to be explicit. Structure 12, 737–738.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

McClure MA, Vasi TK, Fitch WM (1994) Comparative analysis of multiple protein-sequence alignment methods. Molecular Biology and Evolution 11, 571–592.
| PubMed |

Mecham J, Clement M, Snell Q, Freestone T, Seppi K, Crandall K (2006) Jumpstarting phylogenetic analysis. International Journal of Bioinformatics Research and Applications 2, 19–35.

Miklós I, Lunter GA, Holmes I (2004) A “long indel” model for evolutionary sequence alignment. Molecular Biology and Evolution 21, 529–540.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Milinkovitch MC, LeDuc RG, Adachi J, Farnir F, Georges M, Hasegawa M (1996) Effects of character weighting and species sampling on phylogeny reconstruction: a case study based on DNA sequence data in cetaceans. Genetics 144, 1817–1833.
| PubMed |

Miller W (2001) Comparison of genomic DNA sequences: solved and unsolved problems. Bioinformatics 17, 391–397.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Mindell DP (1991) Aligning DNA sequences: homology and phylogenetic weighting. In ‘Phylogenetic analysis of DNA sequences’. (Eds MM Miyamoto, J Cracraft) pp. 73–89. (Oxford University Press: New York)

Morell V (1996) TreeBASE: the roots of phylogeny. Science 273, 569.
| Crossref | GoogleScholarGoogle Scholar |

Morgenstern B (1999) DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics 15, 211–218.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Morgenstern B, Prohaska SJ, Pohler D, Stadler PF (2006) Multiple sequence alignment with user-defined anchor points. Algorithms for Molecular Biology 1, 6.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Morris P, Cobabe E (1991) Cuvier meets Watson and Crick: the utility of molecules as classical homologies. Biological Journal of the Linnean Society 44, 307–324.

Morrison DA (2006) Phylogenetic analyses of parasites in the new millennium. Advances in Parasitology 63, 1–124.

Morrison DA, Ellis JT (1997) Effects of nucleotide sequence alignment on phylogeny estimation: a case study of 18S rDNAs of Apicomplexa. Molecular Biology and Evolution 14, 428–441.
| PubMed |

Mugridge NB, Morrison DA, Jäkel T, Heckeroth AR, Tenter AM, Johnson AM (2000) Effects of sequence alignment and structural domains of ribosomal DNA on phylogeny reconstruction for the protozoan family Sarcocystidae. Molecular Biology and Evolution 17, 1842–1853.
| PubMed |

Myers G, Selznick S, Zhang Z, Miller W (1996) Progressive multiple alignment with constraints. Journal of Computational Biology 3, 563–572.
| PubMed |

Nguyen HD, Yoshihara I, Yamamori K, Yasunaga M (2002) Aligning multiple protein sequences by parallel hybrid genetic algorithm. Genome Informatics 13, 123–132.
| PubMed |

Nicholas HB, Ropelewski AJ, Deerfield DW (2002) Strategies for multiple sequence alignment. BioTechniques 32, 572–591.
| PubMed |

Notredame C (2002) Recent progress in multiple sequence alignment: a survey. Pharmacogenomics 3, 131–144.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Notredame C, O’Brien EA, Higgins DG (1997) RAGA: RNA sequence alignment by genetic algorithm. Nucleic Acids Research 25, 4570–4580.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Notredame C, Holm L, Higgins DG (1998) COFFEE: an objective function for multiple sequence alignments. Bioinformatics 14, 407–422.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Notredame C, Higgins DG, Heringa J (2000) T-coffee: a novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302, 205–217.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Nozaki Y, Bellgard M (2005) Statistical evaluation and comparison of a pairwise alignment algorithm that a priori assigns the number of gaps rather than employing gap penalties. Bioinformatics 21, 1421–1428.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

O’Brien EA, Notredame C, Higgins DG (1998) Optimization of ribosomal RNA profile alignments. Bioinformatics 14, 332–341.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

O’Donnell K, Kistler HC, Tacke BK, Casper HH (2000) Gene genealogies reveal global phylogeographic structure and reproductive isolation among lineages of Fusarium graminearum, the fungus causing wheat scab. Proceedings of the National Academy of Sciences USA 97, 7905–7910.
| Crossref | GoogleScholarGoogle Scholar |

Ogden TH, Rosenberg MS (2006) Multiple sequence alignment accuracy and phylogenetic inference. Systematic Biology 55, 314–328.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Ohlson T, Wallner B, Elofsson A (2004) Profile–profile methods provide improved fold recognition: a study of different profile–profile alignment methods. Proteins: Structure, Function, and Bioinformatics 57, 188–197.
| Crossref | GoogleScholarGoogle Scholar |

Oliver T, Schmidt B, Nathan D, Clemens R, Maskell D (2005) Using reconfigurable hardware to accelerate multiple sequence alignment with ClustalW. Bioinformatics 21, 3431–3432.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Ophir R, Graur D (1997) Patterns and rates of indel evolution in processed pseudogenes from humans and murids. Gene 205, 191–202.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

O’Sullivan O, Suhre K, Abergel C, Higgins DG, Notredame C (2004) 3DCoffee: combining protein sequences and structures within multiple sequence alignments. Journal of Molecular Biology 340, 385–395.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Page RDM (2000) Comparative analysis of secondary structure of insect mitochondrial small subunit ribosomal RNA using maximum weighted matching. Nucleic Acids Research 28, 3839–3845.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Parida L, Floratos A, Rigoutsos I (1999) An approximation algorithm for alignment of multiple sequences using motif discovery. Journal of Combinatorial Optimization 3, 247–275.
| Crossref | GoogleScholarGoogle Scholar |

Parmentier G, Trystram D, Zola J (2004) Cache-based parallelization of multiple sequence alignment problem. Lecture Notes in Computer Science 3149, 1005–1012.

Pascarella S, Argos P (1992) Analysis of insertions / deletions in protein structures. Journal of Molecular Biology 224, 461–471.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Patterson C (1988) Homology in classical and molecular biology. Molecular Biology and Evolution 5, 603–625.
| PubMed |

Pearson WR, Sierk ML (2005) The limits of protein sequence comparison? Current Opinion in Structural Biology 15, 254–260.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Pedersen CNS, Lyngsø R, Hein J (1998) Comparison of coding DNA. Lecture Notes in Computer Science 1448, 153–173.

Pei J, Grishin NV (2006) MUMMALS: multiple sequence alignment improved by using hidden markov models with local structural information. Nucleic Acids Research 34, 4364–4374.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Pei J, Sadreyev R, Grishin NV (2003) PCMA: fast and accurate multiple sequence alignment based on profile consistency. Bioinformatics 19, 427–428.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Petersen G, Seberg O, Aagesen L, Frederiksen S (2004) An empirical test of the treatment of indels during optimization alignment based on the phylogeny of the genus Secale (Poaceae). Molecular Phylogenetics and Evolution 30, 733–742.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Pettersson EU, Ljunggren EL, Morrison DA, Mattsson JG (2005) Functional analysis and localisation of a class delta glutathione S-transferase from Sarcoptes scabiei. International Journal for Parasitology 35, 39–48.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Phillips A (2006) Homology assessment and molecular sequence alignment. Journal of Biomedical Informatics 39, 18–33.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Phillips A, Janies D, Wheeler W (2000) Multiple sequence alignment in phylogenetic analysis. Molecular Phylogenetics and Evolution 16, 317–330.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Pible O, Imbert G, Pellequer J-L (2005) INTERALIGN: interactive alignment editor for distantly related protein sequences. Bioinformatics 21, 3166–3167.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

de Pinna MCC (1991) Concepts and tests of homology in the cladistic paradigm. Cladistics 7, 367–394.
| Crossref | GoogleScholarGoogle Scholar |

Poch O, Delarue M (1996) Converting sequence block alignments into structural insights. Methods in Enzymology 266, 662–680.
| PubMed |

Pollard DA, Bergman CM, Stoye J, Celniker SE, Eisen MB (2004) Benchmarking tools for the alignment of functional noncoding DNA. BMC Bioinformatics 5, 6.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Ponting CP , Birney E (2000) Identification of domains from protein sequences. In ‘Protein structure prediction: methods and protocols’. (Ed. DM Webster) pp. 53–69. (Humana Press: Totowa)

Qian B, Goldstein RA (2001) Distribution of indel lengths. Proteins: Structure, Function, and Genetics 45, 102–104.
| Crossref | GoogleScholarGoogle Scholar |

Raghava GPS, Searle SMJ, Audley PC, Barber JD, Barton GJ (2003) OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy. BMC Bioinformatics 4, 47.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Rainaldi G, Volpicella M, Licciulli F, Liuni S, Gallerani R, Ceci LR (2003) PLMItRNA, a database on the heterogeneous genetic origin of mitochondrial tRNA genes and tRNAs in photosynthetic eukaryotes. Nucleic Acids Research 31, 436–438.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Raphael B, Zhi D, Tang H, Pevzner P (2004) A novel method for multiple alignment of sequences with repeated and shuffled elements. Genome Research 14, 2336–2346.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Redelings BD, Suchard MA (2005) Joint bayesian estimation of alignment and phylogeny. Systematic Biology 54, 401–418.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Reeck GR, de Haën C, Teller DC, Doolittle RF, Fitch WM , et al. (1987) “Homology” in proteins and nucleic acids: a terminology muddle and a way out of it. Cell 50, 667.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Reese JT, Pearson WR (2002) Empirical determination of effective gap penalties for sequence comparison. Bioinformatics 18, 1500–1507.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Reinert K, Stoye J, Will T (2000) An iterative method for faster sum-of-pairs multiple sequence alignment. Bioinformatics 16, 808–814.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Riaz T, Wang Y, Li K-B (2004) Multiple sequence alignment using tabu search. Conferences in Research and Practice in Information Technology 29, 223–232.

Riaz T, Wang Y, Li K-B (2005) Tabu search algorithm for post-processing multiple sequence alignment. Journal of Bioinformatics and Computational Biology 3, 145–156.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Rice KA, Donoghue MJ, Olmstead RG (1997) Analyzing large data sets: rbcL 500 revisited. Systematic Biology 46, 554–563.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Rieppel O (1994) Homology, topology, and typology: the history of modern debates. In ‘Homology: the hierarchical basis of comparative biology’. (Ed. BK Hall) pp. 63–100. (Academic Press: San Diego)

Rieppel O, Kearney M (2002) Similarity. Biological Journal of the Linnean Society 75, 59–82.
| Crossref | GoogleScholarGoogle Scholar |

Rinsma-Melchert I (1993) The expected number of matches in optimal global sequence alignments. New Zealand Journal of Botany 31, 219–230.

Rodriguez R , Vriend G (1997) Professional gambling. In ‘Biomolecular structure and dynamics: recent experimental and theoretical advances’. (Eds G Vergoten, T Theophanides) pp. 79–120. (Kluwer Academic Publishers: Dordrecht)

Rosenberg MS (2005a) Evolutionary distance estimation and fidelity of pair wise sequence alignment. BMC Bioinformatics 6, 102.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Rosenberg MS (2005b) MySSP: non-stationary evolutionary sequence simulation, including indels. Evolutionary Bioinformatics Online 1, 81–83.

Roshan U, Livesay DR (2006) Probalign: multiple sequence alignment using partition function posterior probabilities. Bioinformatics 22, 2715–2721.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Rost B, Valencia A (1996) Pitfalls of protein sequence analysis. Current Opinion in Biotechnology 7, 457–461.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Sadreyev RI, Grishin NV (2004) Estimates of statistical significance for comparison of individual positions in multiple sequence alignments. BMC Bioinformatics 5, 106.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Sammeth M, Heringa J (2006) Global multiple-sequence alignment with repeats. Proteins: Structure, Function, and Bioinformatics 64, 263–274.
| Crossref | GoogleScholarGoogle Scholar |

Sammeth M, Weniger T, Harmsen D, Stoye J (2005) Alignment of tandem repeats with excision, duplication, substitution and indels (EDSI). Lecture Notes in Computer Science 3692, 276–290.

Sanchis A, Michelana JM, Latorre A, Quicke DLJ, Gärdenfors U, Belshaw R (2001) The phylogenetic analysis of variable-length sequence data: elongation factor-1α introns in European populations of the parasitoid wasp genus Pauesia (Hymenoptera: Braconidae: Aphidiinae). Molecular Biology and Evolution 18, 1117–1131.
| PubMed |

Sankoff D , Cedergren RJ (1983) Simultaneous comparison of three or more sequences related by a tree. In ‘Time warps, string edits, and macromolecules: the theory and practice of sequence comparison’. (Eds D Sankoff, JB Kruskal) pp. 253–264. (Addison-Wesley: Reading)

Sankoff D, Morel C, Cedergren RJ (1973) Evolution of 5S RNA and the non-randomness of base replacement. Nature 245, 232–234.
| Crossref | GoogleScholarGoogle Scholar |

Sauder JM, Arthur JW, Dunbrack RL (2000) Large-scale comparison of protein sequence alignment algorithms with structure alignments. Proteins: Structure, Function, and Genetics 40, 6–22.
| Crossref | GoogleScholarGoogle Scholar |

Schmollinger M, Nieselt K, Kaufmann M, Morgenstern B (2004) DIALIGN P: fast pair-wise and multiple sequence alignment using parallel processors. BMC Bioinformatics 5, 128.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Schuler GD, Altschul SF, Lipman DJ (1991) A workbench for multiple alignment construction and analysis. Proteins 9, 180–190.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Schultes EA, Hraber PT, LaBean TH (1999) Estimating the contributions of selection and self-organization in RNA secondary structure. Journal of Molecular Evolution 49, 76–83.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Schultz J, Maisel S, Gerlach D, Müller T, Wolf M (2005) A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. RNA 11, 361–364.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Schwikowski B, Vingron M (1997a) The deferred path heuristic for the generalized tree alignment problem. Journal of Computational Biology 4, 415–431.
| PubMed |

Schwikowski B, Vingron M (1997b) A clustering approach to generalized tree alignment with application to Alu repeats. Lecture Notes in Computer Science 1278, 115–124.

Schwikowski B, Vingron M (2003) Sequence graphs: boosting iterated dynamic programming using locally suboptimal solutions. Discrete Applied Mathematics 127, 95–117.
| Crossref | GoogleScholarGoogle Scholar |

Shakhnovich BE (2005) Improving the precision of the structure–function relationship by considering phylogenetic context. PLoS Computational Biology 1, e9.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Shull VL, Vogler AP, Baker MD, Maddison DR, Hammond PM (2001) Sequence alignment of 18S ribosomal RNA and the basal relationships of adephagan beetles: evidence for monophyly of aquatic families and the placement of Trachypachidae. Systematic Biology 50, 945–969.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Siddharthan R (2006) Sigma: multiple alignment of weakly-conserved non-coding DNA sequence. BMC Bioinformatics 7, 143.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Siebert S, Backofen R (2005) MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons. Bioinformatics 21, 3352–3359.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simmons MP (2004) Independence of alignment and tree search. Molecular Phylogenetics and Evolution 31, 874–879.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simmons MP, Ochoterena H (2000) Gaps as characters in sequence-based phylogenetic analysis. Systematic Biology 49, 369–381.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simmons MP, Freudenstein JV (2003) The effects of increasing genetic distance on alignment of, and tree construction from, rDNA internal transcribed spacer sequences. Molecular Phylogenetics and Evolution 26, 444–451.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simmons MP, Carr TG, O’Neill K (2004) Relative character-state space, amount of potential phylogenetic information, and heterogeneity of nucleotide and amino acid characters. Molecular Phylogenetics and Evolution 32, 913–926.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simossis VA, Heringa J (2004) Integrating protein secondary structure prediction and multiple sequence alignment. Current Protein and Peptide Science 5, 249–266.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simossis VA, Heringa J (2005) PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information. Nucleic Acids Research 33, W289–W294.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Simossis VA, Kleinjung J, Heringa J (2005) Homology-extended sequence alignment. Nucleic Acids Research 33, 816–824.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Slowinski JB (1998) The number of multiple alignments. Molecular Phylogenetics and Evolution 10, 264–266.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Sluys R (1996) The notion of homology in current comparative biology. Journal of Zoological Systematics and Evolutionary Research 34, 145–152.

Smith NGC, Hurst LD (1998) Sensitivity of patterns of molecular evolution to alterations in methodology: a critique of Hughes and Yeager. Journal of Molecular Evolution 47, 493–500.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

del Sol Mesa A, Pazos F, Valencia A (2003) Automatic methods for predicting functionally important residues. Journal of Molecular Biology 326, 1289–1302.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Sprinzl M, Vassilenko KS (2005) Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Research 33, D139–D140.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Stebbings LA, Mizuguchi K (2004) HOMSTRAD: recent developments of the homologous protein structure alignment database. Nucleic Acids Research 32, D203–D207.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Stocsits RR, Hofaker IL, Fried C, Stadler PF (2005) Multiple sequence alignments of partially coding nucleic acid sequences. BMC Bioinformatics 6, 160.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Stoye J, Evers D, Meyer F (1998) Rose: generating sequence families. Bioinformatics 14, 157–163.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Subramanian AR, Weyer-Menkhoff J, Kaufmann M, Morgenstern B (2005) DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment. BMC Bioinformatics 6, 66.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Sze S-H, Lu Y, Yang Q (2006) A polynomial time solvable formulation of multiple sequence alignment. Journal of Computational Biology 13, 309–319.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Szklarczyk R, Heringa J (2004) Tracking repeats using significance and transitivity. Bioinformatics 20, i311–i317.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Szymanski M, Barciszewska MZ, Erdmann VA, Barciszewski J (2002) 5S ribosomal RNA database. Nucleic Acids Research 30, 176–178.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Taylor WR (1986) Identification of protein sequence homology by consensus template alignment. Journal of Molecular Biology 188, 233–258.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Taylor WR (1987) Multiple sequence alignment by a pairwise algorithm. Computer Applications in the Biosciences 3, 81–87.
| PubMed |

Taylor WR (1996) Multiple protein sequence alignment: algorithms and gap insertion. Methods in Enzymology 266, 343–367.
| PubMed |

Teeling H, Gloeckner FO (2006) RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits. BMC Bioinformatics 7, 66.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Telford MJ, Wise MJ, Gowri-Shankar V (2005) Consideration of RNA secondary structure significantly improves likelihood-based estimates of phylogeny: examples from the Bilateria. Molecular Biology and Evolution 22, 1129–1136.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Terry MD, Whiting MF (2005) Comparison of two alignment techniques within a single complex data set: POY versus Clustal. Cladistics 21, 272–281.
| Crossref | GoogleScholarGoogle Scholar |

Thébault P, Monestié A, Higgins DG (1999) MIAH: automatic alignment of eukaryotic SSU rRNAs. Bioinformatics 15, 341–342.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22, 4673–4680.
| PubMed |

Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL-X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research 25, 4876–4882.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Plewniak F, Poch O (1999a) BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs. Bioinformatics 15, 87–88.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Plewniak F, Poch O (1999b) A comprehensive comparison of multiple sequence alignment programs. Nucleic Acids Research 27, 2682–2690.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Plewniak F, Thierry J-C, Poch O (2000) DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches. Nucleic Acids Research 28, 2919–2926.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Plewniak F, Ripp R, Thierry J-C, Poch O (2001) Towards a reliable objective function for multiple sequence alignments. Journal of Molecular Biology 314, 937–951.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Thierry JC, Poch O (2003) RASCAL: rapid scanning and correction of multiple sequence alignments. Bioinformatics 19, 1155–1161.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thompson JD, Koehl P, Ripp R, Poch O (2005) BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark. Proteins: Structure, Function, and Bioinformatics 61, 127–136.
| Crossref | GoogleScholarGoogle Scholar |

Thomsen R , Fogel GB , Krink T (2002) A Clustal alignment improver using evolutionary algorithms. In ‘Proccedings of the fourth congress on evolutionary computation (CEC-2002)’. (Eds DB Fogel, X Yao, G Greenwood, H Iba, P Marrow, M Shackleton) pp. 121–126. (IEEE Press: Piscataway)

Thomsen R , Fogel GB , Krink T (2003) Improvement of Clustal-derived sequence alignments with evolutionary algorithms. In ‘Proccedings of the fifth congress on evolutionary computation (CEC-2003)’. (Eds DR Sarker, R Reynolds, H Abbass, KC Tan, B McKay, D Essam, T Gedeon) pp. 1499–1507. (IEEE Press: Piscataway)

Thorne JL, Kishino H (1992) Freeing phylogenies from artifacts of alignment. Molecular Biology and Evolution 9, 1148–1162.
| PubMed |

Thorne JL, Churchill GA (1995) Estimation and reliability of molecular sequence alignments. Biometrics 51, 100–113.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thorne JL, Kishino H, Felsenstein J (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. Journal of Molecular Evolution 33, 114–124.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Thorne JL, Kishino H, Felsenstein J (1992) Inching toward reality: an improved likelihood model for sequence evolution. Journal of Molecular Evolution 34, 3–16.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Titus TA, Frost DR (1996) Molecular homology assessment and phylogeny in the lizard family Opluridae (Squamata: Iguania). Molecular Phylogenetics and Evolution 6, 49–62.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Touzet H, Perriquet O (2004) CARNAC: folding families of related RNAs. Nucleic Acids Research 32, W142–W145.
| PubMed |

Trystram D, Zola J (2005) Parallel multiple sequence alignment with decentralized cache support. Lecture Notes in Computer Science 3648, 1217–1226.

Tsai YT, Huang YP, Yu CT, Lu CL (2004) MuSiC: a tool for multiple sequence alignment with constraints. Bioinformatics 20, 2309–2311.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Tyson H (1992) Relationships between amino acid sequences determined through optimum alignments, clustering, and specific distance patterns: application to a group of scorpion toxins. Genome 35, 360–371.
| PubMed |

van Valen L (1982) Homology and causes. Journal of Morphology 173, 305–312.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Van Walle I, Lasters I, Wyns L (2004) Align-m—a new algorithm for multiple alignment of highly divergent sequences. Bioinformatics 20, 1428–1435.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Van Walle I, Lasters I, Wyns L (2005) SABmark—a benchmark for sequence alignment that covers the entire known fold space. Bioinformatics 21, 1267–1268.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Varani G , Pardi A (1994) Structure of RNA. In ‘RNA–protein interactions’. (Eds K Nagai, IW Mattaj) pp. 1–24. (IRL Press: Oxford)

Vingron M (1999) Sequence alignment and phylogeny construction. In ‘Mathematical support for molecular biology’. (Eds M Farach-Colton, FS Roberts, M Vingron, M Waterman) pp. 53–64. (American Mathematical Society: Providence)

Vingron M, Waterman MS (1994) Sequence alignments and penalty choice: review of concepts, case studies and implications. Journal of Molecular Biology 235, 1–12.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Vingron M, von Haeseler A (1997) Towards integration of multiple alignment and phylogenetic tree construction. Journal of Computational Biology 4, 23–34.
| PubMed |

Vogt G, Etzold T, Argos P (1995) An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited. Journal of Molecular Biology 249, 816–831.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Vogt L (2002) Testing and weighting characters. Organisms, Diversity and Evolution 2, 319–333.
| Crossref | GoogleScholarGoogle Scholar |

Wagner GP (1989) The biological homology concept. Annual Review of Ecology and Systematics 20, 51–69.
| Crossref | GoogleScholarGoogle Scholar |

Wallace IM, Blackshields G, Higgins DG (2005a) Multiple sequence alignments. Current Opinion in Structural Biology 15, 261–266.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wallace IM, O’Sullivan O, Higgins DG (2005b) Evaluation of iterative alignment algorithms for multiple alignment. Bioinformatics 21, 1408–1414.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wallace IM, O’Sullivan O, Higgins DG, Notredame C (2006) M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Research 34, 1692–1699.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wang G, Dunbrack RL (2004) Scoring profile-to-profile sequence alignments. Protein Science 13, 1612–1626.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wang L, Jiang T (1994) On the complexity of multiple sequence alignment. Journal of Computational Biology 1, 337–348.
| PubMed |

Wang Y, Li K-B (2004) An adaptive and iterative algorithm for refining multiple sequence alignment. Computational Biology and Chemistry 28, 141–148.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wareham HT (1995) A simplified proof of the NP- and MAX SNP-hardness of multiple sequence tree alignment. Journal of Computational Biology 2, 509–514.
| PubMed |

Waterman MS (1995) ‘Introduction to computational biology: maps, sequences and genomes.’ (Chapman & Hall: London)

Wegner K, Jansen S, Wuchty S, Gauges R, Kummer U (2004) CombAlign: a protein sequence comparison algorithm considering recombinations. In Silico Biology 4, 0021.

Wegnez M (1987) Letter to the editor. Cell 51, 516.
| Crossref | GoogleScholarGoogle Scholar |

Wernersson R, Pedersen AG (2003) RevTrans: multiple alignment of coding DNA from aligned amino acid sequences. Nucleic Acids Research 31, 3537–3539.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Westbrook J, Feng Z, Chen L, Yang H, Berman HM (2003) The Protein Data Bank and structural genomics. Nucleic Acids Research 31, 489–491.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wexler Y, Yakhini Z, Kashi Y, Geiger D (2005) Finding approximate tandem repeats in genomic sequences. Journal of Computational Biology 12, 928–942.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wheeler WC (1993) The triangle inequality and character analysis. Molecular Biology and Evolution 10, 707–712.

Wheeler WC (1994) Sources of ambiguity in nucleic acid sequence alignment. In ‘Molecular ecology and evolution: approaches and applications’. (Eds B Schierwater, B Streit, GP Wagner, R DeSalle) pp. 323–352. (Birkhäuser Verlag: Basel)

Wheeler WC (1995) Sequence alignment, parameter sensitivity, and phylogenetic analysis of molecular data. Systematic Biology 44, 321–331.
| Crossref | GoogleScholarGoogle Scholar |

Wheeler W (1996) Optimization alignment: the end of multiple sequence alignment in phylogenetics? Cladistics 12, 1–9.
| Crossref | GoogleScholarGoogle Scholar |

Wheeler W (1998) Alignment characters, dynamic programming and heuristic solutions. In ‘Molecular approaches to ecology and evolution’. (Eds R DeSalle, B Schierwater) pp. 243–251. (Birkhäuser Verlag: Basel)

Wheeler WC (1999) Fixed character states and the optimization of molecular sequence data. Cladistics 15, 379–385.
| Crossref | GoogleScholarGoogle Scholar |

Wheeler W (2001 a) Homology and DNA sequence data. In ‘The character concept in evolutionary biology’. (Ed. GP Wagner) pp. 303–317. (Academic Press: San Diego)

Wheeler W (2001b) Homology and the optimization of DNA sequence data. Cladistics 17, S3–S11.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wheeler WC (2002) Optimization alignment: down, up, error, and improvements. In ‘Techniques in molecular systematics and evolution’. (Eds R DeSalle, G Giribet, W Wheeler) pp. 55–69. (Birkhäuser Verlag: Basel)

Wheeler WC (2003a) Iterative pass optimization of sequence data. Cladistics 19, 254–260.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wheeler WC (2003b) Implied alignment: a synapomorphy-based multiple-sequence alignment method and its use in cladogram search. Cladistics 19, 261–268.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wheeler WC (2003c) Search-based optimization. Cladistics 19, 348–355.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wheeler WC (2005) Alignment, dynamic homology, and optimization. In ‘Parsimony, phylogeny, and genomics’. (Ed. VA Albert) pp. 71–80. (Oxford University Press: Oxford)

Wheeler WC (2006) Dynamic homology and the likelihood criterion. Cladistics 22, 157–170.
| Crossref | GoogleScholarGoogle Scholar |

Wheeler WC, Gladstein DS (1994) MALIGN: a multiple sequence alignment program. Journal of Heredity 85, 417–418.

Whelan S, de Bakker PIW, Quevillon E, Rodriguez N, Goldman N (2006) PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees. Nucleic Acids Research 34, D327–D331.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Whiting AS, Sites JW, Pellegrino KCM, Rodrigues MT (2006) Comparing alignment methods for inferring the history of the new world lizard genus Mabuya (Squamata: Scincidae). Molecular Phylogenetics and Evolution 38, 719–730.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Williams DM (1993) A note on molecular homology: multiple patterns from single datasets. Cladistics 9, 233–245.
| Crossref | GoogleScholarGoogle Scholar |

Winnepenninckx B, Backeljau T (1996) 18S rRNA alignments derived from different secondary structure models can produce alternative phylogenies. Journal of Zoological Systematics and Evolutionary Research 34, 135–143.

Winter WP, Walsh KA, Neurath H (1968) Homology as applied to proteins. Science 162, 1433.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Wrabl JO, Grishin NV (2004) Gaps in structurally similar proteins: towards improvement of multiple sequence alignment. Proteins: Structure, Function, and Bioinformatics 54, 71–87.
| Crossref | GoogleScholarGoogle Scholar |

Wuyts J, Perrière G, Van de Peer Y (2004) The European ribosomal RNA database. Nucleic Acids Research 32, D101–D103.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Xiao L, Sulaiman IM, Ryan UM, Zhou L, Atwill ER, Tischler ML, Zhang X, Fayer R, Lal AA (2002) Host adaptation and host-parasite co-evolution in Cryptosporidium: implications for taxonomy and public health. International Journal for Parasitology 32, 1773–1785.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Yamada S, Gotoh O, Yamana H (2004) Extension of Prrn: implementation of a doubly nested randomized iterative refinement strategy under a piecewise linear gap cost. Genome Informatics 15, P082.

Yu H , Deng M (2005) ClustalY: speed up the guide tree building for ClustalW. In ‘Proceedings of the eighth international conference on high-performance computing in Asia-Pacific region (HPCASIA’05)’. pp. 608–610. (IEEE Press: Piscataway)

Yuan J, Amend A, Borkowski J, DeMarco R, Bailey W, Liu Y, Xie G, Blevins R (1999) MULTICLUSTAL: a systematic method for surveying ClustalW alignment parameters. Bioinformatics 15, 862–863.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Zhang X , Kahveci T (2006) A new approach for alignment of multiple proteins. In ‘Proceedings of the 11th Pacific Symposium on Biocomputing 2006, Hawaii’. pp. 339–350.

Zhou H, Zhou Y (2005) SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structure. Bioinformatics 21, 3615–3621.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Zhu J, Liu JS, Lawrence CE (1998) Bayesian adaptive sequence alignment algorithms. Bioinformatics 14, 25–39.
| Crossref | GoogleScholarGoogle Scholar | PubMed |

Zwieb C (1997) The uRNA database. Nucleic Acids Research 25, 102–103.
| Crossref | GoogleScholarGoogle Scholar | PubMed |