Академический Документы
Профессиональный Документы
Культура Документы
55(5)715-728,2006
Copyright Society of Systematic Biologists
ISSN: 1063-5157 print / 1076-836X online
DO1:10.1080/10635150600969864
Abstract.DNA barcoding and DNA taxonomy have recently been proposed as solutions to the crisis of taxonomy and
received significant attention from scientific journals, grant agencies, natural history museums, and mainstream media.
Here, we test two key claims of molecular taxonomy using 1333 mitochondrial COI sequences for 449 species of Diptera. We
investigate whether sequences can be used for species identification ("DNA barcoding") and find a relatively low success
rate (<70%) based on tree-based and newly proposed species identification criteria. Misidentifications are due to wide
overlap between intra- and interspecific genetic variability, which causes 6.5% of all query sequences to have allospecific
or a mixture of allo- and conspecific (3.6%) best-matching barcodes. Even when two COI sequences are identical, there is a
6% chance that they belong to different species. We also find that 21% of all species lack unique barcodes when consensus
sequences of all conspecific sequences are used. Lastly, we test whether DNA sequences yield an unambiguous species-level
taxonomy when sequence profiles are assembled based on pairwise distance thresholds. We find many sequence triplets for
which two of the three pairwise distances remain below the threshold, whereas the third exceeds it; i.e., it is impossible to
consistently delimit species based on pairwise distances. Furthermore, for species profiles based on a 3% threshold, only 47%
of all profiles are consistent with currently accepted species limits, 20% contain more than one species, and 33% only some
sequences from one species; i.e., adopting such a DNA taxonomy would require the redescription of a large proportion of
the known species, thus worsening the taxonomic impediment. We conclude with an outlook on the prospects of obtaining
complete barcode databases and the future use of DNA sequences in a modern integrative taxonomy. [Cytochrome oxidase
I; genetic distance; pairwise distance; species identification.]
715
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117543, Singapore;
E-mail: dbsmr@nus.edu. sg (R.M.)
716
SYSTEMATIC BIOLOGY
VOL. 55
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
2006
717
'Best match"
Ambiguous/unidentified
Query without conspecific
sequence in data set
("singleton")
(a) Query without conspecific
sequence in data set
OR
(b) Query sister group to
conspecifics
OR
(c) Query in polytomy with at
least one conspecific and one
allospecific sequence
Sequence(s) with smallest
distance to query a mixture of
allo- and conspecific
sequences
(a) Sequence(s) with smallest
distance to query a mixture of
allo- and conspecific
sequences and within the 95th
percentile of all infraspecific
distances
(b) No match within 95 percentile
of all infraspecific distances
As in "best match", but with at
least one allospecific sequence
being more similar to query
than the least similar
conspecific sequence
Misidentification
Query with conspecific sequences
in multiple clusters
(a) Query at least one node into
an allospecific clade
OR
(b) Query in polytomy with only
allospecific sequences
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
Successful identification
Query clusters with all
conspecific sequences
718
VOL. 55
SYSTEMATIC BIOLOGY
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
2006
719
We tested the viability of threshold values for distinguishing intra- from interspecific variability for 2%, 3%,
4%, 5%, and 6% thresholds. For this purpose, TaxonDNA
finds for each query a set of barcodes for which each sequence in the set has at least one other sequence within
the threshold distance. For all clusters, we determined
whether the largest observed distance exceeded the
threshold distance and whether they correspond to currently accepted species (= contains all sequences for
one species). If not, we determined whether it contained
sequences for several species (error 1) and/or not all
sequences for the same species (error 2).
RESULTS
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
quences were from the same species, the identification was considered a success, whereas mismatched
names were counted as failures. Several equally good
best matches from different species were considered
ambiguous.
2. "Best close match." We used TaxonDNA to plot the
relative frequency of intraspecific distances in order
to determine the threshold value below 95% of all intraspecific distances are found. All queries without
barcode match below the threshold value remained
unidentified. For the remaining queries, their identity was compared to the species identity of their closest barcode. If the name was identical, the query was
considered an identification success. The identification was considered a failure when the names were
mismatched and considered ambiguous when several
equally good best matches were found that belonged
to a minimum of two species.
3. "All species barcodes." We assembled for each query a
list of all barcodes sorted by similarity to the query using the same threshold as for best close match. Queries
were considered a success when they were followed
by all conspecific barcodes as long as there were at
least two barcodes for the species. Queries were considered ambiguous when they were followed by only
one conspecific barcode or only some of the conspecific sequences. Queries followed by all conspecific sequences from the "wrong" species were considered
misidentified.
VOL. 55
SYSTEMATIC BIOLOGY
720
TABLE 2. Intra- and interspecific sequence variability and identification success based on "All species barcodes."
Sequence
overlap
Total overlap
(% observations)
0-15.5%
0-15.5%
0-15.5%
0-15.5%
987
975
664
406
(99)
(99)
(100)
(100)
is very inconsistent across genera, with some essentially lacking overlap whereas others have little separation. Species barcodes sensu stricto constructed as the
union-based consensus of all conspecific barcodes revealed that 34 species are involved in 22 cases of two
species sharing identical barcodes (Appendix 2; available online at http://systematicbiology.org). Twentyfive of these species belong to the 117 species in the
data set with multiple sequences that have a minimum overlap of 300 bp; i.e., when multiple individuals are sequenced 21% of the species lack unique
barcodes.
90% Overlap
(% observations)
Queries with
allospecific
identical match
"All species
barcodes"
success
2.31-3.34 (5.2)
2.29-3.34 (5.6)
1.81-3.25 (9.3)
0.84-5.14 (33.1)
6% (35)
6% (35)
7.9% (28)
16.9% (28)
40.6%
41.7%
34.3%
33.1%
TABLE 3. Identification success based on "best match" and "best close match."
"Best match"
Sequence overlap
Success
Ambiguous
Misidentification
300
400
500
600
67.7%
68.1%
65.0%
55.9%
5.3%
5.4%
5.9%
8.5%
27.1%
26.5%
29.0%
35.7%
Success
66.9%
67.5%
64.1%
55.1%
Ambiguous
Misidentification
No match
3.6%
3.7%
4.2%
6.3%
6.5%
6.0%
5.7%
11.2%
23.0%
22.8%
26.0%
27.4%
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
300
400
500
600
Number of
conspecific
sequences
2006
721
interspecific
intraspecific
2%
4%
6%
8%
16% 18%
Pairwise Distances
FIGURE 2. Overlap between intra- and interspecific genetic variability for congeneric sequences.
Many biologists have argued that the future of descriptive taxonomy will depend on successfully embracing new techniques. Many ideas have been proposed
(Godfray, 2002) and much progress has been achieved
by, for example, digitizing taxonomic information, using new microscopic techniques, and developing interactive identification tools (De Ley et al., 2005; Dunn, 2003;
Klaus et al., 2003; Marshall, 2003; Prendini, 2005; Sperling, 2003; Thacker, 2003; Wheeler, 2004; Will et al., 2005).
It is only natural that the discussion has now turned to the
contribution of DNA sequences. Uncontroversial is their
great promise for associating morphologically disparate
life history stages, associating males and females, solving species limits for polymorphic and cryptic species,
and identifying artefacts made of materials derived from
endangered species (Birstein et al., 1998; Hebert et al.,
2003a; Marshall, 2003; Paquin and Hedin, 2004; Palumbi
and Cipriano, 1998; Tautz et al., 2003). However, controversial is the extent to which DNA sequences and biologists only versed in molecular techniques should and
can replace the morphological tools and retiring traditional taxonomists. Most systematists would argue that
a more straightforward solution to employing molecular
techniques is hiring new taxonomists (Lee, 2004; Minelli,
2003; Will and Rubinoff, 2004).
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
0%
Parsimony
Bayes
NJ
Aedes
Anastrepha
Anopheles
Asphondylia
Bactrocera
Calliphora
Chiastocheta
Chironomus
Chrysomya
Culicoides
Liriomyza
Lucilia
Paragus
Phytomyza
Sapromyza
Scathophaga
Subgenus
Drosophila
Sophophora
Total
Genus
31/3
16/5
4/2
10/4
34/3
11/4
93/33
111/47
915/243
430 (47.0%)/55(22.6%)
368 (40.2%)/53 (21.8%)
410 (44.8%)/55 (22.6%)
22/6
2/1
153/5
11/2
27/2
15/2
22/4
2/1
10/3
18/4
18/3
16/2
0/0
9/4
14/3
24/5
Bayes
22/6
2/1
155/6
11/2
27/2
15/2
22/4
2/1
10/3
18/4
18/3
5/1
0/0
9/4
14/3
24/5
Par
22/6
2/1
155/6
42/3
27/2
15/2
22/4
2/1
10/3
18/4
18/3
5/1
0/0
9/4
14/3
24/5
NJ
32/14
45/15
188/11
42/3
27/2
15/2
50/9
43/34
16/9
86/6
19/4
50/7
12/7
19/14
32/11
35/15
No. of
seq./spp.
61/3
64/6
3/1
35/6
29/1
31/1
0/0
0/0
28/5
10/2
0/0
67/1
0/0
44/5
8/3
0/0
14/4
2/1
Par
34/2
61/6
3/1
35/6
31/2
31/1
0/0
0/0
28/5
10/2
0/0
67/1
0/0
34/4
8/3
0/0
14/4
2/1
Bayes
31/2
63/6
3/1
35/6
29/1
0/0
0/0
0/0
28/5
10/2
0/0
67/1
0/0
44/5
8/3
0/0
14/4
2/1
NJ
Failed seq./spp.
28
37
7
8
4
0
0
0
0
31
6
1
1
1
4
10
4
9
Singletons
(= ambiguous)
55
28
13
10
179
40
27
13
34
3
5
81
14
35
0
2
13
23
NJ
35
31
13
13
:177
38
24
15
27
2
6
82
16
39
0
2
14
24
Bayes
575 (62.8%)
550 (60.1%)
558 (61.0%)
32
28
14
11
178
39
26
14
28
3
5
81
15
39
0
2
11
24
Par
Successful sequences
38
80
19
33
8
2
0
2
16
39
11
4
5
14
12
17
17
12
NJ
329 (36.0%)
354 (38.7%)
346 (37.8%)
60
79
18
32
9
3
1
1
22
39
11
4
4
10
12
17
21
11
Par
57
76
19
31
10
4
3
0
23
41
10
3
3
10
12
17
17
10
Bayes
Ambiguous identification
Success rate for tree-based species identification. Singleton = sequence without conspecific representation in data set.
Successful seq./spp.
TABLE 4.
0
1
1
0
0
0
0
0
0
1
0
1
0
0
1
1
1
4
0
2
1
0
0
0
0
1
0
1
0
1
0
0
0
0
1
4
0
2
1
0
0
0
0
1
0
0
1
0
0
2
0
0
3
11 (1.2%)
11 (1.2%)
11 (1.2%)
Bayes
Par
NJ
Misident. sequences
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
I-H
2006
723
TABLE 5. DNA taxonomy: clusters based on fixed pairwise-distance thresholds for all sequences from species with intraspecific sequences
(>300 bp overlap: 987 sequences, 117 species).
Threshold
pairwise distance
125
106
95
82
75
Profiles with
threshold violations
15
14
13
14
9
(12%)
(13%)
(14%)
(17%)
(12%)
Maximum
pairwise distance
4.8%
4.8%
6.1%
8.5%
12.3%
55 (44%)/105
50 (47%)/85
47 (49%)/73
37 (45%)/59
37 (49%)/54
Max. no.
of species per profile
Overall, we find that for our data set the identification success rates for DNA barcoding are unacceptably low and never exceed 70% (Fig. 3). Furthermore,
this low success rate does not improve even if sequences with greater sequence overlap are used (Tables
2 and 3). However, larger data sets will be needed to
rigorously test this prediction. In Diptera identification,
success actually declines with increasing sequence overlap, which we believe to be due to the relatively small
number of sequences with more than 500 base pairs of
overlap. Whether these success rates are high enough to
justify the considerable expense for barcoding all species
and obtaining sequences for unidentified specimens remains a judgement call for the user. However, we doubt
that DNA barcoding has a bright future unless the identification success rates can be increased considerably. Incidentally, our best identification rate of 68% is in line
with the rates reported in Will and Rubinoff (2004) and
decades of research on intraspecific sequence variability in a wide range of taxa (summarized in Funk and
Omland, 2003). Funk and Omland (2003) found that 23%
of the 2319 assayed species did not form monophyletic
groups and that in two-thirds of these cases, the "polyphyly" was supported by bootstrap values above 70%.
For arthropods, the rate was even higher (26.5% of 702
100%
ffl no match
E3 misidentified
ambiguous
success
300 bp
400 bp
500 bp
Basepair Overlap
600 bp
FIGURE 3. Sequence overlap and identification success based on best match (1st bar), best close match (2nd bar), and all species barcodes
(3rd bar).
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
2%
3%
4%
5%
6%
No. of
DNA profiles
724
SYSTEMATIC BIOLOGY
2005). Given the problems with tie-trees and the problematic assumption of species monophyly, we do not believe that trees are a promising tool (see also DeSalle
et al., 2005; Pozhitkov and Tautz, 2002; Steinke et al.,
2005). This is particularly so because each tree has many
nested clades and it is thus impossible to decide based
on tree topology alone which node on the tree delimits a
population, species, or supraspecific taxon (see our Homo
sapiens-great ape example in the introduction). Instead,
some kind of similarity criterion has to be used in conjunction with the tree before one can decide whether a
particular query is conspecific or allospecific to its closest match. Given that a similarity metric will also be
needed for tree-based identification, we would argue
that one may as well abandon the tree-based approaches
and instead directly use the similarity metric as has been
implemented in several recent algorithms (e.g., Blaxter
et al., 2005; Steinke et al., 2005). We furthermore find
that for our data set, similarity-based techniques outperform tree-based identification. For example, the success
rate of tree-based identification is below 50% (Table 4),
whereas best close match yields a much higher success
rate (67%) and has a relatively low incidence of misidentification (6%; Table 3). Therefore, if we had to use DNA
sequences for species identification, we would prefer this
technique, although a large proportion of the queries remain unidentified because they lack close matches in the
barcode database. We nevertheless consider this result
preferable over using an identification technique like
best match that yields marginally higher success rates
(68%) but also a large number of misidentifications (27%).
All species barcodes is the most conservative identification technique, but given its low success rate of 39% it
will probably only be justifiable for forensic purposes.
Is a Complete Barcode Database Obtainable?
Best match would perform much better if it was applied to a data set from which single-sequence species
have been removed. However, in a real-life situation
it is impossible to know whether a new query is coming from a "new" species or from a species that is already represented in the data set (Scotland et al., 2003;
Seberg, 2004; Will and Rubinoff, 2004). Of course, the
other solution would be using a database that has barcodes for all species on the planet. Creating such a
database may alleviate some of the problems with DNA
barcoding and is thus the goal of the Barcoding Life
Consortium. However, we agree with many systematists that this goal is completely unrealistic (Marshall,
2003). The existing strategies for creating a complete
barcode database mostly rely on tissues from existing
cryo-collections and museum specimens. However, the
former will only yield barcodes for relatively few, mostly
common, and/or widespread species. The proposed use
of tissues from identified specimens in natural history
museums is equally inadequate and problematic.
First, museums can at best provide identified specimens for described speciesfor poorly known groups
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
VOL. 55
2006
725
DNA Taxonomy
49%
47%
1 45%
U
3 43%
300 bp overlap
39%
37%
400 bp overlap
500 bp overlap
600 bp overlap
35%
2%
3%
4%
5%
6%
7%
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
726
VOL. 55
SYSTEMATIC BIOLOGY
CONCLUSIONS
REFERENCES
Armstrong, K. F., and S. L. Ball. 2005. DNA barcodes for biosecurity:
Invasive species identification. Philos. Trans. R. Soc. Lond. B Biol.
Sci. 360:1813-1823.
Avise, J. C. 2000. Phylogeography: The history and formation of species.
Harvard University Press, Cambridge, Massachusetts.
Avise, J. C, and D. Walker. 1999. Species realities and numbers in sexual vertebrates: Perspectives from an asexually transmitted genome.
Proc. Natl. Art. Sci. USA 96:992-995.
Backeljau, T., L. De Bruyn, H. De Wolf, K. Jordaens, S. Van Dongen, and
B. Winnepennincks. 1996. Multiple UPGMA and neighbor-joining
trees and the performance of some computer packages. Mol. Biol.
Evol. 13:309-313.
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
2006
727
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
Baker, C. S., M. L. Dalebout, S. Lavery, and H. A. Ross. 2003. www.DNA- Hebert, P. D. N., A. Cywinska, S. L. Ball, and J. R. deWaard. 2003a.
surveillance: Applied molecular taxonomy for species conservation
Biological identifications through DNA barcodes. Proc. Roy. Soc. Ser.
and discovery. TREE 18:271-272.
B 270:313-321.
Barrett, R. D. H., and P. D. N. Hebert. 2005. Identifying spiders through Hebert, P. D. N., and T. R. Gregory. 2005. The promise of DNA barcoding
DNA barcodes. Can. J. Zool. 83:481^191.
for taxonomy. Syst. Biol. 54:852-859.
Birstein, V. J., P. Doukakis, B. Sorkin, and R. Desalle. 1998. Population Hebert, P. D. N., S. Ratnasingham, and J. R. deWaard. 2003b. Barcodaggregation analysis of three caviar-producing species of sturgeons
ing animal life: Cytochrome c oxidase subunit 1 divergences among
closely related species. Proc. Roy. Soc. Ser. B 270:S96-S99.
and implications for the species identification of black caviar. Cons.
Biol. 12:766-775.
Hebert, P. D. N., S. Ratnasingham, and R. Dooh. 2004a. Barcodes of
Life, http://www.barcodinglife.com/
Blaxter, M. 2003. Counting angels with DNA. Nature 421:122-124.
Blaxter, M., and R. Floyd. 2003. Molecular taxonomies for biodiversity Hebert, P. D. N., M. Stoeckle, T. S. Zemlak, and C. M. Francis. 2004b.
surveys: Already a reality. TREE 18:268-269.
Identification of birds through DNA barcodes. PLoS Biol. 2:1657Blaxter, M., J. Mann, T. Chapman, F. Thomas, C. Whitton, R. Floyd, and
1663.
E. Abebe. 2005. Defining operational taxonomic units using DNA Hogg, I. D., and P. D. N. Hebert. 2004. Biological identification of springbarcode data. Philos. Trans. R. Soc. Lond. B Biol. Sci. 360:1935-1943.
tails (Hexapoda: Collembola) from the Canadian Arctic, using mitoBond, J. E. 2004. Systematics of the Californian euctenizine spider genus
chondrial DNA barcodes. Can. J. Zool. 82:749-754.
Apomastus (Araneae: Mygalomorphae: Cyrtaucheniidae): The rela- Janzen, D. 2004. Now is the time. Philos. Trans. R. Soc. Lond. B Biol.
tionship between molecular and morphological taxonomy. Invert.
Sci. 359:731-732.
Syst. 18:361-376.
Johns, G. C, and J. C. Avise. 1998. A comparative summary of genetic
Bond, J. E., and P. Sierwald. 2003. Molecular taxonomy of the Anadenobo- distances in the vertebrates from the mitochondrial cytochrome b
lus excisus (Diplopoda: Spirobolida: Rhinocricidae) species-group on
gene. Mol. Biol. Evol. 15:1481-1490.
the Caribbean island of Jamaica. Invert. Syst. 17:515-528.
Klaus, A. V., V. L. Kulasekera, and V. Schawaroch. 2003. ThreeChu, K. H., H. Y. Ho, C. P. Li, and T. Y. Chan. 2003. Molecular phyloge- dimensional visualization of insect morphology using confocal laser
netics of the mitten crab species in Eriocheir, sensu lato (Brachyura:
scanning microscopy. J. Microsc. 212:107-121.
Grapsidae). J. Crust. Zool. 23:738-746.
Lee, M. S. Y. 2004. The molecularisation of taxonomy. Inv. Syst. 18:1-6.
Chu, K. H., J. Tong, and T. Y. Chan. 1999. Mitochondrial cytochrome Lipscomb, D., N. Platnick, and Q. Wheeler. 2003. The intellectual
content of taxonomy: A comment on DNA taxonomy. TREE 18:65-66.
oxidase I sequence divergence in some Chinese species of Charybdis (Crustacea: Decapoda: Portunidae). Biochem. Syst. Ecol. 27:461- Marshall, E. 2005. Taxonomy: Will DNA Bar Codes Breathe Life Into
468.
Classification? Science 307:1037.
Crisp, M. D., and G. T. Chandler. 1996. Paraphyletic species. Telopea Marshall, S. 2003. The real costs of insect identification [opinion page].
6:813-844.
Newsl. Biol. Surv. Can. (Terrestrial Arthropods) 22:15-18.
De Ley, P., I. T. De Ley, K. Morris, E. Abebe, M. Mundo-Ocampo, Meier, R., and T. Dikow. 2004. Significance of specimen databases from
M. Yoder, J. Heras, D. Waumann, A. Rocha-Olivares, A. H. J. Burr,
taxonomic revisions for estimating and mapping the global species
J. G. Baldwin, and W. K. Thomas. 2005. An integrated approach to
diversity of invertebrates and repatriating reliable and complete
fast and informative morphological vouchering of nematodes for apspecimen data. Cons. Biol. 18:478-488.
plications in molecular barcoding. Philos. Trans. R. Soc. Lond. B Biol. Meyer, C. P., and G. Paulay. 2005. DNA barcoding: Error rates based
Sci. 360:1945-1958.
on comprehensive sampling. PLoS Biol. 3:2229-2238.
DeSalle, R., M. G. Egan, and M. Siddall. 2005. The unholy trinity: Tax- Moritz, C, and C. Cicero. 2004. DNA barcoding: Promise and pitfalls.
onomy, species delimitation and DNA barcoding. Philos. Trans. R.
PLoS Biol. 2:1529-1531.
Soc. Lond. B Biol. Sci. 360:1905-1916.
Nilsson, R. H., K. H. Larsson, and B. M. Ursing. 2004. GalaxieCGI
Dunn, C. P. 2003. Keeping taxonomy based in morphology. TREE
scripts for sequence identification through automated phylogenetic
18:270-271.
analysis. Bioinformatics 20:1447-1452.
Ebach, M. C, and C. Holdrege. 2005. DNA barcoding is no substitute Nylander, J. A. A. 2004. MrModelTest, version 2. Evolutionary Biology
for taxonomy. Nature 434:697.
Centre, Uppsala University.
Ferguson, J. W. H. 2002. On the use of genetic divergence for identifying Page, R. D. M., and E. C. Holmes. 1998. Molecular evolution: A phylospecies. Biol. J. Linn. Soc. 75:509-516.
genetic approach. Blackwell Science, Oxford, UK.
Floyd, R., E. Abebe, A. Papert, and M. Blaxter. 2002. Molecular barcodes Palumbi, S. R., and F. Cipriano. 1998. Species identification using gefor soil nematode identification. Mol. Ecol. 11:839-850.
netic tools: The value of nuclear and mitochondrial gene sequences
Frost, D. R., H. M. Crafts, L. A. Fitzgerald, and T. A. Titus. 1998. Gein whale conservation. J. Hered. 89:459-464.
ographic variation, species recognition, and molecular evolution of Panchen, A. L. 1992. Classification, evolution, and the nature of biology.
cytochrome oxidase I in the Tropidurus spinulosus complex (Iguania: Cambridge University Press, Cambridge, UK.
Tropiduridae). Copeia 1998:839-851.
Paquin, P., and M. Hedin, M. 2004. The power and perils of "molecFunk, D. J., and K. E. Omland. 2003. Species-level paraphyly and polyular taxonomy": A case study of eyeless and endangered Cicurina
phyly: Frequency, causes, and consequences, with insights from an(Araneae: Dictynidae) from Texas caves. Mol. Ecol. 13: 3239-3255.
Pons, J., and A. P. Vogler. 2005. Complex pattern of coalescence and fast
imal mitochondrial DNA. Ann. Rev. Ecol. Syst. 34:397-423.
Gibbs, M. J., J. S. Armstrong, and A. J. Gibbs. 2005. Individual sequences evolution of a mitochondrial rRNA pseudogene in a recent radiation
in large sets of gene sequences may be distinguished efficiently by
of tiger beetles. Mol. Biol. Evol. 22:991-1000.
combinations of shared sub-sequences. Bmc Bioinformatics 6.
Pennisi, E. 2003. Modernizing the tree of life. Science 300:1692-1697.
Godfray, H. C. J. 2002. Challenges for taxonomy. Nature 417:17-19.
Pestano, J., R. P. Brown, N. M. Suarez, and M. Baez. 2003. DiversificaGodfray, H. C. J., and S. Rnapp. 2004. Taxonomy for the twentytion of sympatric Sapromyza (Diptera: Lauxaniidae) from Madeira:
first centuryIntroduction. Philos. Trans. R. Soc. Lond. B Biol. Sci.
Six morphological species but only four mtDNA lineages. Mol. Phy359:559-569.
logenet. Evol. 27:422-428.
Goloboff, P., J. S. Farris, and K. Nixon. 2003. T.N.T.: Tree analysis using Ponder, W., and D. Lunney. 1999. The other 99%The conservation
new technology. Program and documentation, available from the
and biodiversity of invertebrates. Trans. Roy. Zool. Soc. NSW.
authors, and at www.zmuc.dk/public/phylogeny.
Pozhitkov, A. E., and D. Tautz. 2002. An algorithm and program for
Hajibabaei, M., J. R. DeWaard, N. V. Ivanova, S. Ratnasingham, R. T.
finding sequence specific oligonucleotide probes for species identification. BMC Bioinformatics 3.
Dooh, S. L. Kirk, P. M. Mackie, and P. D. N. Hebert. 2005. Critical
factors for assembling a high volume of DNA barcodes. Philos. Trans. Prendini, L. 2005. Comment on "Identifying spiders through DNA barR. Soc. Lond. B Biol. Sci. 360:1959-1967.
codes." Can. J. Zool. 83:498-504.
Quicke, D. L. J. 2004. The world of DNA barcoding and morphology
Harris, D. J. 2003. Can you bank on GenBank? TREE 18:317-319.
Collision or synergism and what of the future? Systematist 23: 8-12.
Hebert, P. D. N., and R. D. H. Barrett. 2005. Reply to the comment by
L. Prendini on "Identifying spiders through DNA barcodes." Can. J. Ronquist, F., and J. P. Huelsenbeck. 2003. MrBayes 3: Bayesian phyloZool. 83:481-491.
genetic inference under mixed models. Bioinformatics 19:1572-1574.
728
SYSTEMATIC BIOLOGY
Downloaded from http://sysbio.oxfordjournals.org at BIBLIOTECA CENTRAL UNIV ESTADUAL CAMPINAS on March 11, 2010
VOL. 55