Академический Документы
Профессиональный Документы
Культура Документы
www.elsevier.com/locate/ympev
Abstract
Phylogenetic relationships among the nine spiral-horn antelope species of the African bovid tribe Tragelaphini are controversial.
In particular, mitochondrial DNA sequencing studies are not congruent with previous morphological investigations. To test the utility of nuclear DNA intron markers at lower taxonomic levels and to provide additional data pertinent to tragelaphid evolution, we
sequenced four nuclear DNA segments (MGF, PRKCI, SPTBN, and THY) and combined these data with mitochondrial DNA
sequences from three genes (cytochrome b, 12S rRNA, and 16S rRNA). Our molecular supermatrix comprised 4682 characters which
were analyzed independently and in combination. Parsimony and model based phylogenetic analyses of the combined nuclear DNA
data are congruent with those derived from the analysis of mitochondrial gene sequences. The corroboration between nuclear and
mtDNA gene trees reject the possibility that genetic processes such as lineage sorting, gene duplication/deletion and hybrid speciation account for the conXict evident in the previously published phylogenies. It suggests rather that the morphological characters
used to delimit the Tragelaphid species are subject to convergent evolution. Divergence times among species, calculated using a
relaxed Bayesian molecular clock, are consistent with hypotheses proposing that climatic oscillations and their impact on habitats
were the major forces driving speciation in the tribe Tragelaphini.
2005 Elsevier Inc. All rights reserved.
Keywords: Bovidae; Tragelaphini; Systematics; Phylogeny; Molecular clock
1. Introduction
Congruence among multiple data sets is arguably the
most reliable indicator of phylogenetic accuracy
(Cracraft and Helm-Bychowski, 1991; Miyamoto and
Cracraft, 1991; Miyamoto and Fitch, 1995; SwoVord,
1991). Current phylogenetic analyses thus often utilize a
multigene approach and analyses frequently include
morphological and/or palaeontological data (see
Flores-Villela et al., 2000; Gatesy et al., 2003; Nylander
et al., 2004 among others). One group providing a case in
point is the Cetartiodactyla. Initial investigations were
based mainly on morphological and palaeontological
1055-7903/$ - see front matter 2005 Elsevier Inc. All rights reserved.
doi:10.1016/j.ympev.2005.01.018
625
acters becoming Wxed in only a portion of the descendents of a polymorphic ancestor (Hillis, 1999).
The aims of this study were Wrst to test the utility of
nuclear DNA markers in a group that experienced a
fairly rapid radiation during the last 15 My (Hassanin
and Douzery, 2003; Matthee and Robinson, 1999; Vrba,
1985). Second, resolution at the nuclear DNA level could
clarify the apparent conXict that exists between the
mtDNA and morphological topologies. Additionally it
was anticipated that a more comprehensive molecular
data set would allow for greater accuracy in determining
of the pattern and timing of episodes of speciation
within the group, as well as providing further insights on
the factors thought to be driving speciation in these African antelope. This is particularly pertinent since suggestions that species inhabiting moist forest habitats (T.
euryceros, T. spekei, and T. scriptus) are derived, and
those adapted to more arid environments (T. imberbis,
T. angasi, T. strepsiceros, T. oryx, and T. derbianus) are
more basal, rest on a single, maternally inherited locus
(Matthee and Robinson, 1999).
626
Table 1
Taxonomic sampling used in the study
Species name (Vernacular name)
DNA region
Mitochondrial DNA
Nuclear DNA
12S rRNA
16S rRNA
Cyt b
MGF
PRKCI
SPTBN
THY
AY667206*
AF091710d
AY667207*
AY667208*
AF091696d
AY667209*
AF091698d
AY667210*
AY667211*
AY667212*
AF091691d
AY667213*
AF091692d
AY667214*
AF091693d
U86963c
NC001567a
AF091688d
U86964c
AY667193*
AY667194*
U87060c
AY667195*
AY667196*
M86493b
AY667197*
AY667198*
AY667199*
AY667200*
AY667201*
AY667202*
AY667203*
AY667204*
AY667205*
U87013c
NC001567a
U87061c
U87014c
AF022062e
AF036278d
AF022057e
AF022063e
AF036280d
AF036279d
AF091633d
AF022066e
AY667215*
AY667216*
AF036276d
AF022065e
AJ222680d
AF036277d
AF022067e
AJ222679d
NC001567a
AF036275d
AF036289d
AY667177*
AY667178*
AY667179*
AY667180*
AY667181*
AY667182*
AY667183*
AY667184*
AY667185*
AY667186*
AY667187*
AY667188*
AY667189*
AY667190*
AY667191*
AF165724g
AF165716g
AY667192*
AF165780g
AY667162*
AY667163*
AY667164*
AY667165*
AY667166*
AY667167*
AY667168*
AY667169*
AY667170*
AY667171*
AY667172*
AY667173*
AY667174*
AY667175*
AY667176*
AF165725g
AY029293f
AF210175h
AF165781g
AY667232*
AY667233*
AY667234*
AY667235*
AY667236*
AY667237*
AY667238*
AY667239*
AY667240*
AY667241*
AY667242*
AY667243*
AY667244*
AY667245*
AY667246*
AF165726g
AF165718g
AF210197h
AF165782g
AY667217*
AY667218*
AY667219*
AY667220*
AY667221*
AY667222*
AY667223*
AY667224*
AY667225*
AY667226*
AY667227*
AY667228*
AY667229*
AY667230*
AY667231*
AF165729g
AF65721g
AF210219h
AF165785g
Cycle sequencing was performed using Big Dye chemistry (Version 3.1, Applied Biosystems) with the resulting
products analyzed on a 3100 ABI automated sequencer.
Sequences were aligned using the default settings of
CLUSTAL X (Thompson et al., 1997) and optimized
manually to ensure accuracy. Areas of ambiguously
aligned sites were limited to a single gene and this region
was excluded from all analysis (positions 149159 in
aligned MGF data set). Heterozygous sites were coded
using the IUB codes. The exon and intron boundaries
were deWned from previously published sequence data
for each DNA region (Matthee et al., 2001; Matthee and
Davis, 2001). To further ensure the homology of the
DNA sequences, exon regions were translated into
amino acids and screened for functionality. No stop
codons or insertions were present in the exonic regions.
All new sequences were deposited in GenBank, accession
numbers are shown in Table 1.
2.3. Data partitioning
Each DNA region was analyzed separately. Additionally, data sets were partitioned and analyzed according
to origin of marker (i.e., mitochondrial or nuclear), and
all characters and taxa were merged into a single data
matrix or molecular supermatrix. To explore inconsistencies among data partitions, pair-wise incongruence
length diVerences (ILD) were calculated in PAUP*
4.0b10 (SwoVord, 2003) excluding uninformative characters (Hipp et al., 2004). To further explore the homogeneity of signal among data partitions the crossed
ShimodairaHasegawa tests (crossed SH tests, Shimodaira and Hasegawa, 1999) were performed in PAUP*
4.0b10. In these tests the most likely topology obtained
by the mitochondrial, nuclear partitions and the supermatrix were compared in pair-wise fashion.
2.4. Data analysis
Parsimony and maximum likelihood searches were
performed in PAUP* 4.0b10 using the heuristic search
option with 100 replicates of random taxon addition and
TBR branch swapping. For the maximum likelihood
analyses the optimal evolutionary model and model
parameters were estimated applying the Akaike information criteria (AIC, Akaike, 1974) using MODELTEST
3.06 (Posada and Crandall, 1998). All character transformations were equally weighted and alignment gaps
treated as missing characters in parsimony. Nodal support for parsimony and maximum likelihood trees was
627
3. Results
3.1. Data description
The aligned supermatrix contained 7 DNA regions
and 4682 characters, 3149 bp non-protein coding and
1533 bp protein coding characters. The diVerent DNA
regions were characterized by variation in base composition and modes of evolution (Table 2). As typically
expected the mitochondrial partition contained a higher
proportion of variable characters (28%) when compared
to that of the nuclear partition (12.7%). As expected
given the higher proportion of variable characters in the
628
Table 2
Patterns of sequence variability among the independent DNA regions, mitochondrial partition, nuclear partition, and supermatrix
DNA
partition
Total
(bp)
% Protein
coding
% Variable
characters
Gamma distribution
( value)
Nucleotide frequencies
%A
%T
%C
%G
12S rRNA
16S rRNA
Cyt b
MGF
PRKCI
SPTBN
THY
Mitochondrial
Nuclear
Supermatrix
591
348
1140
674
667
765
667
2079
2603
4682
0
0
100
0
11
16
34
55
15
33
15.6
23.9
35.8
14.1
9.60
16.1
9.90
28.0
12.7
19.5
0.736
0.489
2.023
0.499
NA
0.505
0.139
0.799
0.275
0.623
36.1
34.9
31.9
32.3
34.3
23.2
30.1
34.9
29.8
31.6
22.7
24.6
25.9
34.6
38.7
27.2
32.6
23.7
32.3
28.6
24.7
21.3
28.9
18.1
11.2
24.8
19.3
28.0
18.8
22.9
16.5
19.2
13.4
15.0
16.0
24.8
18.1
13.4
19.1
16.9
In the case of nuclear sequence data % protein coding refers to Xanking exon regions.
Table 3
Summary of the topologies obtained from analysis of the independent DNA regions, mitochondrial partition, nuclear partition, and supermatrix
Data partition
16S rRNA
Cyt b
MGF
PRKCI
SPTBN
THY
Mitochondrial
Nuclear
Supermatrix
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
Pars.
ML
Bayesian
No. of pars.
informative
characters
No. of equally
parsimonious
trees
Tree length
CI
RI
51
94
142
0.5510
0.5882
61
151
0.5547
0.7311
290
904
0.4861
0.6346
37
116
0.7547
0.8632
17
104
52
0.8261
0.9149
37
142
0.7736
0.8667
31
24
75
0.7209
0.8400
550
1699
0.4753
0.5527
145
491
0.7162
0.8215
700
2203
0.5043
0.5938
No. of topologies
in 95% posterior
interval
14,096
2015
212
7299
13,242
517
10,078
22
18
Nodes
A
51
51
0.70
72
73
0.99
81
100
1.00
100
99
1.00
96
93
1.00
89
80
1.00
100
100
1.00
98
98
1.00
100
100
1.00
100
100
1.00
X
X
X
77
63
1.00
86
88
0.99
X
88
X
X
X
X
X
57
X
X
X
X
95
95
1.00
X
X
X
93
71
0.97
X
X
X
X
X
X
67
79
1.00
X
X
X
X
X
X
92
85
0.97
X
X
X
78
87
1.00
85
94
1.00
95
99
1.00
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
96
94
1.00
81
75
1.00
X
X
X
X
X
X
95
94
1.00
X
X
X
X
X
X
95
96
1.00
X
X
X
94
94
1.00
65
86
1.00
99
100
1.00
X
X
X
66
70
0.95
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
58
75
1.00
62
76
1.00
X
X
X
X
X
X
90
93
1.00
X
X
X
X
X
X
X
X
X
X
X
X
96
100
1.00
X
X
X
100
98
1.00
X
X
X
X
X
X
58
70
0.79
X
X
X
X
X
0.64
97
90
1.00
X
X
X
50
83
0.90
93
87
1.00
100
100
1.00
12S rRNA
Method
The number of parsimony informative characters, number of trees, optimal tree length, consistency index (CI), and retention index (RI) values are given for each parsimony tree, as well as, the
number of topologies found within the 95% posterior intervals of each partition. Nodes AH correspond to those in Figs. 1 and 2, bootstrap values (>50) are given, X indicates where a node
was not retrieved.
629
630
Fig. 1. Topologies retrieved by (A) the combined mitochondrial DNA data and (B) the combined nuclear DNA data. Values given above the
branches represent the molecular clock estimates of divergence times for nodes labelled AH with standard deviations in parenthesis. Values below
the branches represent Bayesian posterior probabilities, maximum likelihood bootstrap values, and parsimony bootstrap values.
631
Fig. 2. Topology retrieved by the combined analysis of all molecular data. Values given above the branches represent the molecular clock estimates of
divergence times for nodes labelled AH with standard deviations in parenthesis. These divergence values represent the age of the most recent common ancestor of the taxa contained within that clade. Values below the branches represent Bayesian posterior probabilities, maximum likelihood
bootstrap values, parsimony bootstrap values. Solid bars indicate deletion events and stippled bars represent insertion events. The single homoplasious deletion is highlighted with *.
632
Fig. 3. The number of unambiguous synapomorphies, partitioned Bremer support and nodal signiWcance values for each DNA region at nodes AH.
Positive PBS values indicate that a particular gene region contributes phylogenetic signal at that node, while a negative value (shown in black) indicates that a data set conXicts with the supermatrix at a particular node.
4. Discussion
4.1. Nuclear intron markers
The nuclear introns selected in the present investigation have been applied successfully to explain evolutionary relationships among the Cetartiodactyla (Matthee
633
634
Acknowledgments
We acknowledge E. Harley (University of Cape
Town), M. Harman (Powell Cotton Museum), J. Sakwa
(University of Pretoria), and the Pretoria Zoological
Gardens for providing samples. John Gatesy and three
anonymous reviewers are also thanked for their useful
comments on the manuscript. Financial assistance was
provided by the National Research Foundation of
South Africa (GUN 2053662).
References
Alden, P.C., Estes, R.D., Schlitter, D., MacBride, B., 1995. National
Audubon Society Field Guide to African Wildlife. Knopf, New
York.
Allard, M., Miyamoto, M., Jarecki, L., Kraus, F., Tennant, M., 1992.
DNA systematics and evolution of the artiodactyls family Bovidae.
Proc. Natl. Acad. Sci. USA 89, 39723976.
Akaike, H., 1974. A new look at the statistical model identiWcation.
IEEE Trans. Autom. Contr. 19, 716723.
Anderson, S., de Bruijn, M.H., Coulson, A.R., Eperon, I.C., Sanger, F.,
Young, I.G., 1982. Complete sequence of bovine mitochondrial
DNA. Conserved features of the mammalian genome. J. Mol. Biol.
156, 683717.
Ansell, W.F.H., 1971. Order Artiodactyla. In: Meester, J., Setzer, H.W.
(Eds.), The Mammals of Africa: An IdentiWcation Manual. Smithsonian Institute Press, Washington.
Bremer, K., 1988. The limits of amino acid sequence data in angiosperm phylogenetic reconstruction. Evolution 42, 795803.
Bremer, K., 1994. Branch support and tree stability. Cladistics 10, 295
304.
Bronner, G.N., HoVmann, M., Taylor, P.J., Chimimba, C.T., Best, P.B.,
Matthee, C.A., Robinson, T.J., 2003. A revised systematic checklist
of the extant mammals of the southern African subregion. Durban
Mus. Novit. 28, 56106.
Buckley, T.R., Arensburger, P., Simon, C., Chambers, G.K., 2002. Combining data, bayesian phylogenetics, and the origin of the New Zealand Circada genera. Syst. Biol. 51, 418.
Cerling, T.E., Harris, J.M., MacFadden, B.J., Leakey, M.G., Quade,
J., Eisenmann, V., Ehleringer, J.R., 1997. Global vegetation
change through the Miocene/Pliocene boundary. Nature 389,
153158.
Corneli, P.S., 2003. Complete mitochondrial genomes and eutherian
evolution. J. Mammal. Evol. 9, 281305.
Cracraft, J., Helm-Bychowski, K., 1991. Parsimony and phylogenetic inference using DNA sequences: some methodological
635
636
Vrba, E.S., 1985. African Bovidae: evolutionary events since the Miocene. S. Afr. J. Sci. 81, 263266.
Waddell, P.J., Kishino, H., Ota, R., 2001. A phylogenetic foundation
for comparative mammalian genomics. Genome Informat. 12, 141
154.
Walker, E.P., 1964. Mammals of the World. Johns Hopkins Press, Baltimore.
Wilcox, T.P., Zwickl, D.J., Heath, T.A., Hillis, D.M., 2002. Phylogenetic
relationships of the dwarf boas and a comparison of Bayesian and