Академический Документы
Профессиональный Документы
Культура Документы
Analytical Biochemistry
journal homepage: www.elsevier.com/locate/yabio
a r t i c l e i n f o a b s t r a c t
Article history: Metagenomic studies traditionally rely on cloning polymerase chain reaction (PCR) products and
Received 9 March 2009 sequencing multiple clones. However, this approach is tedious and expensive, thereby limiting the range
Available online 3 May 2009 and scale of questions that can be addressed. Recent developments in DNA sequencing technologies
enable a dramatic increase in throughput via parallel in-depth analysis of many samples with limited
Keywords: sample processing and lower costs. We directly compared the traditional cloning approach with a bar-
Saliva coded pyrosequencing method to see whether the latter accurately describes microbiome diversity in
Microbiome diversity
human saliva. Our results indicate that despite the shorter read lengths, the pyrosequencing approach
Barcoded pyrosequencing
Cloning
provides a description of the human salivary microbiome that is in good agreement with results based
on the traditional cloning and sequencing approach.
Ó 2009 Elsevier Inc. All rights reserved.
Analysis of 16S ribosomal RNA (rRNA)1 sequence variation has ing, not a direct comparison between the pyrosequencing and
revolutionized the study of bacterial diversity in various commu- cloning approaches, and concluded that the pyrosequencing ap-
nities [1–12]. However, the traditional way to carry out such proach would describe with reasonable accuracy the diversity
metagenomic analyses relies on cloning polymerase chain reaction of microbial communities within the mouse and human gut
(PCR) products and sequencing multiple clones [1–5], a process and the Guerrero Negro microbial mat ecosystems down to the
that is tedious and expensive. Therefore, metagenomic studies genus level [6–8]. In addition, a few studies of microbiome
have been limited in the number of samples and number of se- diversity in environmental and human fecal samples have been
quences obtained per sample, and this in turn limits the range carried out using the pyrosequencing approach alone [9–12]
and scale of questions that can be addressed. Recent improve- without any comparison with the traditional cloning approach.
ments in DNA sequencing technology (e.g., barcoded pyrosequenc- Here we directly compare the results of the barcoded pyrose-
ing on the Genome Sequencer FLX/454 Life Sciences platform) quencing approach with a previous study of cloned sequences
enable a dramatic increase in throughput via parallel in-depth from the same samples [14] to assess the reliability and informa-
analysis of many samples with limited sample processing and tiveness of the former.
lower costs [13].
However, such advances generally come at the cost of shorter
sequence read lengths, and this could potentially decrease taxo- Materials and methods
nomic classification fidelity. Even though recent developments in
pyrosequencing technology are enabling longer read lengths, it is Samples
still of interest to know whether shorter sequences limit the dis-
criminatory power of each read and whether they are informa-
DNA extracts from saliva samples collected from 12 individuals,
tive enough for metagenomic analyses. To date, this issue has
each from a different geographic location, were used in this study.
been addressed by only a few studies that used in silico model-
The geographic locations of the sampled groups were described
elsewhere [14]. The same DNA samples were used in a previous
study of the human saliva microbiome in which a portion
* Corresponding author. Fax: +49 34 13550555.
(500 bp) of the 16S rRNA gene was amplified and cloned
E-mail address: nasidze@eva.mpg.de (I. Nasidze).
1
Abbreviations used: rRNA, ribosomal RNA; PCR, polymerase chain reaction; R1,
and approximately 120 clones were sequenced from each individ-
region 1; R2, region 2; C, cloning; RDPII, Ribosomal Database Project II. ual [14].
0003-2697/$ - see front matter Ó 2009 Elsevier Inc. All rights reserved.
doi:10.1016/j.ab.2009.04.034
Comparative analysis by pyrosequencing and cloning / I. Nasidze et al. / Anal. Biochem. 391 (2009) 64–68 65
Fig. 2. Heat plot of the abundance of each bacterial genus in each individual, based on the partial 16S rRNA sequences. Each numbered horizontal row corresponds to a genus;
the genus name corresponding to each number can be found in Supplementary Table 1 (see Supplementary material). Each block of three columns, separated by vertical lines,
is an individual saliva sample. Column C in each block refers to the results obtained by the cloning approach, whereas columns R1 and R2 indicate results for region 1 and
region 2, respectively, obtained by the pyrosequencing approach. The abundance of each genus is indicated by the grayscale value according to the scale at the bottom of the
plot.
the variability in the saliva microbiome is shared among individu- the three datasets within an individual was 0.732, whereas be-
als [14]. Nevertheless, the average correlation coefficient among tween individuals it was 0.441, which is significantly lower
Comparative analysis by pyrosequencing and cloning / I. Nasidze et al. / Anal. Biochem. 391 (2009) 64–68 67
Table 1
Numbers of sequence reads and genera detected by different approaches.
Individual Number Number of Number of Number of Number of Number of Number of Number of Number of Number of
of reads C detected reads R1 detected reads R2 detected reads detected reads detected
genera genera genera R1 + R2 genera C + R1 + R2 genera
Germany 116 14 141 22 117 18 258 26 374 28
China 118 26 148 26 204 27 352 34 470 42
Philippines 115 16 119 18 342 23 461 27 576 30
Poland 118 15 164 17 271 21 435 27 553 29
Oakland 112 15 126 22 385 25 511 27 623 29
Turkey 120 20 160 23 187 20 347 26 467 28
Georgia 124 21 107 24 292 27 399 36 523 43
Bolivia 127 20 95 18 315 20 410 25 537 33
Argentina 115 16 102 12 271 17 373 22 488 26
Louisiana 102 19 138 20 298 23 436 29 538 38
Congo 124 15 151 11 137 12 288 17 412 22
South 138 20 116 17 212 20 328 24 466 28
Africa
Total 1429 52 1567 68 3031 54 4598 79 6027 89
Fig. 5. Pairwise correlation matrix between datasets both within individuals and between individuals. The correlation coefficient values are indicated by the grayscale value
according to the scale at the bottom of the matrix.