Академический Документы
Профессиональный Документы
Культура Документы
INTRODUCTION
We have developed a high throughput SNP genotyping system that offers speed
and flexibility, produces reliable, high quality data and supports large-scale
genotyping projects for disease research, including association and linkage
analysis. This SNPlex Genotyping System is based on multiplex OLA/PCR and
capillary electrophoresis for high throughput genotyping. The assay uses an
optimized universal set of ZipChute reagents for accurate read-out on highthroughput capillary electrophoresis platforms. Genomic DNA is interrogated with
multiplexed sets of ligation probes targeting currently up to 48 specific SNP loci
in each reaction. Such a multiplex reaction utilizes less than 1 ng of gDNA per
SNP genotype. A pair of universal PCR primers amplifies all ligation products in
a multiplex simultaneously. Amplicons containing internal universal cZipCode
oligonucleotide sequences are hybridized to a corresponding mix of universal
fluorescent ZipChute reagents. ZipChute reagents contain sequences complementary to the cZipCode oligonucleotide sequences and exhibit unique preoptimized mobilities during electrophoresis. Genotypes are determined by identification of specifically hybridized ZipChute reagents that are eluted and identified
by capillary electrophoresis and subsequently associated with target SNPs using
the GeneMapper Analysis Software. Using a 48-plex format with the Applied
Biosystems 3730xl instrument and commercially available robotics systems one
can process approximately 1.5 million genotypes in 5 days. Statistics for 1,250
population validated SNPs are presented including the in silico assay design,
conversion rate, accuracy and call rate.
Results
A total of 1250 high-quality SNPs were selected from Applera Corp. resequencing and discovery project. The SNP targets were submitted to the
automated assay and pool design pipeline (Figure 3). Successfully designed and
synthesized OLA probe sets were tested against a panel of 92 DNA samples
from Coriell Cell Repositories. Genotype data were compared to TaqMan probebased assays (Ranade et al., 2001) and sequencing data.
ZipCode1
GER
A
ASO1
spacer
ASO-Linker 1
rev primer
GER
site
ZipCode2 GER
ASO2
LSO
spacer
ASO-Linker 2
Activate probes
Perform OLA
Allelic
Discrimination
LSO-Linker
Step 1: Activation
A
P
P
Step 2: Ligation
G
Purification
gDNA
Step 3: Purification
Ligation Product
Amplification
Capture on SA-plates
Step 4: Amplification
Purification
Biotin
Step 5: Capture
Anneal Reporter
Probe
Read Out on CE
Streptavidin
Fluorescent
Label
ZipChute
Probe
Mobility
Modifier
Step 6: Hybridization
Total
~16 hrs
Step 7: Elution
Allele Calling
1250
100
Successful designs
1166
93.3
762
61.0
404
32.3
OLA probes were designed using proprietary probe design algorithm. Both DNA
strands were initially targeted for probe design.
Percent
SNPs submitted
For 84 SNPs both strands failed design due to common incompatible sequence
motifs, secondary structure, and repeat sequences within the genome.
Based on probe design rules one strand is favored over the other based on
sequence composition and secondary structure. Our design filters indicated that
second strand synthesis should not be attempted.
Step 1: Detection
Total
Percent
1928
100.0
1773
92.0
1166
100.0
1068
91.6
762
100.0
705
92.5
Tested Designs
Passed Designs
Bin1 + Bin2 = Allele Identification
Applied Biosystems 3730xl DNA Analyzer
Step 3:
Genotype Clustering
References
High-Throughput Genotyping with Single Nucleotide Polymorphisms (2001). Ranade, K., Chang, M.-S., Ting, C.T., Pei, D., Hsiao, C.-F., Olivier, M., Pesich, R., Hebert, J., Chen, Y.-D., Dzau, V. J., Curb, D., Olshen, R., Risch,
N., Cox, D. R. and Botstein, D. Genome Res., 11, 12621268.
Acknowledgements
*The author wishes to acknowledge the following groups at Applied Biosystems for their support of this project:
Genomic Applications R&D, Pilot Operations R&D, Bioinformatics, Global Oligo Operations, Product Test, Genotyping
Applications Marketing, Consumables Development and Manufacturing, and Analysis Software R&D.
Note
The PCR process is covered by patents owned by Roche Molecular Systems, Inc. and F. Hoffmann-La Roche Ltd.
Applied Biosystems and GeneMapper are registered trademarks and AB (Design), Applera, SNPbrowser, SNPlex,
ZipChute, and ZipCode are trademarks of Applera Corporation or its subsidiaries in the US and/or certain other
countries.
TaqMan is a registered trademark of Roche Molecular Systems, Inc.
Genotype clustering with polar plotting and genotype calling, using GeneMapper Analysis Software version 3.5
99.6%
99.9%
99.2%
98.5%
96.4%
Start Design
Batch
Multiplex
Pool Design
myScience
Assay Design
XML
Web Portal
SNP Specificity
SNP sequence
Input flat-file
Input validation
Web Portal
myScience
Probe Sets
Manufacturing
Review Design
Report
SNP sequence
Input XML file
Genome
Assembly
Assay Information
File (XML)
Probe Sets
(48-plex)
Experimental design. A set of SNPs were selected for which both sequencing
and TaqMan probe based assay data were available. This set is a subset of SNPs
with genotype information from both the Applera Genome Initiative and the
TaqMan Assays-on-Demand SNP Genotyping Products. Selected assays were
required to have between 9 and 14 high-quality sequencing reads for DNA
samples in common between the re-sequencing data and the TaqMan data. A
further criterion was to select only SNPs whose minor allele frequencies
determined by 5-nuclease were at least 0.10 with at least one of the major
populations. These assays were previously tested against a panel of 89 DNA
samples from the Coriell Institute Cell Repository to obtain allele frequencies.
Call rate
Concordance
Precision
12
98.68%
99.79%
99.97%
One additional SNP set, consisting of 48 high quality SNPs with minor allele
frequencies of typically >0.1 with four major populations (Caucasian, African
American, Japanese, Chinese) was selected from Applera Corp. SNP discovery
project. This 48-plex SNP set was run through the whole SNPlex system 12
times. For the 46 SNPs that always passed the automated GeneMapper analysis
across all twelve runs, call rate, concordance, and precision were calculated.
Concordance was determined based on 34,891 genotypes that were in common
between SNPlex and the TaqMan Assays on Demand SNP genotyping products.
Of the 576 possible assays 560 (97.2%) passed the automatic genotyping
pipeline. Of the 16 failed assays one SNP failed in all twelve runs, a second SNP
failed in 4 out of 12 runs due to insufficient cluster separation.
Conclusions
We show initial results of SNP genotyping that were achieved
with the SNPlex Genotyping System, an assay for the high
throughput determination of SNP genotypes. Genotypes are
detected with low consumption of gDNA (~0.8 ng/genotype), high
reproducibility and accuracy, and high throughput on readily
available capillary electrophoresis based read-out systems. In
this study that used population validated SNPs with a minor allele
frequency of at least 5% in at least one major population, the
overall system conversion rate was ~85% (93% in silico design
conversion x 92% assay conversion). The SNPbrowser software
can be used to select these SNPs and to display their position
within LD and haplotype blocks. Where possible, those SNPs can
be used as backbone SNPs for the required application. At a 48plex level one can conceivably generate approximately 1.5 million
genotypes in 5 days on an Applied Biosystems 3730xl system
and commercially available robotics systems. Genotypes are
called automatically, using GeneMapper software, with the option
of manual editing. The system is currently configured at 48-plex
per reaction; however, the future system enhancements include
increased throughput through higher multiplexed reactions, such
as 96- and 192-plex per reaction.