Академический Документы
Профессиональный Документы
Культура Документы
Recent advances in proteomics technologies provide tre- have recently gained significant attention due to the power of
mendous opportunities for biomarker-related clinical ap- these technologies for analyzing complex protein mixtures
plications; however, the distinctive characteristics of hu- and their potential for identifying novel markers indicative of
man biofluids such as the high dynamic range in protein disease. It is widely believed that many complex human dis-
abundances and extreme complexity of the proteomes eases, including cancers, might be more effectively cured if
present tremendous challenges. In this review we sum- specific disease biomarkers were available to enable detec-
marize recent advances in LC-MS-based proteomics pro-
tion and treatment at very early stages of disease (3). Despite
filing and its applications in clinical proteomics as well
noteworthy efforts, only a handful of cancer biomarkers have
as discuss the major challenges associated with imple-
1
From the Biological Sciences Division and Environmental Molecular The abbreviations used are: FDA, Food and Drug Administration;
Sciences Laboratory, Pacific Northwest National Laboratory, SCX, strong cation exchange chromatography; NET, normalized elu-
Richland, Washington 99352 tion time; AMT, accurate mass and time; IMS, ion mobility spectrom-
Received, May 2, 2006, and in revised form, July 25, 2006 etry; 2D, two-dimensional; RPLC, reversed phase LC; MARS, multiple
Published, MCP Papers in Press, August 3, 2006, DOI 10.1074/ affinity removal system; HUPO, Human Proteome Organization; LPS,
mcp.M600162-MCP200 lipopolysaccharide; MRM, multiple reaction monitoring.
This paper is available on line at http://www.mcponline.org Molecular & Cellular Proteomics 5.10 1727
LC-MS-based Clinical Proteomics
TABLE I
Challenges and limitations of current LC-MS-based proteomics technologies applied to biomarker discovery
Challenge Current techniques for addressing the challenge Limitations
Dynamic range of Immunoaffinity depletion and multidimensional Low throughput, requires relatively large
measurements fractionation coupled with high resolution LC-MS sample sizes
or MS/MS instrumentation
Sensitivity Small inner diameter LC column (50 m or less) Issues in robustness and expense
coupled with nanoflow electrospray ionization and
advanced MS instrumentation (i.e. FTICR, LTQ-FT)
Reproducibility and Platform automation (including sample processing), Variations from multistep sample
quantitation label-free direct quantitation, and isotope labeling- processing, ionization suppression and
based quantitation instrument variations, labeling efficiencies
Throughput Automated fast LC and gas phase ion mobility Limited dynamic range or coverage
separations
False positive identifications Improved database searching algorithms and Lack of consensus
statistical models
(MS/MS)-based proteomics technologies offer highly sensi- and therapeutic target discovery (21), offering a promising al-
tive analytical capabilities and a relatively large dynamic range ternative to direct tissue analysis. In the following review, we
FIG. 1. A component diagram of an LC-MS protein profiling platform. FFE, free flow electrophoresis; 1D, one-dimensional; iTRAQ,
isobaric tags for relative and absolute quantitation.
3) accurate quantitation of relative protein abundances across throughput can be severely reduced. Other key performance
many clinical samples, and 4) high throughput capable of factors are the confidence of protein identifications and the
analyzing large numbers of clinical samples to provide suffi- quantitative accuracy, which determine the ability of the plat-
cient statistical power needed to address biological variability. form to confidently identify a potential biomarker based on the
FIG. 2. A typical LC-FTICR analysis of an IgY-12 depleted human plasma sample. A, the base peak chromatogram. B, a 2D display of
⬃2,800 identified species at the mass and NET space. The analysis was performed using a Bruker 9.4-tesla FTICR instrument coupled with
an LC system equipped with a 150-m-inner diameter and 65-cm-long capillary column operated at 5,000 p.s.i.
TABLE II
The proteome coverage and estimated dynamic range offered by current LC-MS technologies
A pooled reference plasma sample from healthy individuals was used for this evaluation. A prepacked 4.6 ⫻ 50-mm (loading capacity, 15
l of plasma) MARS affinity column (Agilent, Palo Alto, CA) and a 7 ⫻ 52-mm (loading capacity, 25 l of plasma) ProteomeLab IgY-12 affinity
column (Beckman Coulter, Fullerton, CA) were used for the depletion of high abundance proteins. For each method, the samples were
processed in triplicate and individually analyzed using a 150-m-inner diameter and 65-cm-long column coupled with either a Finnigan LTQ
system (MS/MS) or a Bruker 9.4-tesla FTICR instrument. 10 and 5 g of peptide samples were loaded for each LC-MS/MS and LC-FTICR
analyses, respectively. 300 g of peptides were used for each SCX fractionation. The LC and SCX operations were the same as described
previously (31). Peptides were filtered with a confidence level ⬎95% based on reversed database evaluation (32), and proteins were identified
with at least two different peptides. ALS, acid-labile subunit; vWF, von Willebrand factor; SAA, serum amyloid A; CRP, C-reactive protein;
HGFA, hepatocyte growth factor activator; MSF, megakaryocyte-stimulating factor; EGFR, epidermal growth factor receptor; APOC2,
apolipoprotein C-II; B2M, 2-microglobulin; NAP1L1, nucleosome assembly protein 1-like1; MMP2, matrix metallopeptidase 2; 1D, one-
dimensional. We note that more relaxed indentification criteria would considerably expand the numbers of peptides and proteins identified by
all approaches.
Replicate Estimated
Methods Overlap Identified low abundance proteins dynamic range
1 2 3 of coverage
Non-depleted plasma and 1D LC-MS/MS
Peptides 1,398 1,213 1,466 972 ALS, 25 g/ml; Factor XII, 30 g/ml;
Proteins 99 97 102 96 APOC2, 35 g/ml ⬃103
MARS depletion and 1D LC-MS/MS
data2 with all proteins identified using a minimum of two sion effects for different proteins/peptides within the complex
different peptides. As shown, the single LC-MS/MS analysis sample.
only identifies ⬃100 proteins with high confidence and pro- One key area of recent advances in LC-MS technologies is
vides a dynamic range of ⬃103. With the removal of either the the improvement associated with capillary LC instrumentation
top six (MARS) or top 12 (IgY-12) abundant proteins, the that provides enhanced peak capacities and dynamic range
overall dynamic range is enhanced to ⬃105. LC-FTICR shows of detection needed to analyze clinical samples. These im-
greater coverage for both peptide and protein identifications provements have been achieved primarily through the use of
compared with LC-MS/MS, and the dynamic range is esti- very high pressure (10 –20 kp.s.i.), very small porous particles
mated to be similar to that observed for LC-MS/MS. (It should (3 m or less), smaller inner diameter columns (50-m inner
be noted that presently unassigned peptides probably include diameter or less), nanoelectrospray interfaces, and relatively
many more proteins.) When IgY-12 depletion and SCX frac- long columns and long gradients for separations (33–35). For
tionation are combined with LC-MS/MS, a dynamic range of example, high efficiency separations with peak capacities of
106–107 can be achieved, allowing identification of nearly 500 ⬃1,000 have been achieved by using 15–75-m-inner diam-
proteins in plasma with high confidence including many at the eter and 85-cm-long capillary columns packed with 3-m
low ng/ml level, and 2D LC-FTICR analyses would be ex- C18-bonded silica particles operated at 10 kp.s.i. By using
pected to increase this by approximately another order of smaller inner diameter columns (e.g. 15 m) (34), the sensi-
magnitude. Note, however, that this dynamic range still falls 3 tivity of the system continues to increase inversely as the
orders of magnitude short for detecting pg/ml protein con- mobile phase flow rates drop to as low as 20 nl/min, demon-
centrations. In addition, it should be noted that not all the strating the advantages of ESI-MS analyses at very low liquid
proteins within the estimated dynamic range will be detected flow rates (36, 37). More recently, the use of 20 kp.s.i. capillary
due to the differences in digestion efficiency and ion suppres- LC columns packed with 1.4 –3-m porous C18-bonded silica
particles has been demonstrated to provide chromatographic
2
V. A. Petyuk, W. J. Qian, M. H. Chin, H. Wang, E. A. Livesay,
peak capacities of 1,000 –1,500 for complex peptide and me-
M. E. Monroe, J. N. Adkins, N. Jaitly, D. J. Anderson, D. G. Camp, tabolite mixtures (35). Although these very high pressure sys-
D. J. Smith, and R. D. Smith, manuscript submitted. tems present technical challenges for robust automated op-
erations, the recently commercialized Waters nanoACQUITY followed by in-gel digestion has also been used for plasma
UPLC System that takes advantage of 1.7-m sized particles protein fractionation prior to LC-MS/MS (52). A number of
and operates at ⬎10 kp.s.i. demonstrates the feasibility of recent large scale proteome profiling studies have combined
such high performance systems for routine applications. With different protein- and peptide-level fractionation techniques
further improvements in robustness, these “ultraperformance” (e.g. PF2D (45), SCX/RPLC (54), free flow electrophoresis-IEF/
systems may become a powerful component for separating RPLC (47), ZOOM/SDS-PAGE (50), and Rotofor/RPLC/SDS-
complex mixtures such as human biofluids while concurrently PAGE (49) protein fractionation) with peptide-level LC-MS/MS
providing the high dynamic range needed for candidate analyses to achieve more comprehensive coverage of the
biomarker discovery applications. plasma proteome.
An alternative to plasma protein fractionation is to specifi-
MULTIDIMENSIONAL FRACTIONATION STRATEGIES COUPLED WITH cally enrich functional “subproteomes” such as the glycopro-
LC-MS FOR IMPROVED PROTEOME COVERAGE teome or the cysteinyl subproteome by using chemical tag-
Given the tremendous dynamic range of protein abun- ging or capture agents; this significantly reduces overall
dances and the extraordinary complexity of human biofluid sample complexity and enhances detection of low abundance
proteomes, many different fractionation techniques have proteins. For example, we have recently demonstrated a sim-
been developed and applied in a multidimensional fashion to ple procedure for effectively enriching cysteinyl peptides from
enhance dynamic range of detection and improve proteome complex proteomes (including human biofluids (55)) that pro-
coverage (13). Multicomponent immunoaffinity removal of vides significantly improved proteome coverage when used
sample analyses per day per MS instrument. Several reports (commonly helium or N2) and a uniform electric field estab-
have explored the use of smaller particle-packed columns lished along the axis of separation. Mixtures of peptides,
or monolithic columns for fast LC separations (10 min or proteins, or small molecules are separated by their gas phase
less) as well as multiplex column systems to significantly cross-sections (size) in addition to charge, and knowledge of
improve the throughput (60, 61). However, it is unclear their mobility provides another separation dimension to aid in
whether sufficient separation power can be achieved with identification.
these fast liquid phase separations because the increase in The power of IMS has been advanced by several recent
the solvent gradient speed can degrade the separation peak technical developments. IMS coupled with a TOF MS platform
capacity (60), which in turn reduces the overall dynamic and combinatorial libraries (65) has been recently demon-
range of detection. Other strategies for achieving robust strated for analysis of proteolytic digests (66). Because an
fast separations include liquid phase chromatographic and IMS separation typically requires 1–100 ms and has a resolv-
electrophoretic separations on a microfluidic chip platform ing power of 50 –200, a single species IMS peak exits the drift
(62– 64). Such chip-based separation devices also have the tube over a ⬃0.1–1-ms period. Generation of a typical TOF
advantage of providing better robustness, reliability, and MS spectrum requires ⬃30 –100 s, which allows multiple
ease of operation. mass spectra to be obtained during the “elution” of an IMS
Very fast (millisecond scale) gas phase separations based peak. More recently, LC has been coupled to IMS-TOF MS via
on ion mobility spectrometry (IMS; a separation method that is an ESI interface, providing 2D separations prior to MS anal-
somewhat analogous to electrophoresis in the gas phase) are ysis (67). Despite enormous potential for high throughput
another powerful alternative to liquid phase separations for analyses of complex samples, the application of IMS-TOF MS
significant improvement in throughput. At its simplest, an IMS has been limited by low sensitivity due to ion losses at the
stage consists of a drift tube filled with a non-reactive gas IMS-MS interface; however, the recent implementation of
FIG. 4. Schematic diagram of a prototype ESI-IMS-Q-TOF instrumentation platform that uses electrodynamic ion funnel interfaces
at both ends of the IMS drift tube and, as a result, provides very high sensitivity from high speed analyses. Reproduced with permission
TABLE III
Comparison of peptide and protein identifications from a plasma proteome profiling dataset analyzed using different criteria (59)
Average Estimated
Peptides Proteins Multipeptide
Filtering criteria Difference in stringency peptides false positive
identified identifieda proteins
per protein rateb
%
assignments from the data (79). This approach has been measurements for peptide/protein identifications. The utility
directly compared with the reversed database approach for of accurate mass measurements initially was demonstrated in
analyzing the same dataset derived from human plasma (59). the “peptide mass fingerprinting” approach for protein iden-
Following filtering with reversed database criteria, 6,279 tification in which a set of peptide fragments unique to each
unique peptides were identified from this dataset with ⬎95% protein are created by digestion, and the mass of these pep-
confidence, whereas 6,341 unique peptides were identified by tide fragments is used as a “fingerprint” to identify the original
PeptideProphet using a minimum computed probability of protein (84 – 86). Thus far, this approach has been limited to
0.95. Approximately 95% of peptides were common between simple protein mixtures or single proteins. The more recently
the two datasets, suggesting comparable results from these reported AMT tag approach utilizes accurate LC retention
two statistical approaches. The use of ProteinProphet, an- time measurements in addition to accurate mass measure-
other statistical model that computes the probability of the ments to identify peptides and has been successfully applied
presence of proteins, addresses the issue of whether pep- to global proteome profiling, including the human plasma
tides are present in more than one entry in the protein data- proteome (31, 87). With the AMT tag approach, peptides are
base (protein redundancy problem) (80). The list of identified identified by matching LC-MS observed mass and normalized
peptides from both the PeptideProphet and the reversed da- elution time (NET) features to AMT tags in the pre-established
tabase filtering approaches can serve as input for Protein- reference database (look-up table of peptides) with a given
Prophet to generate a list of non-redundant protein identifi- mass error and NET error tolerances (typically 1–5 ppm for
cations. Several other statistical methods have been recently mass and 1–3% for NET). The potential false positive identi-
described for evaluating peptide assignments from MS/MS fications resulting from random matching of features to the
spectra (81– 83). Ideally universal acceptance of a statistical reference database are indicated on histograms of mass error
model that optimizes both sensitivity and specificity for con- (the difference between observed mass and calculated mass
fident peptide identifications from MS/MS spectra will allow for the matched peptide in the database) exemplified in Fig.
cross-comparison of protein profiling results from different 6A for a human plasma dataset analyzed by LC-FTICR. Note
laboratories, which currently remains as an unresolved that the use of the NET constraint significantly reduces the
challenge. level of random matches as indicated by the background level
Similar challenges exist for evaluating false positive identi- for each histogram. Similar to the reversed database ap-
fications from MS-only approaches that utilize accurate mass proach for MS/MS, we have recently applied a shifted data-
base approach for evaluating the false positive rate in the served to be up-regulated upon LPS administration. Several
AMT tag process.2 As shown in Fig. 6B, an ⬃3% false positive other studies have shown that this peptide hits approach can be
rate for this human plasma dataset was estimated as the ratio used as a semiquantitative approach for initial screening when
of the area beneath the curve that represents matches to the applied with proper controls and with adequate thresholds
shifted database (black squares) and the area beneath the (90 –93).
curve that represents matches to the normal database within More recently, we have demonstrated 16O/18O labeling
a ⫾2 ppm window (gray circles). In addition to being used for combined with the AMT tag strategy as an effective global
direct identification in the MS-only approach, the accurate quantitative approach for quantifying relative protein abun-
mass information also has been utilized for improving the dance differences in human plasma (31). By incubating tryptic
confidence of peptide identifications by MS/MS through ap- peptides in 18O water (55, 94) in the presence of trypsin, the
18
plication of the new generation of LTQ-FT and LTQ-Orbitrap O atoms are incorporated into the carboxyl terminus of
mass spectrometers (88, 89). tryptically cleaved peptides via a postdigestion trypsin-cata-
lyzed oxygen exchange reaction. The 16O/18O-labeled pep-
QUANTITATION STRATEGIES tide pairs provide a 4-Da mass difference (Fig. 7A), which
The ability to quantitatively measure relative protein abun- allows a high resolution mass spectrometer such as FTICR or
dance differences between different clinical samples is essen- TOF to effectively resolve the 16O- and 18O-labeled peptide
tial for identifying candidate protein biomarkers; however, the pairs and accurately measure the relative abundances. The
vast majority of proteomics work related to biomarker discov- advantage is that all types of samples (e.g. tissues, cells, and
ery published to date has been qualitative, highlighting the biological fluids) can be effectively labeled using this simple
need for more robust quantitative approaches for such appli- and specific enzyme-catalyzed reaction. Fig. 7A shows a
cations. Our initial application for comparative proteome anal- partial 2D display of detected peptide pairs in mass versus
ysis of human plasma following lipopolysaccharide (LPS) ad- time dimensions. The 18O/16O-labeled peptides are readily
ministration involved a semiquantitative strategy based on the visualized as co-eluting pairs (4 Da apart), and the abundance
total number of peptide identifications per protein (peptide ratio can be precisely calculated for each 18O/16O pair. In this
hits or spectrum count) (74). In this study, standard SCX-LC- initial comparative analysis demonstration of two human
MS/MS analysis was performed at the 0-h time point (control) plasma samples obtained from a healthy individual prior to
and a 9-h time point following LPS administration, and pep- (control) and following LPS administration, relative abundance
tide hits were used to obtain a relative quantitative measure differences between the two plasma samples were quantified
between the control and 9-h time point. Several known in- for a total of 429 plasma proteins. Fig. 7B shows the normal-
flammatory response and acute phase proteins were ob- ized -fold changes in 429 quantified proteins and demon-
strates the significant changes in abundance for a set of capillary columns for separations (36, 37). It is well demon-
proteins following LPS administration. The combined 16O/18O strated that smaller inner diameter columns with lower flow
labeling-AMT tag strategy can also be easily coupled with rates provide significantly higher sensitivity than larger inner
subsequent peptide-level fractionation approaches such as diameter columns with higher flow rates (34) because of the
cysteinyl peptide enrichment (55) and SCX fractionation. significant improvements in both ionization and MS sampling
Other stable isotope labeling methods based on relative efficiencies. Reversed phase packed nanoscale LC and mon-
peptide/protein abundance measurements include metabolic olithic nanoscale LC separations have been developed and
labeling (95–97) and chemical labeling of specific functional coupled to ESI for improved ionization and quantitation (34,
groups using reagents such as ICAT (98) and iTRAQ (isobaric 105). As ionization efficiencies are increased for nanoelectro-
tags for relative and absolute quantitation) (99, 100) have been spray, detection biases are decreased because undesired
routinely used for quantitative proteomics analysis. In clinical matrix effects and/or ion suppression effects are either re-
proteomics applications, these stable isotope labeling tech- duced or eliminated (104 –106), providing the basis for im-
niques are well suited for detecting accurate changes in pair- proved quantitation. With further improvements to the ro-
wise comparisons provided the samples can be effectively bustness of these nano-LC-ESI-MS systems, label-free
labeled; however, it is often challenging to compare across a quantitation may be widely applied in clinical applications.
large number of clinical samples. One alternative to the use of Another challenge for quantitative clinical proteomics appli-
these labeling techniques is the use of a labeled reference cations is the variability introduced during multiple steps of
sample (often a pooled composite) that is spiked into each sample processing. With continued development of cleanup
due to individual genetic variability (i.e. gender, race, etc.) healthy control subjects present a challenge for identifying
and/or to contributing environmental factors such as diet, disease-specific differences. To address these challenges
overall health, detrimental environmental exposures, etc. The and increase the confidence of discovery results, it is essen-
complexity of human diseases presents another degree of tial for the discovery platform to be able to analyze a relatively
challenge. For example, in human cancer, each tumor type large number of clinical samples in a high throughput manner
typically consists of a number of subtypes that differ with to obtain sufficient statistical power.
regard to their spectrum of genetic alterations (107). There- Other proteomics studies have also described the effects of
fore, a potential candidate biomarker of disease may be ele- human heterogeneity in specific model systems. Hu et al. (15)
vated only in a certain percentage of the pool of disease performed a limited study that compared both intra- and
patients. interindividual variability of human cerebrospinal fluid samples
The implications of human heterogeneity in the context of obtained from six individuals. Specific proteins were observed
LC-MS-based proteomics experiments centers mostly on the to fluctuate over time with the same individual, but overall
measured quantitative values for peptide/protein identifica- there was a higher concordance of interindividual results
tions. Fig. 8 shows an initial evaluation of the technical vari- than across individuals. Interestingly results from measuring
ation and biological variations of human and mouse plasma intraindividual protein levels suggested that certain proteins
samples based on the Pearson correlation of the identified tended to fluctuate more than others, calling into question
peptide intensities between any two individual samples. The the effectiveness of using these proteins as potential dis-
technical replicate results (Fig. 8A; nine individually processed ease markers. Other studies include a report by Zhan and
samples from one pooled reference plasma) show overall Desiderio (108) that showed the heterogeneity in 2D gel
good correlation (0.94 ⫾ 0.02), which suggests relatively good electrophoresis human pituitary proteome analysis and an
reproducibility of the overall analytical platform. The increased interesting review by Mann et al. (109) that overviewed the
variation among human subjects (Fig. 8B) appears obvious on effects of genotypic and phenotypic variations in evalua-
the basis of significantly reduced average correlation coeffi- tions of the hemostatic proteome. They reported that “nor-
cients (0.85 ⫾ 0.06) compared with the technical replicate mal” pro- and anticoagulant concentrations were observed
results; whereas mouse plasma samples (Fig. 8C) show only to vary significantly and influence downstream responses,
slightly reduced correlation (0.92 ⫾ 0.05), which suggests demonstrating how heterogeneity in individual phenotypes
relatively small biological variation in these inbred mouse should influence diagnosis and therapy for hemorrhagic and
models. Such large variations observed among different thrombotic diseases.
Designing experiments to minimize biological variability is of these two approaches may lead to more effective biomar-
imperative for clinical studies. One example is to analyze a ker discovery.
serial sample set, i.e. plasma or biopsy tissue samples, from
the same individual over a time course or disease progres- TARGETED PROTEOMICS APPROACHES
sion; this in theory will alleviate a majority of heterogeneity The majority of proteomics applications in the search for
effects, but such samples are traditionally more difficult to candidate biomarkers to date have been focused on global
obtain in addition to the fact that most patients do not have a proteome characterization focused on identifying multiple
“control” blood or tissue sample in storage for comparison protein differences (candidate biomarkers) that correlate
against a possible disease diagnosis. For most studies that with specific human diseases; however, as discussed pre-
use cross-sectional approaches, it is desirable to match the viously, there are many challenges associated with applying
patients and controls in terms of age, sex, race, weight, and such a strategy to the discovery of low abundance candi-
even diet if possible. A recent study reported the potential date marker proteins. An alternative strategy for biomarker
utility of pooling for reducing the effects of biological variation discovery that complements global profiling is the targeted
in microarray studies while retaining the accuracy of identify- proteomics approach that involves quantitative MS to
ing differentially expressed genes when biological replicates measure a hypothesis-generated list of candidates (112).
are retained in the study design and providing the additional The targeted proteomics strategy often provides greater
benefit of a great reduction in the total number of samples to sensitivity and allows for detection of low abundance can-
be analyzed (110). Such a strategy might be explored and didate proteins. Anderson and Hunter (113) recently dem-
an integral component for an effective LC-MS profiling plat- * Portions of the reviewed research were supported by the United
form suitable for clinical applications. States Department of Energy (DOE) Office of Biological and Environ-
mental Research; the National Institutes of Health through the Na-
CONCLUSIONS AND PERSPECTIVES tional Center for Research Resources Grant RR018522, NIGMS Large
The amount of effort placed into the development and Scale Collaborative Research Grant U54 GM-62119-02, NIDDK Grant
application of effective proteomics profiling of serum/plasma R21 DK070146, and NIDA Grant 1P30DA01562501; the Entertain-
ment Industry Foundation (EIF) and the EIF Women’s Cancer Re-
and other clinical samples has increased tremendously over search Fund; and the Laboratory Directed Research Development
the last several years. With the emergence of more effective program at Pacific Northwest National Laboratory. Our laboratories
LC-MS technologies and the variety of fractionation ap- are located in the Environmental Molecular Sciences Laboratory, a
proaches, the number of proteins detectable in human plasma national scientific user facility sponsored by the DOE and located at
by global profiling has been greatly expanded (e.g. 889 pro- Pacific Northwest National Laboratory, which is operated by Battelle
Memorial Institute for the DOE under Contract DE-AC05-76RL0 1830.
teins with ⬎95% confidence reported in the recent HUPO
The costs of publication of this article were defrayed in part by the
study and 1,494 proteins with ⬎99% confidence, including payment of page charges. This article must therefore be hereby
confident identification of many low ng/ml level plasma pro- marked “advertisement” in accordance with 18 U.S.C. Section 1734
teins, in our recent study (59)). Although this level of detection solely to indicate this fact.
still falls short of the 10 orders of magnitude in dynamic range ‡ To whom correspondence should be addressed: Environmental
Molecular Sciences Laboratory, Pacific Northwest National Labora-
that encompasses plasma protein abundances, it still offers
tory, P. O. Box 999, MSIN: K8-98, Richland, WA 99352. E-mail:
significant potential for the discovery of novel candidate bi- rds@pnl.gov.
omarkers from clinical plasma/serum samples.
18
R. G., and Morrison, R. S. (2005) Biomarkers: mining the biofluid pro- O labeling and the accurate mass and time tag approach. Mol. Cell.
teome. Mol. Cell. Proteomics 4, 409 – 418 Proteomics 4, 700 –709
13. Lee, H. J., Lee, E. Y., Kwon, M. S., and Paik, Y. K. (2006) Biomarker 32. Qian, W. J., Liu, T., Monroe, M. E., Strittmatter, E. F., Jacobs, J. M.,
discovery from the plasma proteome using multidimensional fraction- Kangas, L. J., Petritis, K., Camp, D. G., and Smith, R. D. (2005) Prob-
ation proteomics. Curr. Opin. Chem. Biol. 10, 42– 49 ability-based evaluation of peptide and protein identifications from tan-
14. Wright, M. E., Han, D. K., and Aebersold, R. (2005) Mass spectrometry- dem mass spectrometry and SEQUEST analysis: the human proteome.
based expression profiling of clinical prostate cancer. Mol. Cell. Pro- J. Proteome Res. 4, 53– 62
teomics 4, 545–554 33. Tolley, L., Jorgenson, J. W., and Moseley, M. A. (2001) Very high pressure
15. Hu, Y., Malone, J. P., Fagan, A. M., Townsend, R. R., and Holtzman, D. M. gradient LC/MS/MS. Anal. Chem. 73, 2985–2991
(2005) Comparative proteomic analysis of intra- and interindividual var- 34. Shen, Y., Zhao, R., Berger, S. J., Anderson, G. A., Rodriguez, N., and
iation in human cerebrospinal fluid. Mol. Cell. Proteomics 4, 2000 –2009 Smith, R. D. (2002) High-efficiency nanoscale liquid chromatography
16. Wattiez, R., and Falmagne, P. (2005) Proteomics of bronchoalveolar la- coupled on-line with mass spectrometry using nanoelectrospray ioni-
vage fluid. J. Chromatogr. B Anal. Technol. Biomed. Life Sci. 815, zation for proteomics. Anal. Chem. 74, 4235– 4249
169 –178 35. Shen, Y., Zhang, R., Moore, R. J., Kim, J., Metz, T. O., Hixson, K. K., Zhao,
17. Liao, H., Wu, J., Kuhn, E., Chin, W., Chang, B., Jones, M. D., O’Neil, S., R., Livesay, E. A., Udseth, H. R., and Smith, R. D. (2005) Automated 20
Clauser, K. R., Karl, J., Hasler, F., Roubenoff, R., Zolg, W., and Guild, kpsi RPLC-MS and MS/MS with chromatographic peak capacities of
B. C. (2004) Use of mass spectrometry to identify protein biomarkers of 1000 –1500 and capabilities in proteomics and metabolomics. Anal.
disease severity in the synovial fluid and serum of patients with rheu- Chem. 77, 3090 –3100
matoid arthritis. Arthritis Rheum. 0, 3792–3803 36. Wilm, M. S., and Mann, M. (1994) Electrospray and Taylor-Cone theory,
18. Varnum, S. M., Covington, C. C., Woodbury, R. L., Petritis, K., Kangas, Dole’s beam of macromolecules at last? Int. J. Mass Spectrom. Ion
L. J., Abdullah, M. S., Pounds, J. G., Smith, R. D., and Zangar, R. C. Process. 136, 167–180
(2003) Proteomic characterization of nipple aspirate fluid: identification 37. Smith, R. D., Shen, Y., and Tang, K. (2004) Ultrasensitive and quantitative
of potential biomarkers of breast cancer. Breast Cancer Res. Treat. 80, analyses from combined separations-mass spectrometry for the char-
87–97 acterization of proteomes. Acc. Chem. Res. 37, 269 –278
Andrews, P., Omenn, G. S., and Hanash, S. M. (2005) A wide range of of-flight mass spectrometry approach. Int. J. Mass Spectrom. 212,
protein isoforms in serum and plasma uncovered by a quantitative intact 97–109
protein analysis system. Proteomics 5, 3343–3352 68. Tang, K., Shvartsburg, A. A., Lee, H. N., Prior, D. C., Buschbach, M. A., Li,
50. Tang, H. Y., Ali-Khan, N., Echan, L. A., Levenkova, N., Rux, J. J., and F., Tolmachev, A. V., Anderson, G. A., and Smith, R. D. (2005) High-
Speicher, D. W. (2005) A novel four-dimensional strategy combining sensitivity ion mobility spectrometry/mass spectrometry using electro-
protein and peptide separation methods enables detection of low- dynamic ion funnel interfaces. Anal. Chem. 77, 3330 –3339
abundance proteins in human plasma and serum proteomes. Proteom- 69. Shen, Y., Jacobs, J. M., Camp, D. G., Fang, R., Moore, R. J., Smith, R. D.,
ics 5, 3329 –3342 Xiao, W., Davis, R. W., and Tompkins, R. G. (2004) High efficiency
51. Herbert, B., and Righetti, P. G. (2000) A turning point in proteome analysis: SCXLC/RPLC/MS/MS for high dynamic range characterization of the
sample prefractionation via multicompartment electrolyzers with iso- human plasma proteome. Anal. Chem. 76, 1134 –1144
electric membranes. Electrophoresis 21, 3639 –3648 70. Anderson, N. L., Polanski, M., Pieper, R., Gatlin, T., Tirumalai, R. S.,
52. Tu, C. J., Dai, J., Li, S. J., Sheng, Q. H., Deng, W. J., Xia, Q. C., and Zeng, Conrads, T. P., Veenstra, T. D., Adkins, J. N., Pounds, J. G., Fagan, R.,
R. (2005) High-sensitivity analysis of human plasma proteome by im- and Lobley, A. (2004) The human plasma proteome: a nonredundant list
mobilized isoelectric focusing fractionation coupled to mass spectrom- developed by combination of four separate sources. Mol. Cell. Pro-
etry identification. J. Proteome Res. 4, 1265–1273 teomics 3, 311–316
53. Andersen, J. S., Lam, Y. W., Leung, A. K., Ong, S. E., Lyon, C. E., Lamond, 71. States, D. J., Omenn, G. S., Blackwell, T. W., Fermin, D., Eng, J., Speicher,
A. I., and Mann, M. (2005) Nucleolar proteome dynamics. Nature 433, D. W., and Hanash, S. M. (2006) Challenges in deriving high-confidence
77– 83 protein identifications from data gathered by a HUPO plasma proteome
54. Jin, W. H., Dai, J., Li, S. J., Xia, Q. C., Zou, H. F., and Zeng, R. (2005) collaborative study. Nat. Biotechnol. 24, 333–338
Human plasma proteome analysis by multidimensional chromatography 72. Peng, J., Elias, J. E., Thoreen, C. C., Licklider, L. J., and Gygi, S. P. (2003)
prefractionation and linear ion trap mass spectrometry identification. J. Evaluation of multidimensional chromatography coupled with tandem
Proteome Res. 4, 613– 619 mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the
55. Liu, T., Qian, W. J., Strittmatter, E. F., Camp, D. G., Anderson, G. A., Thrall, yeast proteome. J. Proteome Res. 2, 43–50
B. D., and Smith, R. D. (2004) High throughput comparative proteome 73. Tirumalai, R. S., Chan, K. C., Prieto, D. A., Issaq, H. J., Conrads, T. P., and
molecular mass searching of peptide fragments in protein sequence with multidimensional liquid chromatography and tandem mass spec-
databases. Proc. Natl. Acad. Sci. U. S. A. 90, 5011–5015 trometry. J. Proteome Res. 4, 377–386
85. Pappin, D. J., Hojrup, P., and Bleasby, A. J. (1993) Rapid identification of 101. Wang, W., Zhou, H., Lin, H., Roy, S., Shaler, T. A., Hill, L. R., Norton, S.,
proteins by peptide-mass fingerprinting. Curr. Biol. 3, 327–332 Kumar, P., Anderle, M., and Beker, C. H. (2003) Quantification of pro-
86. Yates, J. R., Speicher, S., Griffin, P. R., and Hunkapiller, T. (1993) Peptide teins and metabolites by mass spectrometry without isotope labeling or
mass maps: a highly informative approach to protein identification. spiked standards. Anal. Chem. 75, 4818 – 4826
Analytical Biochemistry 214, 397– 408 102. Chelius, D., and Bondarenko, P. V. (2002) Quantitative profiling of proteins
87. Zimmer, J. S., Monroe, M. E., Qian, W. J., and Smith, R. D. (2006) in complex mixtures using liquid chromatography and mass spectrom-
Advances in proteomics data analysis and display using an accurate etry. J. Proteome Res. 1, 317–323
mass and time tag approach. Mass Spectrom. Rev. 25, 450 – 482 103. Fang, R., Elias, D. A., Monroe, M. E., Shen, Y., McIntosh, M., Wang, P.,
88. Olsen, J. V., and Mann, M. (2004) Improved peptide identification in Goddard, C. D., Callister, S. J., Moore, R. J., Gorby, Y. A., Adkins, J. N.,
proteomics by two consecutive stages of mass spectrometric fragmen- Fredrickson, J. K., Lipton, M. S., and Smith, R. D. (2006) Differential
tation. Proc. Natl. Acad. Sci. U. S. A. 101, 13417–13422 label-free quantitative proteomic analysis of Shewanella oneidensis cul-
89. Dieguez-Acuna, F. J., Gerber, S. A., Kodama, S., Elias, J. E., Beausoleil, tured under aerobic and suboxic conditions by accurate mass and time
S. A., Faustman, D., and Gygi, S. P. (2005) Characterization of mouse tag approach. Mol. Cell. Proteomics 5, 714 –725
spleen cells by subtractive proteomics. Mol. Cell. Proteomics 4, 104. Tang, K., Page, J. S., and Smith, R. D. (2004) Charge competition and the
1459 –1470 linear dynamic range of detection in electrospray ionization mass spec-
90. Gao, J., Opiteck, G. J., Friedrichs, M. S., Dongre, A. R., and Hefta, S. A. trometry. J. Am. Soc. Mass Spectrom. 15, 1416 –1423
(2003) Changes in the protein expression of yeast as a function of 105. Luo, Q., Shen, Y., Hixson, K. K., Zhao, R., Yang, F., Moore, R. J., Mottaz,
carbon source. J. Proteome Res. 2, 643– 649 H. M., and Smith, R. D. (2005) Preparation of 20-m-i.d. silica-based
91. Liu, H., Sadygov, R. G., and Yates, J. R. (2004) A model for random monolithic columns and their performance for proteomics analyses.
sampling and estimation of relative protein abundance in shotgun pro- Anal. Chem. 77, 5028 –5035
teomics. Anal. Chem. 76, 4193– 4201 106. Juraschek, R., Dulcks, T., and Karas, M. (1999) Nanoelectrospray—more
92. Jacobs, J. M., Diamond, D. L., Chan, E. Y., Gritsenko, M. A., Qian, W. J., than just a minimized-flow electrospray ionization source. J. Am. Soc.