MCP Danish Cancer Society
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/mcp.M600326-MCP200 on March 5, 2007.
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
M600326-MCP200v1
6/6/1059    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Glossary
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Heller, M.
Right arrow Articles by Haeberli, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Heller, M.
Right arrow Articles by Haeberli, A.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Molecular & Cellular Proteomics 6:1059-1072, 2007.
© 2007 by The American Society for Biochemistry and Molecular Biology, Inc.


Research

Compositional Protein Analysis of High Density Lipoproteins in Hypercholesterolemia by Shotgun LC-MS/MS and Probabilistic Peptide Scoring*,S

Manfred Heller{ddagger},§, Evelyn Schlappritzi{ddagger}, Daniel Stalder{ddagger}, Jean-Marc Nuoffer and André Haeberli{ddagger}

From the {ddagger} Laboratory of Thrombosis Research, Department of Clinical Research, University of Bern and Institute of Clinical Chemistry, University Hospital and University Children's Hospital, 3010 Bern, Switzerland


    ABSTRACT
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 CONCLUDING REMARKS
 REFERENCES
 
A protein of a biological sample is usually quantified by immunological techniques based on antibodies. Mass spectrometry offers alternative approaches that are not dependent on antibody affinity and avidity, protein isoforms, quaternary structures, or steric hindrance of antibody-antigen recognition in case of multiprotein complexes. One approach is the use of stable isotope-labeled internal standards; another is the direct exploitation of mass spectrometric signals recorded by LC-MS/MS analysis of protein digests. Here we assessed the peptide match score summation index based on probabilistic peptide scores calculated by the PHENYX protein identification engine for absolute protein quantification in accordance with the protein abundance index as proposed by Mann and co-workers (Rappsilber, J., Ryder, U., Lamond, A. I., and Mann, M. (2002) Large-scale proteomic analysis of the human spliceosome. Genome Res. 12, 1231–1245). Using synthetic protein mixtures, we demonstrated that this approach works well, although proteins can have different response factors. Applied to high density lipoproteins (HDLs), this new approach compared favorably to alternative protein quantitation methods like UV detection of protein peaks separated by capillary electrophoresis or quantitation of protein spots on SDS-PAGE. We compared the protein composition of a well defined HDL density class isolated from plasma of seven hypercholesterolemia subjects having low or high HDL cholesterol with HDL from nine normolipidemia subjects. The quantitative protein patterns distinguished individuals according to the corresponding concentration and distribution of cholesterol from serum lipid measurements of the same samples and revealed that hypercholesterolemia in unrelated individuals is the result of different deficiencies. The presented approach is complementary to HDL lipid analysis; does not rely on complicated sample treatment, e.g. chemical reactions, or antibodies; and can be used for projective clinical studies of larger patient groups.


Functional proteomics aims at identifying and quantifying proteins of biological systems. Often protein quantity is determined as a relative concentration difference between two states, e.g. non-stimulated cells versus stimulated cells. Two-dimensional gel electrophoresis (2DE)1 is capable of displaying thousands of protein spots on one gel. Changes in protein quantities are measured by densitometry of stained spots. Covalent staining of proteins with fluorescent cyanine dyes (DIGE technology) enables comparison of several samples on the same gel (1). More recently, alternative shotgun LC-MS/MS-based approaches have been developed for global proteome analysis to obviate problems inherent to 2DE. When combined with differential isotope labeling of proteins, shotgun proteomics is a fast, sensitive, and powerful technique for comparative proteomics. There are several approaches available for incorporating stable isotope labels, and the most appropriate one depends on sample source and type. When proteome samples are of human origin, such as human plasma and serum, it is not practical to label proteins metabolically (2, 3). Chemical labeling strategies have been developed that offer post-biosynthesis/bioprocess labeling options such as alkylation, esterification, or acetylation (47) as well as endoprotease-catalyzed labeling with 18O at the C terminus of peptides (8, 9).

All the chemical and enzymatic reactions do have shortcomings however. They rely on the assumption that all proteins and their isoforms and all amino acid side chains of one type, e.g. free sulfhydryl group of cysteines, have the same chemical properties and react equally well with any chemical reagent. This is clearly not the case as every protein chemist can confirm. Further disadvantages are high costs and the need to perform pairwise analyses that prevents retrospective or prospective studies with clinical samples. New promising approaches exploit the wealth of data produced by LC-MS/MS of protein digests, such as chromatographic peptide peak intensity (1012), spectrum sampling (SpS) (12, 13), protein abundance index (PAI) (14, 15), or probabilistic peptide scores (PMSS) (16). PMSS is an extension of spectrum sampling. It adds up all the probabilistic peptide scores, as calculated by the OLAV/PHENYX peptide identification engine, that identify the same protein instead of counting just the number of spectra. It can be assumed that peptide concentrations in two different but compositionally similar samples will be the same when the protein concentrations before digestion were the same. In data-dependent LC-MS/MS it can therefore be inferred that a similar number of fragment spectra on these protein-specific peptides will be conducted. Accordingly the number of spectra identifying one protein will become smaller if the concentration in one sample is lower than in the other one. SpS and PMSS are based on this principle, and Colinge et al. (16) have shown that the latter approach compares favorably with the spectral counting approach. Recent publications have presented evidence that integrated ion chromatographic peak areas may be more robust than SpS for comparative protein quantitation especially if only one or two peptides of one particular protein can be identified (12). However, integrated peak areas can be influenced by different effects including ion suppression, by limited ion trapping capacity of mass spectrometers, or simply by the parameters applied to create extracted ion chromatograms, e.g. m/z tolerance, background subtraction, etc. Fundamental to all these approaches are correct peptide identifications.

Absolute protein quantitation in complex protein mixtures can be determined by the staining intensity of a protein spot on 2DE relative to the total intensity on the gel. However, this approach relies on complete separation of all proteins without any loss, or equal losses between repetitions, in the first dimensional gel and an equal staining efficiency for all proteins. Unfortunately these prerequisites are difficult, if not impossible, to achieve with 2DE. Alternatively isotope-labeled synthetic peptides have been used as internal standards with LC-MS/MS (17). Such an approach is costly when many proteins are to be investigated, and difficulties arise in spiking appropriate amounts of synthetic peptides to avoid suppression effects. Mann and co-workers (14, 15) have shown recently that SpS can be used for absolute protein abundance estimation called PAI or emPAI. PAI is calculated by the number of observed peptides (SpS) divided by the number of observable peptides (defined as theoretically possible peptides produced by enzymatic proteolysis of a protein). For absolute protein quantity PAI is divided by the sum of all PAIs. The emPAI is equal to 10PAI 1. The authors found that PAI values correlated better with logarithmic protein concentrations for unknown reasons.

Probabilistic identification scores add some kind of quality measure to each spectrum count because the more ions are available for fragmentation the better the resulting fragment spectrum quality can be, resulting in an increased identification score. Therefore, we pursued the possibility to use probabilistic identification scores for protein quantitation of high density lipoproteins.

Lipoproteins shuttle hydrophobic metabolites, especially lipids, from sites of absorption and production to sites of storage and excretion through the blood system of mammals. Derangements in lipid metabolism can have different causes, and elevated lipid levels in blood are a high risk factor for development of cardiovascular disease, especially if hypercholesterolemia is associated with high LDL cholesterol (LDL-C) and low HDL cholesterol (HDL-C) concentrations. It has been recognized that reduced or elevated levels of certain apolipoproteins can cause life-threatening health conditions, including an increased risk to develop atherosclerosis. For example, apoE–/– mice that are deficient in apolipoprotein E, a ligand of lipoprotein receptors, develop atherosclerotic lesions resembling those observed in humans (18). Elevated HDL-C is considered as antiatherogenic due to its reverse cholesterol transport function. It has been postulated that a specific HDL subclass enriched in apoA-I (LpA-I) is the main antiatherogenic factor. However, contradictory literature about this issue exists as reviewed by Alaupovic (19). The different results published might be explained by the use of different antibody-based immunological techniques with different antibodies and standards to measure apolipoprotein concentrations. Possibly due to such controversies, in clinical chemistry laboratories lipoproteins are still characterized only through lipid measurements and not through their protein composition.

Here we present results of a pilot study on protein compositional analysis of total HDL based on antibody-free mass spectrometry technology. The data presented in this study indicate the possibility that such an approach has the potential to clarify existing controversial results, reveals new facts about lipid metabolism mechanisms, and might help to define atherosclerotic risk factors based on HDL protein composition rather than cholesterol levels alone.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 CONCLUDING REMARKS
 REFERENCES
 
All reagents were at least of analytical purity grade. Potassium bromide, Coomassie Brilliant Blue R-250, glycine, TFA, and formic acid (FA) were from Merck (VWR International, Dietikon, Switzerland). Sodium chloride, EDTA, boric acid, DTT, iodoacetamide, and ammonium bicarbonate were from Fluka (Buchs, Switzerland). Bromphenol blue and Tris were from Sigma. Acrylamide-Bis solution 37.5:1 (Serva) and sequencing grade modified trypsin (Promega) were purchased from Catalys (Wallisellen, Switzerland). LC-MS grade (Chromasolv) ACN was from Riedel-de-Haën/Fluka (Buchs, Switzerland). Deionized water was in-house prepared on a Milli-Q apparatus (Millipore, Volketswil, Switzerland).

Standard Protein Mixtures—
Bovine serum albumin (ALBU_BOVIN, P02769, molecular mass = 66,433 Da) was from Merck. Bovine fetuin (FETUA_BOVIN, P12763, molecular mass = 36,353 Da) and chicken lysozyme (LYSC_CHICK, P00698, molecular mass = 14,313 Da) were from Calbiochem (VWR International). Recombinant human growth hormone (SOMA_HUMAN, P01241, molecular mass = 22,129 Da) from Novo Nordisk Pharma was generously provided by P. Mullis (Department of Clinical Research, University of Bern, Switzerland). Chicken actin (ACTS_CHICK, P68139, molecular mass = 41,817 Da), jack bean concanavalin A (CONA_CANEN, P02866, molecular mass = 25,616 Da), bovine insulin (INS_BOVIN, P01317, molecular mass = 5,739.6 Da), bovine ß-lactoglobulin {alpha}- and ß-form (LACB_BOVIN, P02754, molecular mass = 18,281 Da), horse holomyoglobin (MYG_HORSE, P68082, molecular mass = 17,568 Da), and rabbit parvalbumin (PRVA_RABIT, P02624, molecular mass = 11,934 Da) were from Sigma. A stock solution of each standard protein was prepared in 20 mM Tris/HCl, pH 8.0, 50 mM NaCl (except SOMA that came as a 10 mg/ml solution), and aliquots were stored at –40 °C. The exact protein concentration of each solution was determined by quantitative amino acid analysis at the Analytical Research and Analysis Facility of the University of Bern (Johann Schaller, Department of Chemistry and Biochemistry, University of Bern, Bern, Switzerland). Defined amounts of these proteins were mixed to create five different mixtures with ALBU, CONA, FETUA, LACB, LYSC, MYG, and THYG building a stable matrix with ACTS, PRVA, INS, and SOMA added in at different ratios as given in Table I. PRVA was contaminated with 17.2% (w/w) triose-phosphate isomerase (TPIS_RABIT, P00939, molecular mass = 26,625 Da).


View this table:
[in this window]
[in a new window]

 
TABLE I Theoretical and PMSSI measured protein composition of five standard protein mixtures with protein concentrations given as percent weight

Shaded in light gray are proteins kept as a constant matrix in all five mixtures. TPIS, triose-phosphate isomerase; theor., theoretical.

 
Blood Samples—
Plasma samples of hypercholesterolemia (HC) donors were from related subjects C1, C2, and C3 and unrelated subject C5 with severe hypercholesterolemia (high LDL-C, low HDL-C) who were treated with LDL lipopheresis and three individuals not treated with LDL lipopheresis, namely C4 with high LDL-C/low HDL-C and the two sisters C6 and C7 with high LDL-C and high HDL-C (serum lipid levels are given in Table II). The control, normolipidemia (N) group consisted of nine donors named N1 through N9. Donors N1 and N2 provided two different blood samples each that were collected several months apart, denoted as N1-2 and N2-2, respectively. Furthermore N3 was a pool of plasma from >10,000 healthy donors kindly provided by ZLB Behring AG (Bern, Switzerland). Apart from the N3 sample, all blood samples were collected in 9-ml EDTA KE S-Monovette tubes (Sarstedt, Sevelen, Switzerland) from donors giving their consent on the use of their blood for this study. Cells were removed by centrifugation for 20 min at 2,000 x g at 20 °C. Protease activity in plasma supernatant was inhibited by the addition of protease inhibitor Complete solution (Roche Diagnostics) according to the manufacturer's recommendations. Aliquots of 1–1.5 ml of plasma supernatant were distributed in 2-ml polypropylene tubes with screw caps and stored at –40 °C until use. Serum lipid levels or HDL-C were determined from serum or isolated HDL, respectively, by homogenous, colorimetric, enzymatic assays from Roche Diagnostics on a Roche/Hitachi Modular analyzer according to standard operating protocols (Roche Diagnostics) at the Institute of Clinical Chemistry of the University Hospital in Bern, Switzerland.


View this table:
[in this window]
[in a new window]

 
TABLE II Characterization of donor plasma and isolated HDL samples used in this study

f, female; m, male; Chol, cholesterol; TG, triglyceride.

 
Isolation of Lipoproteins by Salt Gradient Ultracentrifugation—
Sudan Black (0.2% in DMSO) was added to plasma at a final concentration of 0.008% (w/v) and incubated for 30 min at room temperature. Unbound dye and any particulate material were removed by centrifugation for 5 min at 16,000 x g. The density of 1 ml of plasma was adjusted to 1.24 kg/liter by the addition of 381.6 mg of dry potassium bromide (KBr). Polyallomer Quick-Seal® tubes (Beckman) were filled by placing subsequently the following solutions at the bottom of the tube: first 1.08 ml of salt solution of density 1.06 kg/liter (KBr dissolved in 0.9% (w/v) NaCl, 0.1% (w/v) EDTA, pH 7.4), then 1.0 ml of salt solution of density 1.12 kg/liter, and finally 1 ml of plasma solution of density 1.24 kg/liter. Tubes were filled to the top with the 1.06 kg/liter salt solution, sealed, and centrifuged with a TLA-100.3 rotor in a Beckman TL-100 centrifuge for 4 h at 541,000 x g. After centrifugation, the main HDL population was visible as a ring in the middle of the tubes. Tubes were punctured 1.7 cm below the upper rim of the tubes with a needle attached to a 1-ml syringe, and 1 ± 0.05 ml was aspirated into the syringe. The average density of this HDL fraction was 1.12 kg/liter. The HDL preparations were desalted and concentrated in 50 mM ammonium bicarbonate as described previously (20). Final protein concentration was measured by A280 nm on a NanoDrop® UV/visible spectrophotometer (NanoDrop, Wilmington, DE). One optical density unit was considered to be equivalent to a protein concentration of 1 mg/ml. Errors on absorption values due to light dispersion by HDL particles were not detected by recording A280 nm values of serial dilutions of HDL.

Gel Electrophoresis and Protein Identification—
Desalted HDL preparations were analyzed by 1DE and/or 2DE, protein bands from 1DE or protein spots from 2DE were cut from gels, and proteins were digested with trypsin and identified by LC-MS/MS as described previously (20).

Shotgun LC-MS/MS—
Protein samples were diluted to 1.0 mg/ml with 50 mM ammonium bicarbonate. An aliquot of 5 µl was denatured by addition of 1 µl of 0.3 M DTT in water and 6 µl of ACN followed by incubation for 30 min at 40 °C. Samples were diluted with 18 µl of 50 mM ammonium bicarbonate containing 5 mM CaCl2, and digestion of proteins was started by addition of 2 µl of trypsin solution (50 ng/µl) and incubation at ~30 °C on an orbital thermoshaker 5437 (Eppendorf, Dübendorf, Switzerland). After 6 h a new trypsin aliquot was added to complete the digestion overnight. Trypsin activity was quenched by dilution with 0.1% formic acid to a final protein concentration of 33 ng/µl. When not stated differently, 500 ng of protein digest was loaded onto a self-made microbore column (0.15-mm inner diameter x 60-mm length) at a flow rate of ~4 µl/min solvent A (0.1% FA in water/ACN (98:2)). Columns were packed with GROM-SIL 300 Octyl-6 MB, 5 µm, reversed phase material (Grom GmbH, Rottenburg-Haiflingen, Germany). Columns were developed by a biphasic ACN gradient of 0–5% solvent B (0.1% FA in water/ACN (4.9:95)) in 1 min followed by 5–40% solvent B in 60 min at a flow rate of ~3 µl/min. The column effluent was directly coupled to an Esquire3000+ ion trap mass spectrometer from Bruker Daltonics (Bremen, Germany) via a capillary ESI source operated at 3,700 volts. The gain control (ion charge control) was set to 30,000 with a maximum accumulation time of 200 ms. CID was triggered on the two most abundant, not singly charged peptide ions in the m/z range of 360–1400. Precursors were set in an exclusion list for 0.5 min. CID spectra interpretation was performed with PHENYX on the Vital-IT server operated by GeneBio (Geneva, Switzerland) against the latest release of the UniProt-Swiss-Prot protein database allowing variable modifications of Met oxidation, Asn/Gln deamidation, and formylation of free amino groups. Parent and fragment mass tolerances were set to 2.0 and 0.8 Da, respectively. Up to three missed cleavages and half-tryptic peptides were allowed. All peptide identifications on proteins known to be present in the analyzed sample reaching a p value of ≤0.00001 were accepted for protein quantitation calculations. We have shown previously that peptide identifications with higher p values are essentially false positive identifications (21). Identification results were extracted into Microsoft Excel by means of an in-house developed perl script. Each sample was analyzed at least four times. Protein identification of the four LC-MS/MS runs of the synthetic protein mixture 1 was additionally performed with SEQUEST and PeptideProphet validation with the Trans-Proteomic Pipeline software suite version 2.8 (D. Goodlett laboratory, University of Washington, Seattle, WA) and MASCOT (Matrix Science) by using the same search parameters.

Absolute Protein Quantitation, PMSSI—
The total protein score was calculated by summing all peptide Z-scores belonging to the same protein (PMSS). PMSS values were corrected by multiplication by the molecular weight of the native protein (considering no post-translational modifications) as given on the EXPASY website (www.expasy.org) and division by the number of theoretically observable peptides, which was defined as peptides following the trypsin cleavage rules (C-terminal to lysine and arginine, not if proline at P1' position) with zero or one missed cleavage and having a molecular mass between 720 and 3,000 Da (equals the typical mass range of peptide identifications achieved on our setup).

These criteria diverge from what we used as an allowance criteria of positive identification during database search (which was optimized for the most complete list of peptide identifications). We chose these criteria for observable peptides nevertheless because >91% of all peptide identifications were on fully trypsin-specific peptides with zero or one missed cleavage (protein-dependent). Most of the remaining identifications were based on missed cleavages due to in-source fragmentation of fully tryptic peptides. Some chymotryptic cleavages were also observed. These nonspecific trypsin identifications are not predictable; hence they cannot be used to define a generalized criterion for observable peptides. However, these deviations did not contribute significantly to the final results.

The weight fraction percentage (% w/w) or peptide match score summation index (PMSSI) of each protein was calculated by dividing the corrected PMSS value of each individual protein by the sum of all corrected PMSS values essentially as published by Ishihama et al. (15). Peptide ion peak areas were extracted with a visual basic script run on the Bruker DataAnalysis software, version 3.1, with background noise subtraction.

Determination of ApoA-I by Capillary Electrophoresis with UV Detection at 200 nm (CE-UV)—
Samples were solubilized in 2% SDS, and phenylalanine as internal standard was added (1 mg/ml final concentration). After incubation in a boiling water bath for 3 min, the samples were diluted five times in electrophoresis buffer: 50 mM sodium borate, pH 9.1, 0.2% SDS, 5% methanol. Capillary electrophoresis was performed in an open capillary with inner diameter of 50 µm and length of 50 cm at 30 kV and 30 °C for 20 min on a 270HT CE system (Applied Biosystems). UV detection occurred on line at a wavelength of 200 nm. Absolute apoA-I quantitation was performed by comparing the area ratios of the apoA-I and the phenylalanine (internal standard) peak using a linear calibration curve. Purity in percent (%) was calculated from the ratio of the peak areas between apoA-I and the total area of all other protein peaks. All samples were analyzed twice.

Statistical Testing—
The non-parametrical, one-way analysis of variance test by Kruskal-Wallis using {chi}2 statistics was used to test whether protein concentrations between the samples of the hypercholesterolemia group and the normolipidemia group were statistically different at a significance level of 95%. Statistical testing was performed on the means of all LC-MS/MS experiments and multiple HDL isolations recorded for each blood sample. Non-parametrical statistics was used because the measured values from the limited population size in this study were not normally distributed as determined by a graphical quantile-quantile plot.


    RESULTS
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 CONCLUDING REMARKS
 REFERENCES
 
Protein Quantitation by Probabilistic Identification Scores—
Evaluation of PMSS with a comparison with SpS for comparative proteomics was published recently (16). Mann and co-workers (14, 15) corrected the number of observed peptides by dividing spectrum counts (SpS) through the number of observable peptides per protein and named it PAI. PAI was shown to correlate well with the absolute protein concentration in the analyzed protein mixture. We sought to combine the PMSS and PAI approach to determine absolute protein concentrations of our clinical samples and named it PMSSI. We prepared five protein mixtures of variable composition with a total of 12 different proteins. Seven proteins served as a constant matrix, whereas the five others were added at different concentrations (Table I). We compared the relationship between spectrum counts and protein scores derived from the three search engines PHENYX, SEQUEST with PeptideProphet validation, and MASCOT. As presented in Fig. 1, there was a linear relationship between PMSS and SpS. Interestingly the correlation of the SEQUEST/PeptideProphet results became almost perfect by applying an acceptance threshold of ≥0.8 probability, whereas there was no improvement by applying more stringent thresholds on MASCOT search results by either using an ion score of >24 or an expect value of ≤1, respectively (the latter result is not shown). PHENYX Z-score correlation remained unchanged (R2 = 0.995) after applying a more stringent acceptance criteria in the form of a p value threshold of ≤10–8. This result indicated that true probabilistic Z-scores, as calculated by PHENYX, are as good to represent protein abundance as SpS and correlated better than MASCOT ion or PeptideProphet probability scores when only first rank identification was used as the sole acceptance criteria for identification.


Figure 1
View larger version (13K):
[in this window]
[in a new window]

 
FIG. 1. Correlation between SpS and PMSS derived from PeptideProphet-validated SEQUEST, PHENYX, and MASCOT search engines. Shown are the protein identification scores from all software tools tested (PeptideProphet/SEQUEST probability score, inverted triangles; PHENYX Z-score, open circles; MASCOT ion score, black dots) of the 12 proteins present in mixture 1, and peptide spectrum sampling positively correlated in a linear relationship. Panel A, all first ranking identifications on proteins known to be present in the sample were considered without any other threshold criteria applied, resulting in correlation coefficients of 0.956 for PeptideProphet/SEQUEST, 0.994 for PHENYX, and 0.975 for MASCOT. Panel B, the correlation was significantly increased to 0.9996 in the case of PeptideProphet/SEQUEST when only identifications with a probability score of ≥0.8 were considered. Applying score thresholds with a corresponding p value of ≤10–8 in the case of PHENYX or an ion score threshold of >24 for MASCOT did not result in improved correlation coefficients (0.995 and 0.973, respectively). However, by applying more stringent acceptance criteria, potentially correct peptide identifications were removed; relatively more were removed with SEQUEST/PeptideProphet and MASCOT than with PHENYX. Logarithmic scales were chosen for display purposes with the given correlation coefficients calculated on non-logarithmic data.

 
We then used the data acquired on the five different protein mixtures to compare the performance of Z-score with peak area-derived protein abundance determination. The summed peptide Z-scores of three different peptides each from the four proteins MYG, LYSC, PRVA, and SOMA of mixtures 1 and 2 were compared with the corresponding peak areas (Supplemental Table 1). The summed peptide Z-scores and the peptide peak areas of each of the four acquisitions were plotted against the theoretical concentrations. Two very similar plots resulted as seen in Fig. 2, panels A and B. There was a correlation between summed Z-scores and peak areas as illustrated in Fig. 2, panels C and D. However, it became apparent by the group of four triangles with elevated values in panels AC that peptides of PRVA caused outliers. The N-terminal peptide AMTELLNAEDIKK, acetylated at the N terminus of alanine, produced more intense peaks as well as summed Z-scores than the other two peptides of PRVA. It turned out that PRVA was incompletely digested by our digestion procedure. A chromatographic peak with a charge envelope spanning from the +9 to the +18 ion was observed at the end of each LC-MS separation gradient whose deconvoluted mass peak of 11,977 Da corresponded exactly with the theoretical molecular weight of acetylated PRVA. Furthermore the integrated peak area of all detected charge states increased linearly with PRVA concentration (result not shown).


Figure 2
View larger version (24K):
[in this window]
[in a new window]

 
FIG. 2. Comparison of label-free protein quantitation by LC-MS peak area and PHENYX protein Z-scores. In panels A and B the response in terms of Z-scores (panel A) and peak area (panel B) as a function of the theoretical protein concentration (% w) in mixture 1 were compared, illustrated by three peptides (sequences in Supplemental Table 1) for each of the proteins MYG (black dots), LYSC (open triangles), PRVA (open circles), and SOMA (filled inverted triangles), respectively. Peak areas were integrated from extracted ion chromatograms with the Bruker DataAnalysis software version 3.1, allowing a mass tolerance of +0.7/–0.3 m/z units. Panel C represents a correlation analysis between the summed peptide Z-scores from the same three peptides of each protein, as used for panels A and B, with the corresponding peptide peak areas based on all data from mixtures 1 and 2. The same correlation analysis was performed with the tryptic peptide of INS (FVNQHLCGSHLVEALYLVCGER) from all five protein mixtures (panel D). In panel E peak area ratios (mixture 1 (Mix1) in relation to mixtures 2–5) of one representative peptide of each of the proteins used were compared with the ratios calculated from the theoretical protein concentration in each mixture. Peak area ratios between 0.2 and 3 correlated well with theoretical concentration ratios; hence ratios of PMSSI-based protein concentration calculations in this range were compared with the corresponding peak area ratios in panel F. Note that CONA values were excluded in this evaluation because of problems with CONA digestion.

 
Other factors influencing peptide signal responses could be different ionization efficiencies at different eluent compositions during the LC run, retention time shifts altering the surrounding matrix of analytes together with eluent composition, limitations on the number of trapped ions by the dynamic ion gain control of the ion trap, inaccuracies of the sample loading mechanism on the HPLC instrument (partial loop filling), and LC-MS system saturation effects. The latter was very obvious when increasing amounts of protein digests were loaded onto the column. Summed protein Z-scores did increase linearly when column loading was increased from 50 to about 500 ng of protein digest. At higher amounts of loaded protein the system became saturated, and no further increase in summed Z-scores together with no additional identifications were possible (Supplemental Fig. 1). Based on this finding, we aimed at always loading 500 ng of protein digest for comparative protein quantitation measurements.

Despite certain signal variations, calculated ratios between peak areas of one representative peptide per protein in mixture 1 and the other four mixtures correlated with the ratios calculated from the theoretical concentrations (R2 = 0.83) (Fig. 2, panel E). This correlation was actually excellent in the range between 0.2 and 3, corroborating the conclusions by others that peak areas can be used for comparative proteomics. We were able to show that within this ratio range the peak area ratios correlated well with PMSS ratio calculations too (R2 = 0.89) (Fig. 2, panel F, and Supplemental Table 1). In summary, peak area ratios are a good tool to validate Z-score-based protein ratios; the latter is quite accurate when dealing with protein concentration differences not bigger than 3–5. At higher ratios, the score-based calculation tends to underestimate the real differences, and proteins with only one or two peptide identifications cannot accurately be quantitated. The underestimation of elevated concentration differences can be explained by a 10-fold smaller dynamic range of peptide identification scores with finite numbers spanning at most 2 orders of magnitude compared with integrated LC-MS peak areas, which can cover 3 orders of magnitude on an ion trap mass spectrometer.

The PMSSI-calculated absolute protein quantities did compare well with the theoretical values as demonstrated in Fig. 3 and Table I. Exceptions were MYG and SOMA, which were determined at higher than theoretical concentrations, and CONA and PRVA, which were detected with lower concentrations than expected. Most tryptic peptides of MYG and SOMA were detected in their +2 and +3 charge state; hence each peptide could be identified twice resulting in higher scores compared with proteins from which peptides were detected most often only with one charge state. CONA and PRVA were only partially digested because only a few peptides could be detected sporadically. Incomplete digestion of a protein is a common fact in protein chemistry and will result not only in a decreased estimation of the concentration of the affected protein but ultimately to an overestimation of the concentration of all other proteins in the sample. However, this error source will apply to all other shotgun LC-MS/MS approaches too. Further validation of the score-based absolute protein quantitation was achieved with HDL samples as outlined further below.


Figure 3
View larger version (15K):
[in this window]
[in a new window]

 
FIG. 3. Correlation analysis between PMSSI-derived protein concentrations and theoretically calculated values. All data points (measured PMSSI concentration versus theoretical concentration) of proteins present as a constant matrix were plotted in panel A. Panel B is the corresponding graph for all proteins that varied in concentration between the five different mixtures. Deviations from the ideal diagonal line of the measured values are discussed in the text. TPIS, triose-phosphate isomerase.

 
Isolation of HDL from Human Plasma—
Initially we isolated HDL by KBr density gradient ultracentrifugation in 33-ml tubes from several milliliters of plasma with unloading of the tubes by aspiration of the content from the bottom and collection of fractions (20). Although this approach offered the possibility of separating HDL into different density classes, several other disadvantages were encountered, like difficulties with reproducibility, long and cumbersome manipulation steps, long centrifugation times, and significant contamination by plasma proteins, respectively. We improved the isolation protocol by using a bench top ultracentrifuge with 3.5-ml tubes, where only 1 ml of plasma was needed, and a three-step KBr density gradient allowing the reproducible harvest of 1 ml of HDL particles with an average density of ~1.12 kg/liter. We also tested the possibility of isolating HDL by immunoaffinity column chromatography with a mouse anti-human apoA-I antibody coupled to Sepharose beads (R. James, University of Geneva, Switzerland). However, such preparations seemed to be composed of a specific subpopulation differing clearly in protein composition from any other HDL class isolated with our earlier method (20). Additionally the immunoaffinity preparations were contaminated with abundant plasma proteins including human immunoglobulins and apolipoprotein B among others (results not shown).

We determined the average HDL yield in the isolated HDL density class to be 37 ± 7% based on the measurement of the HDL-C concentration of 12 different HDL preparations before and after isolation (result not shown). Losses of HDL were indicated by a second Sudan Black-stained ring with a density intermediate between HDL and LDL. Analysis by SDS-PAGE and shotgun LC-MS/MS of this second ring revealed a lipoprotein population that was enriched in apoA-I, C apolipoproteins, and apoE; depleted in apoA-II, apoD, and serum amyloid protein A4 (SAA4); and slightly contaminated with apoB (<1% by weight), respectively (not shown). This additional HDL population corresponded therefore to HDL of lower density and was not included in this study together with HDL of very high density.

Further Validation of Absolute Protein Quantitation by PMSSI—
ApoA-I is the main constituent of HDL. We used CE to separate apoA-I from all other proteins present in HDL and for detection of protein peaks at 200 nm. The apoA-I purity measurements of two different HDL preparations and four different KBr density gradient ultracentrifugation preparations of reconstituted HDL (rHDL) were compared with PMSSI values. As illustrated in Fig. 4, the apoA-I concentrations measured by these two independent methods correlated very well (correlation coefficient of 0.94). Furthermore we analyzed by PMSSI and gel densitometry six different mixtures composed of progressively diluted rHDL (regarded as an apoA-I standard with a purity of ~93%) in a matrix of six proteins, ALBU, FETUA, LACB, LYSC, SOMA, and THYG (Supplemental Fig. 2). ApoA-I concentrations as well as the concentrations of the matrix proteins, estimated by PMSSI, linearly correlated with the corresponding theoretical concentrations (Supplemental Fig. 2), whereas gel densitometry estimations were consistently much lower than expected. There are different possible causes for this discrepancy. Determination of apoA-I from purified HDL by gel image densitometry proved to be difficult because of the following. First, apoA-I co-migrated consistently with all other proteins on 1DE or 2DE as revealed by co-identification of apoA-I in all other spots or bands (20). Second, the low molecular weight proteins apoA-II and apoC-I were not stained representatively on gels. Third, apoA-I bands/spots were very intense making accurate band/spot density detection difficult (Supplemental Fig. 2). Fourth, although apoA-I densitometry data from samples run on one gel were comparable with each other, they were not when compared between different gels (Supplemental Fig. 3). This can be explained by non-reproducible polyacrylamide polymerization effects between gels resulting in minor changes of protein migration and possible staining inhomogeneities between gels, although the same staining procedure was carried out each time. Fifth, apoA-I staining intensities were close to saturation for SYPRO RubyTM when 15 µg of protein was loaded per lane. Loading of less protein could circumvent this problem, although the detection of less abundant protein species would be compromised resulting in an overestimation of apoA-I concentration.


Figure 4
View larger version (10K):
[in this window]
[in a new window]

 
FIG. 4. Validation of LC-MS/MS-derived apoA-I concentration of HDL by CE-UV. Two different HDL and four different rHDL samples prepared from density fractions of an ultracentrifugation run were analyzed by LC-MS/MS and CE-UV. The apoA-I purity values with ±1 S.D. are shown. Linear regression curve fitting with forced intercept at origin resulted in a correlation coefficient of 0.94 and is shown as a dotted line.

 
Absolute Protein Quantitation of HDL—
Applying the PMSSI approach, we determined the protein composition of HDL particles isolated from plasma samples with varying cholesterol levels. Additionally all HDL preparations were analyzed by 1DE and some also by 2DE (Supplemental Figs. 3 and 4). All samples contained traces of serum albumin but were void of any high molecular weight apoB, a marker for very low density lipoprotein or LDL contamination.

With gel electrophoresis, we could detect and identify a total of 16 different proteins in isolated HDL. The nine most abundant ones were reproducibly identified and quantified by shotgun LC-MS/MS (Fig. 5 and Supplemental Figs. 3 and 4). The relative S.D. values within the four LC-MS/MS repetitions were exponentially and inversely related to the actual abundance of a given protein. Relative S.D. medians ranged from 83 to 34 to 4% for protein abundances of 0.9% (apoC-II) to 1.7% (apoE) to 62% (apoA-I), respectively. All measured protein concentrations were plotted in Fig. 5 with each panel representing one protein with the HC group against the N group. Kruskal-Wallis non-parametric statistical tests revealed a statistically highly significant (p < 0.01) reduced concentration for apoA-I in the HC group and increased concentrations for apoC-I, apoC-III, and apoE, respectively. As there were only few samples tested in this pilot study, these differences cannot be regarded as representative for the entire population; however, they were in agreement with the clinical lipid measurements (see "Discussion"). Relative albumin concentrations did not differ between the groups. This observation corroborates our previous notion that albumin most likely associates stoichiometrically with HDL particles (20). If albumin were dispersed after ultracentrifugation in a gradient-like manner with highest concentration at the bottom and lowest concentration at the top of centrifugation tubes, the albumin relative to HDL protein concentration would not be the same. This was obviously not the case.


Figure 5
View larger version (19K):
[in this window]
[in a new window]

 
FIG. 5. Scatter plots and statistical analysis of apolipoprotein concentrations in HDL. The means of all PMSSI values of the nine major HDL proteins of the HC and N groups were compared by the non-parametric Kruskal-Wallis one-way analysis of variance test, and all experimental data points derived from all LC-MS/MS measurements were included in these plots. The test results in terms of p values are given above each scatter plot in parentheses.

 
By comparing protein levels of individual samples, it became evident that HC samples C6 and C7 with high HDL-C resembled more the N group samples in terms of total HDL protein and protein composition than the other five HC members. Furthermore the four plots in Fig. 6 show that the three members with a familial hypercholesterolemia, C1, C2, and C3, formed a group with significantly less apoA-I, and consequently less total HDL protein too, but more apoE relative to all other samples, whereas C4 and C5 were in transition between normolipidemia and the C1, C2, C3 group.


Figure 6
View larger version (14K):
[in this window]
[in a new window]

 
FIG. 6. Correlation of lipid levels with absolute HDL protein concentrations in individual HDL preparations. Two measured parameters were correlated in these scatter plots and a third, denoted in the upper left or right corner of each plot, is represented by the diameter size of each circle. Protein levels are given as absolute concentrations in mg/liter of plasma except for apoA-II in the lower left panel where percent weight of total HDL protein mass was used. HC group members were depicted in orange with different border line thickness for the three family members C1, C2, and C3, white for C4 (thick border line) and C5, dark gray and bright gray for the two siblings C6 and C7, respectively. N group members are in blue tones for females, red, ochre, and magenta tones for males, black for pooled plasma N3, and borders around filled circles indicate replicates from the same donor, respectively. TG, triglyceride.

 
From two individual donors of the N group two different blood draws were taken 9 or 6 months apart, labeled N1 and N1-2, or N2 and N2-2, respectively. With samples N2 and N2-2 two different HDL isolations were performed 10 and 4 months apart, respectively. Despite these time intervals the four N2 HDL protein profiles were very similar, clustering into the same three-dimensional space within the four different plots of Fig. 6 (actual values are given in Supplemental Table 2). One exception was apoD whose measured concentrations in the second analysis of each plasma sample were only about 50% of what was determined with the fresh plasma sample. This result might indicate that apoD association with HDL is sensitive to prolonged storage of plasma samples at –40 °C. Otherwise the PMSSI results indicated that this approach is reproducible, thus making shotgun LC-MS/MS of HDL, with probabilistic identification score-based absolute protein quantitation, a powerful method for the study of clinical samples over extended periods of time.


    DISCUSSION
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 CONCLUDING REMARKS
 REFERENCES
 
The exploitation of chromatographic information from shotgun LC-MS/MS for protein quantitation has emerged recently as a valid alternative to the different isotope labeling strategies. Peptide ion peak integration and spectral sampling have the advantage that no chemical treatment of samples and no expensive isotope-labeled reagents are required. Furthermore the accurate comparison of two differentially isotope-labeled peptide mass signal intensities is generally limited to a difference of about 10 even with the use of higher resolution mass spectrometers like time-of-flight instruments (9). Integrated peak ratios appear to better represent real protein abundance ratios (Fig. 2, panel E), and peak intensity differences with 3 orders of magnitude between two different LC-MS runs are theoretically quantifiable. Spectral sampling can also successfully be used for relative protein abundance measurements (12, 14). We replaced the spectral counts with actual true probabilistic peptide identification scores as suggested by Colinge et al. (16) and were able to show that score-based abundance ratios do behave similarly to integrated peak areas (Figs. 1 and 2). Because true probabilistic identification scores correlate perfectly with spectral sampling, it is also possible to calculate absolute protein abundance through protein scores in accordance to PAI as published by Mann and co-workers (14, 15). Thus, all valid peptide identifications from one LC-MS/MS run determine the total mass of digestible protein present in the sample, making it straightforward to calculate absolute and relative protein abundances (Fig. 3).

However, the method depends on sample complexity, instrument configuration, and LC separation performance. Automatic gain control or ion charge control, as it is called by Bruker, maximizes the trap filling of a selected precursor ion. In theory, fragment spectral quality should be relatively independent of changing precursor ion intensity over the course of a chromatographic peak unless the precursor ion is of low abundance making maximal trap filling impossible within the set maximum ion accumulation time. As a result, fragment spectra of low abundance precursors have generally lower intensities (smaller signal-to-noise ratios) that might result in lower identification scores as opposed to high abundance precursors with good signal-to-noise fragment spectra. The ion trap was programmed to select the two most intense, not singly charged precursors during any full MS scan for subsequent fragmentation (signals one and two). These two precursors were then placed in an exclusion list for 30 s. In the meantime, MS/MS was performed on the two next most intense peaks and so on. However, if all available MS signals had already been submitted for MS/MS before the 30-s exclusion period of signals one and two was over, the instrument returned to them for lack of other suitable precursors. Additionally the third isotopic peak of a high abundance precursor can also be chosen for fragmentation due to its high signal intensity. Therefore, the PMSSI approach is based on the fact that despite the dynamic exclusion list the precursor ion of a particular peptide, being more abundant in sample A than B, is selected, on average, more in sample A than B. Our results did suggest that this was indeed the case if the complexity of the peptide population desorbing from the reversed-phase column is low enough during the entire LC run. Thus, the reversed-phase column development must be adapted to the complexity and the amount of protein digest loaded as well as the scanning speed of the instrument. Indeed it was possible to show with two different samples, each composed of not more than 16 proteins, that a loading of 500 ng of protein digest and an ACN gradient of 60 min was required for best performance (Supplemental Fig. 1). A shortened 40-min gradient resulted in missing a substantial amount of the low abundance peptides that resulted in a changed protein composition biased in favor of the more abundant proteins. A prolongation of the gradient to 80 min had no influence on the outcome compared with the 60-min gradient (results not shown). In conclusion, low abundance precursor ions, derived from low abundance proteins, are stochastically underrepresented by the SpS and PMSSI methods; hence the accuracy of their quantitation suffers as seen by the high relative S.D. values (Supplemental Table 2).

As illustrated in Figs. 3 and 4, PMSSI worked reliably with the exception of proteins that were (a) hydrolyzed by trypsin to well ionizing peptides always forming a doubly and triply charged ion within the m/z range of 500–1,000 during the electrospray ionization process (for example, MYG and SOMA) or (b) incompletely digested proteins like PRVA and CONA. It needs to be mentioned that these deviations will also influence the results of isotope-labeled samples. The comparative labeling strategy has its advantage in the fact that two (with the iTRAQTM currently up to four) samples are compared in a single LC-MS/MS run. However, all labeling strategies rely on chemical reactions that are difficult to equalize with different samples, and it is always a pure comparative approach for only few samples at a time. The PMSSI approach does not rely on any chemical reaction, and it is possible to measure as many replicates as needed, dispersed over extended periods of time, as long as standard operating protocols are followed meticulously (Fig. 6).

We chose a well established HDL isolation protocol based on floatation in a density gradient. This protocol does not allow the separation of HDL subpopulations that differ in apolipoprotein composition, like LpA-I (lipoprotein enriched in apoA-I) or LpA-I/A-II (lipoprotein enriched in apoA-I and apoA-II) (19). However, it allowed the reproducible isolation of HDL particles with an average density of 1.12 kg/liter that were free of any significant contamination by low density lipoproteins or other plasma proteins. In the existing literature, we could not find any quantitative numbers for total HDL protein in plasma because HDL is usually measured in the form of its cholesterol content. The overall recovery of about 1 g/liter HDL protein from normolipidemic plasma is what can be expected with this method.2 The calculated apoA-I levels ranged from 221 to 684 mg/liter, which corresponded to about 1/3 to 1/2 of what is published elsewhere (19). The apparently missing apoA-I is explained in several ways. First, HDL subclasses of higher and lower density were not collected for further analysis in this study as mentioned under "Results." Second, Asztalos et al. (22) reported specific loss of 10% apoA-I associated with small HDL particles during ultracentrifugation caused by denaturing forces and conditions used with this approach. Correspondingly our HDL yields, determined by HDL-C measurements, confirmed the loss of HDL protein in the order of 60%. Third, we used UV absorption measurements at 280 nm for absolute protein quantification. Although this method is rugged and easy to use, protein-specific extinction coefficients differ according to the relative abundance of tyrosine and tryptophan residues in the corresponding amino acid sequence. Despite this shortcoming, UV 280-nm measurements allowed for a robust comparison between different HDL samples.

Another explanation for discrepancies of apoA-I yields is due to the use of antibodies in most published results. Antibodies do have different avidities and have to recognize apolipoproteins that are part of a multimolecular protein-lipid complex, resulting in a possible steric hindrance of antibody-antigen recognition. This possibility was supported by our observation that immunoaffinity-purified HDL differed significantly from different HDL density classes. In contrast, a mass spectrometry-based method, like PMSSI, is independent of quaternary structures because proteins and protein-lipid complexes are dissociated under denaturing conditions followed by proteolysis of proteins into many peptides. Peptides are then separated on a reversed-phase column enabling the mass spectrometer to detect multiple surrogates of many antigens within the course of one analysis. Although the ionization process is biased by the chemical structure of peptides (see overestimation of MYG and SOMA in Fig. 3), this bias can be considered to be randomly distributed over all tryptic peptides of a proteome and should therefore have only a minor effect.

In the following we give possible interpretations of the observed protein compositions in relation to HDL metabolism. However, we would like to emphasize that only a distinct HDL subpopulation of one defined density range was analyzed. Together with the fact that we analyzed only a small number of samples in this preliminary study, the clinical significance of the given interpretations cannot be warranted completely.

Lipid-free apoA-I, small pre-ß HDL, and discoidal HDL are important acceptors for cholesterol and after remodeling through the action of LCAT and other plasma enzymes become spherical HDL. LCAT is activated by apoA-I and apoC-I and esterifies cholesterol molecules of HDL using phosphatidylcholine as fatty acyl donor. Therefore, apoA-I can be considered as the rate-limiting factor for HDL-C (23). Indeed apoA-I mass correlated positively with HDL-C (Fig. 6, top left panel) and total HDL protein (not shown). Additionally the relative apoC-I concentrations in hypercholesterolemia HDL appeared to be increased (Fig. 5), which might indicate a certain compensatory function for LCAT activation by apoC-I in the case of reduced apoA-I levels. Such a compensation mechanism is not obvious in the apoC-I plot of absolute protein concentrations of Fig. 6 (top left). However, if HDL-C were solely dependent on apoA-I in a linear relationship, hypercholesterolemia samples would have HDL-C levels that were significantly lower than the actual measured levels. For instance, C4 with an apoA-I concentration of 344 mg/liter would have a theoretical HDL-C level of 0.66 instead of the measured 1.07 mM when correlated with N1 for instance. The absolute HDL-associated apoC-I concentration of C4 (29.3 mg/liter) was equal to the absolute HDL-associated apoC-I concentrations of normolipidemia samples (mean ± S.D. of 23.9 ± 10.7 mg/liter), corroborating a positive effect of apoC-I on HDL-C in hypercholesterolemia.

As mentioned above, lipid-poor HDL precursors are important acceptors of cellular cholesterol and contribute to lipid metabolism and reverse cholesterol transport. HDL precursors are dissociated from chylomicrons and very low density lipoprotein during lipoprotein lipase-mediated hydrolysis of triglycerides. ApoC-III is an inhibitor of lipoprotein lipase, and increased levels of apoC-III will cause an accumulation of triglycerides in serum and, due to reduced availability of HDL precursors, a reduction in HDL-C. This relationship is indicated in the Fig. 6, top right panel. C2 and C3 did not fit into such a model, indicating that lipid metabolism is a multifactorial mechanism. A defect in lipid clearance might actually explain the two "outliers." ApoE-specific receptors mediate the uptake of apoA-I and apoB lipoproteins. A plot with HDL-associated apoE concentration in relation to total serum cholesterol and the combined apoC-I and apoC-III concentrations as the third dimension (Fig. 6, lower right panel) revealed that normolipidemia samples were aligned at a cholesterol concentration of about 5 mM and had a heterogeneous distribution of apoE (0–22.3 mg/liter) as well as apoC-I + apoC-III (59.5–125.0 mg/liter), respectively. The hypercholesterolemia samples had similar traits with 8.8–28.9 mg/liter apoE and 59.9–172.1 mg/liter for the C apolipoproteins; however, they segregated into two populations, one containing C1, C2, and C3 and the other containing C6 and C7 with C4 and C5 being at the crossroad. C1 to C5 had increased apoE and cholesterol concentrations and low to average C apolipoproteins, whereas C6 and C7 had average apoE but most extreme apoC. There are three common apoE isoforms, apoE-II (Cys112/Cys158), apoE-III (Cys112/Arg158), and apoE-IV (Arg112/Arg158), that strongly affect their binding properties to the LDL receptor and their intracellular recycling (for a review, see Ref. 24). Subjects homozygous for apoE-II have elevated triglyceride, cholesterol, and extracellular apoE. All these characteristics would fit to the traits of C2 and C3. However, we could not detect increased amounts of the more acidic apoE-II isoform by 2DE nor the tryptic peptide 158CLAVYQAGAR167, indicative for the apoE-II form, by LC-MS/MS analyzes. From this, it must be concluded that the familial hypercholesterolemia of subjects C1, C2, and C3 is caused by a lipoprotein receptor defect, and genetic tests are still ongoing to reveal its origin.

We identified serum amyloid A4 as a constitutive member of total HDL, and with samples C2 and C5 we also detected trace amounts of SAA (accession number P02735). Serum amyloid A proteins are acute phase response proteins that are increased during inflammation and are chronically elevated in metabolic syndrome and type 2 diabetes. In vitro experiments had shown that SAA is able to replace apoA-I and apoA-II from HDL, but in vivo results in mice did not confirm this (25, 26). Our results indicate that serum amyloid A4 is constitutively associated with HDL where it is maintained at a constant ratio relative to apoA-I as long as HDL-C is at physiologically normal levels of >1 mM (Fig. 6, lower left panel). In samples with reduced HDL-C, the relative contribution of SAA4 to total HDL protein and apoA-I appeared to increase, whereas relative concentrations of apoA-II remained unaffected. Serum amyloid can form dense but large HDL particles independently of the availability of apoA-I in apoA-I knock-out mice and cell culture (2729). Our finding suggests that SAA4 protein may take over some functionality of apoA-I-enriched lipoproteins but most likely does not contribute to reverse cholesterol transport.

Derangements in lipid metabolism are clinically indicated by the measurement of total serum cholesterol, triglyceride, HDL-C, and LDL-C. Hypercholesterolemia is defined as a severe form with high LDL-C and low HDL-C that is associated with an increased risk to develop atherosclerotic disease and a milder form with high LDL-C and high HDL-C. Our quantitative apolipoprotein results of a certain HDL density class did highlight different causes for hypercholesterolemia. HDL samples of the three related subjects C1, C2, and C3, who do have relatives with atherosclerotic incidences, had a protein pattern that clearly differentiated them from all other samples. Subjects C4 and C5 had less extreme lipid levels and could be categorized by our protein quantitation as having a milder form of hypercholesterolemia. The two sisters, C6 and C7, were differentiated from the other hypercholesterolemia subjects by high HDL-C. In accordance, the HDL apolipoprotein pattern was comparable with that of the normolipidemia group except for their extremely high levels of C apolipoproteins.


    CONCLUDING REMARKS
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 CONCLUDING REMARKS
 REFERENCES
 
The PMSSI approach does not rely on any chemical modification of proteins, and it is possible to measure reproducibly as many replicates as needed at any point in time as long as standard operating protocols are followed meticulously. The PMSSI approach is generally applicable to any protein mixture and is completely independent of avidity and affinity of antibodies that rely on recognition of their antigens, which can be hidden in a multimolecular protein-lipid or protein-protein complex. The presented mass spectrometry-based method for protein quantitation is independent of such quaternary structures. All valid peptide identifications from one LC-MS/MS run determine the total mass of digestible protein present in the sample, making it straight forward to calculate absolute and relative protein abundances in the analyzed sample. To our knowledge this is the first report of a quantitative compositional protein analysis of HDL, and it opens up new possibilities to study HDL biology in even more detail than previously possible.


    ACKNOWLEDGMENTS
 
We thank R. Bolli and P. Lerch (ZLB Behring, Bern, Switzerland) for providing pooled plasma and rHDL as well as running the UV-CE analyses, G. Taylor and D. Goodlett (Department of Medicinal Chemistry, University of Washington, Seattle, WA) for running SEQUEST analyses, R. James (Clinical Diabetes Unit, University Hospital, Geneva, Switzerland) for immunoaffinity isolation of HDL particles, and all persons involved in measuring lipid levels at the Institute of Clinical Chemistry, University Hospital in Bern, Switzerland.


   FOOTNOTES
 
Received, August 23, 2006, and in revised form, February 22, 2007.

Published, MCP Papers in Press, March 5, 2007, DOI 10.1074/mcp.M600326-MCP200

1 The abbreviations used are: 2DE, two-dimensional gel electrophoresis; 1DE, one-dimensional gel electrophoresis (SDS-PAGE); ACTS, actin; ALBU, serum albumin; apoX, apolipoprotein X (X = A-I, A-II, C-I, C-II, C-III, etc.); CE-UV, capillary electrophoresis with UV detection at 200 nm; CONA, concanavalin A; emPAI, exponentially modified protein abundance index; FA, formic acid; FETUA, fetuin; HC, hypercholesterolemia; HDL, high density lipoprotein; HDL-C, HDL cholesterol; INS, insulin; LACB, ß-lactoglobulin; LCAT, lecithin-cholesterol acyltransferase; LDL, low density lipoprotein; LDL-C, LDL cholesterol; LYSC, lysozyme; MYG, myoglobin; N, normolipidemia; PAI, protein abundance index; PMSS, peptide match score summation; PMSSI, peptide match score summation index; PRVA, parvalbumin; rHDL, reconstituted HDL; SAA, serum amyloid protein A; SOMA, growth hormone (somatotropin); SpS, spectrum sampling; THYG, thyroglobulin. Back

2 R. Bolli, personal communication. Back

* This work was supported in part by ZLB Behring. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. Back

§ To whom correspondence should be addressed: Laboratory of Thrombosis Research, Dept. of Clinical Research, Freiburgstrasse, 3010 Bern, Switzerland. Fax: 41-31-632-2683; E-mail: manfred.heller{at}dkf.unibe.ch


    REFERENCES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 CONCLUDING REMARKS
 REFERENCES
 

  1. Tonge, R., Shaw, J., Middleton, B., Rowlinson, R., Rayner, S., Young, J., Pognan, F., Hawkins, E., Currie, I., and Davison, M. (2001) Validation and development of fluorescence two-dimensional differential gel electrophoresis proteomics technology. Proteomics 1, 377 –396[CrossRef][Medline]

  2. Oda, Y., Huang, K., Cross, F. R., Cowburn, D., and Chait, B. T. (1999) Accurate quantitation of protein expression and site-specific phosphorylation. Proc. Natl. Acad. Sci. U. S. A. 96, 6591 –6596[Abstract/Free Full Text]

  3. Conrads, T. P., Alving, K., Veenstra, T. D., Belov, M. E., Anderson, G. A., Anderson, D. J., Lipton, M. S., Pasa-Tolic, L., Udseth, H. R., Chrisler, W. B., Thrall, B. D., and Smith, R. D. (2001) Quantitative analysis of bacterial and mammalian proteomes using a combination of cysteine affinity tags and 15N-metabolic labelling. Anal. Chem. 73, 2132 –2139[Medline]

  4. Gygi, S. P., Rist, B., Gerber, S. A., Turacek, F., Gelb, M. H., and Aebersold, R. (1999) Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat. Biotechnol. 17, 994 –999[CrossRef][Medline]

  5. Munchbach, M., Quadroni, M., Miotto, G., and James, P. (2000) Quantitation and facilitated de novo sequencing of proteins by isotopic N-terminal labeling of peptides with a fragmentation-directing moiety. Anal. Chem. 72, 4047 –4057[Medline]

  6. Goodlett, D. R., Keller, A., Watts, J. D., Newitt, R., Yi, E. C., Purvine, S., von Haller, P. D., Eng, J. K., Aebersold, R., and Kolker, E. (2001) Differential stable isotope labeling of peptides for quantitation and de novo sequence derivation. Rapid Commun. Mass Spectrom. 15, 1214 –1221[CrossRef][Medline]

  7. Ross, P. L., Huang, Y. N., Marchese, J. N., Williamson, B., Parker, K., Hattan, S., Khainovski, N., Pillai, S., Dey, S., Daniels, S., Purkayastha, S., Juhasz, P., Martin, S., Bartlet-Jones, M., He, F., Jacobson, A., and Pappin, D. J. (2004) Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents. Mol. Cell. Proteomics 3, 1154 –1169[Abstract/Free Full Text]

  8. Yao, X., Freas, A., Ramirez, J., Demirev, P. A., and Fenselau, C. (2001) Proteolytic 18O labeling for comparative proteomics: Model studies with two serotypes of adenovirus. Anal. Chem. 73, 2836 –2842[Medline]

  9. Heller, M., Mattou, H., Menzel, C., and Yao, X. (2003) Trypsin catalyzed 16O-to-18O exchange for comparative proteomics: tandem mass spectrometry comparison using MALDI-TOF, ESI-QTOF and ESI-ion trap mass spectrometers. J. Am. Soc. Mass Spectrom. 14, 704 –718[CrossRef][Medline]

  10. Chelius, D., and Bondarenko, P. V. (2002) Quantitative profiling of proteins in complex mixtures using liquid chromatography and mass spectrometry. J. Proteome Res. 1, 317 –323[CrossRef][Medline]

  11. Lasonder, E., Ishihama, Y., Andersen, J. S., Vermunt, A. M., Pain, A., Sauerwein, R. W., Elling, W. M., Hall, N., Waters, A. P., Stunnenberg, H. G., and Mann, M. (2002) Analysis of the Plasmodium falciparum proteome by high-accuracy mass spectrometry. Nature 419, 537 –542[CrossRef][Medline]

  12. Old, W. M., Meyer-Arendt, K., Aveline-Wolf, L., Pierce, K. G., Mendoza, A., Sevinsky, J. R., Resing, K. A., and Ahn, N. G. (2005) Comparison of label-free methods for quantifying human proteins by shotgun proteomics. Mol. Cell. Proteomics 4, 1487 –1502[Abstract/Free Full Text]

  13. Liu, H., Sadygov, R. G., and Yates, J. R., III (2004) A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal. Chem. 76, 4193 –4201[Medline]

  14. Rappsilber, J., Ryder, U., Lamond, A. I., and Mann, M. (2002) Large-scale proteomic analysis of the human spliceosome. Genome Res. 12, 1231 –1245[Abstract/Free Full Text]

  15. Ishihama, Y., Oda, Y., Tabata, T., Sato, T., Nagasu, T., Rappsilber, J., and Mann, M. (2005) Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein. Mol. Cell. Proteomics 4, 1265 –1272[Abstract/Free Full Text]

  16. Colinge, J., Chiappe, D., Lagache, S., Moniatte, M., and Bougueleret, L. (2005) Differential proteomics via probabilistic peptide identification scores. Anal. Chem. 77, 596 –606[Medline]

  17. Gerber, S. A., Rush, J., Stemman, O., Kirschner, M. W., and Gygi, S. P. (2003) Absolute quantification of proteins and phosphoproteins from cell lysates by tandem MS. Proc. Natl. Acad. Sci. U. S. A. 100, 6940 –6945[Abstract/Free Full Text]

  18. Breslow, J. L. (1996) Mouse models of atherosclerosis. Science 272, 685 –688[Abstract]

  19. Alaupovic, P. (2003) The con