|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 7:499-508, 2008.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
An alternative approach for localization studies is to use antibodies or other affinity reagents to specifically visualize the subcellular localization of proteins. The major advantage of using the antibody-based method is that no genetic constructs are needed and that the possible artifactual effects of protein fusions are avoided. One drawback is that this strategy requires fixed and permeabilized cells, limiting the experiments to dynamically fixed end points. The antibody-based and fluorescent tag methods are therefore complementary for large scale studies of protein localization in cellular systems. The reason that no one, so far, has attempted to use the antibody approach in large scale is most likely the lack of protein binders to an adequate number of proteins (6). The Swedish Human Protein Atlas (HPA)1 program has recently been set up to allow for systematic exploration of the human proteome using antibody-based proteomics (7). This is accomplished by combining high throughput generation of affinity-purified (monospecific) antibodies with protein profiling in a multitude of tissues/cell types assembled in tissue microarrays. Due to the difficulty to acquire spatial information on the fine cellular and subcellular level using immunohistochemistry, we have therefore decided to expand the analysis with confocal microscopy using fluorescently labeled antibodies. Here we report on a pilot study of 466 proteins in three human cell lines as a first attempt to generate an atlas of human proteins using confocal microscopy. This pilot study has generated
3000 high resolution images all available on the Human Protein Atlas portal.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Immunofluorescent Sample Preparation—
Immunofluorescent stainings were prepared in 96-well glass bottom plates (Whatman). After coating each well with human 12.5 µg/ml fibronectin, (Sigma-Aldrich) for 1 h followed by washes with PBS, 15,000 cells (10,000 for A-431 cells) were seeded per well and grown for 3 h. All the subsequent steps were performed in 96-well format using a pipetting work station (Matrix 2 x 2, Matrix Technologies Corp., Hudson, NH). After washes with PBS the cells were fixed with ice-cold 4% paraformaldehyde (PFA) (Sigma) in growth media for 15 min and permeabilized using freshly prepared 0.1% Triton X-100 in PBS (Sigma) for 3 x 5 min. Rabbit monospecific HPA antibodies were dissolved to 2 µg/ml in blocking buffer (PBS + 4% fetal bovine serum) containing 1 µg/ml mouse anti-tubulin (Abcam, Cambridge, UK) and 1 µg/ml chicken anti-calreticulin (Abcam). Incubation of primary antibodies was performed overnight at 4 °C. After four washes with PBS, secondary antibodies, goat anti-rabbit Alexa488, rabbit anti-mouse Alexa555, and goat anti-chicken Alexa647 (Molecular Probes, Invitrogen), diluted to 1 µg/ml in blocking buffer were incubated for 1 h at room temperature. The cells were counterstained with the nuclear probe DAPI, diluted to 300 nM, for 4 min. After extensive washes with PBS the wells were filled up with PBS containing 78% glycerol and then sealed. On each plate a negative control well was included where the primary antibodies had been left out. This was used as a reference to distinguish positive from negative staining.
Image Acquisition—
Image acquisition was performed with a LSM 510 Meta confocal laser scanning microscope equipped with a motorized table and a 63x/1.4 numerical aperture oil immersion objective (all equipment from Carl Zeiss GmbH, Jena, Germany). Images were acquired in four sequential steps (one for each dye) to minimize the spectral bleed-through. In each well two lateral images (selected from the center of the cells) and a single z axis slice were recorded. The confocal settings were as follows: 12-bit acquisition; line averaging, 2; pixel time, 1.84 µs; and a pixel size of 80 x 80 nm. The detector gain and offset were adjusted for each sample to use the full dynamic range of the detector and to obtain as high signal to noise ratios as possible. The operator ensured that the acquired images were representative for the well.
Image Annotation—
Image annotation was performed manually using a Web-based annotation document (Supplemental Fig. 4). This enabled linking each image to an antibody identification number and existing data available in our Laboratory Information Management System (7). For each image pair, sample identification tags and microscope parameters were entered, and the image was further visually inspected before the antibody-specific subcellular localization and staining characteristics were described. Based on the image and the detector gain settings, the staining was classified as negative, weak, moderate, or strong. Ten different subcellular localizations were listed in the annotation document: cytoplasm, nucleus, nuclear membrane, nucleolus, mitochondrion, endoplasmic reticulum, Golgi apparatus, cytoskeleton, extracellular matrix, and vesicles. This last category groups organelles as lysosomes, peroxisomes, endosomes, and all types of protein transport bodies. The description of the subcellular compartments was further combined with parameters describing the staining characteristics; smooth, granular, speckled, dotty, fibrous, or clustered. For an indiscernible pattern, the image was annotated as unspecific (Supplemental Fig. 5). For a pattern that did not fall into the 10 subcellular categories, the image was annotated as "other" combined with a comment.
Data Handling—
After scanning of the microplate, images were collected on our servers and compressed into different resolution JPEG images for display purposes. All raw (TIFF) images were kept and stored in the campus facility. Each well of a plate corresponded to a specific antibody and was described by an identification number. This allowed linking the new subcellular information to the existing database with immunohistochemistry data and antibody information (see Fig. 6).
|
|
|
|
| RESULTS |
|---|
|
|
|---|
2.8 million immunohistochemistry images corresponding to 3015 antibodies (version 3.0).
|
Several different cellular probes were tested as reference for identification of organelles and other cellular compartments. Due to limitation of the excitation/emission characteristics of the fluorescent probes used (i.e. to avoid problems associated with spectrally overlapping fluorophores) and the need to maintain a high throughput operation, we decided to limit the pilot study to three cellular probes in addition to the probe for the analyte. The following three probes were selected: (i) DNA stain DAPI for the nucleus, (ii) tubulin as internal control for fixation quality and homogeneity, and (iii) calreticulin for the endoplasmic reticulum.
When preparing samples for immunofluorescence labeling, an important factor is the choice of fixative and detergent to permeabilize the cells. There are many different options available, all with different advantages and disadvantages (11, 12). Several different alternatives to suit a high throughput approach were investigated,2 and examples of PFA/Triton X-100 and PFA/saponin are shown in Supplemental Fig. 1. Although saponin yielded excellent images for proteins localized in the cytoplasm, our results suggest that stronger detergents, such as Triton X-100, are needed to stain proteins localized to mitochondria or nuclei (data not shown) (11). The Triton permeabilization was therefore chosen as the standard procedure for the high throughput effort. In Supplemental Fig. 2, an example of the split view of the four channels is presented showing staining of a putative mitochondrial protein (A) with probes for nucleus (B), tubulin (C), and endoplasmic reticulum (D).
Annotation of Subcellular Localization—
The analysis of three cell lines using four probes was performed for 466 antibodies selected at random from the validated list of antibodies generated within the framework of the protein atlas program. The set of antibodies covers proteins originating from all chromosomes with slightly higher representation on chromosomes 1, 3, 11, 14, 19, 22, and X (see Supplemental Fig. 3). Fig. 2 shows an example of the nine different subcellular localizations annotated in this work, namely cytoplasm, microtubules, nucleus, mitochondria, vesicles, microfilaments, endoplasmic reticulum, Golgi apparatus, and extracellular matrix. The microscopic parameters were set to reach a lateral resolution close to the diffraction limit, i.e. about 200 nm. All images in the atlas were manually annotated using a Web-based annotation tool (Supplemental Fig. 4). The subcellular localization, staining pattern characteristics (e.g. smooth, speckled, etc.), and intensity were manually annotated.
The diversity of staining patterns is illustrated by examples of nuclear staining in Fig. 3. Eight different proteins are shown, all having different distributions within the nucleus (nucleus, nucleoli, and nuclear membrane) as well as varying characteristics of the staining (smooth, granular, speckled, dotty, and clusters of spots). More detailed annotation could provide more information about the function of individual proteins, but the large spectrum of staining patterns makes a more detailed annotation risky and could lead to false positives (13). Therefore the annotations are focused on the main organelles, and to ensure that no information is lost, both the images and annotation comments are accessible for the viewers.
Quality of Antibodies—
All HPA antibodies were diluted to a final concentration of 2 µg/ml. To analyze whether the specificity of the staining could be related to the initial antibody concentration, a diagram showing the relation between the antibody stock concentrations and staining specificity is displayed in Supplemental Fig. 6. It is noteworthy that even though the antibody concentration was normalized there were a higher number of unspecific and negative stainings for low stock concentration compared with higher concentration suggesting that high concentrations of specific antibodies after immunization is an indicator of good antibody quality. However, it is worth pointing out that unspecificity of the staining can also be associated with the fixation/permeabilization protocol used as shown previously (11, 12).
Evaluation of Localization—
In Fig. 4, the distribution of the different subcellular localization is shown for the three cell lines. A, B, and C show protein localization distributed in three major compartments: nucleus, cytoplasm, and nucleus with cytoplasm. A fourth category groups unspecific and negative staining. Antibodies were annotated as unspecific when there was a homogeneous staining all over the cell, as shown in Supplemental Fig. 5A, in contrast to the compartment "nucleus + cytoplasm" (Supplemental Fig. 5B) where the annotator was able to distinguish different staining intensities within the two localizations. Negative staining was identified when laser gain was above a defined threshold about which the noise level is too high compared with signal intensity. Data are similar between U-2OS (Fig. 4A) and A-431 (Fig. 4C) with about 30% of all antibodies located in the cytoplasmic compartment, 17% located in the nuclear compartment, and 30% located in both. The U-251MG cell line (Fig. 4B) had more proteins with an apparent nuclear localization (43%), and this finding needs to be further investigated. For the three cell lines, an average of 18% of antibodies was considered as unspecific or negative.
|
Fig. 4E gives a more detailed overview of subcellular localizations. For this figure, only antibodies specific to one organelle were taken into account. Differences among the three cell lines can be noticed. For example, more antibodies are annotated to the mitochondria compartment in U-2OS cell line (26 antibodies) compared with the other cell lines (21 and four antibodies for U-251MG and A-431, respectively). About 200 antibodies show a strict localization, 200 show localizations in multiple cellular compartments, and about 70 are considered unspecific (or negative). The latter figure can be explained through multiple parameters, independent or inter-related, like sample preparation, antibody specificity in a particular cell line, cell morphology, or annotation.
Comparisons with Gene Ontology Annotations—
In Fig. 5, localization annotations from this study are compared with the Gene Ontology (GO) annotations in a "score map." To be able to link the two annotation ensembles, the first database was populated to connect each HPA antibody with its corresponding cellular component GO terms (data are mined from the antibody information pages available on our servers). A second database links our annotation terms (i.e. nuclear, cytoplasm, mitochondrion, etc.) with GO terms from the cellular component category (available at the Gene Ontology Website) based on term similarity. Finally a color score was generated for our 10 subcellular compartments to evaluate a match or mismatch between our annotations and Gene Ontology (Fig. 5).
Fig. 5 enables a direct comparison between our subcellular annotations of protein localization and GO terminology of the gene products. For instance, the antibody cluster denoted by a in Fig. 5A highlights a good agreement for localization into nuclear and cytoplasmic compartments. b illustrates a group of about 20 proteins annotated to the nucleus and cytoplasmic compartments but without any GO annotations for cytoplasmic localization. However, this localization is supported by other GO information based on molecular functions or biological processes. c indicates a group of proteins for which annotations are in good agreement between our study and GO, and c' shows a mismatch between the two databases highlighting antibodies directed against cytoskeleton proteins as predicted by GO. For some of these antibodies, the sample preparation (especially the fixation step of the immunofluorescence protocol) might have masked antigens or modified protein conformations and ultimately hindered annotations. d represents a group of proteins predicted by the Gene Ontology to belong to the vesicle subcellular compartment. Vesicles (e.g. endosomes or clathrins) are by definition small subcellular organelles and thus are more difficult to annotate as described by Glory and Murphy (14). e shows a set of 20 proteins for which there is no existing GO terms and where our results add new localization information.
Overall the agreement between GO and our annotations is larger than 80%, calculated as the percentage of proteins in which the GO and the experimental data predict the presence (shown in green in Fig. 5) or absence (light blue in Fig. 5) of a protein in a defined subcellular compartment. It is obvious that comparisons of the nuclear and cytoplasmic compartments are more complex than comparisons for the other compartments (Fig. 5). These are the most often annotated compartments, and here the agreement with GO data was lower (45 and 55%, respectively). One explanation for this discrepancy is the limitations due to using only one method for sample preparation. For example the clusters indicated by c' and d in Fig. 5A (cytoskeletal and vesicular proteins, respectively) had in the experimental evaluation been annotated as nuclear or cytoplasmic. This might be expected because some types of proteins will not easily be resolved in their intact subcellular compartment and hence fall into a less resolved compartment, i.e. cytoplasm or nucleus. Another reason for this discrepancy is that annotations often take multiple localizations into account. The nuclear and cytoplasmic compartments could in this context be considered as metacompartments including cytoplasmic and nuclear organelles. This will sometimes lead to false mismatches when comparing with predictive data. Interestingly when compiling data from all three cell lines (Fig. 5, A, B, and C), there is a 60% agreement between the GO prediction and the experimental localization determined here (data not shown). The presented data therefore add valuable experimental information to the Gene Ontology terminology by evaluating subcellular localization in cells with different phenotypes.
Integration into the Human Protein Atlas—
The images and subcellular localization data resulting from this project has been released as a new feature of the currently available Human Protein Atlas. In Fig. 6, a screen shot of the layout of the subcellular atlas is shown. The objective is to include this analysis for every antibody/protein included in the atlas in the future.
| DISCUSSION |
|---|
|
|
|---|
In the past decade, fluorescence microscopy imaging has gone through a dramatic change due to development of new types of fluorescent probes, new advancement in imaging instrumentation (18, 19), and new applications (3, 20). All proteomics studies using cells and imaging rely on three major steps: sample preparation, image acquisition, and data analysis. For the first step, two different techniques are available: live cell studies (21) or immunostaining of fixed cells (16). An advantage of live cell studies compared with fixed cells is the possibility to study temporal patterns of protein expression, which is a parameter of importance in cell behavior. The cell lines used in this study do not have their cell cycles synchronized, meaning that at a fixed time point the cell population exhibits cells randomly distributed in G1, S, G2, and M phases. Manual image acquisition alleviates this problem by choosing a representative region of interest. An attractive expansion of the work described here could therefore be to include specific cell cycle markers (e.g. DNA probe, bromo-2-deoxyuridine incorporation, and cyclin expression) to bring a pseudodynamic input for protein localization (22). New imaging techniques, such as imaging-bleaching cycles could also be use on selected sets of proteins to study protein networks and interactions (23).
Here we performed manual annotation on each image to classify antibody staining patterns into different subcellular compartments. As shown in a recent review, human and computer accuracy scores are comparable for large organelles like mitochondria, endoplasmic reticulum, or cytoskeleton (14). However, human annotation is labor-intensive and is susceptible to annotator subjectivity, and automated image analysis is therefore attractive. Such analysis can be performed using two different principles: (i) co-localization or (ii) pattern recognition. For co-localization analysis, organelle-specific markers are needed for every subcellular compartment to be analyzed, and three-dimensional image acquisitions are desired, making such studies difficult and demanding. A pilot study has been conducted on a reduced set of organelle markers2 demonstrating the feasibility of such an approach, although more technical development is needed to implement such a strategy in a streamlined, high throughput scheme. The pattern recognition strategy has shown feasibility and accuracy in large scale studies (24–26). It can be divided into two groups, supervised and unsupervised classifications. Supervised learning requires an initial training phase before the subcellular location class of the protein can be determined. Unsupervised learning methods are able to define classes based on the distance of objects in a feature space (27). These different approaches are currently being investigated to enable automated image annotation.
Here our manual annotations of 466 antibodies led to more than 80% classified in one or multiple subcellular compartment(s) and about 20% left with undefined localization. Several factors can be responsible for these non-localized antibodies such as (i) the lack of the protein target in the analyzed cell lines, (ii) the failure to permeabilize the compartment in which the protein target is present, or (iii) the failure to recognize the epitope of the protein target due to denaturation of the corresponding protein by the fixation (detergent). More work is needed to distinguish between these alternatives including the use of an alternative antibody recognizing the same target but recognizing another epitope.
As reviewed recently (14), there is a need for additional experimental determination of protein localization, and this project attempts to contribute to this effort. The annotated images in this atlas classify proteins into subcellular compartments and can be compared with existing databases, such as the Gene Ontology consortium, Proteome Analyst WoLF PSORT, or hum-PLOC (see Nucleic Acids Research for an extensive list). The data in this pilot study were evaluated on the degree of agreement in protein localization using the GO database. The results show the usefulness of complementing theoretical prediction methods with experimental procedures, such as the antibody-based method presented or with a fluorescent tagging method as demonstrated earlier (15).
The scope of proteomics is broad from identification and quantification of proteins in cells and tissues (in normal and cancer states) to deeper understanding of molecular processes within cells. The recent addition of cytomics to the "omics" data sets brings knowledge of cell systems and molecular architecture thanks to imaging techniques and molecular biology. The presented subcellular atlas aims to provide a publicly available atlas for subcellular localization of all human proteins and to facilitate studies comparing normal and disease protein profiles with the ultimate objective to discover new biomarkers and targets for the design of diagnostic tools and drugs.
| FOOTNOTES |
|---|
Published, MCP Papers in Press, November 19, 2007, DOI 10.1074/mcp.M700325-MCP200
1 The abbreviations used are: HPA, Human Protein Atlas; GO, Gene Ontology; PFA, paraformaldehyde; DAPI, 4',6-diamidino-2-phenylindole. ![]()
2 L. Barbe and H. Andersson-Svahn, unpublished data. ![]()
* This work was supported by The Royal Academy of Sciences. The Human Proteome Resource program is funded by the Knut and Alice Wallenberg Foundation. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
|| To whom correspondence should be addressed. E-mail: mathias{at}biotech.kth.se
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
E. Bjorling and M. Uhlen Antibodypedia, a Portal for Sharing Antibody and Antigen Validation Data Mol. Cell. Proteomics, October 1, 2008; 7(10): 2028 - 2037. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Berglund, E. Bjorling, P. Oksvold, L. Fagerberg, A. Asplund, C. Al-Khalili Szigyarto, A. Persson, J. Ottosson, H. Wernerus, P. Nilsson, et al. A Genecentric Human Protein Atlas for Expression Profiles Based on Antibodies Mol. Cell. Proteomics, October 1, 2008; 7(10): 2019 - 2027. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |