Supporting Online Material

Size: px
Start display at page:

Download "Supporting Online Material"

Transcription

1 Supporting Online Material Table of contents 1 Supplementary figure 1 2 Supplementary figure 2 3 Supplementary figure 3 4 Supplementary figure 4 5 Supplementary figure 5 6 Supplementary figure 6 7 Supplementary figure 7 8 Supplementary figure 8 9 Supplementary figure 9 10 Supplementary table 1 11 Supplementary table 2 12 Supplementary table 3 13 Supplementary table 4

2 4$"8(+ MNO MNP MNQ MNR MNS MNT MNU MNV!"#$%&'($)*+,-.+$/ 01,,+('12/-3$ 4$*%+'12/-3$&'1.,-.+$/ 0+%"*+$5'12/-3$ 6%*"7'!#&'83"*3'-$33/ 6%*"7'9,&'83"*3'-$33/ 6%*"7'4-&'83"*3'-$33/ 6%*"7'9:&'7$2%,7/ 6%*"7'9,&'7$2%,7/ 6%*"7'4-&'7$2%,7/ 6%*"7'!#&'7$2%,7/ 6%*"7'9:&'8%*723*%'3*.$% 6%*"7'9:&'1,3$-23*%'3*.$%!278&'1*-%,)(*8$/ 0)3$$7&';("+$')23)!.1)('7,5$&',+($%'3.1)(,"5'-$33/ <,7/"3&',+($%'3.1)(,"5'-$33/ =))$75">&'3.1)(,-.+$/!.1)('7,5$&'8$%1"7*3'-$7+$% <,7/"3&'8$%1"7*3'-$7+$% 6,7$'1*%%,; A#*%.&'/+%,1*3'-$33/ B+$%2/'C&'$751$+%"*3'/+%,1* B+$%2/'C&'$75,1$+%"*3'/+%,1* <$/+"/&'/$1"7"D$%2/'52-+ <,7/"3&'09 0E"7'C&'$)"5$%1*3'-$33/ 0E"7'CC&'$)"5$%1*3'-$33/ F/,)(*82/&'09 G*8"7*&'09 9$%#">&'09 A%*3'12-,/*&'09 6%$*/+&'?9 H3*-$7+*&'5$-"52*3'-$33/ H3*-$7+*&'+%,)(,:3*/+/ 63*55$%&'2%,+($3"*3'-$33/ I*33,)"*7'+2:$&'?9 B+$%2/'C&'?9 B+$%2/'CC&'?9 J*/,)(*%.7>&'?9 6%,7-(2/&'?9 9$%#">&'?9 F)"5"5.1"/&'?9 <(.%,"5'83*75&'?9 0*3"#*%.'83*75&'?9 0$1"7*3'#$/"-3$&'?9 H*7-%$*/&'?9 H%,/+*+$&'?9 =5%$7*3 H*%*+(.%,"5'83*75&'?9 H*7-%$*/&'"/3$+'-$33/ 0+,1*-('C&'?9 0+,1*-('CC&'?9?*33:3*55$%&'?9 K2,5$721&'?9 01*33'"7+$/+"7$&'?9 =))$75">&'?9 9,3,7&'?9 L$-+21&'?9 Supplementary figure 1. Dendrogram of protein expression patterns based on protein-encoded genes from the single human chromosome 9. The dendrogram was generated based on all antibodies (n=215) available for genes encoded on chromosome 9. The colored bar show the tissues with similar functional annotation as described in Figure 1. Light blue represent CNS tissues, green = hematopoetic system, pink = mesenchymal cells, yellow = squamous epithelia, dark blue = endocrine cells and the two shades of brown represent glandular and transitional epithelia. The similarity of the dendrogram generated with a smaller set of antibodies and the dendogram

3 generated with the complete set of antibodies (Fig. 1) has a cophenetic correlation coefficient of 0.91.

4 L$"2(+ MNO MNP MNQ MNR MNS MNT MNU MNV!"#$%&'($)*+,-.+$/!012&'3*-%,)(*2$/!"#$%&'45 6#*%.&'/+%,3*7'-$77/ 8+$%0/'9&'$1:3$+%"*7'/+%,3* 8+$%0/'9&'$1:,3$+%"*7'/+%,3* ;":1$.&'27,3$%07"!012&')1$03,-.+$/ <,1/"7&',+($%'7.3)(,":'-$77/ A))$1:"B&'7.3)(,-.+$/!.3)('1,:$&'2$%3"1*7'-$1+$% <,1/"7&'2$%3"1*7'-$1+$% D)":":.3"/&'45 <(.%,":'27*1:&'45 E*1-%$*/&'"/7$+'-$77/ <$/+"/&'!$.:"2'-$77/ A:%$1*7 H*77,)"*1'+0C$&'45 8+$%0/'9&'45 8+$%0/'99&'45 5$%#"B&'45 >%$*/+&'45 >7*::$%&'0%,+($7"*7'-$77/ E7*-$1+*&':$-":0*7'-$77/ E7*-$1+*&'+%,)(,C7*/+/ >%,1-(0/&'45 A))$1:"B&'45 5,7,1&'45 K$-+03&'45 >%*"1'5C&'1$0%,1/ >%*"1'5C&'3,7$-07*%'7*.$% >%*"1'5,&'1$0%,1/ >%*"1'L-&'1$0%,1/ >%*"1'!#&'1$0%,1/ >%*"1'5C&'2%*107*%'7*.$% >%*"1'5,&'27"*7'-$77/ >%*"1'L-&'27"*7'-$77/ Supplementary figure 2. Dendrogram of protein expression patterns based on protein-encoded genes from the single human chromosome 10. The dendrogram was generated based on all antibodies (n=203) available for genes encoded on chromosome 10. The colored bar indicates the tissues with similar functional annotation as described in figure S1. The similarity of the dendrogram generated with a smaller set of antibodies and the dendogram generated with the complete set of antibodies (Fig. 1) has a cophenetic correlation coefficient of 0.89.

5 0$":(+ MNO MNP MNQ MNR MNS MNT MNU!"#$%&'($)*+,-.+$/ 0$*%+'12/-3$&'1.,-.+$/ 4+%"*+$5'12/-3$ 6%*"7'89&':%*723*%'3*.$% 6%*"7'89&'1,3$-23*%'3*.$% 6%*"7'8,&':3"*3'-$33/ 6%*"7'0-&':3"*3'-$33/ 6%*"7'!#&':3"*3'-$33/ 6%*"7'89&'7$2%,7/ 6%*"7'!#&'7$2%,7/ 6%*"7'8,&'7$2%,7/ 6%*"7'0-&'7$2%,7/ 41,,+('12/-3$ ;+$%2/'<&'$751$+%"*3'/+%,1* ;+$%2/'<&'$75,1$+%"*3'/+%,1*!27:&')7$21,-.+$/ ="57$.&':3,1$%23" 6,7$'1*%%,A!.1)('7,5$&',+($%'3.1)(,"5'-$33/?,7/"3&',+($%'3.1)(,"5'-$33/ B))$75"C&'3.1)(,-.+$/!.1)('7,5$&':$%1"7*3'-$7+$%?,7/"3&':$%1"7*3'-$7+$% 4)3$$7&'%$5')23) 4)3$$7&'A("+$')23) D*%*+(.%,"5':3*75&'E8 ="57$.&'+2923$/!"#$%&'E8 D3*-$7+*&'5$-"52*3'-$33/ D3*-$7+*&'+%,)(,93*/+/ F)"5"5.1"/&'E8 D*7-%$*/&'"/3$+'-$33/ 4G"7'<&'$)"5$%1*3'-$33/ 63*55$%&'2%,+($3"*3'-$33/ >%*3'12-,/*&'48 F/,)(*:2/&'48 8$%#"C&'48 H*:"7*&'48?,7/"3&'48 4G"7'<<&'$)"5$%1*3'-$33/ D%,/+*+$&'E8 4$1"7*3'#$/"-3$&'E8 B5%$7*3?$/+"/&'!$.5":'-$33/ 4+,1*-('<&'E8 4+,1*-('<<&'E8 E*3393*55$%&'E8 I2,5$721&'E8 41*33'"7+$/+"7$&'E8 B))$75"C&'E8 8,3,7&'E8 J$-+21&'E8 6%$*/+&'E8 K*33,)"*7'+29$&'E8 6%,7-(2/&'E8 L*/,)(*%.7C&'E8?(.%,"5':3*75&'E8 8$%#"C&'E8 ;+$%2/'<&'E8 ;+$%2/'<<&'E8 D*7-%$*/&'E8 4*3"#*%.':3*75&'E8 Supplementary figure 3. Dendrogram of protein expression patterns based on protein-encoded genes from the single human chromosome 22. The dendrogram was generated based on all antibodies (n=206) available for genes encoded on chromosome 22. The colored bar indicates the tissues with similar functional annotation as described in figure S1. The similarity of the dendrogram generated with a smaller set of antibodies and the dendogram generated with the complete set of antibodies (Fig. 1) has a cophenetic correlation coefficient of 0.92.

6 *+,-./!"#!"$!"%!"&!"'!"(!")!"#$%&'($)*+,-.+$/ 0$*%+'12/-3$&'1.,-.+$/ 4+%"*+$5'12/-3$ 6*7-%$*/&'"/3$+'-$33/ 8%*"7'-$%$9%*3'-,%+$: 8%*"7&'(")),-*1)2/&';3"*3!$<+'#$7+%"-3$&';3"*3 =$%$9$3321&';%*723*%'3*.$% =$%$9$3321&'7$2%,7/ =$%$9$3321&'1,3$-23*%'3*.$%!$<+'#$7+%"-3$&'7$2%,7/ 8%*"7'-$%$9%*3'-,%+$:&'7$2%,7/ 8%*"7'(")),-*1)2/&'7$2%,7/!27;&'1*-%,)(*;$/ 4)3$$7&'%$5')23) 8,7$'1*%%,> 41,,+('12/-3$ A"57$.&';3,1$%23"!27;&')7$21,-.+$/ B#*%.&'/+%,1* C+$%2/'D&'$75,1$+%"*3'/+%,1* C+$%2/'DD&'$75,1$+%"*3'/+%,1* A"57$.&'+2923$/!"#$%&';3*7523*% 4E"7'D&'$)"5$%1*3?,7/"3&'/F2*1,2/ B%*3'12-,/*&'/F2*1,2/ 4E"7'DD&'$)"5$%1*3 G/,)(*;2/&'/F2*1,2/ =$%#":&'/F2*1,2/ H*;"7*&'/F2*1,2/ 63*-$7+*&'5$-"52*3'-$33/ 63*-$7+*&'+%,)(,93*/+/ I*3393*55$%&';3*7523*% 4$1"7*3'#$/"-3$&';3*7523*% G)"5"5.1"/&';3*7523*% 8%$*/+&';3*7523*% 4*3"#*%.';3*75&';3*7523*% 8%,7-(2/&';3*7523*% J*/,)(*%.7:&';3*7523*% K*33,)"*7'+29$&';3*7523*% 83*55$%&'2%,+($3"*3'-$33/ 4+,1*-('D&';3*7523*% 4+,1*-('DD&';3*7523*% L2,5$721&';3*7523*% 41*33'"7+$/+"7$&';3*7523*% =,3,7&';3*7523*% 6*7-%$*/&';3*7523*% 6%,/+*+$&';3*7523*% ''=$%#":&';3*7523*% C+$%2/'D&';3*7523*% C+$%2/'DD&';3*7523*% Supplementary figure 4. One of 3000 dendrograms of protein expression patterns based on protein-encoded genes from 200 randomly chosen antibodies. The colored bar indicates the tissues with similar functional annotation as described in figure S1. The mean cophenetic correlation similarity of 3000 generated dendrograms, using a set of 200 antibodies was 0.90.

7 )*+,-.!"#!"$!"%!"&!"'!"(!"#$%&'($)*+,-.+$/ 01,,+('12/-3$ 4$*%+'12/-3$&'1.,-.+$/ 0+%"*+$5'12/-3$ 6$%$7$3321&'8$2%,8/ 6$%$7$3321&'1,3$-23*%'3*.$% 9%*"8'-$%$7%*3'-,%+$:&'8$2%,8/ 9%*"8'(")),-*1)2/&'8$2%,8/!$;+'#$8+%"-3$&'8$2%,8/ 6$%$7$3321&'<%*823*%'3*.$% 9%*"8'-$%$7%*3'-,%+$: 9%*"8&'(")),-*1)2/&'<3"*3!$;+'#$8+%"-3$&'<3"*3 =+$%2/'>&'$85,1$+%"*3'/+%,1* 0)3$$8&'A("+$')23)!.1)('8,5$&'<$%1"8*3'-$8+$% B,8/"3&'<$%1"8*3'-$8+$% C))$85":&'3.1)(,-.+$/!.1)('8,5$&',+($%'3.1)(,"5'-$33/ B,8/"3&',+($%'3.1)(,"5'-$33/ 9,8$'1*%%,A 0)3$$8&'%$5')23) B,8/"3&'/D2*1,2/ 0E"8'>&'$)"5$%1*3 0E"8'>>&'$)"5$%1*3 6$%#":&'/D2*1,2/ F*<"8*&'/D2*1,2/ H*%*+(.%,"5'<3*85&'<3*8523*% C5%$8*3 B$/+"/&'3$.5"<'-$33/!28<&'1*-%,)(*<$/ B$/+"/&'/$1"8";$%2/'52-+ H3*-$8+*&'5$-"52*3'-$33/ H*8-%$*/&'"/3$+'-$33/ H%,/+*+$&'<3*8523*% 0+,1*-('>&'<3*8523*% 0+,1*-('>>&'<3*8523*% H*8-%$*/&'<3*8523*% 0*3"#*%.'<3*85&'<3*8523*% G)"5"5.1"/&'<3*8523*% B(.%,"5'<3*85&'<3*8523*% H3*-$8+*&'+%,)(,73*/+/ 9%$*/+&'<3*8523*% &'6$%#":&'<3*8523*% 9%,8-(2/&'<3*8523*% I*/,)(*%.8:&'<3*8523*% 0$1"8*3'#$/"-3$&'<3*8523*% J*3373*55$%&'<3*8523*% C))$85":&'<3*8523*% K2,5$821&'<3*8523*% 01*33'"8+$/+"8$&'<3*8523*% 6,3,8&'<3*8523*% L$-+21&'<3*8523*% 93*55$%&'2%,+($3"*3'-$33/ M*33,)"*8'+27$&'<3*8523*% =+$%2/'>&'<3*8523*% =+$%2/'>>&'<3*8523*% Supplementary figure 5. One of 3000 dendrograms of protein expression patterns based on protein-encoded genes from 200 randomly chosen antibodies. The colored bar indicates the tissues with similar functional annotation as described in figure S1. The mean cophenetic correlation similarity of 3000 generated dendrograms, using a set of 200 antibodies was 0.90.

8 +,-./0!"#!"$!"%!"&!"'!"(!")!"*!"#$%&'($)*+,-.+$/ 0$*%+'12/-3$&'1.,-.+$/ 4+%"*+$5'12/-3$!"#$%&'63*7523*% 41,,+('12/-3$ 8#*%.&'/+%,1* 9+$%2/':&'$75,1$+%"*3'/+%,1* 9+$%2/'::&'$75,1$+%"*3'/+%,1* ;%*"7&'(")),-*1)2/&'63"*3 ;%*"7'-$%$<%*3'-,%+$=!$>+'#$7+%"-3$&'63"*3?$%$<$3321&'6%*723*%'3*.$%?$%$<$3321&'7$2%,7/?$%$<$3321&'1,3$-23*%'3*.$% ;%*"7'-$%$<%*3'-,%+$=&'7$2%,7/ A,7/"3&'/B2*1,2/ 8%*3'12-,/*&'/B2*1,2/ 4C"7':&'$)"5$%1*3 D/,)(*62/&'/B2*1,2/ 4C"7'::&'$)"5$%1*3?$%#"=&'/B2*1,2/ F*7-%$*/&'"/3$+'-$33/ A$/+"/&'/$1"7">$%2/'52-+!276&')7$21,-.+$/ F3*-$7+*&'5$-"52*3'-$33/ F*7-%$*/&'63*7523*% 4*3"#*%.'63*75&'63*7523*% A$/+"/&'3$.5"6'-$33/ A(.%,"5'63*75&'63*7523*% G5%$7*3 F*%*+(.%,"5'63*75&'63*7523*% 4+,1*-(':&'63*7523*% 4+,1*-('::&'63*7523*% H2,5$721&'63*7523*% 41*33'"7+$/+"7$&'63*7523*%?,3,7&'63*7523*% G))$75"=&'63*7523*% I$-+21&'63*7523*% 4$1"7*3'#$/"-3$&'63*7523*% D)"5"5.1"/&'63*7523*% J*33<3*55$%&'63*7523*% F%,/+*+$&'63*7523*% ;%$*/+&'63*7523*% F3*-$7+*&'+%,)(,<3*/+/ ;%,7-(2/&'63*7523*% K*/,)(*%.7=&'63*7523*% L*33,)"*7'+2<$&'63*7523*% ;3*55$%&'2%,+($3"*3'-$33/ &'?$%#"=&'63*7523*% 9+$%2/':&'63*7523*% 9+$%2/'::&'63*7523*%!276&'1*-%,)(*6$/ ;,7$'1*%%,M 4)3$$7&'%$5')23) 4)3$$7&'M("+$')23) G))$75"=&'3.1)(,-.+$/!.1)('7,5$&',+($%'3.1)(,"5'-$33/ A,7/"3&',+($%'3.1)(,"5'-$33/!.1)('7,5$&'6$%1"7*3'-$7+$% A,7/"3&'6$%1"7*3'-$7+$% Supplementary figure 6. One of 3000 dendrograms of protein expression patterns based on protein-encoded genes from 200 randomly chosen antibodies. The colored bar indicates the tissues with similar functional annotation as described in figure S1. The mean cophenetic correlation similarity of 3000 generated dendrograms, using a set of 200 antibodies, was 0.90.

9 Supplementary figure 7. Comparison of expression levels for 65 cell types with various sub-fractions of antibodies. The fraction (%) of cells in which a particular protein was detected, including the fraction of cells with the relative expression levels strong (red), moderate (orange) and weak (yellow). Antibodies are arranged according to abundance of the corresponding protein target with cell type-specific proteins to the left and housekeeping proteins to the right. (A) Data from all antibodies (n = 5,936) used in this study. (B) Data from antibodies with a supportive western blot (n = 920). (C) Data from paired antibodies generated to different parts of the same protein and with a correlation coefficient

10 0.5 (n = 518). (D) Data from paired antibodies generated internally and corresponding to different parts of the same protein and with a correlation coefficient 0.5 (n = 184). The difference in intensity distribution of the antibodies between the three subsets and the full set was found to be non-significant using a chi square test statistic.

11 Supplementary figure 8. Comparison of expression levels for 65 cell types with various subfractions of antibodies. Cells are arranged according to the fraction of proteins detected in normal tissues including the fractions of proteins with the relative expression levels strong (red), moderate (orange) and weak (yellow). The black (top) part of the bar represents antibodies with missing data for the particular cell type. A bar displaying the different color codes representing the six major categories of normal cell types (defined in Figure 1) is shown for each cell type. The name of each cell type is shown in table S2 and the results for the various sub-fractions of antibodies are presented in figure S3. (A) Data from all antibodies used in this study (n = 5,936). (B) Data from antibodies with a supportive western blot (n = 920). (C) Data from paired antibodies generated to different parts of the same protein and with a correlation coefficient 0.5 (n = 518). (D) Data from paired antibodies generated internally and corresponding to different parts of the same protein and with a correlation coefficient 0.8 (n = 184). The difference in intensity distribution of the antibodies between the three subsets and the full set was found to be non-significant using a multiple test corrected chi square test statistic for all tissues.

12

13 Supplementary figure 9. Comparison of expression levels for 45 cell lines with various subfractions of antibodies. The fraction (%) of proteins detected in a specific cell line, including the fraction of the three different staining categories: strong (red), moderate (orange) and weak (yellow). The black (top) part of the bar represents antibodies with missing data for the particular cell line. The corresponding name and number of each cell line is shown in Supplementary Table S3. Results from three different datasets are shown: (A) All antibodies (n = 5,352), ordered according to the fraction of proteins detected. (B) Antibodies with supportive results from Western blot analysis (n = 872). (C) Paired antibodies generated to the same protein and with a correlation coefficient 0.5 (n = 310). The correlation coefficients are based on protein expression data which has been normalized using cellular size (5).

14 Supplementary Table S1. The sets of antibodies used in the study. Antibody group Study Fig No of antibodies All tissues Tissues 1,2 Western Tissues S4-S5 Correlation > 0.5 Tissues S4-S5 Correlation > 0.5, HPA paired Tissues Chromosome 9 Tissues S1 Chromosome 10 Tissues S2 Chromosome 22 Tissues S3 Random selection of antibodies Random selection of antibodies Random selection of antibodies Tissues Tissues Tissues All Cells 3 S4-S5 S4 S5 S6 Western Cells S6 Correlation > 0.5 Cells S6 Correlation > 0.8 Cells S6 All IF Cells 4 Number of target genes 5,934 4, Average = 199 Average = 199 Average = 199 5,349 4, ,249 2,179 Comment All antibodies in the study for analysis of 65 cell types All antibodies with a single band of predicted size on Western blot Paired antibodies with cell profiling correlation larger than 0.5 All paired HPA antibodies with cell profiling correlation larger than 0,5 Antibodies towards proteins encoded on human chromosome 9 Antibodies towards proteins encoded on human chromosome 10 Antibodies towards proteins encoded on human chromosome 22 Antibodies selected by random sampling of the complete set of antibodies Antibodies selected by random sampling of the complete set of antibodies Antibodies selected by random sampling of the complete set of antibodies All antibodies used for analysis of 45 cell lines using IHC Antibodies with a single band of predicted size on Western blot Paired antibodies with cell profiling corrleation larger than 0.5 Paired antibodies with cell profiling corrleation larger than 0.8 Antibodies used for analysis of 3 cell lines using confocal microscopy

15 Supplementary Table S2. The total fraction of proteins detected in the 65 human cell types from 48 tissues. The numbers refer to order of cells in Figure 2B. The sub fraction of antibodies is listed in table S1. All refers to all 5,936 antibodies used for analysis of 65 cell types, Western refers to the 920 antibodies with a single band of predicted size using a standardized Western blot assay, Paired refers to the 518 antibodies selected due to the high correlation ( 0.5) of these paired antibodies in the analysis of the cells and HPA are the sub fraction of 92 highly correlated ( 0.5) paired antibodies which has been generated in-house with distinct and non-overlapping epitopes. Supplementary Table S3. The total fraction of proteins detected in the 45 human cell lines. The numbers refer to order of cells in Figure 3B. The sub fraction of antibodies is listed in table S1. All refers to all 5,352 antibodies used for analysis of 45 cell lines, Western refers to the 872 antibodies with a single band of predicted size using a standardized Western blot assay, Corr. > 0.5 refers to the 310 antibodies selected due to the high correlation ( 0.5) of these paired antibodies in the analysis of the cells and Corr. 0.8 are the sub fraction of 88 highly correlated (> 0.8) paired antibodies. Supplementary Table S4. A list of 74 proteins only detected in a single cell type. The query was conducted to show proteins with strong binding exclusively in one of the analyzed 65 cell types (see table S2) and not more than five cells with weak staining.