[HTML][HTML] Proteome sampling by the HLA class I antigen processing pathway

I Hoof, D van Baarle, WH Hildebrand… - PLoS computational …, 2012 - journals.plos.org
I Hoof, D van Baarle, WH Hildebrand, C Keşmir
PLoS computational biology, 2012journals.plos.org
The peptide repertoire that is presented by the set of HLA class I molecules of an individual
is formed by the different players of the antigen processing pathway and the stringent
binding environment of the HLA class I molecules. Peptide elution studies have shown that
only a subset of the human proteome is sampled by the antigen processing machinery and
represented on the cell surface. In our study, we quantified the role of each factor relevant in
shaping the HLA class I peptide repertoire by combining peptide elution data, in silico …
The peptide repertoire that is presented by the set of HLA class I molecules of an individual is formed by the different players of the antigen processing pathway and the stringent binding environment of the HLA class I molecules. Peptide elution studies have shown that only a subset of the human proteome is sampled by the antigen processing machinery and represented on the cell surface. In our study, we quantified the role of each factor relevant in shaping the HLA class I peptide repertoire by combining peptide elution data, in silico predictions of antigen processing and presentation, and data on gene expression and protein abundance. Our results indicate that gene expression level, protein abundance, and rate of potential binding peptides per protein have a clear impact on sampling probability. Furthermore, once a protein is available for the antigen processing machinery in sufficient amounts, C-terminal processing efficiency and binding affinity to the HLA class I molecule determine the identity of the presented peptides. Having studied the impact of each of these factors separately, we subsequently combined all factors in a logistic regression model in order to quantify their relative impact. This model demonstrated the superiority of protein abundance over gene expression level in predicting sampling probability. Being able to discriminate between sampled and non-sampled proteins to a significant degree, our approach can potentially be used to predict the sampling probability of self proteins and of pathogen-derived proteins, which is of importance for the identification of autoimmune antigens and vaccination targets.
PLOS