Submitted on August 7, 2006
Revised on February 12, 2007
Accepted on February 21, 2007
Diversity of translation start sites may define increased complexity of the human short ORFeome
Masaaki Oyama, Hiroko Kozuka-Hata, Yutaka Suzuki, Kentaro Semba, Tadashi Yamamoto, and Sumio Sugano
Institute of Medical Science, University of Tokyo, Tokyo 108-8639
Corresponding Author: moyama{at}ims.u-tokyo.ac.jp
Our previous proteomic analysis of small proteins expressed in human K562 cells provided the first direct evidence of translation of upstream ORFs in human full-length cDNAs (Oyama et al., Genome Res. 2004, 14, 2048-2052). In the present study, we have performed an in-depth proteomic analysis of human K562 and HEK293 cells using a two-dimensional nano-liquid chromatography tandem mass spectrometry system. The results led to the identification of eight protein-coding regions besides 197 small proteins with a theoretical mass less than 20 kDa, which were already annotated coding sequences in the curated mRNA database. In addition to the upstream ORFs in the presumed 5-untranslated regions of mRNAs, bioinformatical analysis based on accumulated 5-end cDNA sequence data provided evidence of novel short coding regions that were likely to be translated from the upstream non-AUG start site or from the new short transcript variants generated by utilization of downstream alternative promoters. Protein expression analysis of the GRINL1A gene revealed that translation from the most upstream start site occurred on the minor alternative splicing transcript, whereas this initiation site was not utilized on the major mRNA, resulting in translation of the downstream ORF from the second initiation codon. These findings reveal a novel post-transcriptional system that can augment the human proteome via the alternative use of diverse translation start sites, coupled with transcriptional regulation through alternative promoters or splicing, leading to increased complexity of short protein-coding regions defined by the human transcriptome.