Abstract
The upstream open reading frame (uORF) is a post-transcriptional regulatory element in the 5′ untranslated region (5′UTR), which modulates the translation levels of main open reading frame (mORF). Earlier studies showed that disturbed uORF-mediated translation control can result in drastic changes in translation levels of mORF, leading to genetic disorders. To date, there has been no systematic investigation into the relationship between variations in patients and uORF status. Here, taking the advantage of several datasets, including gene ontology (GO) annotations and sequence feature analysis, we have examined uORF impacts in human transcripts. GO annotations indicate that uORF-containing genes are enriched in certain features such as oncogenes and transcription factors. Sequence feature analysis reveals that uORF is a factor for determination of the translation initiation site (TIS) in human transcripts. We show that genes with uORFs have lower protein expression levels than genes without uORFs in multiple human tissues. Moreover, by examining three disease variation databases, we identified uORF-altering mutations from a total of 3,740,225 variations, which are highly suspected to be associated with changed levels of gene expression. For an experimental validation, we found four mutations with significant effects on protein expression but with only modest changes in transcription levels. These findings will provide researchers on related diseases with new insights into the importance of known mutations.
Similar content being viewed by others
Notes
UCSC human genome annotation database (hg19, GRCh37), http://hgdownload.cse.ucsc.edu/goldenpath/hg19/database/.
Human mRNA collection, ftp://ftp.ncbi.nlm.nih.gov/refseq/H_sapiens/mRNA_Prot/.
UCSC human genome annotation database (hg19, GRCh37), http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/.
Gene Ontology Consortium, http://geneontology.org/.
Human Proteome Map, http://www.humanproteomemap.org/.
ClinVar, ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/.
TCGA, http://cancergenome.nih.gov/.
AnimalTFDB, http://www.bioguo.org/AnimalTFDB/index.php.
Comprehensive Perl Archive Network, http://www.cpan.org/.
Python, https://www.python.org/.
The Comprehensive R Archive Network, http://cran.r-project.org/.
NCBI Genome Remapping Service, http://www.ncbi.nlm.nih.gov/genome/tools/remap.
Promega, Madison, Wisconsin, USA.
Life Technologies, NY, USA.
TaKaRa Biotechnology, Dalian, China.
Toyobo, Osaka, Japan.
References
Andrews SJ, Rothnagel JA (2014) Emerging evidence for functional peptides encoded by short open reading frames. Nat Rev Genet 15(3):193–204
Ashburner M, Ball CA, Blake JA et al (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25(1):25–29
Barbosa C, Peixeiro I, Romão L (2013) Gene expression regulation by upstream open reading frames and human disease. PLoS Genet 9(8):e1003529
Calvo SE, Pagliarini DJ, Mootha VK (2009) Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. Proc Natl Acad Sci USA 106(18):7507–7512
Chin L, Hahn WC, Getz G, Meyerson M (2011) Making sense of cancer genomic data. Genes Dev 25(6):534–555
Forbes SA, Beare D, Gunasekaran P et al (2014) COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic Acids Res 43(database issue):D805–D811
Goujon M, McWilliam H, Li W (2010) A new bioinformatics analysis tools framework at EMBL-EBI. Nucleic Acids Res. doi:10.1093/nar/gkq313
Hayden CA, Jorgensen RA (2007) Identification of novel conserved peptide uORF homology groups in Arabidopsis and rice reveals ancient eukaryotic origin of select groups and preferential association with transcription factor-encoding genes. BMC Biol 5:32
Holmes M, Itaas V, Ananvoranich S (2014) Sustained translational repression of lactate dehydrogenase 1 in Toxoplasma gondii bradyzoites is conferred by a small regulatory RNA hairpin. FEBS J 281(22):5077–5091
Jiang X, Lu X, McNamara G et al (2010) HGAL, a germinal center specific protein, decreases lymphoma cell motility by modulation of the RhoA signaling pathway. Blood 116(24):5217–5227
Jiménez-González AS, Fernández N, Martínez-Salas E, Sánchez de Jiménez E (2014) Functional and structural analysis of maize hsp101 IRES. PLoS One 9(9):e107459
Kim MS, Pinto SM, Getnet D et al (2014) A draft map of the human proteome. Nature 509(7502):575–581
Kozak M (1987) An analysis of 5′-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res 15(20):8125–8148
Kozak M (1991) An analysis of vertebrate mRNA sequences: intimations of translational control. J Cell Biol 115(4):887–903
Larkin MA, Blackshields G, Brown NP (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23(21):2947–2948
Lu X, Kazmierczak K, Jiang X et al (2011) Germinal center-specific protein human germinal center associated lymphoma directly interacts with both myosin and actin and increases the binding of myosin to actin. FEBS J 278(11):1922–1931
Marcos T, Ruiz-Martín V, de la Puerta ML et al (2014) Proline-serine-threonine phosphatase interacting protein 1 inhibition of T-cell receptor signaling depends on its SH3 domain. FEBS J 281(17):3844–3854
Morris DR, Geballe AP (2000) Upstream open reading frames as regulators of mRNA translation. Mol Cell Biol 20(23):8635–8642
Munzarová V, Pánek J, Gunišová S et al (2011) Translation reinitiation relies on the interaction between eIF3a/TIF32 and progressively folded cis-acting mRNA elements preceding short uORFs. PLoS Genet 7(7):e1002137
Oner R, Agarwal S, Dimovski AJ et al (1991) The G—A mutation at position +22 3′ to the Cap site of the beta-globin gene as a possible cause for a beta-thalassemia. Hemoglobin 15(1–2):67–76
Pichon X, Wilson LA, Stoneley M et al (2012) RNA binding protein/RNA element interactions and the control of translation. Curr Protein Pept Sci 13(4):294–304
Roy B, Vaughn JN, Kim BH et al (2010) The h subunit of eIF3 promotes reinitiation competence during translation of mRNAs harboring upstream open reading frames. RNA 16(4):748–761
Sachs MS, Geballe AP (2006) Downstream control of upstream open reading frames. Genes Dev 20(8):915–921
Somers J, Pöyry T, Willis AE (2013) A perspective on mammalian upstream open reading frame function. Int J Biochem Cell Biol 45(8):1690–1700
Spriggs KA, Bushell M, Willis AE (2010) Translational regulation of gene expression during conditions of cell stress. Mol Cell 40(2):228–237
Starnes TW, Bennin DA, Bing X et al (2014) The F-BAR protein PSTPIP1 controls extracellular matrix degradation and filopodia formation in macrophages. Blood 123(17):2703–2714
Wen Y, Liu Y, Xu Y et al (2009) Loss-of-function mutations of an inhibitory upstream ORF in the human hairless transcript cause Marie Unna hereditary hypotrichosis. Nat Genet 41(2):228–233
Wethmar K (2014) The regulatory potential of upstream open reading frames in eukaryotic gene expression. Wiley Interdiscip Rev RNA 5(6):765–768
Wethmar K, Smink JJ, Leutz A (2010) Upstream open reading frames: molecular switches in (patho) physiology. BioEssays 32(10):885–893
Acknowledgments
The authors would like to thank Hao Zhu, Dun Liu, Jiang Li and Weiyan Shen for helpful discussions. This study was supported by National Natural Science Foundation of China (NSFC)-Guangdong Joint Fund (No. U1201222), National Public Health Grand Research Foundation (Nos. 201202017 and 201302001), Science and Technology Program of Guangzhou (201300000095) and the National Key Technology Research and Development Program (2012BAI09B01).
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Ye, Y., Liang, Y., Yu, Q. et al. Analysis of human upstream open reading frames and impact on gene expression. Hum Genet 134, 605–612 (2015). https://doi.org/10.1007/s00439-015-1544-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-015-1544-7