C2orf3

2017-05-01   Masato Takimoto , Mao Peizhong 

Identity

HGNC
LOCATION
2p12
IMAGE
Atlas Image
LEGEND
Structural Relation among C2orf3, GCF and GCF2. GCF is an artificial cDNA composed of GCF2 and C2orf3
LOCUSID
ALIAS
C2orf3,DNABF,GCF,TCF9

Abstract

Review on C2orf3, with data on DNA, on the protein encoded, and where the gene isimplicated.

DNA/RNA

Description

Genomic DNA
Genomic size of DNA is about 50 kbp and consists of 17 exons. >
cDNA
GCF was originally discovered as a gene that encodes a transcriptional repressor that binds to a GC-rich sequence of the regulatory regions of human EGF-R gene (Kageyama and Pastan, 1989). However, a following study showed that GCF cDNA is an artificial fusion molecule between two different cDNAs; The 5 side of the cDNA encodes a highly basic region with the sequence-specific binding activity to the GC-rich region and the 3 side encodes most of C2orf3 protein (Takimoto et al., 1999). Reed et al. cloned a full length of the former cDNA that has transcriptional repressor activity with the sequence-specific DNA binding ability, and named this cDNA as GCF2. Following studies had confirmed the transcriptional repressive activity of GCF2 ( Shibutani M et al., 1998: Khachigian LM et al., 1999). The full length of C2orf3 cDNA was cloned and its sequence was determined. The cDNA is composed of 2661 nucleotides and encodes a protein of 781 amino acids. The encoded protein does not contain the highly basic region of the published GCF and has no sequence-specific DNA binding activity to the GC-rich sequence (Takimoto et al., 1999) ( GenBank : AB026911 )
cDNA sequence:
GGGCGGGCACTGAAGCTGCGGCTTGCGGTTCAGCGGGTTCTAGGGCGCCGGGCGCTCGGGCCTCGGCCATGGCTCACAGGCCGAAAAGGACTTTTCGGCA
GCGCGCGGCTGATTCCAGCGACAGCGATGGCGCCGAGGAGTCGCCTGCTGAGCCTGGGGCGCCGAGGGAACTTCCGGTCCCGGGTTCTGCGGAGGAAGAG
CCGCCCTCTGGAGGAGGCCGCGCGCAGGTGGCGGGACTGCCCCACCGGGTTCGGGGCCCTCGTGGCCGGGGCCGGGTCTGGGCGAGCTCCCGGCGTGCCA
CCAAAGCGGCTCCCCGCGCGGACGAAGGCTCAGAATCCAGAACCCTTGATGTGTCCACAGATGAAGAGGATAAAATACATCACTCCTCAGAAAGTAAGGA
TGATCAGGGTTTGTCTTCTGACAGTTCTAGCTCTCTTGGAGAAAAAGAACTTTCATCAACAGTTAAGATCCCAGATGCAGCTTTTATTCAGGCAGCCCGC
AGAAAACGTGAATTGGCCAGGGCCCAAGATGACTATATTTCTTTGGATGTACAACATACCTCCTCCATCTCTGGTATGAAGAGAGAGAGCGAAGATGACC
CTGAGAGTGAGCCTGATGACCATGAAAAGAGAATACCATTTACTCTAAGACCTCAAACACTTAGACAAAGGATGGCTGAGGAATCAATAAGCAGAAATGA
AGAAACAAGTGAAGAAAGTCAGGAAGATGAAAAGCAAGATACTTGGGAACAACAGCAAATGAGGAAAGCAGTTAAAATCATAGAGGAAAGAGACATAGAT
CTTTCCTGTGGCAGTGGATCTTCAAAAGTGAAGAAATTTGATACTTCCATTTCATTTCCGCCAGTAAATTTAGAAATTATAAAGAAGCAATTAAATACTA
GATTAACATTACTACAGGAAACTCACCGCTCACACCTGAGGGAGTATGAAAAATACGTACAAGATGTCAAAAGCTCAAAGAGTACCATCCAGAACCTAGA
GAGTTCATCAAATCAAGCTCTAAATTGTAAATTCTATAAAAGCATGAAAATTTATGTGGAAAATTTAATTGACTGCCTTAATGAAAAGATTATCAACATC
CAAGAAATAGAATCATCCATGCATGCACTCCTTTTAAAACAAGCTATGACCTTTATGAAACGCAGGCAAGATGAATTAAAACATGAATCAACGTATTTAC
AACAGTTATCACGCAAAGATGAGACATCCACAAGTGGAAACTTCTCAGTAGATGAAAAAACTCAGTGGATTTTAGAAGAGATTGAATCTCGAAGGACAAA
AAGAAGACAAGCAAGGGTGCTTTCTGGGAATTGTAACCATCAGGAAGGAACATCTAGTGATGATGAACTGCCTTCAGCAGAGATGATTGACTTCCAAAAA
AGCCAAGGTGACATTTTACAGAAACAGAAGAAAGTTTTTGAAGAAGTGCAAGATGATTTTTGTAACATCCAGAATATTTTGTTGAAATTTCAGCAATGGC
GAGAAAAGTTTCCTGACTCCTATTATGAAGCTTTCATTAGTTTATGCATACCAAAGCTTTTAAATCCCCTAATACGAGTTCAGTTGATTGATTGGAATCC
TCTTAAGTTGGAATCCACAGGTTTAAAAGAGATGCCATGGTTCAAATCTGTAGAAGAATTTATGGATAGCAGTGTAGAAGATTCAAAGAAGGAAAGTAGT
TCAGATAAAAAAGTCTTGTCTGCAATCATCAACAAAACAATTATTCCCCGACTTACAGACTTTGTAGAATTCCTTTGGGATCCTTTGTCAACCTCACAGA
CAACAAGTTTAATAACACATTGCAGAGTGATTCTTGAAGAACATTCCACTTGTGAAAATGAAGTTAGTAAAAGCAGACAGGATTTACTTAAATCCATTGT
TTCAAGAATGAAAAAGGCAGTAGAAGATGATGTTTTTATTCCTCTGTATCCAAAGAGTGCTGTAGAAAACAAAACATCACCTCATTCAAAGTTCCAAGAA
AGACAGTTCTGGTCAGGCCTAAAGCTCTTCCGCAATATTCTTCTTTGGAATGGACTCCTTACAGATGACACCTTGCAAGAACTAGGACTAGGGAAGCTGC
TAAATCGTTACCTTATTATAGCACTTCTCAATGCCACACCTGGGCCAGATGTGGTTAAAAAGTGCAACCAGGTAGCAGCATGTCTACCAGAAAAATGGTT
TGAAAATTCTGCCATGAGGACATCTATTCCACAGCTAGAAAACTTCATTCAGTTTTTATTGCAGTCTGCACATAAATTATCTAGAAGTGAATTCAGGGAT
GAAGTCGAAGAAATAATTCTTATTTTGGTGAAAATAAAAGCTTTGAATCAAGCAGAATCCTTCATAGGAGAGCATCACCTAGACCATCTTAAATCACTAA
TTAAAGAAGATTGAATAAACTTTATTGGAAAATGCTAAAATTTTAATATAGTTACACTCAGTTCCTTTGTTTGAGAAGAAGCTGGTGCCTCTCTCTTCTT
TATTCCCTGTAATAGAAGGTAGGATTTGAAAAAAAGCAGGACTCCACCTCTGTATTCCCCCGTGCTTTACCTTCTGGCATCATGAAAAGCTGCCATGATT
CTGTGGTGTTCTAAGGAATTAAATGCACTGGAGCTTTAAGAGCTCAACGTGTTTCCCTTTG

Transcription

It is suggested that MRPL1 gene which is located closely to C2orf3 with head-head orientation and that they are co-regulated (Anthoni et al., 2007). There are several alternative spliced forms for mRNA expression: The initial report on a cDNA that encompass the full coding region is 2661bp in length (Takimoto et al., 1999).

Proteins

Description

Encodes 781 amino acids
Amino acid sequence: (Uniprot : P16383)
MAHRPKRTFRQRAADSSDSDGAEESPAEPGAPRELPVPGSAEEEPPSGGGRAQVAGLPHRVRGPRGRGRVWASSRRATKAAPRADEGSESRTLDVSTDEE
DKIHHSSESKDDQGLSSDSSSSLGEKELSSTVKIPDAAFIQAARRKRELARAQDDYISLDVQHTSSISGMKRESEDDPESEPDDHEKRIPFTLRPQTLRQ
RMAEESISRNEETSEESQEDEKQDTWEQQQMRKAVKIIEERDIDLSCGSGSSKVKKFDTSISFPPVNLEIIKKQLNTRLTLLQETHRSHLREYEKYVQDV
KSSKSTIQNLESSSNQALNCKFYKSMKIYVENLIDCLNEKIINIQEIESSMHALLLKQAMTFMKRRQDELKHESTYLQQLSRKDETSTSGNFSVDEKTQW
ILEEIESRRTKRRQARVLSGNCNHQEGTSSDDELPSAEMIDFQKSQGDILQKQKKVFEEVQDDFCNIQNILLKFQQWREKFPDSYYEAFISLCIPKLLNP
LIRVQLIDWNPLKLESTGLKEMPWFKSVEEFMDSSVEDSKKESSSDKKVLSAIINKTIIPRLTDFVEFLWDPLSTSQTTSLITHCRVILEEHSTCENEVS
KSRQDLLKSIVSRMKKAVEDDVFIPLYPKSAVENKTSPHSKFQERQFWSGLKLFRNILLWNGLLTDDTLQELGLGKLLNRYLIIALLNATPGPDVVKKCN
QVAACLPEKWFENSAMRTSIPQLENFIQFLLQSAHKLSRSEFRDEVEEIILILVKIKALNQAESFIGEHHLDHLKSLIKED

Expression

C2orf3 protein with molecular weight of 89 kD are observed in human cancer cell line, and localizes in nucleoplasm and nucleolus.

Function

C2orf3 plays a role in pre-mRNA splicing, by forming a complex with DHX15 (hPrp43) and TFIP11 (Yoshimoto et al., 2014). As these proteins are present in post-splicing intron complex, C2orf3 protein may play a role in post-splicing turnover of mRNA. The study with and antibody specific to C2ofr3 protein showed that this protein is present nucleoplasm and nucleoli.
After splicing reaction, pre-mRNA releases intron RNA complex, which contains uridine-rich small nuclear RNAs (snRNAs; U1, U2, U4, U5 and U6) PRPF19complex, hnRNP proteins and TFIP11. A RNA helicase hPrp43 removes the several factors from the complex, leaving lariat RNA intron, which is then subject to linearization by a debranching enzyme DBR1 (Wen et al., 2008; Yoshimoto et al., 2009).C2orf3 protein was shown to form a complex with tuftelin-interacting protein (TFIP11) and hPrp43, which play a role in post-splicing turnover of mRNA. Through its amino terminal, TFIP11 binds to a RNA helicase hPrp43 that plays a role in the dissociation of snRNAs from a lariat intron in vitro. C2orf3 preferentially associates with lariat intron in the splicing reaction and C2orf3-deleted nuclear extracts showed a significant repression of splicing of pre-mRNA in vitro (Yoshimoto et al., 2014). The presence of C2orf3 protein in nucleoli suggest a potential role in rRNA processing/or nucleoli structure (Yoshimoto et al., 2014).

Implicated in

Top note
Although it is not conclusive, C2orf3 is suggested to be a causing gene for dyslexia.
Entity name
Dyslexia
Note
The locus containing the C2orf3 gene on Chromosome 2p12 has been shown to link to dyslexia. It had been reported that the genomic regions responsive for human dyslexia, such DYXC1/EKN1, KIAA0319 and DCDC, and RUBO1, are located on human chromosome 15, 6 and 3, respectively (McGrath et al., 2006). In 1999, It was shown that a new region for dyslexia, DYX3, on human chromosome 2 was identified (Fagerheim et al., 1999). Subsequently, a study on Finnish and German families disclosed that DYX3 was present on chromosome 2p12, spanning 157 kbp. It was shown that there are only three genes, FLJ1339, MRPL1 and C2ORF3, in this region and that the latter two genes are closely located with positions in a head-to-head manner respective for transcriptional orientation, suggesting that both genes are transcriptionally co-regulated (Anthoni et al., 2007). Further analyses on several affected families revealed an overlapping region with risk haplotype within the 157 kbp region, delineating to 16 kbp, which located in an intergenic region between FLJ1339 and MRPL1/C2ORF3 genes. There is no SNP marker in the coding regions of MRPL1 and C2ORF3 genes by which coding change correlated with dyslexia, and the expressions of both genes are significant lower in carriers with risk haplotype compared with non-carriers. These results suggested that the 16 kbp region plays a role for transcriptional regulatory element and mutation in this element might lead to reduced expression of the genes, which could be a cause of dyslexia. While the expression of FLJ1339 in human brain, the expressions of MRPL1/C2ORF3 are high and significantly correlated with those of other dyslexia candidate genes of whichexpressions are also high in brain. Especially, the expression of C2ORF3 was correlated across the different parts of brain with those of other dyslexia candidate genes, DYXC1, RUBO1 and DCDC2 (Anthoni et al., 2007). Neuroimaging analyses revealed a significant association between a SNP marker and white matter volume of the posterior parts of the corpus callosum and cingulum (Scerri et al., 2012). In contrary to the studies described above, the studies on the populations of Australia and Inida showed non-significant association for the SNP marker for MRPL1/C2ORF3 with dyslexia (Paracchini et al., 2011 : Venkatesh et al., 2013).

Bibliography

Pubmed IDLast YearTitleAuthors
173098792007A locus on 2p12 containing the co-regulated MRPL19 and C2ORF3 genes is associated to dyslexia.Anthoni H et al
105077211999A new gene (DYX3) for dyslexia is located on chromosome 2.Fagerheim T et al
25562181989Molecular cloning and characterization of a human DNA binding factor that represses transcription.Kageyama R et al
103645631999GC factor 2 represses platelet-derived growth factor A-chain gene transcription and is itself induced by arterial injury.Khachigian LM et al
167818912006Breakthroughs in the search for dyslexia candidate genes.McGrath LM et al
208462472011Analysis of dyslexia candidate genes in the Raine cohort representing the general Australian population.Paracchini S et al
97052901998Molecular cloning and characterization of a transcription regulator with homology to GC-binding factor.Reed AL et al
232097102012The dyslexia candidate locus on 2p12 is associated with general cognitive ability and white matter structure.Scerri TS et al
95069911998Transcriptional down-regulation of epidermal growth factor receptors by nerve growth factor treatment of PC12 cells.Shibutani M et al
239548682013Lack of association between genetic polymorphisms in ROBO1, MRPL19/C2ORF3 and THEM2 with developmental dyslexia.Venkatesh SK et al
191653502008TFIP11 interacts with mDEAH9, an RNA helicase involved in spliceosome disassembly.Wen X et al
243046932014Identification of a novel component C2ORF3 in the lariat-intron complex: lack of C2ORF3 interferes with pre-mRNA splicing via intron turnover pathway.Yoshimoto R et al

Other Information

Locus ID:

NCBI: 6936
MIM: 189901
HGNC: 1317
Ensembl: ENSG00000005436

Variants:

dbSNP: 6936
ClinVar: 6936
TCGA: ENSG00000005436
COSMIC: GCFC2

RNA/Proteins

Gene IDTranscript IDUniprot
ENSG00000005436ENST00000321027P16383
ENSG00000005436ENST00000409857P16383
ENSG00000005436ENST00000427862H7C335
ENSG00000005436ENST00000442309C9JII4
ENSG00000005436ENST00000470285S4R465
ENSG00000005436ENST00000470503P16383
ENSG00000005436ENST00000472230S4R3C1
ENSG00000005436ENST00000492826S4R461
ENSG00000005436ENST00000541687P16383

Expression (GTEx)

0
5
10
15
20

Pathways

PathwaySourceExternal ID
Gene ExpressionREACTOMER-HSA-74160
Processing of Capped Intron-Containing Pre-mRNAREACTOMER-HSA-72203
mRNA SplicingREACTOMER-HSA-72172
mRNA Splicing - Major PathwayREACTOMER-HSA-72163

References

Pubmed IDYearTitleCitations
203796142010Personalized smoking cessation: interactions between nicotine dose, dependence and quit-success genotype score.62
173098792007A locus on 2p12 containing the co-regulated MRPL19 and C2ORF3 genes is associated to dyslexia.42
173098792007A locus on 2p12 containing the co-regulated MRPL19 and C2ORF3 genes is associated to dyslexia.42
208462472011Analysis of dyslexia candidate genes in the Raine cohort representing the general Australian population.24
201892452010Genome-wide association study of childhood acute lymphoblastic leukemia in Korea.16
243046932014Identification of a novel component C2ORF3 in the lariat-intron complex: lack of C2ORF3 interferes with pre-mRNA splicing via intron turnover pathway.7
269154042016Transcriptional regulation of IER5 in response to radiation in HepG2.5

Citation

Masato Takimoto ; Mao Peizhong

C2orf3

Atlas Genet Cytogenet Oncol Haematol. 2017-05-01

Online version: http://atlasgeneticsoncology.org/gene/54327/c2orf3