Database | ID | Description |
---|---|---|
ProSiteProfiles | PS50103 | Zinc finger C3H1-type profile. |
SMART | SM00451 | ZnF_U1_5 |
MobiDBLite | mobidb-lite | consensus disorder prediction |
SMART | SM00356 | c3hfinal6 |
PANTHER | PTHR16465 | NUCLEASE-RELATED |
Gene3D | G3DSA:3.30.160.60 | Classic Zinc Finger |
Pfam | PF06220 | U1 zinc finger |
SUPERFAMILY | SSF57667 | beta-beta-alpha zinc fingers |
Pfam | PF00642 | Zinc finger C-x8-C-x5-C-x3-H type (and similar) |
SUPERFAMILY | SSF90229 | CCCH zinc finger |
Gene3D | G3DSA:4.10.1000.10 | - |
KEGG | K13152 | U11/U12 small nuclear ribonucleoprotein 20 kDa protein |
KOG | KOG3454 | U1 snRNP-specific protein C; [A] |
MapolyID | Mapoly0054s0011 | - |
GO | GO:0005689 | U12-type spliceosomal complex |
GO | GO:0003676 | nucleic acid binding |
GO | GO:0046872 | metal ion binding |
GO | GO:0008270 | zinc ion binding |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpC3H18 | transcription factor, C3H | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp4g15460.1 AGGAGCGCGC TGACGAGTTG CTGCGGACTG TGGCTGCCTC CCCCTGTTGG TATTGGCTCG ATCCTGTTGC TGCTGTTTAC TGGTGAGTCT CTTGCTCGGG AATGTCTCCA TCATCAGCTT GTGGGTATTA TATCGATGAG AGCTATGGCA TTCCACCTGA TGGACTGGTT AGGAAATTCA GCCGAAATTC AGGCCTTTGT CATCGAGAAA TCCTTCATGC GTATCTGAAA GAATAGAGGA TTCGGAAGAT TTCAAGGTTC GGTTGAGGAT CAATGGGCTG TGACGAGATG TGGTTCGTTT GTCATCCAAT GACTGAGGGT TTAGACGGAA GTACAACCCA TGTCAGCTAA GGGTCAAGGG TTCAGCACTT TGTCATCACC ACGATGCAGA GCATCACTTC ATAATTTGTT TGTGGGCCTA GGTTTTTTGC AGTAGGCTTT TGTTGGCTCT GGAAGAGATT CTCATACCTG CGGGAAATCG CTTCTAGCTT CTCAACAGGT TTTGCGGAAA GAAATGCAGA CCTTAGATTG TGTAGAAATT TCGTGTTTTC AGGCGTAAGA CTTTGTGCTG TGCATTTGCT GCGATGTCAG AGATTACAAT TCATAACAAT TCATGATGGG AAAGGTTCTC GATTCTCAAC TGATCTTTGC AGACGAAAGA GGTGGACACT ACCTGAGATT GGAAGTGAGA AGCGAGAATG CATGCGCTTG CCCAGTCTGA ACGGACTTCA AGCTCACTGA TGAGAAGAAA TTGGACAACC AGCGGGACAC AATATTAGAA GCAACCTTAG GACGTGTACA TGGATCGGGA GAGAGCAGCG CAAAGAAACT TAACCTGTGA GCAACCCCAT AGCAGCCACT TGCAGTTTCT CTAGATACTT TACAAGGGAG GTAGGAGATG CAGATCCTGC AGCCTCTCCT GCACAATCTC TGGTATCGCA ACGCAGAAAT TTATGTTAAG TTTTTCGAAC AATTCTTCCG CTTATGAAGC TCGTGGTCAT GATCTAGGAG GCACGTCTTT CTTGAAAGAG CGCAAGTGTC TCTACGTCCA TGTACCCTCG TAGAGAAGGA TGCCGACCCC AAAATATGTC TGTGACTACT GTGACAAAAC ATTTCATGAT ACACCAGCTT CTCGCAAGCG ACACATGCAA GGAATAACGC ATCAACGAGC TGTGAAAGCT TGGTATGATT CCTTCAGAGG TTAAAATCTT CACCTCCTGA AGCTTTTGTT TCTGCACCTT CAACCTTACT TGCAAAGGAG ACTCGTGTTA AGCTTGTTCA CTCGACCATG TATGCTATCG TGCATTTGCA GATAAAGATC AAGGTGGAAA CCGTGGAGTT TGCGCATATT TCCAGCGAAC GGTATGCTGA TCTATTTAGT GAAAGCTCAT GTGATACAGG CCATAAATTA CGCTCTTCTT TTATGGTGTA GATCTCCATG TTGTGTTCTC TCCCTTGCCT TCGACCACGT TCATCTTTTA GCACAACAAA GAAGTTTATC CTACCGAAAT TGCTGGCTCA AAGTTCATGT GCTTCTCTGC TGGCAACTGG TAACATACAT GATGGTTTGC ACATTACAGG GCACATGCAA TTATGGATCG AATTGTCAGT ACGCGCATAT AACACATTCA GGTCCTTCTT ATGCAGCAGC CGGGATGTCA GGTTAGTACA GAGGTGGAGA TCCGTTCTGA TTCTCGTGAT TCTCTGTTGT GAGATAATCA AACCAGCTCA TCATATCAAG TAGATTACAA AATCTGACAA CTGGAGCTGG GTTTGGATTA GCCACCAGAA AATGTAGCAT TGACCAGTAG TATACACCTA ATCATGACGC ACAGGCATAG TCTTCTTCAC CTTCACATCG ACCTAATTTG CAGCATATGC GAGTAAAATA TCTTCGTCTG GTGCATCCGG TCAAAACAAA ATGAGATGTT TCTGTTATGA GGTGTTTAAA GAGATTGGTT TCGGAACTGG AATATAGTAG GCACAGCTAT AGATGTCTAC GATGAAAAGA AATACTGTCC TGATGCGCAA ATTGTCGGAC TTCAGCAGTA AAGAGAATAG TGGATATTTT GCAGTGGATG AACAAACTTT GTGCATTCGT GCTTGTTACA GTGTCAAACA TTTCTCAAAG TGACTTGGGG CCTTCTCCAA CGAATGCAGC GCATGGGGCG CCCTCAGCTA CTTCCGCATC TGTTCCTGGT AGGTATTCTT GTCCGCTCTT TCGGTGGAGT TTTTTATCAT CTTCAATTCG GAATGTGCTT GGAACTAAAG ACTTCTGCCA AAGCCAACAG TAACAGGTGT AACACTGAAA GTAGGTCAGA GTTGTGATAC TGATACTTTT CTTCTGTGGC ATATGGTGGT TCATCGCCAG TAAGCAGAGT TGCTGAGAAC AAACTTCCTC CCTCTCTACA ACCTCCGCCA GAAGATGGTT ATCCATCTCT ACCTTTCGTA GATTGGGGTT AGTGATTTAG AGAAGCCAAA GTACGTATTG ACACAACATC TCACATGCTT TCTCGCTTTC CTTAGACTTT CGAAGTGAAA TTTTGCAGTC CCAGGTTTAA AGCTTAGCAA ATATCACAAA TTTATCACTC GAAACTCTCG AACGGAACGA AAAT |
mRNA UTR + CDS |
>Mp4g15460.1 AGGAGCGCGC TGACGAGTTG CTGCGGACTG TGGCTGCCTC CCCCTGTTGG TATTGGCTCG ATCCTGTTGC TGCTGTTTAC TGACGAAAGA GGTGGACACT ACCTGAGATT GGAAGTGAGA AGCGAGAATG CATGCGCTTG CCCAGTCTGA ACGGACTTCA AGCTCACTGA TGAGAAGAAA TTGGACAACC AGCGGGACAC AATATTAGAA GCAACCTTAG GACGTGTACA TGGATCGGGA GAGAGCAGCG CAAAGAAACT TAACCTGAGG CACGTCTTTC TTGAAAGAGC GCAAGTGTCT CTACGTCCAT GTACCCTCGT AGAGAAGGAT GCCGACCCCA AAATATGTCT GTGACTACTG TGACAAAACA TTTCATGATA CACCAGCTTC TCGCAAGCGA CACATGCAAG GAATAACGCA TCAACGAGCT GTGAAAGCTT GGTATGATTC CTTCAGAGAT AAAGATCAAG GTGGAAACCG TGGAGTTTGC GCATATTTCC AGCGAACGGG CACATGCAAT TATGGATCGA ATTGTCAGTA CGCGCATATA ACACATTCAG GTCCTTCTTA TGCAGCAGCC GGGATGTCAG TGTCAAACAT TTCTCAAAGT GACTTGGGGC CTTCTCCAAC GAATGCAGCG CATGGGGCGC CCTCAGCTAC TTCCGCATCT GTTCCTGTAA GCAGAGTTGC TGAGAACAAA CTTCCTCCCT CTCTACAACC TCCGCCAGAA GATGGTTATC CATCTCTACC TTTCGTAGAT TGGGGTTAGT GATTTAGAGA AGCCAAAGTA CGTATTGACA CAACATCTCA CATGCTTTCT CGCTTTCCTT AGACTTTCGA AGTGAAATTT TGCAGTCCCA GGTTTAAAGC TTAGCAAATA TCACAAATTT ATCACTCGAA ACTCTCGAAC GGAACGAAAA T |
CDS |
>Mp4g15460.1 ATGCCGACCC CAAAATATGT CTGTGACTAC TGTGACAAAA CATTTCATGA TACACCAGCT TCTCGCAAGC GACACATGCA AGGAATAACG CATCAACGAG CTGTGAAAGC TTGGTATGAT TCCTTCAGAG ATAAAGATCA AGGTGGAAAC CGTGGAGTTT GCGCATATTT CCAGCGAACG GGCACATGCA ATTATGGATC GAATTGTCAG TACGCGCATA TAACACATTC AGGTCCTTCT TATGCAGCAG CCGGGATGTC AGTGTCAAAC ATTTCTCAAA GTGACTTGGG GCCTTCTCCA ACGAATGCAG CGCATGGGGC GCCCTCAGCT ACTTCCGCAT CTGTTCCTGT AAGCAGAGTT GCTGAGAACA AACTTCCTCC CTCTCTACAA CCTCCGCCAG AAGATGGTTA TCCATCTCTA CCTTTCGTAG ATTGGGGTTA G |
Protein |
>Mp4g15460.1 MPTPKYVCDY CDKTFHDTPA SRKRHMQGIT HQRAVKAWYD SFRDKDQGGN RGVCAYFQRT GTCNYGSNCQ YAHITHSGPS YAAAGMSVSN ISQSDLGPSP TNAAHGAPSA TSASVPVSRV AENKLPPSLQ PPPEDGYPSL PFVDWG |