Database | ID | Description |
---|---|---|
CDD | cd00074 | H2A |
PRINTS | PR00620 | Histone H2A signature |
FunFam | G3DSA:1.10.20.10:FF:000009 | Histone H2A |
Pfam | PF16211 | C-terminus of histone H2A |
MobiDBLite | mobidb-lite | consensus disorder prediction |
Pfam | PF00125 | Core histone H2A/H2B/H3/H4 |
ProSitePatterns | PS00046 | Histone H2A signature. |
PANTHER | PTHR23430 | HISTONE H2A |
SUPERFAMILY | SSF47113 | Histone-fold |
SMART | SM00414 | h2a4 |
Gene3D | G3DSA:1.10.20.10 | Histone, subunit A |
KEGG | K11251 | histone H2A |
KOG | KOG1756 | Histone 2A; [B] |
MapolyID | Mapoly0090s0046 | - |
GO | GO:0030527 | structural constituent of chromatin |
GO | GO:0000786 | nucleosome |
GO | GO:0003677 | DNA binding |
GO | GO:0046982 | protein heterodimerization activity |
GO | GO:0000790 | chromatin |
GO | GO:0006342 | heterochromatin formation |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpH2A.X.2 | Chromatin related protein | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp4g21750.1 AAAGGACGAA TGAAGAATGA ATGAGAGCCC AAGTGGCACT CCGGATTTGC CTGGATCCCG AATCTCGAGT CTGGGTACGA TCGGAACAAG TCATTTTGGA GGCGGAAATG CGCGGGTCGT GGTCAGACTC ATGCCCAGGC GCATCTGACA TGCACTTCAA GAGGGAGGCG AAGAGTCCTG TGGAGATCTC CGTCTCAAGT TTAGGTTGCG AGGCAAGCAG GATAACATAT GAGTAACTGT TTGCGAGTCT CGTTAGAATC GGTGGGAAGA CGAGTGCGTG GATTGACGAA ATTGTTTCTG GCAGTGGGTC TGACCATCTG GCGAGCGATT GTGACTGCGC TGTCTGACGT ATTCTTCTTC TTCGTCCAGT TCATCCCCTG CTCTTTGTGT TCGGACACCA AGGCCGAAGA AGTCGGCGAT CTGCAAGCCG CCCGGATTTT TTAGAATGAC GGCCGCAGGA GGAGGAGGCA GGGGCAAGGC GAAAGGGTCC AAATCCGTGT CTCGATCCCA GAAAGCAGGT CTGCAGTTTC CTGTGGGGAG AATTGCGAGG TTTCTGAAGG CGGGAAAGTA TGCTGAGCGA GTCGGAGCTG GAGCCCCCGT CTATTTAGCT GCAGTTATGG AATATCTGGC TGCCGAGGTT CTATATCCCC GCGCTTTGTG AATTGAAGAC CCCGTGGGCA TCCTTACATG CTGCTCTGTA TAAGTAGAAA CGCTCTGCGT CGTGAGTTTG GAATCCGTCG AACGCGTAAT TGTCTTACTG CGGCTGCTTG CCTTTAGAAC ATAGAAGTCT AGAGCCAGCG TAAAACGAAT GCATTCTGTG CAGGTCTTGA TAGATCGAGG TTCAACCAAG AGGATGCTTT TAGAATCAAG ATGCTAAGAA GACGTTGTTA GATCTGTACT GTATATTCTT CTTGTCAGTA TTTTTTGTGT ATTCTTCTGT CTTTTGGTGT CGCCGAATCT AACGGGGGCA GTAATTCAAT GTCTGGCCAA GAATTTCTTC AGTGAGGATT GTGTATTCTT ACTCTGTAGA AACATGTAAT GGCTCGTGAG AATGTAGGTG TTGGAATTAG CCGGGAATGC TGCGAGAGAC AACAAGAAGT CCAGGATCAT TCCGCGCCAT ATTCAGCTGG CTGTGAGAAA TGACGAGGAG CTCAGCAAAT TACTGGGTAC GGTGGTTATT GCGAATGGTG GCGTTCTACC CAACATTCAC AGCTCCCTCC TCCCGAAGAA GACAGGCAAG GGTGGGAAAG GAGAGATAGA GGGTATGTCT CAAGAATTTT GAGACTCTCC AGGCTCCAGC GGCTGTAATT TTTTGCTTTC ACCAACAGGT GTTCAACAGC TAGAGTTAGT TGACTCTGAT CGCCCTTTGA GATTGACAAG AACGATTTTT CTCTGATCTT TGATCATTTC TCTCTTATGT AAATTTTTTC CTATTAACGA AGAGCTTGTT TGTTCCTTTG GAGCGGTACA CAAATCCAAT CAAGTCCCTC ACGCCTGGTG AAACTCTTGT AGAAGGGAGA GATCGTTTTT GCCTTTCTCA CAGCAAAAAA AGCCAGATTA GTTCACTCA |
mRNA UTR + CDS |
>Mp4g21750.1 AAAGGACGAA TGAAGAATGA ATGAGAGCCC AAGTGGCACT CCGGATTTGC CTGGATCCCG AATCTCGAGT CTGGGTACGA TCGGAACAAG TCATTTTGGA GGCGGAAATG CGCGGGTCGT GGTCAGACTC ATGCCCAGGC GCATCTGACA TGCACTTCAA GAGGGAGGCG AAGAGTCCTG TGGAGATCTC CGTCTCAAGT TTAGGTTGCG AGGCAAGCAG GATAACATAT GAGTAACTGT TTGCGAGTCT CGTTAGAATC GGTGGGAAGA CGAGTGCGTG GATTGACGAA ATTGTTTCTG GCAGTGGGTC TGACCATCTG GCGAGCGATT GTGACTGCGC TGTCTGACGT ATTCTTCTTC TTCGTCCAGT TCATCCCCTG CTCTTTGTGT TCGGACACCA AGGCCGAAGA AGTCGGCGAT CTGCAAGCCG CCCGGATTTT TTAGAATGAC GGCCGCAGGA GGAGGAGGCA GGGGCAAGGC GAAAGGGTCC AAATCCGTGT CTCGATCCCA GAAAGCAGGT CTGCAGTTTC CTGTGGGGAG AATTGCGAGG TTTCTGAAGG CGGGAAAGTA TGCTGAGCGA GTCGGAGCTG GAGCCCCCGT CTATTTAGCT GCAGTTATGG AATATCTGGC TGCCGAGGTG TTGGAATTAG CCGGGAATGC TGCGAGAGAC AACAAGAAGT CCAGGATCAT TCCGCGCCAT ATTCAGCTGG CTGTGAGAAA TGACGAGGAG CTCAGCAAAT TACTGGGTAC GGTGGTTATT GCGAATGGTG GCGTTCTACC CAACATTCAC AGCTCCCTCC TCCCGAAGAA GACAGGCAAG GGTGGGAAAG GAGAGATAGA GGGTATGTCT CAAGAATTTT GAGACTCTCC AGGCTCCAGC GGCTGTAATT TTTTGCTTTC ACCAACAGGT GTTCAACAGC TAGAGTTAGT TGACTCTGAT CGCCCTTTGA GATTGACAAG AACGATTTTT CTCTGATCTT TGATCATTTC TCTCTTATGT AAATTTTTTC CTATTAACGA AGAGCTTGTT TGTTCCTTTG GAGCGGTACA CAAATCCAAT CAAGTCCCTC ACGCCTGGTG AAACTCTTGT AGAAGGGAGA GATCGTTTTT GCCTTTCTCA CAGCAAAAAA AGCCAGATTA GTTCACTCA |
CDS |
>Mp4g21750.1 ATGACGGCCG CAGGAGGAGG AGGCAGGGGC AAGGCGAAAG GGTCCAAATC CGTGTCTCGA TCCCAGAAAG CAGGTCTGCA GTTTCCTGTG GGGAGAATTG CGAGGTTTCT GAAGGCGGGA AAGTATGCTG AGCGAGTCGG AGCTGGAGCC CCCGTCTATT TAGCTGCAGT TATGGAATAT CTGGCTGCCG AGGTGTTGGA ATTAGCCGGG AATGCTGCGA GAGACAACAA GAAGTCCAGG ATCATTCCGC GCCATATTCA GCTGGCTGTG AGAAATGACG AGGAGCTCAG CAAATTACTG GGTACGGTGG TTATTGCGAA TGGTGGCGTT CTACCCAACA TTCACAGCTC CCTCCTCCCG AAGAAGACAG GCAAGGGTGG GAAAGGAGAG ATAGAGGGTA TGTCTCAAGA ATTTTGA |
Protein |
>Mp4g21750.1 MTAAGGGGRG KAKGSKSVSR SQKAGLQFPV GRIARFLKAG KYAERVGAGA PVYLAAVMEY LAAEVLELAG NAARDNKKSR IIPRHIQLAV RNDEELSKLL GTVVIANGGV LPNIHSSLLP KKTGKGGKGE IEGMSQEF |