Database | ID | Description |
---|---|---|
Gene3D | G3DSA:1.10.20.10 | Histone, subunit A |
Pfam | PF16211 | C-terminus of histone H2A |
SUPERFAMILY | SSF47113 | Histone-fold |
Pfam | PF00125 | Core histone H2A/H2B/H3/H4 |
PANTHER | PTHR23430 | HISTONE H2A |
SMART | SM00414 | h2a4 |
CDD | cd00074 | H2A |
ProSitePatterns | PS00046 | Histone H2A signature. |
PRINTS | PR00620 | Histone H2A signature |
FunFam | G3DSA:1.10.20.10:FF:000009 | Histone H2A |
KEGG | K11251 | histone H2A |
KOG | KOG1756 | Histone 2A; [B] |
MapolyID | Mapoly0007s0226 | - |
GO | GO:0030527 | structural constituent of chromatin |
GO | GO:0003677 | DNA binding |
GO | GO:0000786 | nucleosome |
GO | GO:0046982 | protein heterodimerization activity |
GO | GO:0000790 | chromatin |
GO | GO:0006342 | heterochromatin formation |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpH2A | Chromatin related protein | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp3g02370.1 CCGACGTAGC CTCGAAGGAG ACAAACATTC ACCTTGCATC TTCGTCATTT GAATTCGAAC TCGATACTGA GATCCAGATT CAGCTCGTCC GCGTCTTCCA ACAGACAGAC TCAGCTCCGT TCGAGAAGTC GGCGCGATGT CTCGAGGAAA GGTCACTGGC AAGAAGACTG TGTCCCGCAG CCAGAAGGCA GGCTTGCAGT TCCCTGTGGG CAGAATTGCC CGATTCCTGA AGAAAGGAAG ATATGCGACA CGAGTCGGCG CAGGAGCTCC AGTTTATCTC GCTGCAGTCT TGGAGTATCT CGCAGCAGAG GTCGGTTTCG GATTCGATGC AATTCGATTC GTCTTTGGCG TTGCGGTTAG TTCTTCTGGC GAGTATTTTC TCGTGGTTGG TCCGGGCTAA GCTTGTTGTG ACGATGTTGT TCAGGTCCTG GAGCTCGCAG GCAATGCGTC AAGAGACAAC AAGAAGACCA GAATTGGGCC CCGTCACATG CAGCTCGCTG TGAGAAATGA CGAGGAGTTA AGCAAGCTGC TCGCCAGTGT GACAATTGCC AATGGTGGTG TTCTCCCAAA TATCCACTCA GTGTTGCTGC CCAAGAAGTC AGGAAAACCT GCCCTCTCGC TCGCCGAGGG AAAAGAGTAG ATTCTTGCTT GATGTGGAGA TCGCCGACGT CTGGAAGATC GATCTCAGAA ATGGTTCGTA GAAATAATTC AGTTATGGGT GCCCACATGC CGTCCAGATG ATGTTTCCTC ACAAACGTCT CCCACAGATG CCGAAGTAGA GAGTTCAGCA GCCTGCTGAA ATATTAGATA CAACAAAGAG AACGAAAGAT GTGAATATTA CCGACACCCT GATGCTCAAC GCGGGAGATG TCATGTACAA ATGTTGAGCC CAATTTCTGA ATCAAATATC GGAATAGATA AATTTTTGTT ACGAGAT |
mRNA UTR + CDS |
>Mp3g02370.1 CCGACGTAGC CTCGAAGGAG ACAAACATTC ACCTTGCATC TTCGTCATTT GAATTCGAAC TCGATACTGA GATCCAGATT CAGCTCGTCC GCGTCTTCCA ACAGACAGAC TCAGCTCCGT TCGAGAAGTC GGCGCGATGT CTCGAGGAAA GGTCACTGGC AAGAAGACTG TGTCCCGCAG CCAGAAGGCA GGCTTGCAGT TCCCTGTGGG CAGAATTGCC CGATTCCTGA AGAAAGGAAG ATATGCGACA CGAGTCGGCG CAGGAGCTCC AGTTTATCTC GCTGCAGTCT TGGAGTATCT CGCAGCAGAG GTCCTGGAGC TCGCAGGCAA TGCGTCAAGA GACAACAAGA AGACCAGAAT TGGGCCCCGT CACATGCAGC TCGCTGTGAG AAATGACGAG GAGTTAAGCA AGCTGCTCGC CAGTGTGACA ATTGCCAATG GTGGTGTTCT CCCAAATATC CACTCAGTGT TGCTGCCCAA GAAGTCAGGA AAACCTGCCC TCTCGCTCGC CGAGGGAAAA GAGTAGATTC TTGCTTGATG TGGAGATCGC CGACGTCTGG AAGATCGATC TCAGAAATGG TTCGTAGAAA TAATTCAGTT ATGGGTGCCC ACATGCCGTC CAGATGATGT TTCCTCACAA ACGTCTCCCA CAGATGCCGA AGTAGAGAGT TCAGCAGCCT GCTGAAATAT TAGATACAAC AAAGAGAACG AAAGATGTGA ATATTACCGA CACCCTGATG CTCAACGCGG GAGATGTCAT GTACAAATGT TGAGCCCAAT TTCTGAATCA AATATCGGAA TAGATAAATT TTTGTTACGA GAT |
CDS |
>Mp3g02370.1 ATGTCTCGAG GAAAGGTCAC TGGCAAGAAG ACTGTGTCCC GCAGCCAGAA GGCAGGCTTG CAGTTCCCTG TGGGCAGAAT TGCCCGATTC CTGAAGAAAG GAAGATATGC GACACGAGTC GGCGCAGGAG CTCCAGTTTA TCTCGCTGCA GTCTTGGAGT ATCTCGCAGC AGAGGTCCTG GAGCTCGCAG GCAATGCGTC AAGAGACAAC AAGAAGACCA GAATTGGGCC CCGTCACATG CAGCTCGCTG TGAGAAATGA CGAGGAGTTA AGCAAGCTGC TCGCCAGTGT GACAATTGCC AATGGTGGTG TTCTCCCAAA TATCCACTCA GTGTTGCTGC CCAAGAAGTC AGGAAAACCT GCCCTCTCGC TCGCCGAGGG AAAAGAGTAG |
Protein |
>Mp3g02370.1 MSRGKVTGKK TVSRSQKAGL QFPVGRIARF LKKGRYATRV GAGAPVYLAA VLEYLAAEVL ELAGNASRDN KKTRIGPRHM QLAVRNDEEL SKLLASVTIA NGGVLPNIHS VLLPKKSGKP ALSLAEGKE |