Database | ID | Description |
---|---|---|
FunFam | G3DSA:1.10.20.10:FF:000085 | Histone H3.2 |
PRINTS | PR00622 | Histone H3 signature |
PANTHER | PTHR11426 | HISTONE H3 |
SMART | SM00428 | h35 |
Pfam | PF00125 | Core histone H2A/H2B/H3/H4 |
Gene3D | G3DSA:1.10.20.10 | Histone, subunit A |
MobiDBLite | mobidb-lite | consensus disorder prediction |
SUPERFAMILY | SSF47113 | Histone-fold |
ProSitePatterns | PS00959 | Histone H3 signature 2. |
KOG | KOG1745 | Histones H3 and H4; N-term missing; [B] |
MapolyID | Mapoly0089s0012 | - |
GO | GO:0046982 | protein heterodimerization activity |
GO | GO:0030527 | structural constituent of chromatin |
GO | GO:0000786 | nucleosome |
GO | GO:0003677 | DNA binding |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpcenH3 | Chromatin related protein | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp3g22050.1 CCTGGACACG CGCCCGTTCT CAAGCCTTGA AAGAACACGT GGCACCGGCT TGGGAACTGT GTTGCATGGT TGGTAATCTA CTCTGACCGG CAGGGGCTTT TGCGGACGAC TGGTCGTCGT AGGTCACAAG AGTCTGGGGC CTTGGCAAGA TTTCCCGCCA AAGAGCGCAC GAACATTGGC GGGAACGATC GACAGGGTCG CAAAGTCCCG AAGACGCTAG AGCAGTTCGC ACCTATTTCG ATTGTGAGTC GGGGCCAAGT AAGCACGTCG AATCCTGAAG TCTCGAAATC TGCCCACGAT AGTCGATTGG AGAGGTGCTG GCAGATGCAA AAATGACGGA GGCTGTGGAT TGGTGACGTC ATCGTCTGGG TTCTTGATCG CTCACTTTTT CACTCGGATT ACGACCCCGG ACGAGGAAAT CAGGTCTCGG GTTTTAGGTG ATTGATGGGT CTCGACAGTG TGTTCGGGAT TTGCATGTTG ATCGTTAGCA GTCAACCGGG AATTCATGGA CGGAATATGT TCGGGAACAG AGGACAGGAT CCTTTCATAG TGACGGTCCC AATCGATCAA TTCGCCTCCT AATAAAGGGA CCCAAACTAT CAGGGTGGAG ATTTCGAATG CCACTTAGTG AAAGGAAATC AAGGCTCGGG TGGTGAGAAA TGCTCGACAG TATTCGGCTT AGTGATGGTG GAACTTAGGT CCAGTTTAGG AAAACAGATC TCACAATAGA ACAGATCGTT CTTGGTTTCT TGTGCCGATC TAGCCCTTCA GCGCAGCTGC CCTTCAGCTT CATTTGGATC ATTCATTGAC GGTTCTATCA AATGCAGCTG GTGACCTTTG GTGGCCCTTG TTCTACCATG ATGATTAGAA GAAGCTTGGA AACCTGCACG TGCATCCGTT TGTGTCGAAC TGCCTTATAA CTTCCTAACT GATTTTTTGG GGATGTGATT ATTGCTCTGT ATGTGAGGAG ACGCGACGGC TTGACATGTA AGAAACTCGT TTCCGCATAA ATAGCTCTCG ATGATATTTT GTAGCTTTTA TTGAACCGTA GGGTTTTCGT GGCTTGTGTG TGAAGATGTA TTTCCCAAGA TGGTCTTAGG ATCGTTGTGC CGATTCGGGA AATGTAGACT GATTTGTATG AGTGAAGGGT TTCGTCGGAC TCAGAGTAAT TGTTAAGGCA TCTGTTTATG TCGAGTTTTT CTGAATTTCA GCTGGATGAC CGATCCGATG GAGGAAATGC AGCGATTTAT GTTCTCCTGT TTTCTTGGAG TATCGGGAAG TTGTAGACTT TGACCTGTGT GAAAGTTTCT TTCTTCGTCT TTTGCCTCAA AGCAGTTTCA TCCCAGGACA GCTGGAAATA GAGTTGGGGG TCTGCTGCTT CTCCGTTTCT CTGATTCTGA AATCATCTTG GGTGATCGCA TGTGCTGCTC CTCCTCCTGC TTGCCTTCTT CTTCATTCTC CTACTCTTCC TGTTCTACTT GTTGTCGCTG TTTCTCGTGG TAGTGTTGTT GTTGTTAACC TTTCCTTTTT TCCAACTGCA TCCATCGGTC CATACAAGAA ATCTCTGCAC ACCCAAAGTA CACAGCTGAT CGTGGGCATG TGTTGCTGCT GCAGGAGGAG GTCTGGAGTT CGAGGAAAAG CTAAGGACTA GCTGTCGAGC CAAAACAAAG GGAACAGAGT GTCACCGGAC AATCATGTCC ACCAGGGAAA AGAAACACTG AAAGAGTATT ATCTTTTTTG TGACCTTTGG AGATCAGTGG ATTGATGTAA AGGTGCTATG GCCAGAAGGA AAACAATTCC GCGGCGAAAC CAACGTGACA ACAGGCGTAA TGCAGGGGCA GCCCCATCAC CAGCATCAGG TGGCCGAGCA CCGCCAGGTA GGCGTAGGGC GAGACAGGGC ACGGTCGCAT TGCGCGAGAT TCGTCATTAT CAAAAGACAT TCGAACTGTT GTTAAGAGCC CTTCCATTCG CCCGATTGGT AAGGGAGATT GGACAAAGTG TATCGAATAG AGTCACTCGT TGGACAGCTG AAGCCCTTGT AGCTCTTCAA GAAGCTGTAG AAGATTACAT TGTCCATCTG TTTGAGGATA CAAATCTCTG TGCTATACAC GCTAAACGGG TGACAATCAT GCCCAAGGAT TTGCACCTTG CAAGACGCAT TCGTGGTGTT AATCAGTAGT AAGATTAGAT TGTTCGGTAG CAAATTAATC GCCAGCTTTT GATTCTGGTA CTAGGTAGCG GTCTGGAATT GAGACGAATT GGACATGCCT TAAAGGTATG GGGGCATCAG ATTTTCTTGT AAGCCTACTC ATTTCCGTTG TGTGCAGTCT TACTCACAGT TAGCAATGCA CACATTCTGA AGACGTATGT CACTCACGAA ACCTAAAGAA TTACATGGAC CTCCACAGAC GTCCAGTTCT TGCAACTTCC AGCGACACTT TCTATAATTG AAAGACCTTT TACACTGC |
mRNA UTR + CDS |
>Mp3g22050.1 CCTGGACACG CGCCCGTTCT CAAGCCTTGA AAGAACACGT GGCACCGGCT TGGGAACTGT GTTGCATGGT TGGTAATCTA CTCTGACCGG CAGGGGCTTT TGCGGACGAC TGGTCGTCGT AGGTCACAAG AGTCTGGGGC CTTGGCAAGA TTTCCCGCCA AAGAGCGCAC GAACATTGGC GGGAACGATC GACAGGGTCG CAAAGTCCCG AAGACGCTAG AGCAGTTCGC ACCTATTTCG ATTGTGAGTC GGGGCCAAGT AAGCACGTCG AATCCTGAAG TCTCGAAATC TGCCCACGAT AGTCGATTGG AGAGGTGCTG GCAGATGCAA AAATGACGGA GGCTGTGGAT TGGTGACGTC ATCGTCTGGG TTCTTGATCG CTCACTTTTT CACTCGGATT ACGACCCCGG ACGAGGAAAT CAGGTCTCGG GTTTTAGGTG ATTGATGGGT CTCGACAGTG TGTTCGGGAT TTGCATGTTG ATCGTTAGCA GTCAACCGGG AATTCATGGA CGGAATATGT TCGGGAACAG AGGACAGGAT CCTTTCATAG TGACGGTCCC AATCGATCAA TTCGCCTCCT AATAAAGGGA CCCAAACTAT CAGGGTGGAG ATTTCGAATG CCACTTAGTG AAAGGAAATC AAGGCTCGGG TGGTGAGAAA TGCTCGACAG TATTCGGCTT AGTGATGGTG GAACTTAGGT CCAGTTTAGG AAAACAGATC TCACAATAGA ACAGATCGTT CTTGGTTTCT TGTGCCGATC TAGCCCTTCA GCGCAGCTGC CCTTCAGCTT CATTTGGATC ATTCATTGAC GGTTCTATCA AATGCAGCTG GTGACCTTTG GTGGCCCTTG TTCTACCATG ATGATTAGAA GAAGCTTGGA AACCTGCACG TGCATCCGTT TGTGTCGAAC TGCCTTATAA CTTCCTAACT GATTTTTTGG GGATGTGATT ATTGCTCTGT ATGTGAGGAG ACGCGACGGC TTGACATGAG GAGGTCTGGA GTTCGAGGAA AAGCTAAGGA CTAGCTGTCG AGCCAAAACA AAGGGAACAG AGTGTCACCG GACAATCATG TCCACCAGGG AAAAGAAACA CTGAAAGAGT ATTATCTTTT TTGTGACCTT TGGAGATCAG TGGATTGATG TAAAGGTGCT ATGGCCAGAA GGAAAACAAT TCCGCGGCGA AACCAACGTG ACAACAGGCG TAATGCAGGG GCAGCCCCAT CACCAGCATC AGGTGGCCGA GCACCGCCAG GTAGGCGTAG GGCGAGACAG GGCACGGTCG CATTGCGCGA GATTCGTCAT TATCAAAAGA CATTCGAACT GTTGTTAAGA GCCCTTCCAT TCGCCCGATT GGTAAGGGAG ATTGGACAAA GTGTATCGAA TAGAGTCACT CGTTGGACAG CTGAAGCCCT TGTAGCTCTT CAAGAAGCTG TAGAAGATTA CATTGTCCAT CTGTTTGAGG ATACAAATCT CTGTGCTATA CACGCTAAAC GGGTGACAAT CATGCCCAAG GATTTGCACC TTGCAAGACG CATTCGTGGT GTTAATCAGT AGTAAGATTA GATTGTTCGG TAGCAAATTA ATCGCCAGCT TTTGATTCTG GTACTAGGTA GCGGTCTGGA ATTGAGACGA ATTGGACATG CCTTAAAGGT ATGGGGGCAT CAGATTTTCT TGTAAGCCTA CTCATTTCCG TTGTGTGCAG TCTTACTCAC AGTTAGCAAT GCACACATTC TGAAGACGTA TGTCACTCAC GAAACCTAAA GAATTACATG GACCTCCACA GACGTCCAGT TCTTGCAACT TCCAGCGACA CTTTCTATAA TTGAAAGACC TTTTACACTG C |
CDS |
>Mp3g22050.1 ATGGCCAGAA GGAAAACAAT TCCGCGGCGA AACCAACGTG ACAACAGGCG TAATGCAGGG GCAGCCCCAT CACCAGCATC AGGTGGCCGA GCACCGCCAG GTAGGCGTAG GGCGAGACAG GGCACGGTCG CATTGCGCGA GATTCGTCAT TATCAAAAGA CATTCGAACT GTTGTTAAGA GCCCTTCCAT TCGCCCGATT GGTAAGGGAG ATTGGACAAA GTGTATCGAA TAGAGTCACT CGTTGGACAG CTGAAGCCCT TGTAGCTCTT CAAGAAGCTG TAGAAGATTA CATTGTCCAT CTGTTTGAGG ATACAAATCT CTGTGCTATA CACGCTAAAC GGGTGACAAT CATGCCCAAG GATTTGCACC TTGCAAGACG CATTCGTGGT GTTAATCAGT AG |
Protein |
>Mp3g22050.1 MARRKTIPRR NQRDNRRNAG AAPSPASGGR APPGRRRARQ GTVALREIRH YQKTFELLLR ALPFARLVRE IGQSVSNRVT RWTAEALVAL QEAVEDYIVH LFEDTNLCAI HAKRVTIMPK DLHLARRIRG VNQ |
Sequences: |
Gene UTR + CDS + intron |
>Mp3g22050.2 GTGGCTTGTG TGTGAAGATG TATTTCCCAA GATGGTCTTA GGATCGTTGT GCCGATTCGG GAAATGTAGA CTGATTTGTA TGAGTGAAGG GTTTCGTCGG ACTCAGAGTA ATTGTTAAGG CATCTGTTTA TGTCGAGTTT TTCTGAATTT CAGCTGGATG ACCGATCCGA TGGAGGAAAT GCAGCGATTT ATGTTCTCCT GTTTTCTTGG AGTATCGGGA AGTTGTAGAC TTTGACCTGT GTGAAAGTTT CTTTCTTCGT CTTTTGCCTC AAAGCAGTTT CATCCCAGGA CAGCTGGAAA TAGAGTTGGG GGTCTGCTGC TTCTCCGTTT CTCTGATTCT GAAATCATCT TGGGTGATCG CATGTGCTGC TCCTCCTCCT GCTTGCCTTC TTCTTCATTC TCCTACTCTT CCTGTTCTAC TTGTTGTCGC TGTTTCTCGT GGTAGTGTTG TTGTTGTTAA CCTTTCCTTT TTTCCAACTG CATCCATCGG TCCATACAAG AAATCTCTGC ACACCCAAAG TACACAGCTG ATCGTGGGCA TGTGTTGCTG CTGCAGGAGG AGGTCTGGAG TTCGAGGAAA AGCTAAGGAC TAGCTGTCGA GCCAAAACAA AGGGAACAGA GTGTCACCGG ACAATCATGT CCACCAGGGA AAAGAAACAC TGAAAGAGTA TTATCTTTTT TGTGACCTTT GGAGATCAGT GGATTGATGT AAAGGTGCTA TGGCCAGAAG GAAAACAATT CCGCGGCGAA ACCAACGTGA CAACAGGCGT AATGCAGGGG CAGCCCCATC ACCAGCATCA GGTGGCCGAG CACCGCCAGG TAGGCGTAGG GCGAGACAGG GCACGGTCGC ATTGCGCGAG ATTCGTCATT ATCAAAAGAC ATTCGAACTG TTGTTAAGAG CCCTTCCATT CGCCCGATTG GTAAGGGAGA TTGGACAAAG TGTATCGAAT AGAGTCACTC GTTGGACAGC TGAAGCCCTT GTAGCTCTTC AAGAAGCTGT AGAAGATTAC ATTGTCCATC TGTTTGAGGA TACAAATCTC TGTGCTATAC ACGCTAAACG GGTGACAATC ATGCCCAAGG ATTTGCACCT TGCAAGACGC ATTCGTGGTG TTAATCAGTA GTAAGATTAG ATTGTTCGGT AGCAAATTAA TCGCCAGCTT TTGATTCTGG TACTAGGTAG CGGTCTGGAA TTGAGACGAA TTGGACATGC CTTAAAGGTA TGGGGGCATC AGATTTTCTT GTAAGCCTAC TCATTTCCGT TGTGTGCAGT CTTACTCACA GTTAGCAATG CACACATTCT GAAGACGTAT GTCACTCACG AAACCTAAAG AATTACATGG ACCTCCACAG ACGTCCAGTT CTTGCAACTT CCAGCGACAC TTTCTATAAT TGAAAGACCT TTTACACTGC |
mRNA UTR + CDS |
>Mp3g22050.2 GTGGCTTGTG TGTGAAGATG TATTTCCCAA GATGGTCTTA GGATCGTTGT GCCGATTCGG GAAATGTAGA CTGATTTGTA TGAGTGAAGG GTTTCGTCGG ACTCAGAGTA ATTGTTAAGG CATCTGTTTA TGTCGAGTTT TTCTGAATTT CAGCTGGATG ACCGATCCGA TGGAGGAAAT GCAGCGATTT ATGTTCTCCT GTTTTCTTGG AGTATCGGGA AGTTGTAGAC TTTGACCTGT GTGAAAGTTT CTTTCTTCGT CTTTTGCCTC AAAGCAGTTT CATCCCAGGA CAGCTGGAAA TAGAGTTGGG GGTCTGCTGC TTCTCCGTTT CTCTGATTCT GAAATCATCT TGGGTGATCG CATGAGGAGG TCTGGAGTTC GAGGAAAAGC TAAGGACTAG CTGTCGAGCC AAAACAAAGG GAACAGAGTG TCACCGGACA ATCATGTCCA CCAGGGAAAA GAAACACTGA AAGAGTATTA TCTTTTTTGT GACCTTTGGA GATCAGTGGA TTGATGTAAA GGTGCTATGG CCAGAAGGAA AACAATTCCG CGGCGAAACC AACGTGACAA CAGGCGTAAT GCAGGGGCAG CCCCATCACC AGCATCAGGT GGCCGAGCAC CGCCAGGTAG GCGTAGGGCG AGACAGGGCA CGGTCGCATT GCGCGAGATT CGTCATTATC AAAAGACATT CGAACTGTTG TTAAGAGCCC TTCCATTCGC CCGATTGGTA AGGGAGATTG GACAAAGTGT ATCGAATAGA GTCACTCGTT GGACAGCTGA AGCCCTTGTA GCTCTTCAAG AAGCTGTAGA AGATTACATT GTCCATCTGT TTGAGGATAC AAATCTCTGT GCTATACACG CTAAACGGGT GACAATCATG CCCAAGGATT TGCACCTTGC AAGACGCATT CGTGGTGTTA ATCAGTAGTA AGATTAGATT GTTCGGTAGC AAATTAATCG CCAGCTTTTG ATTCTGGTAC TAGGTAGCGG TCTGGAATTG AGACGAATTG GACATGCCTT AAAGGTATGG GGGCATCAGA TTTTCTTGTA AGCCTACTCA TTTCCGTTGT GTGCAGTCTT ACTCACAGTT AGCAATGCAC ACATTCTGAA GACGTATGTC ACTCACGAAA CCTAAAGAAT TACATGGACC TCCACAGACG TCCAGTTCTT GCAACTTCCA GCGACACTTT CTATAATTGA AAGACCTTTT ACACTGC |
CDS |
>Mp3g22050.2 ATGGCCAGAA GGAAAACAAT TCCGCGGCGA AACCAACGTG ACAACAGGCG TAATGCAGGG GCAGCCCCAT CACCAGCATC AGGTGGCCGA GCACCGCCAG GTAGGCGTAG GGCGAGACAG GGCACGGTCG CATTGCGCGA GATTCGTCAT TATCAAAAGA CATTCGAACT GTTGTTAAGA GCCCTTCCAT TCGCCCGATT GGTAAGGGAG ATTGGACAAA GTGTATCGAA TAGAGTCACT CGTTGGACAG CTGAAGCCCT TGTAGCTCTT CAAGAAGCTG TAGAAGATTA CATTGTCCAT CTGTTTGAGG ATACAAATCT CTGTGCTATA CACGCTAAAC GGGTGACAAT CATGCCCAAG GATTTGCACC TTGCAAGACG CATTCGTGGT GTTAATCAGT AG |
Protein |
>Mp3g22050.2 MARRKTIPRR NQRDNRRNAG AAPSPASGGR APPGRRRARQ GTVALREIRH YQKTFELLLR ALPFARLVRE IGQSVSNRVT RWTAEALVAL QEAVEDYIVH LFEDTNLCAI HAKRVTIMPK DLHLARRIRG VNQ |