Database | ID | Description |
---|---|---|
ProSitePatterns | PS01360 | Zinc finger MYND-type signature. |
ProSiteProfiles | PS50280 | SET domain profile. |
Gene3D | G3DSA:6.10.140.2220 | - |
Pfam | PF00856 | SET domain |
Gene3D | G3DSA:2.170.270.10 | SET domain |
Gene3D | G3DSA:1.25.40.10 | Tetratricopeptide repeat domain |
Pfam | PF01753 | MYND finger |
Gene3D | G3DSA:1.10.220.160 | - |
SMART | SM00317 | set_7 |
PANTHER | PTHR12197 | HISTONE-LYSINE N-METHYLTRANSFERASE SMYD |
ProSiteProfiles | PS50865 | Zinc finger MYND-type profile. |
SUPERFAMILY | SSF82199 | SET domain |
KEGG | K11426 | [histone H3]-lysine4/36 N-trimethyltransferase SMYD [EC:2.1.1.354 2.1.1.357] |
KOG | KOG2084 | Predicted histone tail methylase containing SET domain; [B] |
MapolyID | Mapoly0101s0047 | - |
GO | GO:0005634 | nucleus |
GO | GO:0034968 | obsolete histone lysine methylation |
GO | GO:0005515 | protein binding |
GO | GO:0018024 | obsolete histone lysine N-methyltransferase activity |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpASHH2 | Chromatin related protein | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp4g21010.1 GCGGCGCTCG GAAGCGGATT TTCGAAAGCA GCGCTCAAGA ATTCATGTGC TCGAAGGACG AGACCTCGAG GTTGCAAAAT ATCGTCCAAT TTTTTTCTAC TGATCCAATC TACGTTCGAG GATACGGAAG CGATCACGGA TTGTTCGCCA GCTTGCGGGA GTTCATTCAG AGCTGAGGAT TGTTCGGTCC GGCGAGGTCG ACCGATTTGC GTTTGGGCCA CGACCACTTT GTTGCAAGGC GATAAGGAGT CTGTGATTAG ATTGCTTCCT TTTTGGAGCG TGACAGGAGT CTGCGAGATT GCAAATCTGG ATTGAGCATT CGATTGCTTG TAAATTTTTA GATGGAGCAG TATTTAGTCG ACCACGGCTT GAAAGTCACT GTTGTCGACA CCAAAGGCCG TTGCCTCGTT GCTGATCGCG ATTTTAGCCC TGGTATGATT CTACAATATT CGCTTTTATC TCTGCTAAAG TGGTCAGACT AATGTCCTCA TTTTTTTGTT ATAGTTTCTG CTTTTTGCTT GATTCAATTG TCTATTGAAT ATAATAATAT AGTGTAAAAT TGTCCTGCTT GAATGGGCGA TTTGGAATCT CGGAAGATTT TCCTGGGTGT GAGGGCGTTG CTCTGTGATT CCGAGTTTTT GTTTGGCTGT CTGCCGTAAG ATGAACGAGC AAGGTGCTGT GTTTCTTTCA GGTGAGATAG TTTTGGACCA AGAGCCCTAC TCATCGGTGC TAGATGCCGA GTCAAAGAGC CTACGATGCG ATGCGTGCTT TAGATGCTCA GAAAATTTGC AGAGATGCTC TGCCTGCAAG AGTGTTAGCT ATTGCTGCAC CAGCTGCCAG GTATCATAAC TGAACTAGGT TCTAAAAAAC CTTCCGCTTA TCAAGATTTA GAACTCTAGA TCTTCAAGCA AGCAAACACA AGGCGTGGAC TCGTCTTGCT TCCTGATAAA CCCTCGCTCT GCACAAGTAC CTTAAAATCA CCAGTATTTT GTTCAAGTGT TCTTACTCTG GTCTAAATAC GGCTTGGATA TTTTCCAGAG GAAGGAGTGG AAACTGCACA AGAGCGAATG TCAGATGATG GTGAAGCTCA GTCAAGCGAA ACAAAAGTTG CCACCTCCGT CATTACGGCT GATAGTTAGG CTTGTCATTA AGAGAAGATT GCAAGCTAGC AATGTGAGTC TTCAACTTTT TCCAGCATGA GATGGAACTT TCGGAGAGAA CCTGGCATTT CATTCTAGTT TTTTCAATTA GTCCAAGTTC TTAGAATGAA GGCCGGTTAC TAGCAATTAC GTTTCACTTT GAGATGTTTG GTTCTGGAAA TGATTGATGA ATTTGGTAGT TAGTGGTCTT TAGTGAAGAT CAGGTGATCG GGTTAAACTT ACCGAACTTT ATCTCCTGGA ATTTGGAGAG GTTAGACGGG AGCAAACTTT AAACTTTGAC AGCTTACTTT GTGCCAAGAC GTGTATCTAG ATTATTCATG TCGAAGGTGT TTATCCCTCC GTGGTAATTC TCTGATTGAA GGATTTTGTC CTATGCTGAC TTATACTTCC CTTAAACCAT GGGTAATGAT TTGACGAACA ATTTTTAAGT TTACTTACAA ATGTTACAAA CGCTGTGCAT GTCCTTTAGG TCTTCCCCCG AACAAACGTC GATAATTTTG AGATTGTGGA GGCTTTACCT ACTCGTATCC TCTTTTGCAG CTTCGATGGC TCTTGTATAT AATTTCTCTT GTCAATTTCT GTGTGCGTTC AGTTTTCTGC ATGTTCAGCT ATCGTGAGCT CCAGCTATCT CAACGTTGTT CGTTACTTGT CTTCAGATGC TCATTTCTTC GAACGTTTGT CTTTAAGCTA GAAGAATAGC TGTAAAAGGC CTTTGGAGAC CTGTCCGTTT TATACTTGAC TTCCTACAGA TTTTTCTGAG ACAGGAGAGG AGCGTTTGGT TCTATACGCC CAAATGTCGA ATCTGGTGAA AGCAATTGTT ACTCCCCTTG AGGTGGATCT CAAAGAAACT TGTGAATTGT TCTGCAGGGT AAGTAAGATC GTTCTGTACC CCAAGAAGAG CTGTCTTGCT CTGTCGTATT GCGCTGTAGC AGCCCTGCTT AAAACTGGCA GGATAGTGCA AATTAAAAAG TATAACAAAA AGCTTTTATG ATTCTATGTC GTCAATATTG TCTCCATAAT TCAAGTTATT CTCGCAGATT GCGTGTAATG CACATACCAT TTGTGATGAC GAGTTCAGAC CAGTTGGCAC TGGTTTGTAT CCCGTTGTAT CGATCATCAA CCACAGGTTA GACCATCCCT GTCTCTCCCA TCGTACTATA CCAGGTGTTT AGTTGATGTG ATTATATATG CCATGTGTGC CAAGCCTTAT TTTTTTCCAT CAATATCTGA AACCTCAGTA TCAGGACCGG AAAAGGATTG TAATTTTGAT CAATGGCTTA CTTTAAGATC AGAGAATTTC TAACATGAAC TTCAGCGCAC ATTTTGCAGC TGTGCCCCAA ACTCAGTTCT TTTGTTTGAT GGGAGGAGAG CCATCGTGCG AGCCATACAG AAAATAGACA GAGGAACTGA GGTGAAGTTT GAAAGCTTTG AAACTTTAAT TGCATTTCAT CATAAAAGTT TCAAAGAATA CTATCTCCTG CGGCTATACC ACGCACTTTC AGTAACACAC TACCCTTTGT CAGGAGGGTC CCTGTTTTTA TTTTCACACT AACACTGTCA AACTTTTAAC AGCTGAATGC AGGCGAAGTT CGGACCCAAG CTGAGCATGT CCATTCATTT GATTTGAACG CAGGTGACAT TAAGTTATGT GGAGCTTGCC GGCAGCACCA AAACTCGTCA GAAGGCTCTC AAAGAACAAT ATTATTTCTC TTGTCACTGC ACACGCTGCA TGAATGCGGT CCTTATCTTG ATAACCCTAT CCTGCGTTGT TGTGTAAATC TGCAACTAGT TCCAACCAGA TGGGACATAT TATCTTCATC CTGTGGTACA TTGGATCTTA TGTTACGACC CTTGAAATTC GCTTCTTGTA GTAGGAAAAC CCTCACAGCC AGCAGCGCAA ACGGCCAGAT TTCTGTTTCT GAAATAAGAC TTGCAGATTT CATAGGAAAC GGTATCATAA GATAAGGATA TCATGTTGAC AGCTTATGCA ACAGATATCA ATGTTATTAT CTGATTTTAT ATTGGTGTGT AGGAAACCAC AGAAGGAACA AGAGAAGATT CCATCCTTGA GGGTTATAAA TGCTCCAATG AGCAATGTGA TGGTGCCGTA ATACAAGAGA GAGGTAAAAT ATCCAGCTAA GTTTGTAGAT ACACTCATAG GGATAACTGG GTGGGGGGGG AGGACAAGGT TGTTGACACC AACTGTCTTC ATTAGCTTCG TTGTGTGCGT CATGTATACT ACAGACTCTA CTTATTTCAC TTACTGCGCA GATCAACCTT ATTTTCGGTG TATACTCTGT GGTTTAACTC ACGATGGAGA CAAGTTCAAG CGCTTGGAAT CAATGGCAAC TAAGCTGACG GAGGAAGCCA ACGCAGCAGT TAAATCCGGC AGTATCCAGA AGACACGTTA CAAGCTTTTG TCACGGCTCA GACGAACTTG TCCACAATTT TGAGACACCT TAAGTCATCT TTTAGATCTT GCATCGCCAG GCAGCCGACT ACATAAGCTG CAGCATTCCG ACTGTGATTT GTTTTTGGTT TTTGACTATA CATCTCCTAG TTGTACGATT TTACTGCAGT TCTGTGCTTT ATGTAATTCT TAACGAGGCC AGACGATGCA CGCGCGCGCT CCATCTTCGA ACAAACAGAA GCTATCCAGA CAACTTTATA CCACAGATAC TCAGTACATC TTATGCGAAC AAGGGATGGT TTGCTGAAGG TACTGTTTCA TCGAGTGTCG ATATGTCGAG GCCTATCGCA GGGCTTTGCT TCTTTAAGGT TGACCTGAGA AAGTTTACCA TGTTTACATT TACACCTTGG AGCTGTCTGT TGACTGTGTA GAGTTGTTAC ATAACATCTA GACGCAAGCT GTATCCTTAT CCTGTTCTAA AAATCTCGAG TTTGTAATAC TGTCATATCA AATTTGCAGG TATGTATGTC TCTTAAGGAT TGGAATGCGG CGCTGAAGTA TTGTCATTTG ACACTTCCTG CTTATGAAAG TAAGCTTCTG ATCATAACGT ACAAGCTGCC AACTTGGCAT CTTGCTTATT TCTTTGTATC AACTGGAGAT GTCATCTTAT TTGTTACTAC ATTGAATCAT TTGGATTGAA TGTAAATTCT TGATCGACTG AAAGAAATTG CATCTTATTT TCACCTTGCG GATCCAGGAT CATATTCAAT GAAATCGCCA CTTGTTGGGC TTCAGTACTA TACTCTCGGC AAATTACAAT GGTAATACTG TGACTCCTTA ACTTCCTAGA TTTCCGATTT CTTCCGGAGT TGCATCCTAA TAAGTGACTT GATATACAGT AATTATTTTT GTCATTGGGT GGAAGCGAAT TGTACACTGA CTGTATCTCG CGTGGCAGGT TCCTCGGCGA TTCCTTGGAA GAAATTAGTG GTGTTGAAGT TTTGAATCGT GCTAAAGAGA TACTGTCAAT AACGCACGGT TCTTCATCGA AACTCGTTCA GGAACTCTCT AGCATGCTTC TAGAGGTAAA CATGGAGGCG GCATACAGAG TTCAAAAAGG ATTAATGAAA TGACGGTGTG CAAATGTGAA CAGCTGAAGT GGCTTCCAAT AATAAAATTT GAAATCGTAA ATACTATTAT GGGGTGACAG TTATGTGCCT CGGGGAGGAA GTCACGCGTG GTCATTTGTT ATCTGTCTAC ATGTACCTCT TTTATAATGA AACTGACCTC CAGTTTACTC AACGTCGTCG TTGACGGTTG GTTGTCCATA ATATCAGTGC TCGATGCGTG GACACCTGGT TGCCTTACGT ACTTCTCAA |
mRNA UTR + CDS |
>Mp4g21010.1 GCGGCGCTCG GAAGCGGATT TTCGAAAGCA GCGCTCAAGA ATTCATGTGC TCGAAGGACG AGACCTCGAG GTTGCAAAAT ATCGTCCAAT TTTTTTCTAC TGATCCAATC TACGTTCGAG GATACGGAAG CGATCACGGA TTGTTCGCCA GCTTGCGGGA GTTCATTCAG AGCTGAGGAT TGTTCGGTCC GGCGAGGTCG ACCGATTTGC GTTTGGGCCA CGACCACTTT GTTGCAAGGC GATAAGGAGT CTGTGATTAG ATTGCTTCCT TTTTGGAGCG TGACAGGAGT CTGCGAGATT GCAAATCTGG ATTGAGCATT CGATTGCTTG TAAATTTTTA GATGGAGCAG TATTTAGTCG ACCACGGCTT GAAAGTCACT GTTGTCGACA CCAAAGGCCG TTGCCTCGTT GCTGATCGCG ATTTTAGCCC TGGTGAGATA GTTTTGGACC AAGAGCCCTA CTCATCGGTG CTAGATGCCG AGTCAAAGAG CCTACGATGC GATGCGTGCT TTAGATGCTC AGAAAATTTG CAGAGATGCT CTGCCTGCAA GAGTGTTAGC TATTGCTGCA CCAGCTGCCA GAGGAAGGAG TGGAAACTGC ACAAGAGCGA ATGTCAGATG ATGGTGAAGC TCAGTCAAGC GAAACAAAAG TTGCCACCTC CGTCATTACG GCTGATAGTT AGGCTTGTCA TTAAGAGAAG ATTGCAAGCT AGCAATGTCT TCCCCCGAAC AAACGTCGAT AATTTTGAGA TTGTGGAGGC TTTACCTACT CATTTTTCTG AGACAGGAGA GGAGCGTTTG GTTCTATACG CCCAAATGTC GAATCTGGTG AAAGCAATTG TTACTCCCCT TGAGGTGGAT CTCAAAGAAA CTTGTGAATT GTTCTGCAGG ATTGCGTGTA ATGCACATAC CATTTGTGAT GACGAGTTCA GACCAGTTGG CACTGGTTTG TATCCCGTTG TATCGATCAT CAACCACAGC TGTGCCCCAA ACTCAGTTCT TTTGTTTGAT GGGAGGAGAG CCATCGTGCG AGCCATACAG AAAATAGACA GAGGAACTGA GGTGACATTA AGTTATGTGG AGCTTGCCGG CAGCACCAAA ACTCGTCAGA AGGCTCTCAA AGAACAATAT TATTTCTCTT GTCACTGCAC ACGCTGCATG AATGCGGAAA CCACAGAAGG AACAAGAGAA GATTCCATCC TTGAGGGTTA TAAATGCTCC AATGAGCAAT GTGATGGTGC CGTAATACAA GAGAGAGATC AACCTTATTT TCGGTGTATA CTCTGTGGTT TAACTCACGA TGGAGACAAG TTCAAGCGCT TGGAATCAAT GGCAACTAAG CTGACGGAGG AAGCCAACGC AGCAGTTAAA TCCGGCAACG ATGCACGCGC GCGCTCCATC TTCGAACAAA CAGAAGCTAT CCAGACAACT TTATACCACA GATACTCAGT ACATCTTATG CGAACAAGGG ATGGTTTGCT GAAGGTATGT ATGTCTCTTA AGGATTGGAA TGCGGCGCTG AAGTATTGTC ATTTGACACT TCCTGCTTAT GAAAGATCAT ATTCAATGAA ATCGCCACTT GTTGGGCTTC AGTACTATAC TCTCGGCAAA TTACAATGGT TCCTCGGCGA TTCCTTGGAA GAAATTAGTG GTGTTGAAGT TTTGAATCGT GCTAAAGAGA TACTGTCAAT AACGCACGGT TCTTCATCGA AACTCGTTCA GGAACTCTCT AGCATGCTTC TAGAGGTAAA CATGGAGGCG GCATACAGAG TTCAAAAAGG ATTAATGAAA TGACGGTGTG CAAATGTGAA CAGCTGAAGT GGCTTCCAAT AATAAAATTT GAAATCGTAA ATACTATTAT GGGGTGACAG TTATGTGCCT CGGGGAGGAA GTCACGCGTG GTCATTTGTT ATCTGTCTAC ATGTACCTCT TTTATAATGA AACTGACCTC CAGTTTACTC AACGTCGTCG TTGACGGTTG GTTGTCCATA ATATCAGTGC TCGATGCGTG GACACCTGGT TGCCTTACGT ACTTCTCAA |
CDS |
>Mp4g21010.1 ATGGAGCAGT ATTTAGTCGA CCACGGCTTG AAAGTCACTG TTGTCGACAC CAAAGGCCGT TGCCTCGTTG CTGATCGCGA TTTTAGCCCT GGTGAGATAG TTTTGGACCA AGAGCCCTAC TCATCGGTGC TAGATGCCGA GTCAAAGAGC CTACGATGCG ATGCGTGCTT TAGATGCTCA GAAAATTTGC AGAGATGCTC TGCCTGCAAG AGTGTTAGCT ATTGCTGCAC CAGCTGCCAG AGGAAGGAGT GGAAACTGCA CAAGAGCGAA TGTCAGATGA TGGTGAAGCT CAGTCAAGCG AAACAAAAGT TGCCACCTCC GTCATTACGG CTGATAGTTA GGCTTGTCAT TAAGAGAAGA TTGCAAGCTA GCAATGTCTT CCCCCGAACA AACGTCGATA ATTTTGAGAT TGTGGAGGCT TTACCTACTC ATTTTTCTGA GACAGGAGAG GAGCGTTTGG TTCTATACGC CCAAATGTCG AATCTGGTGA AAGCAATTGT TACTCCCCTT GAGGTGGATC TCAAAGAAAC TTGTGAATTG TTCTGCAGGA TTGCGTGTAA TGCACATACC ATTTGTGATG ACGAGTTCAG ACCAGTTGGC ACTGGTTTGT ATCCCGTTGT ATCGATCATC AACCACAGCT GTGCCCCAAA CTCAGTTCTT TTGTTTGATG GGAGGAGAGC CATCGTGCGA GCCATACAGA AAATAGACAG AGGAACTGAG GTGACATTAA GTTATGTGGA GCTTGCCGGC AGCACCAAAA CTCGTCAGAA GGCTCTCAAA GAACAATATT ATTTCTCTTG TCACTGCACA CGCTGCATGA ATGCGGAAAC CACAGAAGGA ACAAGAGAAG ATTCCATCCT TGAGGGTTAT AAATGCTCCA ATGAGCAATG TGATGGTGCC GTAATACAAG AGAGAGATCA ACCTTATTTT CGGTGTATAC TCTGTGGTTT AACTCACGAT GGAGACAAGT TCAAGCGCTT GGAATCAATG GCAACTAAGC TGACGGAGGA AGCCAACGCA GCAGTTAAAT CCGGCAACGA TGCACGCGCG CGCTCCATCT TCGAACAAAC AGAAGCTATC CAGACAACTT TATACCACAG ATACTCAGTA CATCTTATGC GAACAAGGGA TGGTTTGCTG AAGGTATGTA TGTCTCTTAA GGATTGGAAT GCGGCGCTGA AGTATTGTCA TTTGACACTT CCTGCTTATG AAAGATCATA TTCAATGAAA TCGCCACTTG TTGGGCTTCA GTACTATACT CTCGGCAAAT TACAATGGTT CCTCGGCGAT TCCTTGGAAG AAATTAGTGG TGTTGAAGTT TTGAATCGTG CTAAAGAGAT ACTGTCAATA ACGCACGGTT CTTCATCGAA ACTCGTTCAG GAACTCTCTA GCATGCTTCT AGAGGTAAAC ATGGAGGCGG CATACAGAGT TCAAAAAGGA TTAATGAAAT GA |
Protein |
>Mp4g21010.1 MEQYLVDHGL KVTVVDTKGR CLVADRDFSP GEIVLDQEPY SSVLDAESKS LRCDACFRCS ENLQRCSACK SVSYCCTSCQ RKEWKLHKSE CQMMVKLSQA KQKLPPPSLR LIVRLVIKRR LQASNVFPRT NVDNFEIVEA LPTHFSETGE ERLVLYAQMS NLVKAIVTPL EVDLKETCEL FCRIACNAHT ICDDEFRPVG TGLYPVVSII NHSCAPNSVL LFDGRRAIVR AIQKIDRGTE VTLSYVELAG STKTRQKALK EQYYFSCHCT RCMNAETTEG TREDSILEGY KCSNEQCDGA VIQERDQPYF RCILCGLTHD GDKFKRLESM ATKLTEEANA AVKSGNDARA RSIFEQTEAI QTTLYHRYSV HLMRTRDGLL KVCMSLKDWN AALKYCHLTL PAYERSYSMK SPLVGLQYYT LGKLQWFLGD SLEEISGVEV LNRAKEILSI THGSSSKLVQ ELSSMLLEVN MEAAYRVQKG LMK |