Database | ID | Description |
---|---|---|
SUPERFAMILY | SSF53901 | Thiolase-like |
FunFam | G3DSA:3.40.47.10:FF:000025 | Chalcone synthase 2 |
Gene3D | G3DSA:3.40.47.10 | - |
Pfam | PF02797 | Chalcone and stilbene synthases, C-terminal domain |
Pfam | PF00195 | Chalcone and stilbene synthases, N-terminal domain |
PANTHER | PTHR11877 | HYDROXYMETHYLGLUTARYL-COA SYNTHASE |
CDD | cd00831 | CHS_like |
PIRSF | PIRSF000451 | PKS_III |
FunFam | G3DSA:3.40.47.10:FF:000014 | Chalcone synthase 1 |
MapolyID | Mapoly0014s0122 | - |
GO | GO:0009058 | biosynthetic process |
GO | GO:0030639 | polyketide biosynthetic process |
GO | GO:0016746 | acyltransferase activity |
GO | GO:0016747 | acyltransferase activity, transferring groups other than amino-acyl groups |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpPKS/CHS1 | Secondary metabolism enzyme | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp1g11030.1 TCGCACCTTC CTAGGCAATT CTACTCCAAT AGCAAGAATT TGGATCTATC CTCAATGGCG GCCCGAGCAG TGGCCCTTGC ACGCTTCGAC ACCGATGCAG CTGCAGGAAA CTGCGGGAGT CTGATCTCTG GTGCTGGGAC GACGGAGAGG TGTGCACGTC CAGGAAAAGC TACGGTTCTG GCCATCGGAC GAGCTGTACC GGGTGTGGTT GTCAAGCAAG AGGGCATGGC AGAAAGATAC CTGCGTGATG TCCACCGCGA TGATGATCCC ATTCTTCTTG CGAAACTTCA ACGACTCTGT GAGCCAAAGT TCTAAGACTC GAAACATATA TATTTATTCC AGCCTGTCAA ACTGGGGATT ACGTGAATTC AATGCAATAG ATGAACCTGA TTGGGTCGGT AGATGTAGAG TGTAGGCGAA TGCACGAGCT GAATGTCGAG GTGTACGTTT GTCCTCTCTG TGGAAAGCCT TGAACGTCTT GTTTCTTGTA TACTGTTTCG TCATGGTGAT TCAACATGAT ATGTTCACAG AACTTGACGC AATGTGCGAA AAAGCCAACT GATGTCTTTT GAATTTGAAC ATTTGAATGT GCAGGCACGA ACACAACGGT GAAGACGAGG TACACGGTGC TAACGGAGAA AATGCTGAAA GAGAATCCGG GATTTTTGAT CGAGGGGGCG AGTACCGTGA AGCAGAGGCT GGAGATATCA GCCGATGCTG TGACGAAGCT CGGGGTGGAG GCGGCGAAGA AAGCGATGGA GGAATGGGGC AGGCCTGCGG GGGACATCAC GCACCTGGTC TACGTGTCGT CGAGCGAGGT CCGACTGCCG GGGGGAGACC TGCACATAGC GATCAATTTG GGACTGCGGA ATGATGTCAA CCGAGTGATG TTGTACATGC TCGGCTGCTG CGGGGGAGCG GCGGGCATCC GAGTCGCCAA AGACCTGGCG GAAAATAATC CCGGCAGTCG TGTTCTGCTC ACCACCAGCG AGACGTGCTT GATCGGCTAC CGCGCTCCTC ATCCGGACCG CCCCTACGAT CTCGTCGGAG CTGCGCTTTT CGGAGACGGG GCTGCTGCAA TGATCATCGG CGCGGATCCC ATTCCTGTTG TCGAGACGCC GTTCTTCGAG TTTCATTGGG CGGGCCAGAG CTTCATTCCC CAGACGGAGC GCACCATCGA GGGCACTCTG ACCGAAGAGG GCATAATCTT CACCCTCGGC AGAGAGCTGC CGAAACTAAT CGAGCACAAC ATCGACGAAT TTGCCCGAGG GCTGCTGAAG AAGATCGGAC TGGACCTCAA ATACGAGGAT CTCTTCTGGG CTGTGCACCC CGGCGGCCCT GCAATTTTGA ATGCTGTCGA GAAGCAGCTG AATCTCCCCC GAGACAAACT ACTTTGCAGC CGGCAGGTTC TCTCGGACTA CGGCAACATC AACAGCAACA CCATCATCTA CGTTCTCGAC TACATGCGCA AGGCCAGTCT GCAGAAGCGC AAAGATCTGC AATGCGGCCT GTCCACGGAT GAAGATCCCG AGTGGGGTTT CATGCTGGCC TTCGGCCCCG GAGTGACCAT CGAAGGCATG GTGGCTCGGA ACCTCGTGTA GTTCCCTCGA TGTCAACCAG AGGGTCGGTC GCGGGTCCGC GGGCCAAAAC ATTCTGCAAT ACGCATTTAA CAAATAACTC TGTTCCAAGA TCGAACACTT GCGACATCGA TACCGTTTTG AAGTTGGTAG GTAGGTGGTG CTCTGTAGAG GATGTATATG ATAATGACAT ATTATGCTAA TGAAGATGGA TTCGGTTTTT GTAATGGTAG ACCGGGGGAT CACCTTGGAA CTTTCTGCTG CCGATGGCTG TCATCCTTGT TTCCAGTGCC TCTCGATCGT CTCAACGACG GGTCATTCGC AGTCTCACGA TAATGAACTT CCA |
mRNA UTR + CDS |
>Mp1g11030.1 TCGCACCTTC CTAGGCAATT CTACTCCAAT AGCAAGAATT TGGATCTATC CTCAATGGCG GCCCGAGCAG TGGCCCTTGC ACGCTTCGAC ACCGATGCAG CTGCAGGAAA CTGCGGGAGT CTGATCTCTG GTGCTGGGAC GACGGAGAGG TGTGCACGTC CAGGAAAAGC TACGGTTCTG GCCATCGGAC GAGCTGTACC GGGTGTGGTT GTCAAGCAAG AGGGCATGGC AGAAAGATAC CTGCGTGATG TCCACCGCGA TGATGATCCC ATTCTTCTTG CGAAACTTCA ACGACTCTGC ACGAACACAA CGGTGAAGAC GAGGTACACG GTGCTAACGG AGAAAATGCT GAAAGAGAAT CCGGGATTTT TGATCGAGGG GGCGAGTACC GTGAAGCAGA GGCTGGAGAT ATCAGCCGAT GCTGTGACGA AGCTCGGGGT GGAGGCGGCG AAGAAAGCGA TGGAGGAATG GGGCAGGCCT GCGGGGGACA TCACGCACCT GGTCTACGTG TCGTCGAGCG AGGTCCGACT GCCGGGGGGA GACCTGCACA TAGCGATCAA TTTGGGACTG CGGAATGATG TCAACCGAGT GATGTTGTAC ATGCTCGGCT GCTGCGGGGG AGCGGCGGGC ATCCGAGTCG CCAAAGACCT GGCGGAAAAT AATCCCGGCA GTCGTGTTCT GCTCACCACC AGCGAGACGT GCTTGATCGG CTACCGCGCT CCTCATCCGG ACCGCCCCTA CGATCTCGTC GGAGCTGCGC TTTTCGGAGA CGGGGCTGCT GCAATGATCA TCGGCGCGGA TCCCATTCCT GTTGTCGAGA CGCCGTTCTT CGAGTTTCAT TGGGCGGGCC AGAGCTTCAT TCCCCAGACG GAGCGCACCA TCGAGGGCAC TCTGACCGAA GAGGGCATAA TCTTCACCCT CGGCAGAGAG CTGCCGAAAC TAATCGAGCA CAACATCGAC GAATTTGCCC GAGGGCTGCT GAAGAAGATC GGACTGGACC TCAAATACGA GGATCTCTTC TGGGCTGTGC ACCCCGGCGG CCCTGCAATT TTGAATGCTG TCGAGAAGCA GCTGAATCTC CCCCGAGACA AACTACTTTG CAGCCGGCAG GTTCTCTCGG ACTACGGCAA CATCAACAGC AACACCATCA TCTACGTTCT CGACTACATG CGCAAGGCCA GTCTGCAGAA GCGCAAAGAT CTGCAATGCG GCCTGTCCAC GGATGAAGAT CCCGAGTGGG GTTTCATGCT GGCCTTCGGC CCCGGAGTGA CCATCGAAGG CATGGTGGCT CGGAACCTCG TGTAGTTCCC TCGATGTCAA CCAGAGGGTC GGTCGCGGGT CCGCGGGCCA AAACATTCTG CAATACGCAT TTAACAAATA ACTCTGTTCC AAGATCGAAC ACTTGCGACA TCGATACCGT TTTGAAGTTG GTAGGTAGGT GGTGCTCTGT AGAGGATGTA TATGATAATG ACATATTATG CTAATGAAGA TGGATTCGGT TTTTGTAATG GTAGACCGGG GGATCACCTT GGAACTTTCT GCTGCCGATG GCTGTCATCC TTGTTTCCAG TGCCTCTCGA TCGTCTCAAC GACGGGTCAT TCGCAGTCTC ACGATAATGA ACTTCCA |
CDS |
>Mp1g11030.1 ATGGCGGCCC GAGCAGTGGC CCTTGCACGC TTCGACACCG ATGCAGCTGC AGGAAACTGC GGGAGTCTGA TCTCTGGTGC TGGGACGACG GAGAGGTGTG CACGTCCAGG AAAAGCTACG GTTCTGGCCA TCGGACGAGC TGTACCGGGT GTGGTTGTCA AGCAAGAGGG CATGGCAGAA AGATACCTGC GTGATGTCCA CCGCGATGAT GATCCCATTC TTCTTGCGAA ACTTCAACGA CTCTGCACGA ACACAACGGT GAAGACGAGG TACACGGTGC TAACGGAGAA AATGCTGAAA GAGAATCCGG GATTTTTGAT CGAGGGGGCG AGTACCGTGA AGCAGAGGCT GGAGATATCA GCCGATGCTG TGACGAAGCT CGGGGTGGAG GCGGCGAAGA AAGCGATGGA GGAATGGGGC AGGCCTGCGG GGGACATCAC GCACCTGGTC TACGTGTCGT CGAGCGAGGT CCGACTGCCG GGGGGAGACC TGCACATAGC GATCAATTTG GGACTGCGGA ATGATGTCAA CCGAGTGATG TTGTACATGC TCGGCTGCTG CGGGGGAGCG GCGGGCATCC GAGTCGCCAA AGACCTGGCG GAAAATAATC CCGGCAGTCG TGTTCTGCTC ACCACCAGCG AGACGTGCTT GATCGGCTAC CGCGCTCCTC ATCCGGACCG CCCCTACGAT CTCGTCGGAG CTGCGCTTTT CGGAGACGGG GCTGCTGCAA TGATCATCGG CGCGGATCCC ATTCCTGTTG TCGAGACGCC GTTCTTCGAG TTTCATTGGG CGGGCCAGAG CTTCATTCCC CAGACGGAGC GCACCATCGA GGGCACTCTG ACCGAAGAGG GCATAATCTT CACCCTCGGC AGAGAGCTGC CGAAACTAAT CGAGCACAAC ATCGACGAAT TTGCCCGAGG GCTGCTGAAG AAGATCGGAC TGGACCTCAA ATACGAGGAT CTCTTCTGGG CTGTGCACCC CGGCGGCCCT GCAATTTTGA ATGCTGTCGA GAAGCAGCTG AATCTCCCCC GAGACAAACT ACTTTGCAGC CGGCAGGTTC TCTCGGACTA CGGCAACATC AACAGCAACA CCATCATCTA CGTTCTCGAC TACATGCGCA AGGCCAGTCT GCAGAAGCGC AAAGATCTGC AATGCGGCCT GTCCACGGAT GAAGATCCCG AGTGGGGTTT CATGCTGGCC TTCGGCCCCG GAGTGACCAT CGAAGGCATG GTGGCTCGGA ACCTCGTGTA G |
Protein |
>Mp1g11030.1 MAARAVALAR FDTDAAAGNC GSLISGAGTT ERCARPGKAT VLAIGRAVPG VVVKQEGMAE RYLRDVHRDD DPILLAKLQR LCTNTTVKTR YTVLTEKMLK ENPGFLIEGA STVKQRLEIS ADAVTKLGVE AAKKAMEEWG RPAGDITHLV YVSSSEVRLP GGDLHIAINL GLRNDVNRVM LYMLGCCGGA AGIRVAKDLA ENNPGSRVLL TTSETCLIGY RAPHPDRPYD LVGAALFGDG AAAMIIGADP IPVVETPFFE FHWAGQSFIP QTERTIEGTL TEEGIIFTLG RELPKLIEHN IDEFARGLLK KIGLDLKYED LFWAVHPGGP AILNAVEKQL NLPRDKLLCS RQVLSDYGNI NSNTIIYVLD YMRKASLQKR KDLQCGLSTD EDPEWGFMLA FGPGVTIEGM VARNLV |