Database | ID | Description |
---|---|---|
PIRSF | PIRSF000451 | PKS_III |
MobiDBLite | mobidb-lite | consensus disorder prediction |
Pfam | PF02797 | Chalcone and stilbene synthases, C-terminal domain |
SUPERFAMILY | SSF53901 | Thiolase-like |
PANTHER | PTHR11877 | HYDROXYMETHYLGLUTARYL-COA SYNTHASE |
Gene3D | G3DSA:3.40.47.10 | - |
FunFam | G3DSA:3.40.47.10:FF:000025 | Chalcone synthase 2 |
FunFam | G3DSA:3.40.47.10:FF:000014 | Chalcone synthase 1 |
Pfam | PF00195 | Chalcone and stilbene synthases, N-terminal domain |
CDD | cd00831 | CHS_like |
MapolyID | Mapoly0020s0082 | - |
GO | GO:0009058 | biosynthetic process |
GO | GO:0030639 | polyketide biosynthetic process |
GO | GO:0016746 | acyltransferase activity |
GO | GO:0016747 | acyltransferase activity, transferring groups other than amino-acyl groups |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpPKS/CHS19 | Secondary metabolism enzyme | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp4g23190.1 GCTACCACCA CCATACGCAC GTTTTTTTTT ATCCATATCA TCTTCAAGAG AACCTTGATA CGTCTCATGG GAGTATCCAT CGAAGTGTGG AATCGGTGAG GAGATTTGGG TGCTCGAACT GACAGTGTCG ATTCAGAGTT CCTCAATTTC AAAATTCTTT GATTTGGTCT GTGATGCTTA TTAACATTGT GGTACCCACT TCGGCGATGT GGTTCTGAAC ATTATTTTTC TGGTTTCATT CGAAGCACTG CAAAGTGTGG ACCAGTGTTC TGTGGAGGAA AAAAAACTTT CATTCCCCTG TTTTGAGCAT TATTACCGCC TGAGCTGCAC GGTGGACGTG TCTGGGCTGA GGTTCAGGCC GTGTGGAATT TTAAAATAGC TGCAAACGCT GCTATTTAAC TCCACACGTA ATAACAGAAT GGTCTGCAGC TCTTCTGCAA ATAGCCGGAG GATAAGTTAG GAACAGGAAG AATTCAATAT CGTATCAAAG TTTGGATCGA CCACATAAAC CTCAGTTCTG TTATTTTTTC CTCTCAATTC GAATCGTTCT GCTCCTAGTT TACGACTGAA CGTAAAGACT CGATTTCAAT CCTCATGCTG CTGGTAGTTA GCATCTAGTA CATTCCTGGA TACCACATAT CGAGGCTCGA AGACTCTCTA TCATTTGCCT TGGTTTTAAA CCGAAAACTT TGTATGATAT TGTGCCGGAA TATCGGAGCA TTCAAAAATT CATCAGGAGT TCCTGGATCC TTTTCAACCA TTGTAATTAC ATTGGTAATC TGAAGAAGGT CTGCATGGGG AAAGAACAAC CCTCGAGCTC GAGCCTGACG GGTGATGTGA ACGGGAGTCG GGAGAAGATG GCGACCAGAG TTCTGTCTTC CCAAGAGAAC TTCGAGAAAC TGATGGCAGA TCTGGCAAGA CCCAACGGCC ACGTATACAG CCAAAGCCAA AGCCAAAGCG GATCGGGCCA GAACGGTGCG GGGACGAGCA TCGTGGCAAA GAACACTGCG AGCATTTTGG CCATCGGGAA AGCCCTGCCA CCCAACCGGA TCTGTCAGAG CACTTACACG GACTTTTACT TTCGAGTCAC TCACTGCTCC CACAAGACGG AGCTCAAAAA CAGGATGCAA AGAATCTGTA AGTTTGCGTC TCAACTTTTT AGCCTCTACT ACTCCTCGAC GCTAGATTGG CTACCTGACA CATCATTATG CATGCTTTGC AATCACGCAG TGGACCGATG CAGGGCTTGA TTTTGTCCTC GTTTTCAACT CTGTCTATAT ATGCTGATTA CTATCTTCTG CTATCTTCGT TTCAAAACCT TGGCGTGCAT GGACCCATAT TTCTGTTAGA ACTCGTCTTA GTTCATGCGT ATTGAGCGGC ACTCATGACA TCGTTTTCAC CTGAGTGATT GATTGATTTT CTCTCGGGTG GTCGAGCTTC TCACGAGAGT TGGTCAAATA TATCCGCTCG ATTTTTCACA TCCGGTTTCA GATATCTCTT CAGTGCACTC AAGAGACTCG TAGCTTTCTT TTCATAATGG TAGTAATCAG TACTCCCTGG TCTGTAGATT AGGCTCGACT TTTGGTGCTA AAAATGAGAT CAGCTGAGAT AAATGCAAAA GTAAGTAAAG TAAAGTAAGA CATCGATCGA AGTTAGAATT GTTTACAGAG CAGACCATTC ACCGATTACA CCTGAAGGAA TGCAGTTTTG AACCGAAGAG AACTGTGCCA GTACGATATA ATATTACAAA TCTCCTCTAA AAGCTCGAAG ATTCACTCGG TCCTAGAAAG ACTTTCATCC ATCTCCAAAT CAATCCCCTT CAAAACGGCA GTGATATCAT GCTCCAAAAC ACTCGGCGCA GAAAAGAGAA ATTCTGCGAC TTTTTGCATG GGAATGGCAG CTGAGTGATG TGGGCATGGC GTGCAGGCGA CAAGTCGGGG ATCAACACCA GGTACCTGCT GCTGGACGAG GAGGCGCTGA AGGAGCACTC GGAGTTTTAC ACGCCGGGCC AGGCGAGCAT CGAGCAGCGG CACGACCTGC TGGAGGAGGC GGTGCCGAAG CTGGCGGCGC AGGCCGCAGC CAGCGCGCTG GAGGAGTGGG GACGGCCGGC CTGCGACGTC ACGCACCTCA TCGTCGTCAC GCTGAGCGGC GTCGCGATCC CCGGCGCCGA CGTGCGCCTG GTCAAGCTGC TGGGGCTGCG CGAGGACGTG AGCCGAGTGA TGCTGTACAT GCTCGGCTGC TATGCCGGGG TCACAGCTCT CCGGCTGGCC AAGGATCTGG CGGAGAATAA TCCCGGCAGC CGAGTGCTCA TAGCCTGCAG CGAGATGACT GCCACGACGT TCCGAGCTCC GAGCGAGAAA TCCATGTATG ACATCGTCGG AGCTTCGCTG TTCGGCGATG GAGCGGTGGG AGTGATCGTG GGAGCCAAGC CCAGGCCCGG CATCGAGCGA TCGATTTTCG AAATTCACTG GGCCGGCGTC TCGCTGGCTC CTGACACGGA GCATGTGGTT CAAGGCAAGC TCAAGCCCGA CGGATTGTAC TTCTTCTTGG ACAAAAGCTT GCCCGGGCTG GTGGGCAAGC ACATCGCGCC CTTCTGCCGG AGCCTGCTGG ACCATGCCCC CGAGAACCTG AACCTCGGAT TCAACGAGGT CTTCTGGGCC GTGCACCCCG GCGGTCCGGC CATCCTGAAC ACCGTCGAGG AGCAGCTGCT GCTGAATTCC GAGAAGCTGA GGGCCAGCAG GGACGTCCTG GCCAACTATG GCAACGTTTC TGCTTCCTCC GTTCTGTACG TGCTCGATGA GCTCAGACAC CGACCCGGGC AAGAGGAATG GGGCGCTGCC TTGGCTTTCG GACCGGGAAT TACATTCGAG GGAGTTTTGC TTCGCAGAAA TGTGAATCAT CGATAAATAG ATTCAGTGTC GAACAAAGAA CTGCACGACG ATAATAACCC AAACCCATTA CCCATGTACT GCAGCAAAAC ATCTCTCCAA ATTTCCCACC AATCTCTAGC TACACAATAT GAAATTTATC CACGGAGTAT ATATCATCAG CCGCTTTGGT CAGCACCCTT TTTTCGTTAC GCTTTCGTTT AGCCATAATG ATGCAGGCCT CTAGTCACAA GGTTAGCTAG AGGCCAGAGT AAAGATGATG TCATTTAGAT TGGTAACTTA CGTTTCTATC GTGGCTCCTC GATTTCAGGT CCTGCTTAAT ACATGAGTAC TGTGTGGTAC CTCAGAGAGT AGTCAGCAGG TACCCGGAAG TCAGGCTTTT GATAAATAAG CTCCAGTTTC TTGAGCCTGA GAAAGATTAC TGAGTCCAAC TGAGTTAAGT GTTCACGAGA CGTTGCAATC ATTTCTAGAA TAATTCTGGC TTTA |
mRNA UTR + CDS |
>Mp4g23190.1 GCTACCACCA CCATACGCAC GTTTTTTTTT ATCCATATCA TCTTCAAGAG AACCTTGATA CGTCTCATGG GAGTATCCAT CGAAGTGTGG AATCGGTGAG GAGATTTGGG TGCTCGAACT GACAGTGTCG ATTCAGAGTT CCTCAATTTC AAAATTCTTT GATTTGGTCT GTGATGCTTA TTAACATTGT GGTACCCACT TCGGCGATGT GGTTCTGAAC ATTATTTTTC TGGTTTCATT CGAAGCACTG CAAAGTGTGG ACCAGTGTTC TGTGGAGGAA AAAAAACTTT CATTCCCCTG TTTTGAGCAT TATTACCGCC TGAGCTGCAC GGTGGACGTG TCTGGGCTGA GGTTCAGGCC GTGTGGAATT TTAAAATAGC TGCAAACGCT GCTATTTAAC TCCACACGTA ATAACAGAAT GGTCTGCAGC TCTTCTGCAA ATAGCCGGAG GATAAGTTAG GAACAGGAAG AATTCAATAT CGTATCAAAG TTTGGATCGA CCACATAAAC CTCAGTTCTG TTATTTTTTC CTCTCAATTC GAATCGTTCT GCTCCTAGTT TACGACTGAA CGTAAAGACT CGATTTCAAT CCTCATGCTG CTGGTAGTTA GCATCTAGTA CATTCCTGGA TACCACATAT CGAGGCTCGA AGACTCTCTA TCATTTGCCT TGGTTTTAAA CCGAAAACTT TGTATGATAT TGTGCCGGAA TATCGGAGCA TTCAAAAATT CATCAGGAGT TCCTGGATCC TTTTCAACCA TTGTAATTAC ATTGGTAATC TGAAGAAGGT CTGCATGGGG AAAGAACAAC CCTCGAGCTC GAGCCTGACG GGTGATGTGA ACGGGAGTCG GGAGAAGATG GCGACCAGAG TTCTGTCTTC CCAAGAGAAC TTCGAGAAAC TGATGGCAGA TCTGGCAAGA CCCAACGGCC ACGTATACAG CCAAAGCCAA AGCCAAAGCG GATCGGGCCA GAACGGTGCG GGGACGAGCA TCGTGGCAAA GAACACTGCG AGCATTTTGG CCATCGGGAA AGCCCTGCCA CCCAACCGGA TCTGTCAGAG CACTTACACG GACTTTTACT TTCGAGTCAC TCACTGCTCC CACAAGACGG AGCTCAAAAA CAGGATGCAA AGAATCTGCG ACAAGTCGGG GATCAACACC AGGTACCTGC TGCTGGACGA GGAGGCGCTG AAGGAGCACT CGGAGTTTTA CACGCCGGGC CAGGCGAGCA TCGAGCAGCG GCACGACCTG CTGGAGGAGG CGGTGCCGAA GCTGGCGGCG CAGGCCGCAG CCAGCGCGCT GGAGGAGTGG GGACGGCCGG CCTGCGACGT CACGCACCTC ATCGTCGTCA CGCTGAGCGG CGTCGCGATC CCCGGCGCCG ACGTGCGCCT GGTCAAGCTG CTGGGGCTGC GCGAGGACGT GAGCCGAGTG ATGCTGTACA TGCTCGGCTG CTATGCCGGG GTCACAGCTC TCCGGCTGGC CAAGGATCTG GCGGAGAATA ATCCCGGCAG CCGAGTGCTC ATAGCCTGCA GCGAGATGAC TGCCACGACG TTCCGAGCTC CGAGCGAGAA ATCCATGTAT GACATCGTCG GAGCTTCGCT GTTCGGCGAT GGAGCGGTGG GAGTGATCGT GGGAGCCAAG CCCAGGCCCG GCATCGAGCG ATCGATTTTC GAAATTCACT GGGCCGGCGT CTCGCTGGCT CCTGACACGG AGCATGTGGT TCAAGGCAAG CTCAAGCCCG ACGGATTGTA CTTCTTCTTG GACAAAAGCT TGCCCGGGCT GGTGGGCAAG CACATCGCGC CCTTCTGCCG GAGCCTGCTG GACCATGCCC CCGAGAACCT GAACCTCGGA TTCAACGAGG TCTTCTGGGC CGTGCACCCC GGCGGTCCGG CCATCCTGAA CACCGTCGAG GAGCAGCTGC TGCTGAATTC CGAGAAGCTG AGGGCCAGCA GGGACGTCCT GGCCAACTAT GGCAACGTTT CTGCTTCCTC CGTTCTGTAC GTGCTCGATG AGCTCAGACA CCGACCCGGG CAAGAGGAAT GGGGCGCTGC CTTGGCTTTC GGACCGGGAA TTACATTCGA GGGAGTTTTG CTTCGCAGAA ATGTGAATCA TCGATAAATA GATTCAGTGT CGAACAAAGA ACTGCACGAC GATAATAACC CAAACCCATT ACCCATGTAC TGCAGCAAAA CATCTCTCCA AATTTCCCAC CAATCTCTAG CTACACAATA TGAAATTTAT CCACGGAGTA TATATCATCA GCCGCTTTGG TCAGCACCCT TTTTTCGTTA CGCTTTCGTT TAGCCATAAT GATGCAGGCC TCTAGTCACA AGGTTAGCTA GAGGCCAGAG TAAAGATGAT GTCATTTAGA TTGGTAACTT ACGTTTCTAT CGTGGCTCCT CGATTTCAGG TCCTGCTTAA TACATGAGTA CTGTGTGGTA CCTCAGAGAG TAGTCAGCAG GTACCCGGAA GTCAGGCTTT TGATAAATAA GCTCCAGTTT CTTGAGCCTG AGAAAGATTA CTGAGTCCAA CTGAGTTAAG TGTTCACGAG ACGTTGCAAT CATTTCTAGA ATAATTCTGG CTTTA |
CDS |
>Mp4g23190.1 ATGGGGAAAG AACAACCCTC GAGCTCGAGC CTGACGGGTG ATGTGAACGG GAGTCGGGAG AAGATGGCGA CCAGAGTTCT GTCTTCCCAA GAGAACTTCG AGAAACTGAT GGCAGATCTG GCAAGACCCA ACGGCCACGT ATACAGCCAA AGCCAAAGCC AAAGCGGATC GGGCCAGAAC GGTGCGGGGA CGAGCATCGT GGCAAAGAAC ACTGCGAGCA TTTTGGCCAT CGGGAAAGCC CTGCCACCCA ACCGGATCTG TCAGAGCACT TACACGGACT TTTACTTTCG AGTCACTCAC TGCTCCCACA AGACGGAGCT CAAAAACAGG ATGCAAAGAA TCTGCGACAA GTCGGGGATC AACACCAGGT ACCTGCTGCT GGACGAGGAG GCGCTGAAGG AGCACTCGGA GTTTTACACG CCGGGCCAGG CGAGCATCGA GCAGCGGCAC GACCTGCTGG AGGAGGCGGT GCCGAAGCTG GCGGCGCAGG CCGCAGCCAG CGCGCTGGAG GAGTGGGGAC GGCCGGCCTG CGACGTCACG CACCTCATCG TCGTCACGCT GAGCGGCGTC GCGATCCCCG GCGCCGACGT GCGCCTGGTC AAGCTGCTGG GGCTGCGCGA GGACGTGAGC CGAGTGATGC TGTACATGCT CGGCTGCTAT GCCGGGGTCA CAGCTCTCCG GCTGGCCAAG GATCTGGCGG AGAATAATCC CGGCAGCCGA GTGCTCATAG CCTGCAGCGA GATGACTGCC ACGACGTTCC GAGCTCCGAG CGAGAAATCC ATGTATGACA TCGTCGGAGC TTCGCTGTTC GGCGATGGAG CGGTGGGAGT GATCGTGGGA GCCAAGCCCA GGCCCGGCAT CGAGCGATCG ATTTTCGAAA TTCACTGGGC CGGCGTCTCG CTGGCTCCTG ACACGGAGCA TGTGGTTCAA GGCAAGCTCA AGCCCGACGG ATTGTACTTC TTCTTGGACA AAAGCTTGCC CGGGCTGGTG GGCAAGCACA TCGCGCCCTT CTGCCGGAGC CTGCTGGACC ATGCCCCCGA GAACCTGAAC CTCGGATTCA ACGAGGTCTT CTGGGCCGTG CACCCCGGCG GTCCGGCCAT CCTGAACACC GTCGAGGAGC AGCTGCTGCT GAATTCCGAG AAGCTGAGGG CCAGCAGGGA CGTCCTGGCC AACTATGGCA ACGTTTCTGC TTCCTCCGTT CTGTACGTGC TCGATGAGCT CAGACACCGA CCCGGGCAAG AGGAATGGGG CGCTGCCTTG GCTTTCGGAC CGGGAATTAC ATTCGAGGGA GTTTTGCTTC GCAGAAATGT GAATCATCGA TAA |
Protein |
>Mp4g23190.1 MGKEQPSSSS LTGDVNGSRE KMATRVLSSQ ENFEKLMADL ARPNGHVYSQ SQSQSGSGQN GAGTSIVAKN TASILAIGKA LPPNRICQST YTDFYFRVTH CSHKTELKNR MQRICDKSGI NTRYLLLDEE ALKEHSEFYT PGQASIEQRH DLLEEAVPKL AAQAAASALE EWGRPACDVT HLIVVTLSGV AIPGADVRLV KLLGLREDVS RVMLYMLGCY AGVTALRLAK DLAENNPGSR VLIACSEMTA TTFRAPSEKS MYDIVGASLF GDGAVGVIVG AKPRPGIERS IFEIHWAGVS LAPDTEHVVQ GKLKPDGLYF FLDKSLPGLV GKHIAPFCRS LLDHAPENLN LGFNEVFWAV HPGGPAILNT VEEQLLLNSE KLRASRDVLA NYGNVSASSV LYVLDELRHR PGQEEWGAAL AFGPGITFEG VLLRRNVNHR |