Database | ID | Description |
---|---|---|
Pfam | PF08491 | Squalene epoxidase |
Gene3D | G3DSA:3.50.50.60 | - |
FunFam | G3DSA:3.50.50.60:FF:000074 | Squalene monooxygenase 2 |
Pfam | PF13450 | NAD(P)-binding Rossmann-like domain |
PRINTS | PR00420 | Aromatic-ring hydroxylase (flavoprotein monooxygenase) signature |
PANTHER | PTHR10835 | SQUALENE MONOOXYGENASE |
SUPERFAMILY | SSF51905 | FAD/NAD(P)-binding domain |
KEGG | K00511 | squalene monooxygenase [EC:1.14.14.17] |
KOG | KOG1298 | Squalene monooxygenase; [I] |
MapolyID | Mapoly0001s0179 | - |
GO | GO:0050660 | flavin adenine dinucleotide binding |
GO | GO:0004506 | squalene monooxygenase activity |
GO | GO:0016126 | sterol biosynthetic process |
GO | GO:0005783 | endoplasmic reticulum |
GO | GO:0016020 | membrane |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpSQE | Secondary metabolism enzyme, Terpene biosynthesis enzymes | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp1g18410.1 GTGCCGGCAG TCGGGCCGCG CTCGAGGAAA CGAGTCCGCG CGAGGCAGAA CAAGTTCGCG GAGCGCATCC GAGCGATCCA ATTGTTCGAG CGGCGCAGGA GCGGAAGGAG GGATCGCGAG GGGCGCTGTG ATGGAGCAGG GGCTGGAGTG CCGAGGGTGG TGGCAGAGGT TGGAATTTGT TGCGGCCGTG GCGAGCTTGG CCGTGGCCTC GGTCTTGCTG TGGAGACTGA AATCCGGTGA CACTTCGCGG AGGAAGCTGA GCGCGAAGGT GGTGGATGCG AGGCTCGAGG TGCCCACGGG CGACGAGCCT CGGGTCGATG CCATCATTGT GGGAGCGGGA GTGGCCGGGG CGGCCTTAGC GTACACTCTG GGTAAGGATG GGAGGAGAGT GCTGGTTTTG GAACGGGATT TGAACGAGCC GGATCGGATT GTGGGCGAGC TGCTGCAACC GGGAGGGTAT TTGAAGCTGG TGAGTCTAGG TCTGGCCGAT TGTGTCGATG GGATCGACGC GCAAAAGGTT TTCGGGTATG CGCTCTTCAA GGATGGGCGA TCGGCCAAGG TGGGGTATCC GCTGGAAGGA TTTTCGGATG ATGTTGCCGG GCGCAGTTTT CACAACGGAC GCTTTATTCA GAAATTGCGC GAGAAGGCCG CCTCGCTCGC CACTGTCACG CTGAAGCAGG CAACTGTTCT TGGCTTGATC GAAGAGAATG GAACGGTCAC TGGAGTGCGC TTCAAGGGTC CCGACGGGAG CCAGATTCAG GTCCATGCAC CTCTGACCTT CGTCTGCGAC GGATGTTTCT CGAATTTGAG GCGAAACCTT TGCGATCCTC AGGTCGAGGT TCCATCGTGC TTCGTGGGCT TGGTGCTTGA AGATTGCGTG CTGCCTCACA CGAGTCACGG GCATGTGGTG CTCGCGGATC CATCCCCTAT CTTGTTCTAC CCCATTAGTA GCACGGAGGT ACGGTGTCTA GTTGATTTCC CGGGCCAAAA GGTTCCCACC ATTGCAAATG GCGATATGGC CAAATATCTC CTCACGCACG TCGCACCTCA GTTACCCGCG CAACTCAAGC AGCCTTTCAT CAATGCTGTG GAAAAGGGTA ATATCCGATC TATGCCGAAC AAGAGTATGC CCGCTCACCC GCGACCAACT CCTGGAGCAT TACTTATGGG AGACGCGTTC AACATGCGAC ATCCCCTCAC CGGAGGCGGT ATGACCGTCG CGCTGTCGGA TATCCTAGTG TTGCAGGAAA TGCTGAAGCC TCTGCATGAC TTCCAAGATC CCTCGGCCCT GTGCGATTAC CTGCAAGCGT TTTACACCCG TCGTAAGCCA GTTGCGTCGA CTATCAATAC CCTGGCCGGG GCTCTGTACA GAGTGTTTTG CGCTTCACCG GATGAGGCCA TGAAGGAAAT GAGGCAGGCA TGCTTCGATT ATTTGAGCCT GGGAGGGGTT TTCTCCTCTG GCCCGGTGGC TCTTCTATCG GGACTCAACC CCCGGCCCTT GAGTTTGGTC ACGCACTTCT TCGCGGTCGC CCTCTACGGT GTCGGCCGAC TGATGCTTCC ATTTCCTTCG ATCAAAAGTA TGTGGATTTC GGCGCGGCTA ATTCAGGGAG CGTCGATGAT CATCTTTCCA ATCATCAAAG CCGAGGGAGT TTTGCAAATG TTTTTCCCCA GAATGGTTCC TACTTATCAC AGATCACCTC CCAAAGAATA GTGCTGGCGG TACTTAGTGG GGAGCAAGAA CGTGTAATTC TTTTTTTTTG GTAGTTTCGG CAGTGGTTCA CGTGGAAGAA TACGCATACG AGTGCAGGAA ATTATGTTAG TCATCAGTTC CGCAGGGTGG ACATGTAACT GTAAAGTTGC CTGGTAGAGA GTTATTAATC TGTCAAAAAC TTTAAGCACA TTCTCCTGGA GGCTTTTCCC ACGTTTCAGG GAGCTACGCG CGGTGGAACA AAAGCACCCT TTAGTACTTC CCATCGACTA GTCCACGAGA ACGGCAGATC GGGATTGCCG TGGTTTGACT TGTGGACAAC ATTGCCGTGG TTGGACTCGG GACTACACGA AATTCGGCAG AGATGTCATA GTTCAATGTC GAGGAGGTTT CTCGCCTGTT GATCACGCTT AAGAAGTCAC GGAAGAGGTG ACACATAAGC CTTTACACTG AAATGCTATG ATGTCGTATG ATGTAGGAGC TAACTGTGTG TTCACATCGA TGTTTCTTGG TGATCCGTCT GGATTGGATT TCTGCTCAAG TGTGAAGGCC TTTCCATGTG TAATTTTGTC GATTTGTATC AGTGAAGCGG AAGACTTGTC GAGAAGCAGG TTTTTTTCTG ATACTAAAGC GAAAGCTTTA TGCTCTGTAC ATGTTTCGAT ATTGTGGACT GAAATCAACC TATCCACTTT CCTCTGAATG AATTTGTCGA TTGTCATAAA CACGAGCTCG TCAGAGGATT CTTATTCGTT TTGTTTCCAT CCGC |
mRNA UTR + CDS |
>Mp1g18410.1 GTGCCGGCAG TCGGGCCGCG CTCGAGGAAA CGAGTCCGCG CGAGGCAGAA CAAGTTCGCG GAGCGCATCC GAGCGATCCA ATTGTTCGAG CGGCGCAGGA GCGGAAGGAG GGATCGCGAG GGGCGCTGTG ATGGAGCAGG GGCTGGAGTG CCGAGGGTGG TGGCAGAGGT TGGAATTTGT TGCGGCCGTG GCGAGCTTGG CCGTGGCCTC GGTCTTGCTG TGGAGACTGA AATCCGGTGA CACTTCGCGG AGGAAGCTGA GCGCGAAGGT GGTGGATGCG AGGCTCGAGG TGCCCACGGG CGACGAGCCT CGGGTCGATG CCATCATTGT GGGAGCGGGA GTGGCCGGGG CGGCCTTAGC GTACACTCTG GGTAAGGATG GGAGGAGAGT GCTGGTTTTG GAACGGGATT TGAACGAGCC GGATCGGATT GTGGGCGAGC TGCTGCAACC GGGAGGGTAT TTGAAGCTGG TGAGTCTAGG TCTGGCCGAT TGTGTCGATG GGATCGACGC GCAAAAGGTT TTCGGGTATG CGCTCTTCAA GGATGGGCGA TCGGCCAAGG TGGGGTATCC GCTGGAAGGA TTTTCGGATG ATGTTGCCGG GCGCAGTTTT CACAACGGAC GCTTTATTCA GAAATTGCGC GAGAAGGCCG CCTCGCTCGC CACTGTCACG CTGAAGCAGG CAACTGTTCT TGGCTTGATC GAAGAGAATG GAACGGTCAC TGGAGTGCGC TTCAAGGGTC CCGACGGGAG CCAGATTCAG GTCCATGCAC CTCTGACCTT CGTCTGCGAC GGATGTTTCT CGAATTTGAG GCGAAACCTT TGCGATCCTC AGGTCGAGGT TCCATCGTGC TTCGTGGGCT TGGTGCTTGA AGATTGCGTG CTGCCTCACA CGAGTCACGG GCATGTGGTG CTCGCGGATC CATCCCCTAT CTTGTTCTAC CCCATTAGTA GCACGGAGGT ACGGTGTCTA GTTGATTTCC CGGGCCAAAA GGTTCCCACC ATTGCAAATG GCGATATGGC CAAATATCTC CTCACGCACG TCGCACCTCA GTTACCCGCG CAACTCAAGC AGCCTTTCAT CAATGCTGTG GAAAAGGGTA ATATCCGATC TATGCCGAAC AAGAGTATGC CCGCTCACCC GCGACCAACT CCTGGAGCAT TACTTATGGG AGACGCGTTC AACATGCGAC ATCCCCTCAC CGGAGGCGGT ATGACCGTCG CGCTGTCGGA TATCCTAGTG TTGCAGGAAA TGCTGAAGCC TCTGCATGAC TTCCAAGATC CCTCGGCCCT GTGCGATTAC CTGCAAGCGT TTTACACCCG TCGTAAGCCA GTTGCGTCGA CTATCAATAC CCTGGCCGGG GCTCTGTACA GAGTGTTTTG CGCTTCACCG GATGAGGCCA TGAAGGAAAT GAGGCAGGCA TGCTTCGATT ATTTGAGCCT GGGAGGGGTT TTCTCCTCTG GCCCGGTGGC TCTTCTATCG GGACTCAACC CCCGGCCCTT GAGTTTGGTC ACGCACTTCT TCGCGGTCGC CCTCTACGGT GTCGGCCGAC TGATGCTTCC ATTTCCTTCG ATCAAAAGTA TGTGGATTTC GGCGCGGCTA ATTCAGGGAG CGTCGATGAT CATCTTTCCA ATCATCAAAG CCGAGGGAGT TTTGCAAATG TTTTTCCCCA GAATGGTTCC TACTTATCAC AGATCACCTC CCAAAGAATA GTGCTGGCGG TACTTAGTGG GGAGCAAGAA CGTGTAATTC TTTTTTTTTG GTAGTTTCGG CAGTGGTTCA CGTGGAAGAA TACGCATACG AGTGCAGGAA ATTATGTTAG TCATCAGTTC CGCAGGGTGG ACATGTAACT GTAAAGTTGC CTGGTAGAGA GTTATTAATC TGTCAAAAAC TTTAAGCACA TTCTCCTGGA GGCTTTTCCC ACGTTTCAGG GAGCTACGCG CGGTGGAACA AAAGCACCCT TTAGTACTTC CCATCGACTA GTCCACGAGA ACGGCAGATC GGGATTGCCG TGGTTTGACT TGTGGACAAC ATTGCCGTGG TTGGACTCGG GACTACACGA AATTCGGCAG AGATGTCATA GTTCAATGTC GAGGAGGTTT CTCGCCTGTT GATCACGCTT AAGAAGTCAC GGAAGAGGTG ACACATAAGC CTTTACACTG AAATGCTATG ATGTCGTATG ATGTAGGAGC TAACTGTGTG TTCACATCGA TGTTTCTTGG TGATCCGTCT GGATTGGATT TCTGCTCAAG TGTGAAGGCC TTTCCATGTG TAATTTTGTC GATTTGTATC AGTGAAGCGG AAGACTTGTC GAGAAGCAGG TTTTTTTCTG ATACTAAAGC GAAAGCTTTA TGCTCTGTAC ATGTTTCGAT ATTGTGGACT GAAATCAACC TATCCACTTT CCTCTGAATG AATTTGTCGA TTGTCATAAA CACGAGCTCG TCAGAGGATT CTTATTCGTT TTGTTTCCAT CCGC |
CDS |
>Mp1g18410.1 ATGGAGCAGG GGCTGGAGTG CCGAGGGTGG TGGCAGAGGT TGGAATTTGT TGCGGCCGTG GCGAGCTTGG CCGTGGCCTC GGTCTTGCTG TGGAGACTGA AATCCGGTGA CACTTCGCGG AGGAAGCTGA GCGCGAAGGT GGTGGATGCG AGGCTCGAGG TGCCCACGGG CGACGAGCCT CGGGTCGATG CCATCATTGT GGGAGCGGGA GTGGCCGGGG CGGCCTTAGC GTACACTCTG GGTAAGGATG GGAGGAGAGT GCTGGTTTTG GAACGGGATT TGAACGAGCC GGATCGGATT GTGGGCGAGC TGCTGCAACC GGGAGGGTAT TTGAAGCTGG TGAGTCTAGG TCTGGCCGAT TGTGTCGATG GGATCGACGC GCAAAAGGTT TTCGGGTATG CGCTCTTCAA GGATGGGCGA TCGGCCAAGG TGGGGTATCC GCTGGAAGGA TTTTCGGATG ATGTTGCCGG GCGCAGTTTT CACAACGGAC GCTTTATTCA GAAATTGCGC GAGAAGGCCG CCTCGCTCGC CACTGTCACG CTGAAGCAGG CAACTGTTCT TGGCTTGATC GAAGAGAATG GAACGGTCAC TGGAGTGCGC TTCAAGGGTC CCGACGGGAG CCAGATTCAG GTCCATGCAC CTCTGACCTT CGTCTGCGAC GGATGTTTCT CGAATTTGAG GCGAAACCTT TGCGATCCTC AGGTCGAGGT TCCATCGTGC TTCGTGGGCT TGGTGCTTGA AGATTGCGTG CTGCCTCACA CGAGTCACGG GCATGTGGTG CTCGCGGATC CATCCCCTAT CTTGTTCTAC CCCATTAGTA GCACGGAGGT ACGGTGTCTA GTTGATTTCC CGGGCCAAAA GGTTCCCACC ATTGCAAATG GCGATATGGC CAAATATCTC CTCACGCACG TCGCACCTCA GTTACCCGCG CAACTCAAGC AGCCTTTCAT CAATGCTGTG GAAAAGGGTA ATATCCGATC TATGCCGAAC AAGAGTATGC CCGCTCACCC GCGACCAACT CCTGGAGCAT TACTTATGGG AGACGCGTTC AACATGCGAC ATCCCCTCAC CGGAGGCGGT ATGACCGTCG CGCTGTCGGA TATCCTAGTG TTGCAGGAAA TGCTGAAGCC TCTGCATGAC TTCCAAGATC CCTCGGCCCT GTGCGATTAC CTGCAAGCGT TTTACACCCG TCGTAAGCCA GTTGCGTCGA CTATCAATAC CCTGGCCGGG GCTCTGTACA GAGTGTTTTG CGCTTCACCG GATGAGGCCA TGAAGGAAAT GAGGCAGGCA TGCTTCGATT ATTTGAGCCT GGGAGGGGTT TTCTCCTCTG GCCCGGTGGC TCTTCTATCG GGACTCAACC CCCGGCCCTT GAGTTTGGTC ACGCACTTCT TCGCGGTCGC CCTCTACGGT GTCGGCCGAC TGATGCTTCC ATTTCCTTCG ATCAAAAGTA TGTGGATTTC GGCGCGGCTA ATTCAGGGAG CGTCGATGAT CATCTTTCCA ATCATCAAAG CCGAGGGAGT TTTGCAAATG TTTTTCCCCA GAATGGTTCC TACTTATCAC AGATCACCTC CCAAAGAATA G |
Protein |
>Mp1g18410.1 MEQGLECRGW WQRLEFVAAV ASLAVASVLL WRLKSGDTSR RKLSAKVVDA RLEVPTGDEP RVDAIIVGAG VAGAALAYTL GKDGRRVLVL ERDLNEPDRI VGELLQPGGY LKLVSLGLAD CVDGIDAQKV FGYALFKDGR SAKVGYPLEG FSDDVAGRSF HNGRFIQKLR EKAASLATVT LKQATVLGLI EENGTVTGVR FKGPDGSQIQ VHAPLTFVCD GCFSNLRRNL CDPQVEVPSC FVGLVLEDCV LPHTSHGHVV LADPSPILFY PISSTEVRCL VDFPGQKVPT IANGDMAKYL LTHVAPQLPA QLKQPFINAV EKGNIRSMPN KSMPAHPRPT PGALLMGDAF NMRHPLTGGG MTVALSDILV LQEMLKPLHD FQDPSALCDY LQAFYTRRKP VASTINTLAG ALYRVFCASP DEAMKEMRQA CFDYLSLGGV FSSGPVALLS GLNPRPLSLV THFFAVALYG VGRLMLPFPS IKSMWISARL IQGASMIIFP IIKAEGVLQM FFPRMVPTYH RSPPKE |