Database | ID | Description |
---|---|---|
KOG | KOG4282 | Transcription factor GT-2 and related proteins, contains trihelix DNA-binding/SANT domain; C-term missing; [K] |
MobiDBLite | mobidb-lite | consensus disorder prediction |
ProSiteProfiles | PS50090 | Myb-like domain profile. |
SUPERFAMILY | SSF46689 | Homeodomain-like |
PANTHER | PTHR47211 | TRIHELIX TRANSCRIPTION FACTOR ASR3 |
PANTHER | PTHR47211:SF2 | TRIHELIX TRANSCRIPTION FACTOR ASR3 |
Pfam | PF13837 | Myb/SANT-like DNA-binding domain |
Gene3D | G3DSA:1.10.10.60 | - |
CDD | cd12203 | GT1 |
MapolyID | Mapoly0043s0010 | - |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpTRIHELIX18 | transcription factor, Trihelix | Published |
Sequences: |
Gene UTR + CDS + intron |
>Mp1g06180.1 ACTCAGAGAG AAGGGAAGGG AAGGGAAGGG GAGGGAGGAG CTCAGGGATG GAGGAGAGCG AAGGAGTTGT CGAGGATGGA ATACAGCGGC CGACGACAAC GACCACCACA ACGACGACGA CGACGACGAC AACGTCTCTG GGAGATGGGG AGTGCGATGA GTCGTCGGTA GTGCAGGAGC CGAAGAAGAA GAAGCAGAAG AAAAACGGGA GCTTCCGCGC GAGCACGACG AAGAAGAGGA GGTGCAGCGA TGGCGTCGAG GGTTTCGCAG GTGGATTGGC AGGAGGAGTG GGCTCAATTC TCAAAGAGCA TGACGAGGAC GAGGAGGACG AGGACGATGA TGACGATGAC GCGTCGGATG AGCCCGAGCA GGTGCAATTT TACATCGCTG TTGATGGGGA TGGTGATGGT GATGTTGATG CCGATGCCGA GCACGATGGA TGTTGGAATG GAAGTGAAGG TGCTGCAGGA GGAGGAGGAG GAGCAGGAGC AGCGAAGGAT TTGCGCATCG AAATCCCGCA TTCAGTGCCG ACGTCGGAAG TTGTGGAAAC GACGACGACG ACGACCACCG AGAACGAGGA TGACGAGGAC GACGAGGACG ACGAGGACGG GGAGGGAGAT GTTATTCCGG AATTTGACCC TGACCACAAA GGGCACAGTC TCTTCACCGA AGAATTGCGA GCTGCGGTGG AGGAGGGATC GGCGCTGCAG TTGCCTCCGA TGATGGCTGT TATGGACTCG CATTTTCCAT CGGATGAGGA GCAGCTGCAG CAACAGCTTC TCCTCATGGA CGGCTCGAAA GATGCCGTGC ATGTGAACGG GGGAGCATTG CAAGGGCAGG TGCCCTTGCC CTTGCCGCCA TTGGCGACCG TGACGCCCTC CACGAACACG ACGTACGAAG GTTTGAACGA GCATTTGGAT GTGATTGCGT TCAAAATTTA CTCCGCTTTA CAGCAGGCGA ATTCGCATCC TCCAGCTGTA TCGTCTCAAG CGGGAGAGGG CGATGGTCAT TATCAGCAGC TCGTCGACCA AGAACAAGGA GGGGACAAGA ATCACGCCGC GGGTGGCATC GACGAAGGGG AGCGGACGCC TCGTCATCCT CGATGGACCA GGGAAGAAAC TGCGGTTCTT CTGGCCGGCA AGAGAAAGCG GGACGAGGAG ATTAGATCTC AGCAGTCGGG ATCGAAGTTT GCTCTGAGTG CCACGGAGCG ATGGGATTCC ATTTCGAATC ACTGTAAAGC GCATGGAGTC GACAGGGATG CTCATCAGTG TCGGAAGCGA TGGAGTAATC TGGCCGCCGA TTTCAAGAAA ATCAGGACTT GGCACAAGAT GAGCGGCGTC GAGTCGTACT GGTCTATGCG GGGCGATAGT CGTAGGAAGA ACAAGCTTCC TGGGAGTTTC GATCCCGAGG TGTACGCGGA CATGGAAGCT ACCTGCGATT CTAATCCTCA CAGGAAGAGA AAGCTCGTCC CCATGGCTGG CACCGCGGAA GACGTCGACG ACCACCAACA TGATCCTCTG GCCTCCGTGT TGGAAAGCAG CGGCAGGGCC ATGCAGGCAG CTTTAGCCCG AAACATCCAA GCGCAGATAG AGGCCCACAA TCAGAATAGC GAGCTGGACA GGAATCAGAG GAAGGAGCAA GGGGACAATC TGGTGGGAGT TCTGGGGTTA CTGGCCGATT CTTTGGCCAA AATCGCAGCG AAAATGTGAG GCTGTGTATT CGCATGCAAC TCGCAAGATT CGAACATGCG TGCTGTATGT GTGGCCGCCT CTGCCACCAT GAGCCGCCAA GAAGCTGCAG CACCGAGGAT AGACAGTCAA TTACTCTCGA TGCCTGGAGT TCGAAAAGGA GCTCTTCCTC GTGGAAACAT CCCAAAGGCA TTGTCTGGGA TTGAGTCGGT CAGCGTCTTT GTCAGTTTTC CTTCCGTGCA TCTCATTCGT CAGTGGTGAC CTAGCTGGAC GGCGTTGATC ACGAGGTACT CTCCGTTCGA GCCCAGCTCG TCACAGCTAT CAAGCATGAA AGCTGCAGAG TCCTCGGTCG CTAGAAATGC CACATCGGAC TTTTCATCGA TTATTCACGA AACGACACTT CCGTGTAACC GGACGCTTCC TGATGACGAG CTTGGCATCC AGGTTGGCAT CCAGGTTGAA ATGTAGGCAG CGCGTAGGGA CTGCATGAAT TGAGGTACTT CAGTGAGTCG AAGAGCGTGG AGACTAGTAT ATTCTCTGCC TCATTGCTTC CCACAGTGAT CGTTTTCCGT GTGGGATTCT AGGATCATTC CACAGTACTG AGTTGCAGAC ATTGAATTGG TATATTATTC AGAACAATTC AGTGTTAAGG ATTGAGAAAT GCCATATGAT TTTCGGATGC A |
mRNA UTR + CDS |
>Mp1g06180.1 ACTCAGAGAG AAGGGAAGGG AAGGGAAGGG GAGGGAGGAG CTCAGGGATG GAGGAGAGCG AAGGAGTTGT CGAGGATGGA ATACAGCGGC CGACGACAAC GACCACCACA ACGACGACGA CGACGACGAC AACGTCTCTG GGAGATGGGG AGTGCGATGA GTCGTCGGTA GTGCAGGAGC CGAAGAAGAA GAAGCAGAAG AAAAACGGGA GCTTCCGCGC GAGCACGACG AAGAAGAGGA GGTGCAGCGA TGGCGTCGAG GGTTTCGCAG GTGGATTGGC AGGAGGAGTG GGCTCAATTC TCAAAGAGCA TGACGAGGAC GAGGAGGACG AGGACGATGA TGACGATGAC GCGTCGGATG AGCCCGAGCA GGTGCAATTT TACATCGCTG TTGATGGGGA TGGTGATGGT GATGTTGATG CCGATGCCGA GCACGATGGA TGTTGGAATG GAAGTGAAGG TGCTGCAGGA GGAGGAGGAG GAGCAGGAGC AGCGAAGGAT TTGCGCATCG AAATCCCGCA TTCAGTGCCG ACGTCGGAAG TTGTGGAAAC GACGACGACG ACGACCACCG AGAACGAGGA TGACGAGGAC GACGAGGACG ACGAGGACGG GGAGGGAGAT GTTATTCCGG AATTTGACCC TGACCACAAA GGGCACAGTC TCTTCACCGA AGAATTGCGA GCTGCGGTGG AGGAGGGATC GGCGCTGCAG TTGCCTCCGA TGATGGCTGT TATGGACTCG CATTTTCCAT CGGATGAGGA GCAGCTGCAG CAACAGCTTC TCCTCATGGA CGGCTCGAAA GATGCCGTGC ATGTGAACGG GGGAGCATTG CAAGGGCAGG TGCCCTTGCC CTTGCCGCCA TTGGCGACCG TGACGCCCTC CACGAACACG ACGTACGAAG GTTTGAACGA GCATTTGGAT GTGATTGCGT TCAAAATTTA CTCCGCTTTA CAGCAGGCGA ATTCGCATCC TCCAGCTGTA TCGTCTCAAG CGGGAGAGGG CGATGGTCAT TATCAGCAGC TCGTCGACCA AGAACAAGGA GGGGACAAGA ATCACGCCGC GGGTGGCATC GACGAAGGGG AGCGGACGCC TCGTCATCCT CGATGGACCA GGGAAGAAAC TGCGGTTCTT CTGGCCGGCA AGAGAAAGCG GGACGAGGAG ATTAGATCTC AGCAGTCGGG ATCGAAGTTT GCTCTGAGTG CCACGGAGCG ATGGGATTCC ATTTCGAATC ACTGTAAAGC GCATGGAGTC GACAGGGATG CTCATCAGTG TCGGAAGCGA TGGAGTAATC TGGCCGCCGA TTTCAAGAAA ATCAGGACTT GGCACAAGAT GAGCGGCGTC GAGTCGTACT GGTCTATGCG GGGCGATAGT CGTAGGAAGA ACAAGCTTCC TGGGAGTTTC GATCCCGAGG TGTACGCGGA CATGGAAGCT ACCTGCGATT CTAATCCTCA CAGGAAGAGA AAGCTCGTCC CCATGGCTGG CACCGCGGAA GACGTCGACG ACCACCAACA TGATCCTCTG GCCTCCGTGT TGGAAAGCAG CGGCAGGGCC ATGCAGGCAG CTTTAGCCCG AAACATCCAA GCGCAGATAG AGGCCCACAA TCAGAATAGC GAGCTGGACA GGAATCAGAG GAAGGAGCAA GGGGACAATC TGGTGGGAGT TCTGGGGTTA CTGGCCGATT CTTTGGCCAA AATCGCAGCG AAAATGTGAG GCTGTGTATT CGCATGCAAC TCGCAAGATT CGAACATGCG TGCTGTATGT GTGGCCGCCT CTGCCACCAT GAGCCGCCAA GAAGCTGCAG CACCGAGGAT AGACAGTCAA TTACTCTCGA TGCCTGGAGT TCGAAAAGGA GCTCTTCCTC GTGGAAACAT CCCAAAGGCA TTGTCTGGGA TTGAGTCGGT CAGCGTCTTT GTCAGTTTTC CTTCCGTGCA TCTCATTCGT CAGTGGTGAC CTAGCTGGAC GGCGTTGATC ACGAGGTACT CTCCGTTCGA GCCCAGCTCG TCACAGCTAT CAAGCATGAA AGCTGCAGAG TCCTCGGTCG CTAGAAATGC CACATCGGAC TTTTCATCGA TTATTCACGA AACGACACTT CCGTGTAACC GGACGCTTCC TGATGACGAG CTTGGCATCC AGGTTGGCAT CCAGGTTGAA ATGTAGGCAG CGCGTAGGGA CTGCATGAAT TGAGGTACTT CAGTGAGTCG AAGAGCGTGG AGACTAGTAT ATTCTCTGCC TCATTGCTTC CCACAGTGAT CGTTTTCCGT GTGGGATTCT AGGATCATTC CACAGTACTG AGTTGCAGAC ATTGAATTGG TATATTATTC AGAACAATTC AGTGTTAAGG ATTGAGAAAT GCCATATGAT TTTCGGATGC A |
CDS |
>Mp1g06180.1 ATGGAGGAGA GCGAAGGAGT TGTCGAGGAT GGAATACAGC GGCCGACGAC AACGACCACC ACAACGACGA CGACGACGAC GACAACGTCT CTGGGAGATG GGGAGTGCGA TGAGTCGTCG GTAGTGCAGG AGCCGAAGAA GAAGAAGCAG AAGAAAAACG GGAGCTTCCG CGCGAGCACG ACGAAGAAGA GGAGGTGCAG CGATGGCGTC GAGGGTTTCG CAGGTGGATT GGCAGGAGGA GTGGGCTCAA TTCTCAAAGA GCATGACGAG GACGAGGAGG ACGAGGACGA TGATGACGAT GACGCGTCGG ATGAGCCCGA GCAGGTGCAA TTTTACATCG CTGTTGATGG GGATGGTGAT GGTGATGTTG ATGCCGATGC CGAGCACGAT GGATGTTGGA ATGGAAGTGA AGGTGCTGCA GGAGGAGGAG GAGGAGCAGG AGCAGCGAAG GATTTGCGCA TCGAAATCCC GCATTCAGTG CCGACGTCGG AAGTTGTGGA AACGACGACG ACGACGACCA CCGAGAACGA GGATGACGAG GACGACGAGG ACGACGAGGA CGGGGAGGGA GATGTTATTC CGGAATTTGA CCCTGACCAC AAAGGGCACA GTCTCTTCAC CGAAGAATTG CGAGCTGCGG TGGAGGAGGG ATCGGCGCTG CAGTTGCCTC CGATGATGGC TGTTATGGAC TCGCATTTTC CATCGGATGA GGAGCAGCTG CAGCAACAGC TTCTCCTCAT GGACGGCTCG AAAGATGCCG TGCATGTGAA CGGGGGAGCA TTGCAAGGGC AGGTGCCCTT GCCCTTGCCG CCATTGGCGA CCGTGACGCC CTCCACGAAC ACGACGTACG AAGGTTTGAA CGAGCATTTG GATGTGATTG CGTTCAAAAT TTACTCCGCT TTACAGCAGG CGAATTCGCA TCCTCCAGCT GTATCGTCTC AAGCGGGAGA GGGCGATGGT CATTATCAGC AGCTCGTCGA CCAAGAACAA GGAGGGGACA AGAATCACGC CGCGGGTGGC ATCGACGAAG GGGAGCGGAC GCCTCGTCAT CCTCGATGGA CCAGGGAAGA AACTGCGGTT CTTCTGGCCG GCAAGAGAAA GCGGGACGAG GAGATTAGAT CTCAGCAGTC GGGATCGAAG TTTGCTCTGA GTGCCACGGA GCGATGGGAT TCCATTTCGA ATCACTGTAA AGCGCATGGA GTCGACAGGG ATGCTCATCA GTGTCGGAAG CGATGGAGTA ATCTGGCCGC CGATTTCAAG AAAATCAGGA CTTGGCACAA GATGAGCGGC GTCGAGTCGT ACTGGTCTAT GCGGGGCGAT AGTCGTAGGA AGAACAAGCT TCCTGGGAGT TTCGATCCCG AGGTGTACGC GGACATGGAA GCTACCTGCG ATTCTAATCC TCACAGGAAG AGAAAGCTCG TCCCCATGGC TGGCACCGCG GAAGACGTCG ACGACCACCA ACATGATCCT CTGGCCTCCG TGTTGGAAAG CAGCGGCAGG GCCATGCAGG CAGCTTTAGC CCGAAACATC CAAGCGCAGA TAGAGGCCCA CAATCAGAAT AGCGAGCTGG ACAGGAATCA GAGGAAGGAG CAAGGGGACA ATCTGGTGGG AGTTCTGGGG TTACTGGCCG ATTCTTTGGC CAAAATCGCA GCGAAAATGT GA |
Protein |
>Mp1g06180.1 MEESEGVVED GIQRPTTTTT TTTTTTTTTS LGDGECDESS VVQEPKKKKQ KKNGSFRAST TKKRRCSDGV EGFAGGLAGG VGSILKEHDE DEEDEDDDDD DASDEPEQVQ FYIAVDGDGD GDVDADAEHD GCWNGSEGAA GGGGGAGAAK DLRIEIPHSV PTSEVVETTT TTTTENEDDE DDEDDEDGEG DVIPEFDPDH KGHSLFTEEL RAAVEEGSAL QLPPMMAVMD SHFPSDEEQL QQQLLLMDGS KDAVHVNGGA LQGQVPLPLP PLATVTPSTN TTYEGLNEHL DVIAFKIYSA LQQANSHPPA VSSQAGEGDG HYQQLVDQEQ GGDKNHAAGG IDEGERTPRH PRWTREETAV LLAGKRKRDE EIRSQQSGSK FALSATERWD SISNHCKAHG VDRDAHQCRK RWSNLAADFK KIRTWHKMSG VESYWSMRGD SRRKNKLPGS FDPEVYADME ATCDSNPHRK RKLVPMAGTA EDVDDHQHDP LASVLESSGR AMQAALARNI QAQIEAHNQN SELDRNQRKE QGDNLVGVLG LLADSLAKIA AKM |