Database | ID | Description |
---|---|---|
MobiDBLite | mobidb-lite | consensus disorder prediction |
PANTHER | PTHR15467 | ZINC-FINGERS AND HOMEOBOXES RELATED |
SUPERFAMILY | SSF46689 | Homeodomain-like |
Coils | Coil | Coil |
ProSiteProfiles | PS50071 | 'Homeobox' domain profile. |
Gene3D | G3DSA:1.10.10.60 | - |
SMART | SM00389 | HOX_1 |
MapolyID | Mapoly0029s0145 | - |
GO | GO:0003677 | DNA binding |
GO | GO:0005634 | nucleus |
GO | GO:0000981 | DNA-binding transcription factor activity, RNA polymerase II-specific |
GO | GO:0006357 | regulation of transcription by RNA polymerase II |
Sequences: |
Gene UTR + CDS + intron |
>Mp1g01010.1 GAATCCACGA AGTAAATTTC TGCCTTTTCT TGGAAGAACA AAAAGGCGGC ACTCAAATCT CTCAATAGTG GGAATTCCCG TCGGAAAGCA GGGAGGTAAT GATTCTGGAA GGGCTTCCGC TTGGAAGAAG GGCATTGATC TCATAAACGC GACCACGGAT TGACGCCCTG ACGGTAGCTC TCAAGTCGTC GTAGGTTGGG GGTTTAACGA GGTTTCGCGT GCTGACGAAT GGGACTCATG GCGATGGCGG TGCAAAGTGC GAGCTTCTGC TCTTTCAGCA CCAGACCGCC AGTGGCAGCT TCAGCCTGGA GCGAATGGAG GGGTGTGCCC TTGCGCAGGG AAATCATTGG GTTAGTGAAA TTCGGAATCA GAAAGAAAAC TCTGCAGGTC GTTAGTAGAA GAGGTGGTGG CGGTGGCAGA GGAGGGCCTG CAAAACCTCG GAATGTTAGC CCCCGCGGTC GCTCTGTGCA ACAGCAGCAG CAGGAAGCTC TACCGAAGTC GTTAATTGCT AAGGTTGGCT CGGTTGGGAA GGTCGATTGT GGGTTGGGAA TCGTGTTAAT TCTGGTATCA AACGAAATAT TGTTGTGAGG CTTTCACAGC GTTCCGTTCA CGTGGATATA TTTTGGCTGG AAGGTCGAGT CTGATATGCT TTGTTTTTGG AGGAAATGGA TTGGTGGAGA AATTGTTGTC GTTGTGGTGG AAGAGGTTAC ATAAAACTTC CTCCACGAAG AAAGACGTTG CTCATTCAAT AAGTATTTTA TCTCATTAGG GTCTGCATTA TACGTCTTTT GTTTTGCTGC AGAAAACTAG ACAAGAAGAA GAAGACGAAG AGGATGCTCT AGCTGAAGCT GCGCTAGAAG CGCTGTTTGC GCAATTGGAG AAGGACTTGG AAGATGGCGC AATGTCAGAA GACGATGATG ATGATGAAGA CTTCACTGAA GAGGAGGTTA TTGCTCTTGA GGATGAGCTA GAAGCTGCTC TGCTGGGTCT CGATTATGAG CAGCAGACTG TCATCGACTC GGAGGCAGGC AAAAGTGAAG TAGGTCCATC ACAAAGCAAG CGGTTGGCAA AGGAGGACGC TGAATTAATT TTGGATGATG AAGAGGAGGA CGAGGAAGAG CAGAGGGTTG TGAGCTTGGA AACGTGGCAG CTCAAAAAGC TAGCTGCCGC TGCGGAGATA GGCCGTCGAA ACATAAATGT AAGCTAGTCC AGAACACTTG TCTGTAAGCG CAAAACACTT GTGTGTGAAC ATTAGTTTGA AAGTTATGAT TTGCTGCTGG ATTATGGATT GAATCTCATT TGTTCTAATA CCGTTGCATG GTTCTCGGCT CTGCGATCAG AGCCGAGAAT AGAGCTAATA CTGTACTGTA CATGCATTCA ACTTTGCTAA AATATTAGCA ACCAGTCGAA CCTCCAATAG TCAGAGGGCG CAATGGGTTA AGAAGTTTGT CACTAGTGAA CTGGTGTGGT GTTTCTGGTG CAATATATCA ATCTACCTTG TGTTTGCGCA CAGGTGAAAG CTCTAGCTGC GGAGGTTGGT ATGGATAGGT CCGACGTTCT CGCTTGGTTG AAGAACCCAC CACCGGAACT TCTGATGCTT GGTGCAACAA TGGGGCTCGA AGAAGATTCG GAGGACGCCG ATCAAGTCAA CGAAGAGTCT GAAGATGCTG CTTCAAGGAC TCCTGCAAAG ATTTCTACTT CAACAAACAA GCAAATACCA GAGAGTCAGG GATTCGCTCC CGAAACATGG TACAATCAAA AGAGATTAAA GAAGGAACAC GTAGCGACGT TTGAGCGAGT ATTTCGTCGA ACGAAACGAC CCACGGTATG CCTTACTTCG CTTTGTTGTT CTTGTTGTGT TTTGATATAC TCTTTCCTTC CACATCTTTT CTCCACAGTG AGCTGGATAG AGATAAGAAA TGGCTGGTCT TGTGTGCTCA TCGAGTGAAC TCAACACTAA AACATGATCA AACTAATCTC CTACATCGCT TTGCTTCAAC CTTGATCATA CATTTGATAT TAGATTGTAA ATTAGTGTTT ATGATCTACT TTGACCTTTC GAATTCCCCT TACCACTGTT ATAGCCAGTG GTGGAAACGA TAATATTGTC CAGACTTGAG TTTTGTTCAC TTGTTTAGCT CGACTCTTCT TCTTTACGCT TGACTGATCT TAAGTCGTTT ACACAGCCGA GAAGTTGGTT TGTTCTGAAG TGTCTCAAGG TTAACGGTTC CGAGCTGATC TTTGTTTCCT TGTGTTCATT ACGCAGAATG CCATGATTCA AAATTTGGTC GAGCTAACGC ATGTCCCTAG GAAGCGGATT GTTGAGTGGT TTGACCATAA AAGGCAGGAA TTAGATCCCT CCCAACGATT GGAACCGCTG AGATGACGAG ATAAGAAGGT ATGGAAAATC TACTTATAGA TCCCGGAATG AAGAAAGTGA GCCGGAGCTT GATCTCCAAT AATATCTCGC GTCTAAGAAT TACCGTGAAT GGATGCACGG AAATTATGAA CGACGTACTT GGAAACCATA CCATCTCCGC TTTCCGAAGT GCCTCTAGAA AGTGGTTGAT AAGAACCGGG ATGAAGAAGA GGTTAATTGT TTCTCTTGAA AACTAGGTCA GCCCTACCCC AACAAATTGT TGGGGTGCTT ACATTTGCCG CAGAGGATTT CAAGGACTGC TAAAGCTTTA GAATAAGAAG AGTTATAAGT ATGCAATTCT TCAGTTTTGT AGTTTATAAT GTAGATTATT CCTCTTAGAT TCTGGATTCC ACCTAGTCTT GGAACTGTAA ATTATGGAGA AGACGAGGAA GTAAAATGAT GCAAATGAAC AGATCGATGG AACTGCACAC GATCTCATGA AGATTTTCTT AGTGATAGTG GATCATTAGG CACGTAGTAG GTAATTCTGC GTCCATGCTC ACCCCTTCTA TTTCTGGGTG CCATCAAGCT GGCCCGTTTC TCTTCCATCG TCAATAAGAT GGTTAAAAGA TTATATTG |
mRNA UTR + CDS |
>Mp1g01010.1 GAATCCACGA AGTAAATTTC TGCCTTTTCT TGGAAGAACA AAAAGGCGGC ACTCAAATCT CTCAATAGTG GGAATTCCCG TCGGAAAGCA GGGAGGTAAT GATTCTGGAA GGGCTTCCGC TTGGAAGAAG GGCATTGATC TCATAAACGC GACCACGGAT TGACGCCCTG ACGGTAGCTC TCAAGTCGTC GTAGGTTGGG GGTTTAACGA GGTTTCGCGT GCTGACGAAT GGGACTCATG GCGATGGCGG TGCAAAGTGC GAGCTTCTGC TCTTTCAGCA CCAGACCGCC AGTGGCAGCT TCAGCCTGGA GCGAATGGAG GGGTGTGCCC TTGCGCAGGG AAATCATTGG GTTAGTGAAA TTCGGAATCA GAAAGAAAAC TCTGCAGGTC GTTAGTAGAA GAGGTGGTGG CGGTGGCAGA GGAGGGCCTG CAAAACCTCG GAATGTTAGC CCCCGCGGTC GCTCTGTGCA ACAGCAGCAG CAGGAAGCTC TACCGAAGTC GTTAATTGCT AAGAAAACTA GACAAGAAGA AGAAGACGAA GAGGATGCTC TAGCTGAAGC TGCGCTAGAA GCGCTGTTTG CGCAATTGGA GAAGGACTTG GAAGATGGCG CAATGTCAGA AGACGATGAT GATGATGAAG ACTTCACTGA AGAGGAGGTT ATTGCTCTTG AGGATGAGCT AGAAGCTGCT CTGCTGGGTC TCGATTATGA GCAGCAGACT GTCATCGACT CGGAGGCAGG CAAAAGTGAA GTAGGTCCAT CACAAAGCAA GCGGTTGGCA AAGGAGGACG CTGAATTAAT TTTGGATGAT GAAGAGGAGG ACGAGGAAGA GCAGAGGGTT GTGAGCTTGG AAACGTGGCA GCTCAAAAAG CTAGCTGCCG CTGCGGAGAT AGGCCGTCGA AACATAAATG TGAAAGCTCT AGCTGCGGAG GTTGGTATGG ATAGGTCCGA CGTTCTCGCT TGGTTGAAGA ACCCACCACC GGAACTTCTG ATGCTTGGTG CAACAATGGG GCTCGAAGAA GATTCGGAGG ACGCCGATCA AGTCAACGAA GAGTCTGAAG ATGCTGCTTC AAGGACTCCT GCAAAGATTT CTACTTCAAC AAACAAGCAA ATACCAGAGA GTCAGGGATT CGCTCCCGAA ACATGGTACA ATCAAAAGAG ATTAAAGAAG GAACACGTAG CGACGTTTGA GCGAGTATTT CGTCGAACGA AACGACCCAC GAATGCCATG ATTCAAAATT TGGTCGAGCT AACGCATGTC CCTAGGAAGC GGATTGTTGA GTGGTTTGAC CATAAAAGGC AGGAATTAGA TCCCTCCCAA CGATTGGAAC CGCTGAGATG ACGAGATAAG AAGGTATGGA AAATCTACTT ATAGATCCCG GAATGAAGAA AGTGAGCCGG AGCTTGATCT CCAATAATAT CTCGCGTCTA AGAATTACCG TGAATGGATG CACGGAAATT ATGAACGACG TACTTGGAAA CCATACCATC TCCGCTTTCC GAAGTGCCTC TAGAAAGTGG TTGATAAGAA CCGGGATGAA GAAGAGGTTA ATTGTTTCTC TTGAAAACTA GGTCAGCCCT ACCCCAACAA ATTGTTGGGG TGCTTACATT TGCCGCAGAG GATTTCAAGG ACTGCTAAAG CTTTAGAATA AGAAGAGTTA TAAGTATGCA ATTCTTCAGT TTTGTAGTTT ATAATGTAGA TTATTCCTCT TAGATTCTGG ATTCCACCTA GTCTTGGAAC TGTAAATTAT GGAGAAGACG AGGAAGTAAA ATGATGCAAA TGAACAGATC GATGGAACTG CACACGATCT CATGAAGATT TTCTTAGTGA TAGTGGATCA TTAGGCACGT AGTAGGTAAT TCTGCGTCCA TGCTCACCCC TTCTATTTCT GGGTGCCATC AAGCTGGCCC GTTTCTCTTC CATCGTCAAT AAGATGGTTA AAAGATTATA TTG |
CDS |
>Mp1g01010.1 ATGGGACTCA TGGCGATGGC GGTGCAAAGT GCGAGCTTCT GCTCTTTCAG CACCAGACCG CCAGTGGCAG CTTCAGCCTG GAGCGAATGG AGGGGTGTGC CCTTGCGCAG GGAAATCATT GGGTTAGTGA AATTCGGAAT CAGAAAGAAA ACTCTGCAGG TCGTTAGTAG AAGAGGTGGT GGCGGTGGCA GAGGAGGGCC TGCAAAACCT CGGAATGTTA GCCCCCGCGG TCGCTCTGTG CAACAGCAGC AGCAGGAAGC TCTACCGAAG TCGTTAATTG CTAAGAAAAC TAGACAAGAA GAAGAAGACG AAGAGGATGC TCTAGCTGAA GCTGCGCTAG AAGCGCTGTT TGCGCAATTG GAGAAGGACT TGGAAGATGG CGCAATGTCA GAAGACGATG ATGATGATGA AGACTTCACT GAAGAGGAGG TTATTGCTCT TGAGGATGAG CTAGAAGCTG CTCTGCTGGG TCTCGATTAT GAGCAGCAGA CTGTCATCGA CTCGGAGGCA GGCAAAAGTG AAGTAGGTCC ATCACAAAGC AAGCGGTTGG CAAAGGAGGA CGCTGAATTA ATTTTGGATG ATGAAGAGGA GGACGAGGAA GAGCAGAGGG TTGTGAGCTT GGAAACGTGG CAGCTCAAAA AGCTAGCTGC CGCTGCGGAG ATAGGCCGTC GAAACATAAA TGTGAAAGCT CTAGCTGCGG AGGTTGGTAT GGATAGGTCC GACGTTCTCG CTTGGTTGAA GAACCCACCA CCGGAACTTC TGATGCTTGG TGCAACAATG GGGCTCGAAG AAGATTCGGA GGACGCCGAT CAAGTCAACG AAGAGTCTGA AGATGCTGCT TCAAGGACTC CTGCAAAGAT TTCTACTTCA ACAAACAAGC AAATACCAGA GAGTCAGGGA TTCGCTCCCG AAACATGGTA CAATCAAAAG AGATTAAAGA AGGAACACGT AGCGACGTTT GAGCGAGTAT TTCGTCGAAC GAAACGACCC ACGAATGCCA TGATTCAAAA TTTGGTCGAG CTAACGCATG TCCCTAGGAA GCGGATTGTT GAGTGGTTTG ACCATAAAAG GCAGGAATTA GATCCCTCCC AACGATTGGA ACCGCTGAGA TGA |
Protein |
>Mp1g01010.1 MGLMAMAVQS ASFCSFSTRP PVAASAWSEW RGVPLRREII GLVKFGIRKK TLQVVSRRGG GGGRGGPAKP RNVSPRGRSV QQQQQEALPK SLIAKKTRQE EEDEEDALAE AALEALFAQL EKDLEDGAMS EDDDDDEDFT EEEVIALEDE LEAALLGLDY EQQTVIDSEA GKSEVGPSQS KRLAKEDAEL ILDDEEEDEE EQRVVSLETW QLKKLAAAAE IGRRNINVKA LAAEVGMDRS DVLAWLKNPP PELLMLGATM GLEEDSEDAD QVNEESEDAA SRTPAKISTS TNKQIPESQG FAPETWYNQK RLKKEHVATF ERVFRRTKRP TNAMIQNLVE LTHVPRKRIV EWFDHKRQEL DPSQRLEPLR |