Database | ID | Description |
---|---|---|
SMART | SM00389 | HOX_1 |
ProSitePatterns | PS00027 | 'Homeobox' domain signature. |
Pfam | PF00046 | Homeodomain |
SUPERFAMILY | SSF46689 | Homeodomain-like |
ProSiteProfiles | PS50071 | 'Homeobox' domain profile. |
FunFam | G3DSA:1.10.10.60:FF:000577 | Homeobox-leucine zipper protein 18 |
CDD | cd00086 | homeodomain |
Coils | Coil | Coil |
Pfam | PF02183 | Homeobox associated leucine zipper |
Gene3D | G3DSA:1.10.10.60 | - |
MobiDBLite | mobidb-lite | consensus disorder prediction |
Pfam | PF04618 | HD-ZIP protein N terminus |
SMART | SM00340 | halz |
PANTHER | PTHR45714 | HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT14 |
KEGG | K09338 | homeobox-leucine zipper protein |
KOG | KOG0483 | Transcription factor HEX, contains HOX and HALZ domains; [K] |
MapolyID | Mapoly0069s0069 | - |
GO | GO:0043565 | sequence-specific DNA binding |
GO | GO:0005634 | nucleus |
GO | GO:0003677 | DNA binding |
GO | GO:0006355 | regulation of DNA-templated transcription |
GO | GO:0000981 | DNA-binding transcription factor activity, RNA polymerase II-specific |
Sequences: |
Gene UTR + CDS + intron |
>Mp2g24200.1 CAGCAGCTGG TCTGAAATCC CCCACCTCCT TACATTCGAG CAACACAATT GGCATTCTAG TTTCTCTCGT GACCCGTCTC CCACTGCAAG TCGCTCTCAC CTCCCTCTCT CTCTCACTCT CTCTCTTTCT CGGGTCTCCT CTCCTGCTCG CCTATCTCTA TCTCGTCCGC TTTCCAGCTC CAGTCATTCT CTCATCCGCT TGCGTGCTCG CTCTCCCTTC CAATCTCTGT CAATCTTTCT CCCTCTCTCT CTCTCTCACT CTCACTCTCT TCTCGGCCTG CCCCACTGGG CGAGTGCTGT GTTGCGCACA CAAGTCGAAC ATTCCACTGT TAGTTTGTCT GTATGACTGT CCGGCTCTAG CAGGCAAGGC GAAGAAGGTT GGCCTGGCGG GCAGAGGCTG GAATTTGAAG AAGAAACCAG CAGTCGCGTT CCATCGTCTT CTGCTTCTTC TTCTTCTTGC TCCGTGCTCG CTTTTCGAAC GAGTTTCGTA GCGAGTGCCA TTCGCTCGGC GTACAGACCA GCCCAGCTCT CTGTGGCAGG CAGGCAGGCA GGCAGGCAGG CCTTGCTCTC TCCTGCAGCT CTGGAACAGG AAGGCTCTCG AAAGACTGCG AGGTAACGCG CAGCTCCTTC TCGGCTTCGC TGTTCTCGGT TCTCTGCAAT CTCTCATTCA CGATCGGAGC TCGGCGGGTC GAGCATCACG GATTGCTCAG ACGAACCATT TCGGCTGTGC GTGCCGGGAG GAACCCGTCG TCCGTCAATC AGTTGGTCTT TCATGCTGCT GACGTCGAGG ATTTTCTCGT CTCAGAGCGT GGCGACGTGA GCCTGAGAAA TACGGACGGC TTGGCTTGGT TCTGGTGGTG GTGGTGGTGT CGGTGTCGGA GCCGGCTCTG AACACGACGA GGAAGCAAGA CTTGACAATC AGTCTAGATG ATGCTCCTAG CGGAGGGGAA GGATAGATTA CGGGAGGAGG TGCAGCGGCT TAGGGCGATG CAAGGCTCCG GAGCGAGCTT GAGCCTCAGC GTAGACTTGA GCATGATGGT AGCGAGGAGC CAGGACGATT GCTCCACCGT AAGACGGCAG TGCCCCGTGC AACTGGATCT GCTGCCCATG CCCACTTTTC CATCGGCTGC AGCAGCCCCG CACCCTCCCA CGCAGCAGTC GCAGTCGCAG TCCCACAATC CCTTCCCGTG GAAACCTCTC TTCAACAGAT CCCGTTCCAA CTCTGCCAGT GGTATGCATT CTAGCGCCTC TACCTCTCGT GTCTTTTTGT TTTCGCTCCT CTGCTCGTGA AGCACTGTCA TGCGTCTTCC ATTTCATCTG TTGCCTCTGT TGGTTCTCTG TTGATTCTGT ACGTCCTCGC TGCGCAATCG GCGCGCGTGT TGGGTGCAAT TTGCTGTCCC TGACGTGCAA TTGGTGCTCA TTGTGCTCAG GTAATGAAGA CATCGAGGCG GCGGAAACTT ACTCCTCGGG ACACGGAGGA TCACCTCGAG GAATCGACGT CAATCAAATC CCGTCTTCAG CCGATTGCGA CGACGTGGTG GTCTCGTCCC CCAACAGCAT CAATCTGAAA AGGGAGAGGG AGAAAACGCA CGATTTTGAC CTGGAACGCG ACCGCACGTG TGACATTTCC TCGAGAGGCA GCGACGAGGA AGAAGGGGGC ACCACGAGGA AGAAGCTCAG ACTGTCCAAG GAGCAGTCGG CTCTCCTCGA GGAAAGCTTC AAAGAGCACA GCACCCTCAA CCCGGTACTA ATCTCACCCG CCTGCCTCTT CGTCCGTTTC CACCCTTGTG TCCTCTTCTT GCTCATGGCC TGCAAAGACT CTCTGGCCTT CTTCGTCCAT TCTAACTCCA TTGTTGTTCG GCTTTTCAGA AGCAAAAGAA CGCACTCGCC AAGCAGTTGA ACTTGCGGCC TCGCCAAGTC GAAGTTTGGT TCCAAAACAG AAGAGCAAGG TACTTTCACG ATTCATTGGA ATTGGTTCAG TTCGCAGCTG TCGTTGTTTG GCGTAGTGTT TGGTATAAGG GCCTGGCTCG GCTGGATTGG ATGAAAGCGA GATCTTATTT ACGATGATGG TGGTCGAGTG AGTCGGTAGA GGAAAGAGTC GTTCTGTCAG TATCTGGTAT TTGTCAAGGG GCGGGCGGGG CTAGCGAGCG ATGTGAGTGA AGGTCGGAAG GGCACGACTG GACCCGAGTC TGACGTGAAG GCTGATGATT GTGCAGGACC AAGCTGAAAC AAACCGAGGT CGATTGCGAG CTTCTGAAGC GATGCTGCGA GACTCTGACG GAGGAGAACC GGAGACTGCA GAAGGAGCTG CAGGAGCTGA GAGCGCTGAA GGTGGCTCCT CCGTGTGTCA TTGCGCACGA CTTCTACATG CCGCTACCTG CCGCCACCCT GACCATGTGC CCATCCTGTG AAAGAGTAAC CACCATGGAC AACAAGACAC TTGCCTTCGC CAAGCCCGGA TTCTCCCACT ACTCGCAATC CTCGGCCGCA TGTTAGAGGC GGCCGGATCG AAGCACAGAC AGGAAGGAAG GATGGAGGGT GAAGAAGCCA CAGGAAAGGT GGAACGAACG GTAGGTAGGT TGTATACATT CACATGACAC ACAGACAGAC ATCGAGGACT CTTAAAGGCC TCCTGTAATA TTGTTACTCA CTCACTCACT CACACTCAAG AGCTCAATGA TCATTTGTGT ATTTCTCAAG TTAGCATAGA ATTCTTCCAC ACTGGAAGCT TGTAATTGAC AACAATGTAG ACATCATGTT AAAAACGGTC GCAGTGTACA CGATCTAGAT TTTTTAAAGA TCTGAGCAGC TGCCCAAATT GAAGATCAGT GGTCTGATCG CACTCAATGC AGCGAGATGA TAGTGCATGC AGGCAGCCAG CCATGTAAGT AGTTGGTAGG ATAGAGGTAT GGATGATCTT TGTGCAACCA GCAGTTCCAT ACATGAGTTT GACCCTGAGT GATTATACTG TCAATGTCTC AATTCTGCCG GCCCCTCGGC AGTCTGGCCC TGAGTGTTTC ATTCTTGAGC AAAATCGACG GAGCGAATCT ACGATCAAAC CCGGGTTCAC ATTTTTGTAT GGCCTTGTTG AAATAAAGAT GCTTTACTGG AAACTTCTCA |
mRNA UTR + CDS |
>Mp2g24200.1 CAGCAGCTGG TCTGAAATCC CCCACCTCCT TACATTCGAG CAACACAATT GGCATTCTAG TTTCTCTCGT GACCCGTCTC CCACTGCAAG TCGCTCTCAC CTCCCTCTCT CTCTCACTCT CTCTCTTTCT CGGGTCTCCT CTCCTGCTCG CCTATCTCTA TCTCGTCCGC TTTCCAGCTC CAGTCATTCT CTCATCCGCT TGCGTGCTCG CTCTCCCTTC CAATCTCTGT CAATCTTTCT CCCTCTCTCT CTCTCTCACT CTCACTCTCT TCTCGGCCTG CCCCACTGGG CGAGTGCTGT GTTGCGCACA CAAGTCGAAC ATTCCACTGT TAGTTTGTCT GTATGACTGT CCGGCTCTAG CAGGCAAGGC GAAGAAGGTT GGCCTGGCGG GCAGAGGCTG GAATTTGAAG AAGAAACCAG CAGTCGCGTT CCATCGTCTT CTGCTTCTTC TTCTTCTTGC TCCGTGCTCG CTTTTCGAAC GAGTTTCGTA GCGAGTGCCA TTCGCTCGGC GTACAGACCA GCCCAGCTCT CTGTGGCAGG CAGGCAGGCA GGCAGGCAGG CCTTGCTCTC TCCTGCAGCT CTGGAACAGG AAGGCTCTCG AAAGACTGCG AGGTAACGCG CAGCTCCTTC TCGGCTTCGC TGTTCTCGGT TCTCTGCAAT CTCTCATTCA CGATCGGAGC TCGGCGGGTC GAGCATCACG GATTGCTCAG ACGAACCATT TCGGCTGTGC GTGCCGGGAG GAACCCGTCG TCCGTCAATC AGTTGGTCTT TCATGCTGCT GACGTCGAGG ATTTTCTCGT CTCAGAGCGT GGCGACGTGA GCCTGAGAAA TACGGACGGC TTGGCTTGGT TCTGGTGGTG GTGGTGGTGT CGGTGTCGGA GCCGGCTCTG AACACGACGA GGAAGCAAGA CTTGACAATC AGTCTAGATG ATGCTCCTAG CGGAGGGGAA GGATAGATTA CGGGAGGAGG TGCAGCGGCT TAGGGCGATG CAAGGCTCCG GAGCGAGCTT GAGCCTCAGC GTAGACTTGA GCATGATGGT AGCGAGGAGC CAGGACGATT GCTCCACCGT AAGACGGCAG TGCCCCGTGC AACTGGATCT GCTGCCCATG CCCACTTTTC CATCGGCTGC AGCAGCCCCG CACCCTCCCA CGCAGCAGTC GCAGTCGCAG TCCCACAATC CCTTCCCGTG GAAACCTCTC TTCAACAGAT CCCGTTCCAA CTCTGCCAGT GGTAATGAAG ACATCGAGGC GGCGGAAACT TACTCCTCGG GACACGGAGG ATCACCTCGA GGAATCGACG TCAATCAAAT CCCGTCTTCA GCCGATTGCG ACGACGTGGT GGTCTCGTCC CCCAACAGCA TCAATCTGAA AAGGGAGAGG GAGAAAACGC ACGATTTTGA CCTGGAACGC GACCGCACGT GTGACATTTC CTCGAGAGGC AGCGACGAGG AAGAAGGGGG CACCACGAGG AAGAAGCTCA GACTGTCCAA GGAGCAGTCG GCTCTCCTCG AGGAAAGCTT CAAAGAGCAC AGCACCCTCA ACCCGAAGCA AAAGAACGCA CTCGCCAAGC AGTTGAACTT GCGGCCTCGC CAAGTCGAAG TTTGGTTCCA AAACAGAAGA GCAAGGACCA AGCTGAAACA AACCGAGGTC GATTGCGAGC TTCTGAAGCG ATGCTGCGAG ACTCTGACGG AGGAGAACCG GAGACTGCAG AAGGAGCTGC AGGAGCTGAG AGCGCTGAAG GTGGCTCCTC CGTGTGTCAT TGCGCACGAC TTCTACATGC CGCTACCTGC CGCCACCCTG ACCATGTGCC CATCCTGTGA AAGAGTAACC ACCATGGACA ACAAGACACT TGCCTTCGCC AAGCCCGGAT TCTCCCACTA CTCGCAATCC TCGGCCGCAT GTTAGAGGCG GCCGGATCGA AGCACAGACA GGAAGGAAGG ATGGAGGGTG AAGAAGCCAC AGGAAAGGTG GAACGAACGG TAGGTAGGTT GTATACATTC ACATGACACA CAGACAGACA TCGAGGACTC TTAAAGGCCT CCTGTAATAT TGTTACTCAC TCACTCACTC ACACTCAAGA GCTCAATGAT CATTTGTGTA TTTCTCAAGT TAGCATAGAA TTCTTCCACA CTGGAAGCTT GTAATTGACA ACAATGTAGA CATCATGTTA AAAACGGTCG CAGTGTACAC GATCTAGATT TTTTAAAGAT CTGAGCAGCT GCCCAAATTG AAGATCAGTG GTCTGATCGC ACTCAATGCA GCGAGATGAT AGTGCATGCA GGCAGCCAGC CATGTAAGTA GTTGGTAGGA TAGAGGTATG GATGATCTTT GTGCAACCAG CAGTTCCATA CATGAGTTTG ACCCTGAGTG ATTATACTGT CAATGTCTCA ATTCTGCCGG CCCCTCGGCA GTCTGGCCCT GAGTGTTTCA TTCTTGAGCA AAATCGACGG AGCGAATCTA CGATCAAACC CGGGTTCACA TTTTTGTATG GCCTTGTTGA AATAAAGATG CTTTACTGGA AACTTCTCA |
CDS |
>Mp2g24200.1 ATGATGCTCC TAGCGGAGGG GAAGGATAGA TTACGGGAGG AGGTGCAGCG GCTTAGGGCG ATGCAAGGCT CCGGAGCGAG CTTGAGCCTC AGCGTAGACT TGAGCATGAT GGTAGCGAGG AGCCAGGACG ATTGCTCCAC CGTAAGACGG CAGTGCCCCG TGCAACTGGA TCTGCTGCCC ATGCCCACTT TTCCATCGGC TGCAGCAGCC CCGCACCCTC CCACGCAGCA GTCGCAGTCG CAGTCCCACA ATCCCTTCCC GTGGAAACCT CTCTTCAACA GATCCCGTTC CAACTCTGCC AGTGGTAATG AAGACATCGA GGCGGCGGAA ACTTACTCCT CGGGACACGG AGGATCACCT CGAGGAATCG ACGTCAATCA AATCCCGTCT TCAGCCGATT GCGACGACGT GGTGGTCTCG TCCCCCAACA GCATCAATCT GAAAAGGGAG AGGGAGAAAA CGCACGATTT TGACCTGGAA CGCGACCGCA CGTGTGACAT TTCCTCGAGA GGCAGCGACG AGGAAGAAGG GGGCACCACG AGGAAGAAGC TCAGACTGTC CAAGGAGCAG TCGGCTCTCC TCGAGGAAAG CTTCAAAGAG CACAGCACCC TCAACCCGAA GCAAAAGAAC GCACTCGCCA AGCAGTTGAA CTTGCGGCCT CGCCAAGTCG AAGTTTGGTT CCAAAACAGA AGAGCAAGGA CCAAGCTGAA ACAAACCGAG GTCGATTGCG AGCTTCTGAA GCGATGCTGC GAGACTCTGA CGGAGGAGAA CCGGAGACTG CAGAAGGAGC TGCAGGAGCT GAGAGCGCTG AAGGTGGCTC CTCCGTGTGT CATTGCGCAC GACTTCTACA TGCCGCTACC TGCCGCCACC CTGACCATGT GCCCATCCTG TGAAAGAGTA ACCACCATGG ACAACAAGAC ACTTGCCTTC GCCAAGCCCG GATTCTCCCA CTACTCGCAA TCCTCGGCCG CATGTTAG |
Protein |
>Mp2g24200.1 MMLLAEGKDR LREEVQRLRA MQGSGASLSL SVDLSMMVAR SQDDCSTVRR QCPVQLDLLP MPTFPSAAAA PHPPTQQSQS QSHNPFPWKP LFNRSRSNSA SGNEDIEAAE TYSSGHGGSP RGIDVNQIPS SADCDDVVVS SPNSINLKRE REKTHDFDLE RDRTCDISSR GSDEEEGGTT RKKLRLSKEQ SALLEESFKE HSTLNPKQKN ALAKQLNLRP RQVEVWFQNR RARTKLKQTE VDCELLKRCC ETLTEENRRL QKELQELRAL KVAPPCVIAH DFYMPLPAAT LTMCPSCERV TTMDNKTLAF AKPGFSHYSQ SSAAC |