Database | ID | Description |
---|---|---|
CDD | cd00086 | homeodomain |
MobiDBLite | mobidb-lite | consensus disorder prediction |
Pfam | PF05920 | Homeobox KN domain |
SMART | SM00389 | HOX_1 |
ProSitePatterns | PS00027 | 'Homeobox' domain signature. |
Gene3D | G3DSA:1.10.10.60 | - |
ProSiteProfiles | PS50071 | 'Homeobox' domain profile. |
SUPERFAMILY | SSF46689 | Homeodomain-like |
PANTHER | PTHR11850 | HOMEOBOX PROTEIN TRANSCRIPTION FACTORS |
KOG | KOG0774 | Transcription factor PBX and related HOX domain proteins; N-term missing; C-term missing; [K] |
MapolyID | Mapoly0132s0008 | - |
GO | GO:0000978 | RNA polymerase II cis-regulatory region sequence-specific DNA binding |
GO | GO:0006357 | regulation of transcription by RNA polymerase II |
GO | GO:0003677 | DNA binding |
GO | GO:0006355 | regulation of DNA-templated transcription |
GO | GO:0000981 | DNA-binding transcription factor activity, RNA polymerase II-specific |
Sequences: |
Gene UTR + CDS + intron |
>Mp4g09650.1 GATTATGAAG CTCGTCACTC ACAAGAGGTG CTTTCGCCTG TTCCATATCC TCAAACATAC GTGGAAGAGG TGTTGGCCAC CCTTGCGTGC TGCTCTCAAC GACCAGAATT GCCTCTCGTC CCATCAGGAG ATAATGACAA TCTTAATGCA GTCAGGGGCT GTGAAGCTGA AGACCCACAA GAAGTCTGTG TTAGTGCTTC TTCTCCTCCA GGATGTGTGG AGGAGGTGCT GGACAAGGAC ATGGACATGG ATAGCTCCTC CGACCAATTG GCATGGGCTC CTGTACCATC ACCAGATGAT GAAATCGTTA CTGAAATGAG GGCTTGTGAG CCTTGTAACA GAGGAGAGGA ATTTGCAAGT GCTCCTATTT CTCAGGGCAG TATGCGAGAG GTGCTGGAAT CTAACCACCC CACTTGCTCT CAACAATTGG AGTTTACTCC CAGGCCAGCA CCAGTTGATG ACCATGTTAA AGCAACCAAC AGCATTCTCG ATACAAATGA TGACTCTTCC ACTGGATCTT TAACTTCATC GAGACGTAAA GAGCTGGAAG GCACCGGATC CAGGTCGGTC CACACACCGG AATCGCTCCA AATCTTGAGG TCAGTACATT TCGAAACGTT TGCTTCCCTC CTTGCTTGAA TTTAGATGAC CAGGGCACGC CTAAGCTATT AATCAAGTAC ATTGCTCAAA AGACAAGGCC GAAGTACCGA ATTTTTATTC ATTTAACCAC CACTCTCTCC CCTTTGCCAC AGAACCTGGT GGGAGAAGAA TGAGCTCATT CCATATCCGA GTCAACGAGA AGTCTATGAA ATGTCACGGA GGACCAACCT TACATTGCAA CAGGTTACTG CCAGCATCCT CACATCTATT ACAGTGGAAT AGCTGCCATC TATAGAGAAA GGTTGCATGT TTCCTCCATC GCGAAAGCGG AAGGATTGTT TAAATGAGCA TCACTTGAAA TTGTGCTCTA GATCTCGGAC TGGTTCAAGA ACGAAAGGAG CAGAAAATGG AGAGACGATC CTCGAAAGAA GAAGGCGAAT AGACTGCCAC CACGGGCAAC ACGACTCTTG AGGTGACATA ACTCTTTGCT TCGCCTTTTT CCATGAATAT CTCATTTCTC GGCGTTATCC AAACAATATG ATATCCTCTT CATATTTGAC CTCTGTTTCT TGAAGCCATT TGTGCCTGGA CTATTGTGGG AAATTTGATT TGTACTTACA TGTACTTTGC TGTAAATCTT TCCATGAAGT TCTCATTCTT ACTTTTCAAC TTGCATTCTG AAACTCGTAG TACATGGGCA TCCGAACATC TGAGTCATCC AAACCCCTCT CGGGAGCAAA GGGAAGATTT AGCAAAGGAA TCTGGAGTTA CCTATGTGCA GGTAAGTGCA TTTTGCTATA GATATTCTAG GGTGACACTA CCTTGCTTAT AGCAACACAT AACGCTTCAC TCTTCAAAAG GTTCCAAATG CATAGTATGC AACATCCGAA AAGTAATCAG TTGATGGAAT AAGAGCTTAC AAGTGACATT CTAACGCTGC ATTTAGTTTG AAGATCCGAG AAAAAGACGG ACTTGTATCA GTGAAAAGAT AAAGATGAAG CATTGCAAAT TTACTAATTT TATACTTGTG ACTGTAACTT GCCCCTCTAT TATCACTTTC TTATCACTAT TAATGGATGA AAGATAAGAG AACTATTTTC TCAATTTTTT GCAGGTTTCG AATTGGTTCA TGAACTTCCG AAAGAGGAGC AAGGCTCGAA TAAGGAAAGC TAAAGCCTCG AATATTCCAT GAAC |
mRNA UTR + CDS |
>Mp4g09650.1 GATTATGAAG CTCGTCACTC ACAAGAGGTG CTTTCGCCTG TTCCATATCC TCAAACATAC GTGGAAGAGG TGTTGGCCAC CCTTGCGTGC TGCTCTCAAC GACCAGAATT GCCTCTCGTC CCATCAGGAG ATAATGACAA TCTTAATGCA GTCAGGGGCT GTGAAGCTGA AGACCCACAA GAAGTCTGTG TTAGTGCTTC TTCTCCTCCA GGATGTGTGG AGGAGGTGCT GGACAAGGAC ATGGACATGG ATAGCTCCTC CGACCAATTG GCATGGGCTC CTGTACCATC ACCAGATGAT GAAATCGTTA CTGAAATGAG GGCTTGTGAG CCTTGTAACA GAGGAGAGGA ATTTGCAAGT GCTCCTATTT CTCAGGGCAG TATGCGAGAG GTGCTGGAAT CTAACCACCC CACTTGCTCT CAACAATTGG AGTTTACTCC CAGGCCAGCA CCAGTTGATG ACCATGTTAA AGCAACCAAC AGCATTCTCG ATACAAATGA TGACTCTTCC ACTGGATCTT TAACTTCATC GAGACGTAAA GAGCTGGAAG GCACCGGATC CAGGTCGGTC CACACACCGG AATCGCTCCA AATCTTGAGA ACCTGGTGGG AGAAGAATGA GCTCATTCCA TATCCGAGTC AACGAGAAGT CTATGAAATG TCACGGAGGA CCAACCTTAC ATTGCAACAG ATCTCGGACT GGTTCAAGAA CGAAAGGAGC AGAAAATGGA GAGACGATCC TCGAAAGAAG AAGGCGAATA GACTGCCACC ACGGGCAACA CGACTCTTGA GTACATGGGC ATCCGAACAT CTGAGTCATC CAAACCCCTC TCGGGAGCAA AGGGAAGATT TAGCAAAGGA ATCTGGAGTT ACCTATGTGC AGGTTTCGAA TTGGTTCATG AACTTCCGAA AGAGGAGCAA GGCTCGAATA AGGAAAGCTA AAGCCTCGAA TATTCCATGA AC |
CDS |
>Mp4g09650.1 ATGGACATGG ATAGCTCCTC CGACCAATTG GCATGGGCTC CTGTACCATC ACCAGATGAT GAAATCGTTA CTGAAATGAG GGCTTGTGAG CCTTGTAACA GAGGAGAGGA ATTTGCAAGT GCTCCTATTT CTCAGGGCAG TATGCGAGAG GTGCTGGAAT CTAACCACCC CACTTGCTCT CAACAATTGG AGTTTACTCC CAGGCCAGCA CCAGTTGATG ACCATGTTAA AGCAACCAAC AGCATTCTCG ATACAAATGA TGACTCTTCC ACTGGATCTT TAACTTCATC GAGACGTAAA GAGCTGGAAG GCACCGGATC CAGGTCGGTC CACACACCGG AATCGCTCCA AATCTTGAGA ACCTGGTGGG AGAAGAATGA GCTCATTCCA TATCCGAGTC AACGAGAAGT CTATGAAATG TCACGGAGGA CCAACCTTAC ATTGCAACAG ATCTCGGACT GGTTCAAGAA CGAAAGGAGC AGAAAATGGA GAGACGATCC TCGAAAGAAG AAGGCGAATA GACTGCCACC ACGGGCAACA CGACTCTTGA GTACATGGGC ATCCGAACAT CTGAGTCATC CAAACCCCTC TCGGGAGCAA AGGGAAGATT TAGCAAAGGA ATCTGGAGTT ACCTATGTGC AGGTTTCGAA TTGGTTCATG AACTTCCGAA AGAGGAGCAA GGCTCGAATA AGGAAAGCTA AAGCCTCGAA TATTCCATGA |
Protein |
>Mp4g09650.1 MDMDSSSDQL AWAPVPSPDD EIVTEMRACE PCNRGEEFAS APISQGSMRE VLESNHPTCS QQLEFTPRPA PVDDHVKATN SILDTNDDSS TGSLTSSRRK ELEGTGSRSV HTPESLQILR TWWEKNELIP YPSQREVYEM SRRTNLTLQQ ISDWFKNERS RKWRDDPRKK KANRLPPRAT RLLSTWASEH LSHPNPSREQ REDLAKESGV TYVQVSNWFM NFRKRSKARI RKAKASNIP |