Database | ID | Description |
---|---|---|
Gene3D | G3DSA:2.130.10.10 | - |
FunFam | G3DSA:2.130.10.10:FF:000512 | WD-40 repeat-containing protein MSI1 |
Pfam | PF00400 | WD domain, G-beta repeat |
ProSiteProfiles | PS50294 | Trp-Asp (WD) repeats circular profile. |
SMART | SM00320 | WD40_4 |
ProSiteProfiles | PS50082 | Trp-Asp (WD) repeats profile. |
ProSitePatterns | PS00678 | Trp-Asp (WD) repeats signature. |
PRINTS | PR00320 | G protein beta WD-40 repeat signature |
PANTHER | PTHR22850 | WD40 REPEAT FAMILY |
Pfam | PF12265 | Histone-binding protein RBBP4 or subunit C of CAF1 complex |
CDD | cd00200 | WD40 |
SUPERFAMILY | SSF50978 | WD40 repeat-like |
KEGG | K10752 | histone-binding protein RBBP4 |
KOG | KOG0264 | Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1; [B] |
MapolyID | Mapoly0101s0048 | - |
GO | GO:0005634 | nucleus |
GO | GO:0031491 | nucleosome binding |
GO | GO:0006338 | chromatin remodeling |
GO | GO:0005515 | protein binding |
GO | GO:0006355 | regulation of DNA-templated transcription |
GO | GO:0042393 | histone binding |
Gene symbol | Product | Transcript ID | Status |
---|---|---|---|
MpMSI | Chromatin related protein | Provisional |
Sequences: |
Gene UTR + CDS + intron |
>Mp4g21020.1 GTTGGTTGGC GCGGCCCCTT CCCGCGGCAC GAGCACGCTC ATTTGTATCG AGCACAACGC GAACGACAGG AGCAAGGGGA AGACATTTGA CCTGCCCCCC GACGACGACG ACGCCAATGC CCCGCTCTTC CATCTCCTCG GCCCTCCTTC TGTGCATCGA CGATGAGCTG AGGCCTTCGA GCGGCCCCGA AGATTCATCG AGCTGAAGCA AGCGAGCAGC AGCCATCGAG GCCGATTCTT TTCTCTTCTG CTGCTGCTTC TTCTTCGTCT TCCTCCTCGA GCGCTTGCTT GCGCGCAGGC GTTAGAGGGA AATCCTATGT GTAGTTCCAG TCTGTGGTCG TGTCGATGAT TCTTTATGTT GTCGAGGTGA GGTCAGCGCT GGGTTGGGTC GGGCCAGTGT GCTGTGATTT GTGCTGTGTC GCGTGAGGCA GGGTGAAGGA CTCTGGAGGC ATTGATTTGA TTTTGGGCCG TGCTGGGGGG TGTGTTTGTG TGTGTGTGGG AAGGAGGGCA TTGTGAGGGA GGGATTCGTG GGATTTTTGG GGCAAGTGCG GTAAGAGAGA CGAGGATTCA GAAAGAGGGA GAGGAATAGG GAGAGGGAGG GAAGAGATGG CGAAGGAGGA TGATGAGTAC CGTGATGAGA TGGAGGAGAG GCTGGTGAAC GAGGAGTATA AGATCTGGAA GAAGAATACG CCTTTTCTCT ATGATTTGGT GATAACTCAT GCTATGGAAT GGCCCAGCCT CACTGTGCAA TGGTTGCCAG ACCGGCTGGA GCCAGCTGGG AAAGACTACT CGGTACAGAA ATTGATACTT GGTACGCACA CCTCAGACAA TGAACCCAAT TATTTGATGC TCGCAGAAGT CCAACTTCCC CTCGATGACA CGGAAAACGA TGCTCGCCAT TACGATGACG AGCGCGGTGA GATTGGGGGA TTTGGCGTTG CGAGCGGCAA GGTGAAGCAA CGGGTGAACC CTATTTCCCA GTTTCTCCCT CCCACTCGTT TATTCTTTTT ACAATTTTAG AATGTCCTTC ATGGCCCCTT TCTGCTCGCT TGTTATTGTT CTAAATGTAA GTGTTTGGAT TCATGACGGT GTTGAGTTCT TCGCCGCTTA GTTTCGATAG CGTCTTGTGT TGGTAATGTC CTGGTCGGAA TGAGACAAAC TGCGAGTGAA CTTGATTGCA AACCCAGACG CAGGTCAAGC GGTGCACATT AAATGGTTTC TTGTCTAACG CTTTTGCCAA CTTGAAGCAG TCATTAACTC TCAGGTTGGC TTGTTTTCGA ACTAGATTTT GCAGAAATGT GTAGACAGAG AGTTGGTGCT TTATCCACTT CTTTTGTCTG TGGAACAATT ATTGGAGGTC ATTTTATGAT CATGTTCTGT GAGGGGCTTG TCTCTATGAG TCTGTTCTCC AGATTGTGGG TTGGTCACTT TCTGAGAGCC TTCTCCGTTT GATTCCTTAG GTGCAAGTCA TACAGCAAAT CAACCATGAG GGAGAGGTGA ACAGAGCGCG TTACATGCCC CAGAATCCCT TCCTAATTGC TACGAAAACG GTCAGTGCGG AAGTCTATGT GTTCGACTAC AGCAAACACC CATCCAAGCC ACCACAAGAA GGCAATTGCA ATCCTGATTT GCGTCTCCGA GGCCACCGAA CAGAGGGCTA TGGCCTTTCT TGGAGCTCTT TCAGAGAGGG TCATCTTCTG AGTGGTTCAG ACGACTCTCA AATTTGTCTG TGGGACATCA GCTCTGCTTC CAAATCCACT CGAGTTCTGG ATGCCAAACA GACCTTTCAG GTGTGCTTCT CGTTCAGCTC TAGCTTGCCT CGACACAACT GTTACATCTG CCCATAAACA AACTTCTTTG CGCAACTGTT GCTGTTTGAA TTTTCTGCGC TTGCTTACAG ATGGTTTTAT GTTTGTTAGT TCTTTTTAAG GGCTGAGTAC TCATGTTCTC GCCAACCACT TGAACAAGGA AAGCATTTAG CACCCATTAC AACTTCTGTA CCTTTTTGCT TCTTCTCATC ACGCCTGTAT GGATCTTGCA GGGCCATGTT GGCGTGGTTG AAGACGTCGC TTGGCACTTG AGACACGACT ACCTTTTTGG TTCCGTTGGA GATGACAGGC AGCTATTGAT ATGGGATACC AGGACGTCAA CTGCGGAAAA GCCATTACAC GCAATTGATG CACATCAGGC TGAGGTACAT GCTCACTTTG TTGACAGGTG TTTTAGATTT ATCTTACCGT CCGGACTGAG TTCATTGAAA GTCACAGGTG TTATTCTGAA TCAAATAGTT GCAATTGAGT AACGGTGTCA CTGGATTGGC AGGTGAATTG CTTAGCCTTT AATCCCTTCA ACGAGTACCT CCTCGCCACA GGCTCTGCCG ACAGGACAGT GGCACTATAT GATCTACGCA AGTTGTCCAA GTGCCTACAC ACTTTCGTCA ACCACGCGTA CGTACATACA TTTCACTCTG AAAGATATTA GACATCTATT TGAAATGAGG TCCAATTGGA AGAATGTCGC TCGATGTGAT GTGAAACTGA AACTTCACTG GTCTGTGCAG TGAGGAGGTT TTCCAAATTG GGTGGAGTCC AATGAACCAA ACGATCCTAG CGTCATGTGG GGCGGACAAA AGATTAATGG TGTGGGATTT AAGCAGGTGA GAAATCTTGT GTTCCGGTTA TTTGATTTGT TGGTTTATCT GTAGTAGCAG GCAGAGTTTC AACGCGCTTC TTGCAGTATC TATTTCTGAA ACCTGTATCT GGCCTCCTTA ACATCTTTGA GCTCATGCCG GGTCAAGTAC CCTGCGTGGC TTCATATCAT AGTTACGTGA TTCTCCTGAC AATGATGGAC TGCTACAACC TTTATTACAC TGGCTTTGTA TGTTTTTTTT TGGGAGGGGC TAGTTAGTCC TCGGTTTGGG CTAATTATCC CACACCCTAG GCTAATGTGC ACTACTTCTC CCATTTTCCT GTGCCCGCTA AGATTATATC ACACAGCCGT AGTAAACATA GGGATAGCTT GGACCAATCA CAGAGCTCTT TTCTGTATAA TATAGTAAGG TGAGGGATAA TTAGCCCAAA CTGAGGGATA ACTAGCCCCT CCCCTTTTTT TCCTACCGTT AAGTTTACCT CCTTTTGTTA TGGAACTAAT ACTTCTGAAC TAATACCTCT GAACTATTAC CTCCCTACAA CAGAGTCGGT GAAGAGCAAT TACCCGAAGA CGCAGAGGAT GGACCACCGG AGCTTCTCTT TATTCACGGA GGCCATACCA GTAAAATATC AGATTTTTCA TGGAATCCAA ATGAACCCTG GCTTATATCT AGTGTGGCAG AAGATAATAT TCTCCAGCTG TGGCAAATGG CTGAGAACAT ATATCATGAC GAGGAAGATG GTCCTGCTGA GGACATGCCA GGTCTAATGT AGACTAGTGA GAAACAAGTA TAACAACGTC CTACAGGTAC GAGAACCGTT TGTAATAACA GAGATCTTAT GGACTATCTA TTCTGGTCAG CCAAGTCAAA CTAGAGATCA TTAAAGACCT GTTGCATCGA CGGCGGCTTT GTGTTGTTTC ACGGGAGTAC CCTTCTAATG TTTAGTGTAA TCGGGTGGTA CTACCATGGC ACTTGTATCA CGGATAGGGG ACGAAGTCCA GCCCTAGGCG GGAACCACAT TTTAAGTAGA GGCAACGAGG CACCACTTTG TAAATTGACA TCCACATCAA TGGAAATCTT TTGCACAATT TTCATTC |
mRNA UTR + CDS |
>Mp4g21020.1 GTTGGTTGGC GCGGCCCCTT CCCGCGGCAC GAGCACGCTC ATTTGTATCG AGCACAACGC GAACGACAGG AGCAAGGGGA AGACATTTGA CCTGCCCCCC GACGACGACG ACGCCAATGC CCCGCTCTTC CATCTCCTCG GCCCTCCTTC TGTGCATCGA CGATGAGCTG AGGCCTTCGA GCGGCCCCGA AGATTCATCG AGCTGAAGCA AGCGAGCAGC AGCCATCGAG GCCGATTCTT TTCTCTTCTG CTGCTGCTTC TTCTTCGTCT TCCTCCTCGA GCGCTTGCTT GCGCGCAGGC GTTAGAGGGA AATCCTATGT GTAGTTCCAG TCTGTGGTCG TGTCGATGAT TCTTTATGTT GTCGAGGTGA GGTCAGCGCT GGGTTGGGTC GGGCCAGTGT GCTGTGATTT GTGCTGTGTC GCGTGAGGCA GGGTGAAGGA CTCTGGAGGC ATTGATTTGA TTTTGGGCCG TGCTGGGGGG TGTGTTTGTG TGTGTGTGGG AAGGAGGGCA TTGTGAGGGA GGGATTCGTG GGATTTTTGG GGCAAGTGCG GTAAGAGAGA CGAGGATTCA GAAAGAGGGA GAGGAATAGG GAGAGGGAGG GAAGAGATGG CGAAGGAGGA TGATGAGTAC CGTGATGAGA TGGAGGAGAG GCTGGTGAAC GAGGAGTATA AGATCTGGAA GAAGAATACG CCTTTTCTCT ATGATTTGGT GATAACTCAT GCTATGGAAT GGCCCAGCCT CACTGTGCAA TGGTTGCCAG ACCGGCTGGA GCCAGCTGGG AAAGACTACT CGGTACAGAA ATTGATACTT GGTACGCACA CCTCAGACAA TGAACCCAAT TATTTGATGC TCGCAGAAGT CCAACTTCCC CTCGATGACA CGGAAAACGA TGCTCGCCAT TACGATGACG AGCGCGGTGA GATTGGGGGA TTTGGCGTTG CGAGCGGCAA GGTGAAGCAA CGGGTGCAAG TCATACAGCA AATCAACCAT GAGGGAGAGG TGAACAGAGC GCGTTACATG CCCCAGAATC CCTTCCTAAT TGCTACGAAA ACGGTCAGTG CGGAAGTCTA TGTGTTCGAC TACAGCAAAC ACCCATCCAA GCCACCACAA GAAGGCAATT GCAATCCTGA TTTGCGTCTC CGAGGCCACC GAACAGAGGG CTATGGCCTT TCTTGGAGCT CTTTCAGAGA GGGTCATCTT CTGAGTGGTT CAGACGACTC TCAAATTTGT CTGTGGGACA TCAGCTCTGC TTCCAAATCC ACTCGAGTTC TGGATGCCAA ACAGACCTTT CAGGGCCATG TTGGCGTGGT TGAAGACGTC GCTTGGCACT TGAGACACGA CTACCTTTTT GGTTCCGTTG GAGATGACAG GCAGCTATTG ATATGGGATA CCAGGACGTC AACTGCGGAA AAGCCATTAC ACGCAATTGA TGCACATCAG GCTGAGGTGA ATTGCTTAGC CTTTAATCCC TTCAACGAGT ACCTCCTCGC CACAGGCTCT GCCGACAGGA CAGTGGCACT ATATGATCTA CGCAAGTTGT CCAAGTGCCT ACACACTTTC GTCAACCACG CTGAGGAGGT TTTCCAAATT GGGTGGAGTC CAATGAACCA AACGATCCTA GCGTCATGTG GGGCGGACAA AAGATTAATG GTGTGGGATT TAAGCAGAGT CGGTGAAGAG CAATTACCCG AAGACGCAGA GGATGGACCA CCGGAGCTTC TCTTTATTCA CGGAGGCCAT ACCAGTAAAA TATCAGATTT TTCATGGAAT CCAAATGAAC CCTGGCTTAT ATCTAGTGTG GCAGAAGATA ATATTCTCCA GCTGTGGCAA ATGGCTGAGA ACATATATCA TGACGAGGAA GATGGTCCTG CTGAGGACAT GCCAGGTCTA ATGTAGACTA GTGAGAAACA AGTATAACAA CGTCCTACAG GTACGAGAAC CGTTTGTAAT AACAGAGATC TTATGGACTA TCTATTCTGG TCAGCCAAGT CAAACTAGAG ATCATTAAAG ACCTGTTGCA TCGACGGCGG CTTTGTGTTG TTTCACGGGA GTACCCTTCT AATGTTTAGT GTAATCGGGT GGTACTACCA TGGCACTTGT ATCACGGATA GGGGACGAAG TCCAGCCCTA GGCGGGAACC ACATTTTAAG TAGAGGCAAC GAGGCACCAC TTTGTAAATT GACATCCACA TCAATGGAAA TCTTTTGCAC AATTTTCATT C |
CDS |
>Mp4g21020.1 ATGGCGAAGG AGGATGATGA GTACCGTGAT GAGATGGAGG AGAGGCTGGT GAACGAGGAG TATAAGATCT GGAAGAAGAA TACGCCTTTT CTCTATGATT TGGTGATAAC TCATGCTATG GAATGGCCCA GCCTCACTGT GCAATGGTTG CCAGACCGGC TGGAGCCAGC TGGGAAAGAC TACTCGGTAC AGAAATTGAT ACTTGGTACG CACACCTCAG ACAATGAACC CAATTATTTG ATGCTCGCAG AAGTCCAACT TCCCCTCGAT GACACGGAAA ACGATGCTCG CCATTACGAT GACGAGCGCG GTGAGATTGG GGGATTTGGC GTTGCGAGCG GCAAGGTGAA GCAACGGGTG CAAGTCATAC AGCAAATCAA CCATGAGGGA GAGGTGAACA GAGCGCGTTA CATGCCCCAG AATCCCTTCC TAATTGCTAC GAAAACGGTC AGTGCGGAAG TCTATGTGTT CGACTACAGC AAACACCCAT CCAAGCCACC ACAAGAAGGC AATTGCAATC CTGATTTGCG TCTCCGAGGC CACCGAACAG AGGGCTATGG CCTTTCTTGG AGCTCTTTCA GAGAGGGTCA TCTTCTGAGT GGTTCAGACG ACTCTCAAAT TTGTCTGTGG GACATCAGCT CTGCTTCCAA ATCCACTCGA GTTCTGGATG CCAAACAGAC CTTTCAGGGC CATGTTGGCG TGGTTGAAGA CGTCGCTTGG CACTTGAGAC ACGACTACCT TTTTGGTTCC GTTGGAGATG ACAGGCAGCT ATTGATATGG GATACCAGGA CGTCAACTGC GGAAAAGCCA TTACACGCAA TTGATGCACA TCAGGCTGAG GTGAATTGCT TAGCCTTTAA TCCCTTCAAC GAGTACCTCC TCGCCACAGG CTCTGCCGAC AGGACAGTGG CACTATATGA TCTACGCAAG TTGTCCAAGT GCCTACACAC TTTCGTCAAC CACGCTGAGG AGGTTTTCCA AATTGGGTGG AGTCCAATGA ACCAAACGAT CCTAGCGTCA TGTGGGGCGG ACAAAAGATT AATGGTGTGG GATTTAAGCA GAGTCGGTGA AGAGCAATTA CCCGAAGACG CAGAGGATGG ACCACCGGAG CTTCTCTTTA TTCACGGAGG CCATACCAGT AAAATATCAG ATTTTTCATG GAATCCAAAT GAACCCTGGC TTATATCTAG TGTGGCAGAA GATAATATTC TCCAGCTGTG GCAAATGGCT GAGAACATAT ATCATGACGA GGAAGATGGT CCTGCTGAGG ACATGCCAGG TCTAATGTAG |
Protein |
>Mp4g21020.1 MAKEDDEYRD EMEERLVNEE YKIWKKNTPF LYDLVITHAM EWPSLTVQWL PDRLEPAGKD YSVQKLILGT HTSDNEPNYL MLAEVQLPLD DTENDARHYD DERGEIGGFG VASGKVKQRV QVIQQINHEG EVNRARYMPQ NPFLIATKTV SAEVYVFDYS KHPSKPPQEG NCNPDLRLRG HRTEGYGLSW SSFREGHLLS GSDDSQICLW DISSASKSTR VLDAKQTFQG HVGVVEDVAW HLRHDYLFGS VGDDRQLLIW DTRTSTAEKP LHAIDAHQAE VNCLAFNPFN EYLLATGSAD RTVALYDLRK LSKCLHTFVN HAEEVFQIGW SPMNQTILAS CGADKRLMVW DLSRVGEEQL PEDAEDGPPE LLFIHGGHTS KISDFSWNPN EPWLISSVAE DNILQLWQMA ENIYHDEEDG PAEDMPGLM |