AT2G40030.1
Subcellular Consensus
(Prediction and Experimental) min: :max .
SUBAcon:nucleus 1.000 ASURE: nucleus What is SUBAcon? |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Experimental Localisations and PPI |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
SUBAcon links
AGI-AGI relationships |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Description (TAIR10) | protein_coding : nuclear RNA polymerase D1B | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Curator Summary (TAIR10) |
Encodes the unique largest subunit of nuclear DNA-dependent RNA polymerase V; homologous to budding yeast RPB1 and the E. coli RNA polymerase beta prime subunit. Required for normal RNA-directed DNA methylation at non-CG methylation sites and transgene silencing. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Computational Description (TAIR10) |
nuclear RNA polymerase D1B (NRPD1B); CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3223 (InterPro:IPR021602), RNA polymerase, N-terminal (InterPro:IPR006592), RNA polymerase, alpha subunit (InterPro:IPR000722), RNA polymerase Rpb1, domain 3 (InterPro:IPR007066), RNA polymerase Rpb1, domain 1 (InterPro:IPR007080), RNA polymerase Rpb1, domain 5 (InterPro:IPR007081); BEST Arabidopsis thaliana protein match is: nuclear RNA polymerase D1A (TAIR:AT1G63020.2); Has 52919 Blast hits to 31940 proteins in 6835 species: Archae - 366; Bacteria - 10380; Metazoa - 13235; Fungi - 6920; Plants - 7147; Viruses - 757; Other Eukaryotes - 14114 (source: NCBI BLink). | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Protein Annotations |
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Coordinates (TAIR10) | chr2:+:16715089..16723406 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular Weight (calculated) | 218239.00 Da | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
IEP (calculated) | 6.16 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
GRAVY (calculated) | -0.61 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Length | 1976 amino acids | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sequence (TAIR10) (BLAST) |
0001: MEEESTSEIL DGEIVGITFA LASHHEICIQ SISESAINHP SQLTNAFLGL PLEFGKCESC GATEPDKCEG HFGYIQLPVP IYHPAHVNEL KQMLSLLCLK 0101: CLKIKKAKGT SGGLADRLLG VCCEEASQIS IKDRASDGAS YLELKLPSRS RLQPGCWNFL ERYGYRYGSD YTRPLLAREV KEILRRIPEE SRKKLTAKGH 0201: IPQEGYILEY LPVPPNCLSV PEASDGFSTM SVDPSRIELK DVLKKVIAIK SSRSGETNFE SHKAEASEMF RVVDTYLQVR GTAKAARNID MRYGVSKISD 0301: SSSSKAWTEK MRTLFIRKGS GFSSRSVITG DAYRHVNEVG IPIEIAQRIT FEERVSVHNR GYLQKLVDDK LCLSYTQGST TYSLRDGSKG HTELKPGQVV 0401: HRRVMDGDVV FINRPPTTHK HSLQALRVYV HEDNTVKINP LMCSPLSADF DGDCVHLFYP QSLSAKAEVM ELFSVEKQLL SSHTGQLILQ MGSDSLLSLR 0501: VMLERVFLDK ATAQQLAMYG SLSLPPPALR KSSKSGPAWT VFQILQLAFP ERLSCKGDRF LVDGSDLLKF DFGVDAMGSI INEIVTSIFL EKGPKETLGF 0601: FDSLQPLLME SLFAEGFSLS LEDLSMSRAD MDVIHNLIIR EISPMVSRLR LSYRDELQLE NSIHKVKEVA ANFMLKSYSI RNLIDIKSNS AITKLVQQTG 0701: FLGLQLSDKK KFYTKTLVED MAIFCKRKYG RISSSGDFGI VKGCFFHGLD PYEEMAHSIA AREVIVRSSR GLAEPGTLFK NLMAVLRDIV ITNDGTVRNT 0801: CSNSVIQFKY GVDSERGHQG LFEAGEPVGV LAATAMSNPA YKAVLDSSPN SNSSWELMKE VLLCKVNFQN TTNDRRVILY LNECHCGKRF CQENAACTVR 0901: NKLNKVSLKD TAVEFLVEYR KQPTISEIFG IDSCLHGHIH LNKTLLQDWN ISMQDIHQKC EDVINSLGQK KKKKATDDFK RTSLSVSECC SFRDPCGSKG 1001: SDMPCLTFSY NATDPDLERT LDVLCNTVYP VLLEIVIKGD SRICSANIIW NSSDMTTWIR NRHASRRGEW VLDVTVEKSA VKQSGDAWRV VIDSCLSVLH 1101: LIDTKRSIPY SVKQVQELLG LSCAFEQAVQ RLSASVRMVS KGVLKEHIIL LANNMTCSGT MLGFNSGGYK ALTRSLNIKA PFTEATLIAP RKCFEKAAEK 1201: CHTDSLSTVV GSCSWGKRVD VGTGSQFELL WNQKETGLDD KEETDVYSFL QMVISTTNAD AFVSSPGFDV TEEEMAEWAE SPERDSALGE PKFEDSADFQ 1301: NLHDEGKPSG ANWEKSSSWD NGCSGGSEWG VSKSTGGEAN PESNWEKTTN VEKEDAWSSW NTRKDAQESS KSDSGGAWGI KTKDADADTT PNWETSPAPK 1401: DSIVPENNEP TSDVWGHKSV SDKSWDKKNW GTESAPAAWG STDAAVWGSS DKKNSETESD AAAWGSRDKN NSDVGSGAGV LGPWNKKSSE TESNGATWGS 1501: SDKTKSGAAA WNSWDKKNIE TDSEPAAWGS QGKKNSETES GPAAWGAWDK KKSETEPGPA GWGMGDKKNS ETELGPAAMG NWDKKKSDTK SGPAAWGSTD 1601: AAAWGSSDKN NSETESDAAA WGSRNKKTSE IESGAGAWGS WGQPSPTAED KDTNEDDRNP WVSLKETKSR EKDDKERSQW GNPAKKFPSS GGWSNGGGAD 1701: WKGNRNHTPR PPRSEDNLAP MFTATRQRLD SFTSEEQELL SDVEPVMRTL RKIMHPSAYP DGDPISDDDK TFVLEKILNF HPQKETKLGS GVDFITVDKH 1801: TIFSDSRCFF VVSTDGAKQD FSYRKSLNNY LMKKYPDRAE EFIDKYFTKP RPSGNRDRNN QDATPPGEEQ SQPPNQSIGN GGDDFQTQTQ SQSPSQTRAQ 1901: SPSQAQAQSP SQTQSQSQSQ SQSQSQSQSQ SQSQSQSQSQ SQSQSQSPSQ TQTQSPSQTQ AQAQSPSSQS PSQTQT |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
See Also |
|
Citation
If you find this resource useful please cite one of the following publications:
Hooper CM, Castleden I, Tanz SK, Aryamanesh, and Millar, AH (2017) SUBA4: the interactive data analysis centre for Arabidopsis subcellular protein locations Nucleic Acids Res. Jan 4;45(D1):D1064-D1074. doi: 10.1093/nar/gkw1041 (PubMed)
Hooper CM, Tanz SK, Castleden IR, Vacher MA, Small ID, Millar AH (2014) "SUBAcon: a consensus algorithm for unifying the subcellular localization data of the Arabidopsis proteome. Bioinformatics." 1;30(23):3356-64. (Bioinformatics) (PubMed)