suba logo
AT5G23880.1
Subcellular Consensus
(Prediction and Experimental)
min: heatmap :max

.
SUBAcon:
nucleus 1.000
What is SUBAcon?
Experimental Localisations and PPI
FP MS/MS PPI
  • PMID:30961429 (2019): nucleus
  • PMID:28865150 (2017): extracellular region plant-type cell wall
  • PMID:21433285 (2011): plasma membrane
SUBAcon links
AGI-AGI relationships
Coexpression PPI
Description (TAIR10) protein_coding : cleavage and polyadenylation specificity factor 100
Curator
Summary (TAIR10)
Encodes a protein similar to the 100kD subunit of cleavage and polyadenylation specificity factor (CPSF), the factor responsible for the recognition of the AAUAAA motif during mRNA polyadenylation. The protein interacts with a portion of a nuclear poly(A) polymerase. It is likely to be a part of the mRNA 3'end formation apparatus.
Computational
Description (TAIR10)
cleavage and polyadenylation specificity factor 100 (CPSF100); FUNCTIONS IN: protein binding, DNA binding; INVOLVED IN: mRNA cleavage, mRNA polyadenylation, posttranscriptional gene silencing by RNA, embryo development ending in seed dormancy; LOCATED IN: mRNA cleavage and polyadenylation specificity factor complex, nucleus; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Beta-Casp domain (InterPro:IPR022712), RNA-metabolising metallo-beta-lactamase (InterPro:IPR011108), Beta-lactamase-like (InterPro:IPR001279); BEST Arabidopsis thaliana protein match is: cleavage and polyadenylation specificity factor 73-I (TAIR:AT1G61010.3); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Protein Annotations
BioGrid:17728DIP:DIP-40385NeggNOG:COG1236eggNOG:KOG1135
EMBL:AB005244EMBL:AF283277EMBL:AY034982EMBL:BT004374
EMBL:CP002688EnsemblPlants:AT5G23880EnsemblPlants:AT5G23880.1entrez:832453
Gene3D:3.60.15.10GeneID:832453Genevisible:Q9LKF9GO:GO:0003723
GO:GO:0005634GO:GO:0005737GO:GO:0005847GO:GO:0006378
GO:GO:0006379GO:GO:0009506GO:GO:0035194Gramene:AT5G23880.1
hmmpanther:PTHR11203hmmpanther:PTHR11203:SF5HOGENOM:HOG000264343InParanoid:Q9LKF9
IntAct:Q9LKF9InterPro:IPR001279InterPro:IPR011108InterPro:IPR022712
InterPro:IPR025069InterPro:IPR027075iPTMnet:Q9LKF9KEGG:ath:AT5G23880
KO:K14402ncoils:CoilOMA:LWCSNGTPANTHER:PTHR11203:SF5
PaxDb:Q9LKF9Pfam:PF07521Pfam:PF10996Pfam:PF13299
Pfam:PF16661Pfam:Q9LKF9PhylomeDB:Q9LKF9PRIDE:Q9LKF9
PRO:PR:Q9LKF9ProteinModelPortal:Q9LKF9Proteomes:UP000006548Reactome:R-ATH-72163
Reactome:R-ATH-72187Reactome:R-ATH-77595RefSeq:NP_197776.1SMART:SM00849
SMART:SM01027SMR:Q9LKF9STRING:3702.AT5G23880.1SUPFAM:SSF56281
TAIR:AT5G23880tair10-symbols:ATCPSF100tair10-symbols:CPSF100tair10-symbols:EMB1265
tair10-symbols:ESP5UniGene:At.25191UniProt:Q9LKF9
Coordinates (TAIR10) chr5:+:8052550..8058147
Molecular Weight (calculated) 82144.00 Da
IEP (calculated) 4.97
GRAVY (calculated) -0.16
Length 739 amino acids
Sequence (TAIR10)
(BLAST)
001: MGTSVQVTPL CGVYNENPLS YLVSIDGFNF LIDCGWNDLF DTSLLEPLSR VASTIDAVLL SHPDTLHIGA LPYAMKQLGL SAPVYATEPV HRLGLLTMYD
101: QFLSRKQVSD FDLFTLDDID SAFQNVIRLT YSQNYHLSGK GEGIVIAPHV AGHMLGGSIW RITKDGEDVI YAVDYNHRKE RHLNGTVLQS FVRPAVLITD
201: AYHALYTNQT ARQQRDKEFL DTISKHLEVG GNVLLPVDTA GRVLELLLIL EQHWSQRGFS FPIYFLTYVS SSTIDYVKSF LEWMSDSISK SFETSRDNAF
301: LLRHVTLLIN KTDLDNAPPG PKVVLASMAS LEAGFAREIF VEWANDPRNL VLFTETGQFG TLARMLQSAP PPKFVKVTMS KRVPLAGEEL IAYEEEQNRL
401: KREEALRASL VKEEETKASH GSDDNSSEPM IIDTKTTHDV IGSHGPAYKD ILIDGFVPPS SSVAPMFPYY DNTSEWDDFG EIINPDDYVI KDEDMDRGAM
501: HNGGDVDGRL DEATASLMLD TRPSKVMSNE LIVTVSCSLV KMDYEGRSDG RSIKSMIAHV SPLKLVLVHA IAEATEHLKQ HCLNNICPHV YAPQIEETVD
601: VTSDLCAYKV QLSEKLMSNV IFKKLGDSEV AWVDSEVGKT ERDMRSLLPM PGAASPHKPV LVGDLKIADF KQFLSSKGVQ VEFAGGGALR CGEYVTLRKV
701: GPTGQKGGAS GPQQILIEGP LCEDYYKIRD YLYSQFYLL
See Also
Citation
If you find this resource useful please cite one of the following publications:

Hooper CM, Castleden I, Tanz SK, Aryamanesh, and Millar, AH (2017) SUBA4: the interactive data analysis centre for Arabidopsis subcellular protein locations Nucleic Acids Res. Jan 4;45(D1):D1064-D1074. doi: 10.1093/nar/gkw1041 (PubMed)

Hooper CM, Tanz SK, Castleden IR, Vacher MA, Small ID, Millar AH (2014) "SUBAcon: a consensus algorithm for unifying the subcellular localization data of the Arabidopsis proteome. Bioinformatics." 1;30(23):3356-64. (Bioinformatics) (PubMed)