suba logo
AT4G02070.1
Subcellular Consensus
(Prediction and Experimental)
min: heatmap :max

.
SUBAcon:
nucleus 1.000
What is SUBAcon?
Experimental Localisations and PPI
FP MS/MS PPI
SUBAcon links
AGI-AGI relationships
Coexpression PPI
no PPI data
Description (TAIR10) protein_coding : MUTS homolog 6
Curator
Summary (TAIR10)
encodes a DNA mismatch repair homolog of human MutS gene, MSH6. There are four MutS genes in Arabidopsis, MSH2, MSH3, MSH6, and MSH7, which all act as heterodimers and bind to 51-mer duplexes. MSH2*MSH6 bound the (+T) substrate strongly, (T/G) well, and (+AAG) no better than it did a (T/A) homoduplex.
Computational
Description (TAIR10)
MUTS homolog 6 (MSH6); FUNCTIONS IN: damaged DNA binding; INVOLVED IN: mismatch repair; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s: DNA mismatch repair protein Msh6 (InterPro:IPR017261), DNA mismatch repair protein MutS, clamp (InterPro:IPR007861), DNA mismatch repair protein MutS, connector (InterPro:IPR007860), DNA mismatch repair protein MutS, core (InterPro:IPR007696), DNA mismatch repair protein MutS-like, N-terminal (InterPro:IPR007695), DNA mismatch repair protein MutS, N-terminal (InterPro:IPR016151), DNA mismatch repair protein MutS, C-terminal (InterPro:IPR000432), DNA mismatch repair protein MutS-homologue MSH6 (InterPro:IPR015536), Tudor domain (InterPro:IPR002999); BEST Arabidopsis thaliana protein match is: homolog of DNA mismatch repair protein MSH3 (TAIR:AT4G25540.1); Has 29702 Blast hits to 21889 proteins in 2967 species: Archae - 234; Bacteria - 11721; Metazoa - 3243; Fungi - 2118; Plants - 3658; Viruses - 802; Other Eukaryotes - 7926 (source: NCBI BLink).
Protein Annotations
eggNOG:COG0249eggNOG:KOG0217EMBL:AF001308EMBL:AF001535
EMBL:AJ245967EMBL:AL161493EMBL:CP002687EnsemblPlants:AT4G02070
EnsemblPlants:AT4G02070.1entrez:828147ExpressionAtlas:O04716Gene3D:3.40.1170.10
Gene3D:3.40.50.300GeneID:828147Genevisible:O04716GO:GO:0000400
GO:GO:0003684GO:GO:0005524GO:GO:0006290GO:GO:0006298
GO:GO:0009411GO:GO:0032137GO:GO:0032138GO:GO:0032301
GO:GO:0043570GO:GO:0045910hmmpanther:PTHR11361hmmpanther:PTHR11361:SF89
HOGENOM:HOG000243127InParanoid:O04716InterPro:IPR000432InterPro:IPR002999
InterPro:IPR007695InterPro:IPR007696InterPro:IPR007860InterPro:IPR007861
InterPro:IPR016151InterPro:IPR027417ncoils:CoilOMA:ALKDCMR
PaxDb:O04716Pfam:O04716Pfam:PF00488Pfam:PF01624
Pfam:PF05188Pfam:PF05190Pfam:PF05192PhylomeDB:O04716
PIR:T01508PRIDE:O04716PRO:PR:O04716PROSITE:PS00486
ProteinModelPortal:O04716Proteomes:UP000006548Reactome:R-ATH-5358565RefSeq:NP_192116.1
scanprosite:PS00486SMART:SM00333SMART:SM00533SMART:SM00534
SMR:O04716STRING:3702.AT4G02070.1SUPFAM:SSF48334SUPFAM:SSF52540
SUPFAM:SSF53150SUPFAM:SSF55271SUPFAM:SSF63748TAIR:AT4G02070
tair10-symbols:ATMSH6tair10-symbols:MSH6tair10-symbols:MSH6-1UniGene:At.34340
UniProt:O04716
Coordinates (TAIR10) chr4:+:906079..912930
Molecular Weight (calculated) 146805.00 Da
IEP (calculated) 6.62
GRAVY (calculated) -0.50
Length 1324 amino acids
Sequence (TAIR10)
(BLAST)
0001: MAPSRRQISG RSPLVNQQRQ ITSFFGKSAS SSSSPSPSPS PSLSNKKTPK SNNPNPKSPS PSPSPPKKTP KLNPNPSSNL PARSPSPGPD TPSPVQSKFK
0101: KPLLVIGQTP SPPQSVVITY GDEVVGKQVR VYWPLDKKWY DGSVTFYDKG EGKHVVEYED GEEESLDLGK EKTEWVVGEK SGDRFNRLKR GASALRKVVT
0201: DSDDDVEMGN VEEDKSDGDD SSDEDWGKNV GKEVCESEED DVELVDENEM DEEELVEEKD EETSKVNRVS KTDSRKRKTS EVTKSGGEKK SKTDTGTILK
0301: GFKASVVEPA KKIGQADRVV KGLEDNVLDG DALARFGARD SEKFRFLGVD RRDAKRRRPT DENYDPRTLY LPPDFVKKLT GGQRQWWEFK AKHMDKVVFF
0401: KMGKFYELFE MDAHVGAKEL DIQYMKGEQP HCGFPEKNFS VNIEKLVRKG YRVLVVEQTE TPDQLEQRRK ETGSKDKVVK REVCAVVTKG TLTDGEMLLT
0501: NPDASYLMAL TEGGESLTNP TAEHNFGVCL VDVATQKIIL GQFKDDQDCS ALSCLLSEMR PVEIIKPAKV LSYATERTIV RQTRNPLVNN LVPLSEFWDS
0601: EKTIYEVGII YKRINCQPSS AYSSEGKILG DGSSFLPKML SELATEDKNG SLALSALGGA IYYLRQAFLD ESLLRFAKFE SLPYCDFSNV NEKQHMVLDA
0701: AALENLEIFE NSRNGGYSGT LYAQLNQCIT ASGKRLLKTW LARPLYNTEL IKERQDAVAI LRGENLPYSL EFRKSLSRLP DMERLIARMF SSIEASGRNG
0801: DKVVLYEDTA KKQVQEFIST LRGCETMAEA CSSLRAILKH DTSRRLLHLL TPGQSLPNIS SSIKYFKDAF DWVEAHNSGR VIPHEGADEE YDCACKTVEE
0901: FESSLKKHLK EQRKLLGDAS INYVTVGKDE YLLEVPESLS GSVPHDYELC SSKKGVSRYW TPTIKKLLKE LSQAKSEKES ALKSISQRLI GRFCEHQEKW
1001: RQLVSATAEL DVLISLAFAS DSYEGVRCRP VISGSTSDGV PHLSATGLGH PVLRGDSLGR GSFVPNNVKI GGAEKASFIL LTGPNMGGKS TLLRQVCLAV
1101: ILAQIGADVP AETFEVSPVD KICVRMGAKD HIMAGQSTFL TELSETAVML TSATRNSLVV LDELGRGTAT SDGQAIAESV LEHFIEKVQC RGFFSTHYHR
1201: LSVDYQTNPK VSLCHMACQI GEGIGGVEEV TFLYRLTPGA CPKSYGVNVA RLAGLPDYVL QRAVIKSQEF EALYGKNHRK TDHKLAAMIK QIISSVASDS
1301: DYSASKDSLC ELHSMANTFL RLTN
See Also
Citation
If you find this resource useful please cite one of the following publications:

Hooper CM, Castleden I, Tanz SK, Aryamanesh, and Millar, AH (2017) SUBA4: the interactive data analysis centre for Arabidopsis subcellular protein locations Nucleic Acids Res. Jan 4;45(D1):D1064-D1074. doi: 10.1093/nar/gkw1041 (PubMed)

Hooper CM, Tanz SK, Castleden IR, Vacher MA, Small ID, Millar AH (2014) "SUBAcon: a consensus algorithm for unifying the subcellular localization data of the Arabidopsis proteome. Bioinformatics." 1;30(23):3356-64. (Bioinformatics) (PubMed)