The predicted subcellular localisation of the sugarcane proteome
Renato Vicentini A B and Marcelo Menossi AA Departamento de Genética e Evolução, Laboratório de Genoma Funcional, Instituto de Biologia, CP 6109, Universidade Estadual de Campinas - UNICAMP, 13083-970, Campinas, SP, Brazil.
B Corresponding author. Email: shinapes@unicamp.br
Functional Plant Biology 36(3) 242-250 https://doi.org/10.1071/FP08252
Submitted: 30 September 2008 Accepted: 12 January 2009 Published: 2 March 2009
Abstract
Plant cells are highly organised, and many biological processes are associated with specialised subcellular structures. Subcellular localisation is a key feature of proteins, since it is related to biological function. The subcellular localisation of such proteins can be predicted, providing information that is particularly relevant to those proteins with unknown or putative function. We performed the first in silico genome-wide subcellular localisation analysis for the sugarcane transcriptome (with 11 882 predicted proteins) and found that most of the proteins were localised in four compartments: nucleus (44%), cytosol (19%), mitochondria (12%) and secretory destinations (11%). We also showed that ~19% of the proteins were localised in multiple compartments. Other results allowed identification of a potential set of sugarcane proteins that could show dual targeting by the use of N-truncated forms that started from the nearest downstream in-frame AUG codons. This study was a first step in increasing knowledge about the subcellular localisation of the sugarcane proteome.
Additional keywords: dual targeting, N-truncated, PWMSubLoc, Saccharum ssp.
Acknowledgements
RV was supported by a fellowship from the UNIEMP Institute and MM received a research fellowship from CNPq. This work was partially supported by grant 05/58104–0 from FAPESP, awarded to MM.
Andrade MA,
O’Donoghue SI, Rost B
(1998) Adaptation of protein surfaces to subcellular location. Journal of Molecular Biology 276, 517–525.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Ashburner M,
Ball CA,
Blake JA,
Botstein D, Butler H ,
et al
.
(2000) Gene ontology: tool for the unification of biology. Nature Genetics 25, 25–29.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Bannai H,
Tamada Y,
Maruyama O,
Nakai K, Miyano S
(2002) Extensive feature detection of N-terminal protein sorting signals. Bioinformatics 18, 298–305.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Borodovsky M, McIninch J
(1993) GENMARK: parallel gene recognition for both DNA strands. Computers & Chemistry 17, 123–133.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Bower N,
Casu RE,
Maclean D,
Reverter A,
Chapman SC, Manners JM
(2005) Transcriptional response of sugarcane roots to methyl jasmonate. Plant Science 168, 761–772.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Burge C, Karlin S
(1997) Prediction of complete gene structures in human genomic DNA. Journal of Molecular Biology 268, 78–94.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Camon E,
Magrane M,
Barrell D,
Lee V,
Dimmer E,
Maslen J,
Binns D,
Harte N,
Lopez R, Apweiler R
(2004) The Gene Ontology Annotation (GOA) database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Research 32, D262–D266.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Carson D, Botha F
(2002) Genes expressed in sugarcane maturing internodal tissue. Plant Cell Reports 20, 1075–1081.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Castillo-Davis CI, Hartl DL
(2003) GeneMerge–post-genomic analysis, data mining, and hypothesis testing. Bioinformatics 19, 891–892.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Casu RE,
Dimmock CM,
Chapman SC,
Grof CP,
McIntyre CL,
Bonnett GD, Manners JM
(2004) Identification of differentially expressed transcripts from maturing stem of sugarcane by in silico analysis of stem expressed sequence tags and gene expression profiling. Plant Molecular Biology 54, 503–517.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Claros MG, Vincens P
(1996) Computational method to predict mitochondrially imported proteins and their targeting sequences. European Journal of Biochemistry 241, 779–786.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Cokol M,
Nair R, Rost B
(2000) Finding nuclear localization signals. EMBO Reports 1, 411–415.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Emanuelsson O,
Nielsen H,
Brunak S, von Heijne G
(2000) Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. Journal of Molecular Biology 300, 1005–1016.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Geisler-Lee J,
O’Toole N,
Ammar R,
Provart NJ,
Millar AH, Geisler M
(2007) A predicted interactome for Arabidopsis. Plant Physiology 145, 317–329.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Guo J,
Lin Y, Sun Z
(2004a) A novel method for protein subcellular localization based on boosting and probabilistic neural network. Proceedings of the second conference on Asia-Pacific bioinformatics 29, 21–27.
Guo T,
Hua S,
Ji X, Sun Z
(2004b) DBSubLoc: database of protein subcellular localization. Nucleic Acids Research 32, D122–D124.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Heazlewood JL,
Tonti-Filippini JS,
Gout AM,
Day DA,
Whelan J, Millar AH
(2004) Experimental analysis of the Arabidopsis mitochondrial proteome highlights signaling and regulatory components, provides assessment of targeting prediction programs, and indicates plant-specific mitochondrial proteins. The Plant Cell 16, 241–256.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Heazlewood JL,
Verboom RE,
Tonti-Filippini J,
Small I, Millar AH
(2007) SUBA: the Arabidopsis subcellular database. Nucleic Acids Research 35, D213–D218.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Hiller K,
Grote A,
Scheer M,
Munch R, Jahn D
(2004) PrediSi: prediction of signal peptides and their cleavage positions. Nucleic Acids Research 32, W375–W379.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Hooper SD, Bork P
(2005) Medusa: a simple tool for interaction graph analysis. Bioinformatics 21, 4432–4433.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Hua S, Sun Z
(2001) Support vector machine approach for protein subcellular localization prediction. Bioinformatics 17, 721–728.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Iseli C,
Jongeneel C, Bucher P
(1999) ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proceedings of International Conference on Intelligent Systems for Molecular Biology , 138–148.
|
CAS |
Kochetov AV
(2005) AUG codons at the beginning of protein coding sequences are frequent in eukaryotic mRNAs with a suboptimal start codon context. Bioinformatics 21, 837–840.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Kochetov AV, Sarai A
(2004) Translational polymorphism as a potential source of plant proteins variety in Arabidopsis thaliana. Bioinformatics 20, 445–447.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Li S,
Ehrhardt DW, Rhee SY
(2006) Systematic analysis of Arabidopsis organelles and a protein localization database for facilitating fluorescent tagging of full-length Arabidopsis proteins. Plant Physiology 141, 527–539.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Lilley KS, Dupree P
(2007) Plant organelle proteomics. Current Opinion in Plant Biology 10, 594–599.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Liu J,
Kang S,
Tang C,
Ellis LBM, Li T
(2007) Meta-prediction of protein subcellular localization with reduced voting. Nucleic Acids Research 35, e96.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Lunn JE
(2007) Compartmentation in plant metabolism. Journal of Experimental Botany 58, 35–47.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Ma HM,
Schulze S,
Lee S,
Yang M,
Mirkov E,
Irvine J,
Moore P, Paterson A
(2004) An EST survey of the sugarcane transcriptome. Theoretical and Applied Genetics 108, 851–863.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Millar AH
(2004) Location, location, location: surveying the intracellular real estate through proteomics in plants. Functional Plant Biology 31, 563–582.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Millar AH,
Whelan J, Small I
(2006) Recent surprises in protein targeting to mitochondria and plastids. Current Opinion in Plant Biology 9, 610–615.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Moreau P,
Brandizzi F,
Hanton S,
Chatre L,
Melser S,
Hawes C, Satiat-Jeunemaitre B
(2007) The plant ER-Golgi interface: a highly structured and dynamic membrane complex. Journal of Experimental Botany 58, 49–64.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Nakai K, Horton P
(1999) PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends in Biochemical Sciences 24, 34–36.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
O’Rourke NA,
Meyer T, Chandy G
(2005) Protein localization studies in the age of ‘Omics’. Current Opinion in Chemical Biology 9, 82–87.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Pessoa-Jr A,
Roberto IC,
Menossi M,
dos Santos RR,
Filho SO, Penna TC
(2005) Perspectives on bioenergy and biotechnology in Brazil. Applied Biochemistry and Biotechnology 121, 59–70.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Rusch SL, Kendall DA
(1995) Protein transport via amino-terminal targeting sequences: common themes in diverse systems. Molecular Membrane Biology 12, 295–307.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Schneider TD
(1997) Information content of individual genetic sequences. Journal of Theoretical Biology 189, 427–441.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Shen Y, Burger G
(2007) ‘Unite and conquer’: enhanced prediction of protein subcellular localization by integrating multiple specialized tools. BMC Bioinformatics 8, 420.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Small I,
Wintz H,
Akashi K, Mireau H
(1998) Two birds with one stone: genes that encode products targeted to two or more compartments. Plant Molecular Biology 38, 265–277.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Small I,
Peeters N,
Legeai F, Lurin C
(2004) Predotar: a tool for rapidly screening proteomes for N-terminal targeting sequences. Proteomics 4, 1581–1590.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Vettore AL,
da Silva FR,
Kemper EL,
Souza GM, da Silva AM ,
et al
.
(2003) Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane. Genome Research 13, 2725–2735.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Vicentini R, Menossi M
(2007) TISs-ST: a web server to evaluate polymorphic translation initiation sites and their reflections on the secretory targets. BMC Bioinformatics 8, 160.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
von Heijne G,
Steppuhn J, Herrmann RC
(1989) Domain structure of mitochondrial and chloroplast targeting peptides. European Journal of Biochemistry 180, 535–545.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
von Mering C,
Jensen LJ,
Kuhn M,
Chaffron S,
Doerks T,
Kruger B,
Snel B, Bork P
(2007) STRING 7–recent developments in the integration and prediction of protein interactions. Nucleic Acids Research 35, D358–D362.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Xie D,
Li A,
Wang M,
Fan Z, Feng H
(2005) LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST. Nucleic Acids Research 33, W105–W110.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |