Identification of Novel Plant Peroxisomal Targeting Signals by a Combination of Machine Learning Methods and in Vivo Subcellular Targeting Analyses
Author(s) -
Thomas Lingner,
Amr R. A. Kataya,
Gerardo E. Antonicelli,
Aline Benichou,
Kjersti Nilssen,
XiongYan Chen,
Tanja Siemsen,
Burkhard Morgenstern,
Peter Meinicke,
Sigrun Reumann
Publication year - 2011
Publication title -
the plant cell
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 5.324
H-Index - 341
eISSN - 1532-298X
pISSN - 1040-4651
DOI - 10.1105/tpc.111.084095
Subject(s) - biology , peroxisome , arabidopsis , proteome , computational biology , peroxisomal targeting signal , arabidopsis thaliana , gene , genetics , mutant
In the postgenomic era, accurate prediction tools are essential for identification of the proteomes of cell organelles. Prediction methods have been developed for peroxisome-targeted proteins in animals and fungi but are missing specifically for plants. For development of a predictor for plant proteins carrying peroxisome targeting signals type 1 (PTS1), we assembled more than 2500 homologous plant sequences, mainly from EST databases. We applied a discriminative machine learning approach to derive two different prediction methods, both of which showed high prediction accuracy and recognized specific targeting-enhancing patterns in the regions upstream of the PTS1 tripeptides. Upon application of these methods to the Arabidopsis thaliana genome, 392 gene models were predicted to be peroxisome targeted. These predictions were extensively tested in vivo, resulting in a high experimental verification rate of Arabidopsis proteins previously not known to be peroxisomal. The prediction methods were able to correctly infer novel PTS1 tripeptides, which even included novel residues. Twenty-three newly predicted PTS1 tripeptides were experimentally confirmed, and a high variability of the plant PTS1 motif was discovered. These prediction methods will be instrumental in identifying low-abundance and stress-inducible peroxisomal proteins and defining the entire peroxisomal proteome of Arabidopsis and agronomically important crop plants.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom