The online version of this article (doi:10.1186/1475-2875-11-375) contains supplementary material, which is available to authorized users.
The authors declare that they have no competing interests.
AMN performed all data analyses. AMN, RSR and AMR work on scripts for data integration. DAM and SSR participated in the experimental validation of new gene models. CJFF coordinated the field work. LHC, CFAB, AMN conceived and designed the study. CFAB coordinated the study. CFAB, AMN and AMR wrote the manuscript. All authors read and approved the final manuscript.
Signal peptide is one of the most important motifs involved in protein trafficking and it ultimately influences protein function. Considering the expected functional conservation among orthologs it was hypothesized that divergence in signal peptides within orthologous groups is mainly due to N-terminal protein sequence misannotation. Thus, discrepancies in signal peptide prediction of orthologous proteins were used to identify misannotated proteins in five Plasmodium species.
Signal peptide (SignalP) and orthology (OrthoMCL) were combined in an innovative strategy to identify orthologous groups showing discrepancies in signal peptide prediction among their protein members (Mixed groups). In a comparative analysis, multiple alignments for each of these groups and gene models were visually inspected in search of misannotated proteins and, whenever possible, alternative gene models were proposed. Thresholds for signal peptide prediction parameters were also modified to reduce their impact as a possible source of discrepancy among orthologs. Validation of new gene models was based on RT-PCR (few examples) or on experimental evidence already published (ApiLoc).
The rate of misannotated proteins was significantly higher in Mixed groups than in Positive or Negative groups, corroborating the proposed hypothesis. A total of 478 proteins were reannotated and change of signal peptide prediction from negative to positive was the most common. Reannotations triggered the conversion of almost 50% of all Mixed groups, which were further reduced by optimization of signal peptide prediction parameters.
The methodological novelty proposed here combining orthology and signal peptide prediction proved to be an effective strategy for the identification of proteins showing wrongly N-terminal annotated sequences, and it might have an important impact in the available data for genome-wide searching of potential vaccine and drug targets and proteins involved in host/parasite interactions, as demonstrated for five Plasmodium species.
Additional file 1: Examples of N-terminal alignments of inspected Mixed groups. In the upper panel, three Mixed groups (OG4_10598, OG4_10633 and OG4_47034) placed in the category of No misannotations after visual inspection. Signal peptide predictions positive (+) or negative (-) are shown to the left of gene identifiers, demonstrating the Mixed nature of these groups. A total of 111 groups belong to this category. In the lower panel, two Mixed groups (OG4_54958 and OG4_54960) in which putative misannotated proteins were identified after visual inspection. Proteins in these groups were reannotated and a comparison of alignments before and after reannotations is shown with the respective signal peptide predictions to the left of gene identifiers. A total of 331 groups belong to this category (Reannotated). Reannotated P. vivax genes that were submitted to RT-PCR validation of new gene models are indicated by asterisks. (PDF 3 MB)12936_2012_2561_MOESM1_ESM.pdf
Additional file 2: Description of PCR conditions used to validate new gene models. Sequences of primers, amplicon sizes, annealing temperatures and number of cycles used in amplifications of seven new gene models are showed. For each gene model were used three forward primers (control, before and after) and the same reverse primer. (XLS 34 KB)12936_2012_2561_MOESM2_ESM.xls
Additional file 3: List of all reannotated proteins. Reannotated proteins identified by their Gene ID from each species are listed including their orthologous group number, signal peptide prediction before and after annotation, description of putative gene product and new sequence proposed. Species: Pb – Plasmodium berghei, Pf – Plasmodium falciparum, Pk – Plasmodium knowlesi, Pv – Plasmodium vivax, and Py – Plasmodium yoelii. (XLS 401 KB)12936_2012_2561_MOESM3_ESM.xls
Additional file 4: Reannotated proteins with orthologs validated experimentally based on ApiLoc. Reannotated proteins from orthologous groups previously showing mixed signal peptide prediction are shown. This information includes the status of each group after proteins reannotation, the protein reannotated from each group with their species and SP prediction status before and after annotation and the description of the orthologous proteins experimentally validated based on ApiLoc information. Species: Pb – Plasmodium berghei, Pf – Plasmodium falciparum, Pk – Plasmodium knowlesi, Pv – Plasmodium vivax, and Py – Plasmodium yoelii. (XLS 37 KB)12936_2012_2561_MOESM4_ESM.xls
Additional file 5: Classification of orthologous groups after protein reannotations and optimization of signal peptide prediction parameters. Classification of each orthologous group according to signal peptide prediction of their proteins in Positive (all proteins of the group showed predicted signal peptide); Negative (all proteins of the group showed prediction of absence of signal peptide); Mixed (proteins with or without predicted signal peptide in the same group). The classifications were performed before reannotation, after reannotation and after optimization of signal peptide parameters of prediction. Classification of group category after visual inspection showed groups without proteins misannotated (No misannotations); groups with all misannotated proteins corrected (Reannotated); groups with one or more proteins still misannotated (Partially reannotated). (XLS 82 KB)12936_2012_2561_MOESM5_ESM.xls
Authors’ original file for figure 112936_2012_2561_MOESM6_ESM.pdf
Authors’ original file for figure 212936_2012_2561_MOESM7_ESM.pdf
Authors’ original file for figure 312936_2012_2561_MOESM8_ESM.pdf
Authors’ original file for figure 412936_2012_2561_MOESM9_ESM.pdf
Authors’ original file for figure 512936_2012_2561_MOESM10_ESM.pdf
Authors’ original file for figure 612936_2012_2561_MOESM11_ESM.pdf
Authors’ original file for figure 712936_2012_2561_MOESM12_ESM.pdf
World Health Organization: World Malaria Report. 2011, Geneva: WHO Press
malERA Consultative Group on Drugs: A research agenda for malaria eradication: drugs. PLoS Med. 2011, 8: e1000402. CrossRef
World Health Organization: World Malaria Report. 2010, Geneva: WHO Press
Alonso PL, Brown G, Arevalo-Herrera M, Binka F, Chitnis C, Collins F, Doumbo OK, Greenwood B, Hall BF, Levine MM, Mendis K, Newman RD, Plowe CV, Rodriguez MH, Sinden R, Slutsker L, Tanner M: A research agenda to underpin malaria eradication. PLoS Med. 2011, 8: e1000406-10.1371/journal.pmed.1000406. PubMedCentralCrossRefPubMed
malERA Consultative Group on Vaccines: A research agenda for malaria eradication: vaccines. PLoS Med. 2011, 8: e1000398. CrossRef
Aurrecoechea C, Brestelli J, Brunk BP, Dommer J, Fischer S, Gajria B, Gao X, Gingle A, Grant G, Harb OS, Heiges M, Innamorato F, Iodice J, Kissinger JC, Kraemer E, Li W, Miller JA, Nayak V, Pennington C, Pinney DF, Ross DS, Ross C, Stoeckert CJ, Treatman C, Wang H: PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res. 2009, 37: D539-D543. 10.1093/nar/gkn814. PubMedCentralCrossRefPubMed
Stat Tools Home Page. http://www.stattools.net/index.php,
ApiLoc - A database of published protein sub-cellular localisation in Apicomplexa. http://apiloc.biochem.unimelb.edu.au/apiloc/apiloc,
Ministério da Saúde, Secretaria de Vigilância em Saúde, Departamento de Vigilância epidemiológica: Guia prático de tratamento da malaria no Brasil. 2010, Brasília: Ministério da Saúde
Bernabeu M, Lopez F, Ferrer M, Martin-Jaular L, Razaname A, Corradin G, Maier A, Del Portillo H, Fernandez-Becerra C: Functional analysis of Plasmodium vivax VIR proteins reveals different subcellular localizations and cytoadherence to the ICAM-1 endothelial receptor. Cell Microbiol. 2012, 14: 386-400. 10.1111/j.1462-5822.2011.01726.x. CrossRefPubMed
Bozdech Z, Mok S, Hu G, Imwong M, Jaidee A, Russell B, Ginsburg H, Nosten F, Day NP, White NJ, Carlton JM, Preiser PR: The transcriptome of Plasmodium vivax reveals divergence and diversity of transcriptional regulation in malaria parasites. Proc Natl Acad Sci USA. 2008, 105: 16290-16295. 10.1073/pnas.0807404105. PubMedCentralCrossRefPubMed
Crowther GJ, Shanmugam D, Carmona SJ, Doyle MA, Hertz-Fowler C, Berriman M, Nwaka S, Ralph SA, Roos DS, Van Voorhis WC, Aguero F: Identification of attractive drug targets in neglected-disease pathogens using an in silico approach. PLoS Negl Trop Dis. 2010, 4: e804-10.1371/journal.pntd.0000804. PubMedCentralCrossRefPubMed
Carlton JM, Adams JH, Silva JC, Bidwell SL, Lorenzi H, Caler E, Crabtree J, Angiuoli SV, Merino EF, Amedeo P, Cheng Q, Coulson RM, Crabb BS, Del Portillo HA, Essien K, Feldblyum TV, Fernandez-Becerra C, Gilson PR, Gueye AH, Guo X, Kang’a S, Kooij TW, Korsinczky M, Meyer EV, Nene V, Paulsen I, White O, Ralph SA, Ren Q, Sargeant TJ, Salzberg SL, Stoeckert CJ, Sullivan SA, Yamamoto MM, Hoffman SL, Wortman JR, Gardner MJ, Galinski MR, Barnwell JW, Fraser-Liggett CM: Comparative genomics of the neglected human malaria parasite Plasmodium vivax. Nature. 2008, 455: 757-763. 10.1038/nature07327. PubMedCentralCrossRefPubMed
Olivieri A, Camarda G, Bertuccini L, van de Vegte-Bolmer M, Luty AJ, Sauerwein R, Alano P: The Plasmodium falciparum protein Pfg27 is dispensable for gametocyte and gamete production, but contributes to cell integrity during gametocytogenesis. Mol Microbiol. 2009, 73: 180-193. 10.1111/j.1365-2958.2009.06762.x. CrossRefPubMed
Kaiser K, Camargo N, Coppens I, Morrisey JM, Vaidya AB, Kappe SH: A member of a conserved Plasmodium protein family with membrane-attack complex/perforin (MACPF)-like domains localizes to the micronemes of sporozoites. Mol Biochem Parasitol. 2004, 133: 15-26. 10.1016/j.molbiopara.2003.08.009. CrossRefPubMed
Mikolajczak SA, Silva-Rivera H, Peng X, Tarun AS, Camargo N, Jacobs-Lorena V, Daly TM, Bergman LW, de la Vega P, Williams J, Aly AS, Kappe SH: Distinct malaria parasite sporozoites reveal transcriptional changes that cause differential tissue infection competence in the mosquito vector and mammalian host. Mol Cell Biol. 2008, 28: 6196-6207. 10.1128/MCB.00553-08. PubMedCentralCrossRefPubMed
- Improving N-terminal protein annotation of Plasmodium species based on signal peptide prediction of orthologous proteins
Armando de Menezes Neto
Denise A Alvarenga
Antônio M Rezende
Sarah S Resende
Ricardo de Souza Ribeiro
Cor JF Fontes
Luzia H Carvalho
Cristiana F Alves de Brito
- BioMed Central
Neu im Fachgebiet Innere Medizin
Meistgelesene Bücher aus der Inneren Medizin
Mail Icon II