Skip to main content
Erschienen in: Journal of Medical Systems 4/2017

01.04.2017 | Systems-Level Quality Improvement

SCALEUS: Semantic Web Services Integration for Biomedical Applications

verfasst von: Pedro Sernadela, Lorena González-Castro, José Luís Oliveira

Erschienen in: Journal of Medical Systems | Ausgabe 4/2017

Einloggen, um Zugang zu erhalten

Abstract

In recent years, we have witnessed an explosion of biological data resulting largely from the demands of life science research. The vast majority of these data are freely available via diverse bioinformatics platforms, including relational databases and conventional keyword search applications. This type of approach has achieved great results in the last few years, but proved to be unfeasible when information needs to be combined or shared among different and scattered sources. During recent years, many of these data distribution challenges have been solved with the adoption of semantic web. Despite the evident benefits of this technology, its adoption introduced new challenges related with the migration process, from existent systems to the semantic level. To facilitate this transition, we have developed Scaleus, a semantic web migration tool that can be deployed on top of traditional systems in order to bring knowledge, inference rules, and query federation to the existent data. Targeted at the biomedical domain, this web-based platform offers, in a single package, straightforward data integration and semantic web services that help developers and researchers in the creation process of new semantically enhanced information systems. SCALEUS is available as open source at http://​bioinformatics-ua.​github.​io/​scaleus/​.
Literatur
1.
Zurück zum Zitat van Dijk, E.L., Auger, H., Jaszczyszyn, Y., and Thermes, C., Ten years of next-generation sequencing technology. Trends Genet. 30(9):418–426, 2014.CrossRefPubMed van Dijk, E.L., Auger, H., Jaszczyszyn, Y., and Thermes, C., Ten years of next-generation sequencing technology. Trends Genet. 30(9):418–426, 2014.CrossRefPubMed
2.
Zurück zum Zitat Gomez-Cabrero, D., Abugessaisa, I., Maier, D., Teschendorff, A., Merkenschlager, M., Gisel, A., Ballestar, E., Bongcam-Rudloff, E., Conesa, A., and Tegnér, J., Data integration in the era of omics: current and future challenges. BMC Syst. Biol. 8(Suppl 2):I1, 2014.CrossRefPubMedPubMedCentral Gomez-Cabrero, D., Abugessaisa, I., Maier, D., Teschendorff, A., Merkenschlager, M., Gisel, A., Ballestar, E., Bongcam-Rudloff, E., Conesa, A., and Tegnér, J., Data integration in the era of omics: current and future challenges. BMC Syst. Biol. 8(Suppl 2):I1, 2014.CrossRefPubMedPubMedCentral
3.
Zurück zum Zitat Berners-Lee, T., Hendler, J., and Lassila, O., The semantic web. Sci. Am. 284(5):28–37, 2001.CrossRef Berners-Lee, T., Hendler, J., and Lassila, O., The semantic web. Sci. Am. 284(5):28–37, 2001.CrossRef
4.
Zurück zum Zitat Passin, T., Explorer’s guide to the semantic web, 2004. Passin, T., Explorer’s guide to the semantic web, 2004.
5.
Zurück zum Zitat Machado, C.M., Rebholz-Schuhmann, D., Freitas, A.T., and Couto, F.M., The semantic web in translational medicine: current applications and future directions. Brief. Bioinform. 16(1):89–103, 2015.CrossRefPubMed Machado, C.M., Rebholz-Schuhmann, D., Freitas, A.T., and Couto, F.M., The semantic web in translational medicine: current applications and future directions. Brief. Bioinform. 16(1):89–103, 2015.CrossRefPubMed
6.
Zurück zum Zitat Belleau, F., Nolin, M.-A., Tourigny, N., Rigault, P., and Morissette, J., Bio2RDF: towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5):706–716, 2008.CrossRefPubMed Belleau, F., Nolin, M.-A., Tourigny, N., Rigault, P., and Morissette, J., Bio2RDF: towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5):706–716, 2008.CrossRefPubMed
7.
Zurück zum Zitat Jupp, S., Malone, J., Bolleman, J., Brandizi, M., Davies, M., Garcia, L., Gaulton, A., Gehant, S., Laibe, C., and Redaschi, N., The EBI RDF platform: linked open data for the life sciences. Bioinformatics. 30(9):1338–1339, 2014.CrossRefPubMedPubMedCentral Jupp, S., Malone, J., Bolleman, J., Brandizi, M., Davies, M., Garcia, L., Gaulton, A., Gehant, S., Laibe, C., and Redaschi, N., The EBI RDF platform: linked open data for the life sciences. Bioinformatics. 30(9):1338–1339, 2014.CrossRefPubMedPubMedCentral
8.
Zurück zum Zitat Sernadela, P., Lopes, P., and Oliveira, J.L., A knowledge federation architecture for rare disease patient registries and biobanks. J. Inf. Syst. Eng. Manag. 1(1):83–90, 2016. Sernadela, P., Lopes, P., and Oliveira, J.L., A knowledge federation architecture for rare disease patient registries and biobanks. J. Inf. Syst. Eng. Manag. 1(1):83–90, 2016.
9.
Zurück zum Zitat Freitas, A., Curry, E., Oliveira, J.G., and O’Riain, S., Querying heterogeneous datasets on the linked data web: challenges, approaches, and trends. IEEE Internet Comput. 16(1):24–33, 2012.CrossRef Freitas, A., Curry, E., Oliveira, J.G., and O’Riain, S., Querying heterogeneous datasets on the linked data web: challenges, approaches, and trends. IEEE Internet Comput. 16(1):24–33, 2012.CrossRef
10.
Zurück zum Zitat J. Pathak, R. Kiefer, and C. Chute, Using semantic web technologies for cohort identification from electronic health records for clinical research. AMIA Summits Transl. Sci. Proc. 2012, 2012. J. Pathak, R. Kiefer, and C. Chute, Using semantic web technologies for cohort identification from electronic health records for clinical research. AMIA Summits Transl. Sci. Proc. 2012, 2012.
11.
Zurück zum Zitat C. David, C. Olivier, and B. Guillaume, A survey of RDF storage approaches. ARIMA J., 2012. C. David, C. Olivier, and B. Guillaume, A survey of RDF storage approaches. ARIMA J., 2012.
12.
Zurück zum Zitat O. Erling, Virtuoso, a hybrid RDBMS/graph column store. IEEE Data Eng. Bull., 2012. O. Erling, Virtuoso, a hybrid RDBMS/graph column store. IEEE Data Eng. Bull., 2012.
14.
Zurück zum Zitat Broekstra, J., Kampman, A., and Van Harmelen, F., Sesame: A generic architecture for storing and querying rdf and rdf schema. Semant. Web — ISWC 2002. 2342:54–68, 2002.CrossRef Broekstra, J., Kampman, A., and Van Harmelen, F., Sesame: A generic architecture for storing and querying rdf and rdf schema. Semant. Web — ISWC 2002. 2342:54–68, 2002.CrossRef
15.
Zurück zum Zitat Aasman, J., Allegro graph: RDF triple database. Franz Inc., Cid. Oakl, 2006. Aasman, J., Allegro graph: RDF triple database. Franz Inc., Cid. Oakl, 2006.
16.
Zurück zum Zitat G. E. Modoni, M. Sacco, and W. Terkaj, A survey of RDF store solutions. In 2014 International conference on engineering, Technology and Innovation: Engineering Responsible Innovation in Products and Services, ICE 2014, 2014. G. E. Modoni, M. Sacco, and W. Terkaj, A survey of RDF store solutions. In 2014 International conference on engineering, Technology and Innovation: Engineering Responsible Innovation in Products and Services, ICE 2014, 2014.
17.
Zurück zum Zitat Gurupur, V.P., and Tanik, M.M., A system for building clinical research applications using semantic web-based approach. J. Med. Syst. 36(1):53–59, 2012.CrossRefPubMed Gurupur, V.P., and Tanik, M.M., A system for building clinical research applications using semantic web-based approach. J. Med. Syst. 36(1):53–59, 2012.CrossRefPubMed
18.
Zurück zum Zitat E. Mezghani, E. Exposito, K. Drira, M. Da Silveira, and C. Pruski, A Semantic Big Data Platform for Integrating Heterogeneous Wearable Data in Healthcare. J. Med. Syst., vol. 39, no. 12, p. 185, 2015. E. Mezghani, E. Exposito, K. Drira, M. Da Silveira, and C. Pruski, A Semantic Big Data Platform for Integrating Heterogeneous Wearable Data in Healthcare. J. Med. Syst., vol. 39, no. 12, p. 185, 2015.
19.
Zurück zum Zitat C. Bizer, T. Heath, and T. Berners-Lee, Linked data-the story so far. Int. J. Semant. Web Inf. Syst., 2009. C. Bizer, T. Heath, and T. Berners-Lee, Linked data-the story so far. Int. J. Semant. Web Inf. Syst., 2009.
20.
Zurück zum Zitat S. Schenk, P. Gearon, and A. Passant, SPARQL 1.1 Update. World Wide Web Consort., 2010. S. Schenk, P. Gearon, and A. Passant, SPARQL 1.1 Update. World Wide Web Consort., 2010.
21.
Zurück zum Zitat S. Harris, A. Seaborne, and E. Prud’hommeaux, SPARQL 1.1 query language. W3C Recomm. 21, 2013. S. Harris, A. Seaborne, and E. Prud’hommeaux, SPARQL 1.1 query language. W3C Recomm. 21, 2013.
22.
Zurück zum Zitat C. Forgy, Rete: a fast algorithm for the many pattern/many object pattern match problem. Artif. Intell. 1982. C. Forgy, Rete: a fast algorithm for the many pattern/many object pattern match problem. Artif. Intell. 1982.
23.
Zurück zum Zitat Weibel, S., The Dublin Core: a simple content description model for electronic resources. Bull. Am. Soc. Inf. Sci. Technol. 24(1):9–11, 1997.CrossRef Weibel, S., The Dublin Core: a simple content description model for electronic resources. Bull. Am. Soc. Inf. Sci. Technol. 24(1):9–11, 1997.CrossRef
24.
Zurück zum Zitat R. Thompson, L. Johnston, D. Taruscio, L. Monaco, C. Béroud, I. G. Gut, M. G. Hansson, P.-B. A. ‘t Hoen, G. P. Patrinos, H. Dawkins, M. Ensini, K. Zatloukal, D. Koubi, E. Heslop, J. E. Paschall, M. Posada, P. N. Robinson, K. Bushby, and H. Lochmüller, “RD-Connect: An Integrated Platform Connecting Databases, Registries, Biobanks and Clinical Bioinformatics for Rare Disease Research. J. Gen. Intern. Med. 2014. R. Thompson, L. Johnston, D. Taruscio, L. Monaco, C. Béroud, I. G. Gut, M. G. Hansson, P.-B. A. ‘t Hoen, G. P. Patrinos, H. Dawkins, M. Ensini, K. Zatloukal, D. Koubi, E. Heslop, J. E. Paschall, M. Posada, P. N. Robinson, K. Bushby, and H. Lochmüller, “RD-Connect: An Integrated Platform Connecting Databases, Registries, Biobanks and Clinical Bioinformatics for Rare Disease Research. J. Gen. Intern. Med. 2014.
25.
Zurück zum Zitat Zollino, M., Ponzi, E., Gobbi, G., and Neri, G., The ring 14 syndrome. Eur. J. Med. Genet. 55(5):374–380, 2012.CrossRefPubMed Zollino, M., Ponzi, E., Gobbi, G., and Neri, G., The ring 14 syndrome. Eur. J. Med. Genet. 55(5):374–380, 2012.CrossRefPubMed
26.
Zurück zum Zitat Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., da Silva Santos, L.B., Bourne, P.E., Bouwman, J., Brookes, A.J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C.T., Finkers, R., Gonzalez-Beltran, A., Gray, A.J.G., Groth, P., Goble, C., Grethe, J.S., Heringa, J., ‘t Hoen, P.A.T., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S.J., Martone, M.E., Mons, A., Packer, A.L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S.-A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M.A., Thompson, M., van der Lei, J., van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J., and Mons, B., The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 3:160018, 2016. Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., da Silva Santos, L.B., Bourne, P.E., Bouwman, J., Brookes, A.J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C.T., Finkers, R., Gonzalez-Beltran, A., Gray, A.J.G., Groth, P., Goble, C., Grethe, J.S., Heringa, J., ‘t Hoen, P.A.T., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S.J., Martone, M.E., Mons, A., Packer, A.L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S.-A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M.A., Thompson, M., van der Lei, J., van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J., and Mons, B., The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 3:160018, 2016.
27.
Zurück zum Zitat Rath, A., Olry, A., Dhombres, F., Brandt, M.M., Urbero, B., and Ayme, S., Representation of rare diseases in health information systems: the orphanet approach to serve a wide range of end users. Hum. Mutat. 33(5):803–808, 2012.CrossRefPubMed Rath, A., Olry, A., Dhombres, F., Brandt, M.M., Urbero, B., and Ayme, S., Representation of rare diseases in health information systems: the orphanet approach to serve a wide range of end users. Hum. Mutat. 33(5):803–808, 2012.CrossRefPubMed
28.
Zurück zum Zitat Barrell, D., Dimmer, E., Huntley, R.P., Binns, D., O’Donovan, C., and Apweiler, R., The GOA database in 2009--an integrated Gene Ontology Annotation resource. Nucleic Acids Res. 37(Database):D396–D403, 2009.CrossRefPubMed Barrell, D., Dimmer, E., Huntley, R.P., Binns, D., O’Donovan, C., and Apweiler, R., The GOA database in 2009--an integrated Gene Ontology Annotation resource. Nucleic Acids Res. 37(Database):D396–D403, 2009.CrossRefPubMed
29.
Zurück zum Zitat Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., Harris, M.A., Hill, D.P., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, J.C., Richardson, J.E., Ringwald, M., Rubin, G.M., and Sherlock, G., Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25(1):25–29, May 2000.PubMed Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., Harris, M.A., Hill, D.P., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, J.C., Richardson, J.E., Ringwald, M., Rubin, G.M., and Sherlock, G., Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25(1):25–29, May 2000.PubMed
30.
Zurück zum Zitat Bairoch, A., Apweiler, R., Wu, C.H., Barker, W.C., Boeckmann, B., Ferro, S., Gasteiger, E., Huang, H., Lopez, R., Magrane, M., Martin, M.J., Natale, D.A., O’Donovan, C., Redaschi, N., and Yeh, L.-S.L., The Universal Protein Resource (UniProt). Nucleic Acids Res. 33(suppl_1):D154–D159, 2005.PubMed Bairoch, A., Apweiler, R., Wu, C.H., Barker, W.C., Boeckmann, B., Ferro, S., Gasteiger, E., Huang, H., Lopez, R., Magrane, M., Martin, M.J., Natale, D.A., O’Donovan, C., Redaschi, N., and Yeh, L.-S.L., The Universal Protein Resource (UniProt). Nucleic Acids Res. 33(suppl_1):D154–D159, 2005.PubMed
31.
Zurück zum Zitat Kersey, P.J., Duarte, J., Williams, A., Karavidopoulou, Y., Birney, E., and Apweiler, R., The international protein index: an integrated database for proteomics experiments. Proteomics. 4(7):1985–1988, 2004.CrossRefPubMed Kersey, P.J., Duarte, J., Williams, A., Karavidopoulou, Y., Birney, E., and Apweiler, R., The international protein index: an integrated database for proteomics experiments. Proteomics. 4(7):1985–1988, 2004.CrossRefPubMed
32.
Zurück zum Zitat C. Bizer and R. Cyganiak, D2r server-publishing relational databases on the semantic web. 5th Int. Semant. Web Conf., 2006. C. Bizer and R. Cyganiak, D2r server-publishing relational databases on the semantic web. 5th Int. Semant. Web Conf., 2006.
33.
Zurück zum Zitat Pang, C., Sollie, A., Sijtsma, A., Hendriksen, D., Charbon, B., de Haan, M., de Boer, T., Kelpin, F., Jetten, J., van der Velde, J.K., Smidt, N., Sijmons, R., Hillege, H., and Swertz, M.A., SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data. Database. 2015:bav089, 2015.CrossRefPubMedPubMedCentral Pang, C., Sollie, A., Sijtsma, A., Hendriksen, D., Charbon, B., de Haan, M., de Boer, T., Kelpin, F., Jetten, J., van der Velde, J.K., Smidt, N., Sijmons, R., Hillege, H., and Swertz, M.A., SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data. Database. 2015:bav089, 2015.CrossRefPubMedPubMedCentral
34.
Zurück zum Zitat Campos, D., Lourenco, J., Matos, S., and Oliveira, J.L., Egas: a collaborative and interactive document curation platform. Database. 2014, 2014. Campos, D., Lourenco, J., Matos, S., and Oliveira, J.L., Egas: a collaborative and interactive document curation platform. Database. 2014, 2014.
Metadaten
Titel
SCALEUS: Semantic Web Services Integration for Biomedical Applications
verfasst von
Pedro Sernadela
Lorena González-Castro
José Luís Oliveira
Publikationsdatum
01.04.2017
Verlag
Springer US
Erschienen in
Journal of Medical Systems / Ausgabe 4/2017
Print ISSN: 0148-5598
Elektronische ISSN: 1573-689X
DOI
https://doi.org/10.1007/s10916-017-0705-8

Weitere Artikel der Ausgabe 4/2017

Journal of Medical Systems 4/2017 Zur Ausgabe

Systems-Level Quality Improvement

Data Mining in HIV-AIDS Surveillance System