Using EMBL-EBI Services via Web Interface and Programmatically via Web Services.

REST SOAP bioinformatics resources bioinformatics tools programmatic access web services workflows

Journal

Current protocols
ISSN: 2691-1299
Titre abrégé: Curr Protoc
Pays: United States
ID NLM: 101773894

Informations de publication

Date de publication:
Jun 2024
Historique:
medline: 10 6 2024
pubmed: 10 6 2024
entrez: 10 6 2024
Statut: ppublish

Résumé

The European Bioinformatics Institute (EMBL-EBI)'s Job Dispatcher framework provides access to a wide range of core databases and analysis tools that are of key importance in bioinformatics. As well as providing web interfaces to these resources, web services are available using REST and SOAP protocols that enable programmatic access and allow their integration into other applications and analytical workflows and pipelines. This article describes the various options available to researchers and bioinformaticians who would like to use our resources via the web interface employing RESTful web services clients provided in Perl, Python, and Java or who would like to use Docker containers to integrate the resources into analysis pipelines and workflows. © 2024 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Retrieving data from EMBL-EBI using Dbfetch via the web interface Alternate Protocol 1: Retrieving data from EMBL-EBI using WSDbfetch via the REST interface Alternate Protocol 2: Retrieving data from EMBL-EBI using Dbfetch via RESTful web services with Python client Support Protocol 1: Installing Python REST web services clients Basic Protocol 2: Sequence similarity search using FASTA search via the web interface Alternate Protocol 3: Sequence similarity search using FASTA via RESTful web services with Perl client Support Protocol 2: Installing Perl REST web services clients Basic Protocol 3: Sequence similarity search using NCBI BLAST+ RESTful web services with Python client Basic Protocol 4: Sequence similarity search using HMMER3 phmmer REST web services with Perl client and Docker Support Protocol 3: Installing Docker and running the EMBL-EBI client container Basic Protocol 5: Protein functional analysis using InterProScan 5 RESTful web services with the Python client and Docker Alternate Protocol 4: Protein functional analysis using InterProScan 5 RESTful web services with the Java client Support Protocol 4: Installing Java web services clients Basic Protocol 6: Multiple sequence alignment using Clustal Omega via web interface Alternate Protocol 5: Multiple sequence alignment using Clustal Omega with Perl client and Docker Support Protocol 5: Exploring the RESTful API with OpenAPI User Inferface.

Identifiants

pubmed: 38857087
doi: 10.1002/cpz1.1065
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1065

Informations de copyright

© 2024 The Authors. Current Protocols published by Wiley Periodicals LLC.

Références

Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J., Zhang, Z., Miller, W., & Lipman, D. J. (1997). Gapped BLAST and PSI‐BLAST: A new generation of protein database search programs. Nucleic Acids Research, 25(17), 3389–3402. https://doi.org/10.1093/nar/25.17.3389
Bairoch, A., Boeckmann, B., Ferro, S., & Gasteiger, E. (2004). Swiss‐Prot: Juggling between evolution and stability. Briefings in Bioinformatics, 5, 39–55. https://doi.org/10.1093/bib/5.1.39
Benson, D. A., Cavanaugh, M., Clark, K., Karsch‐Mizrachi, I., Lipman, D. J., Ostell, J., & Sayers, E. W. (2017). GenBank. Nucleic Acids Research, 45, D37–D42. https://doi.org/10.1093/nar/gkw1070
Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., & Madden, T. L. (2009). BLAST+: Architecture and applications. BMC Bioinformatics, 10, 421. https://doi.org/10.1186/1471‐2105‐10‐421
Cherry, J. M., Hong, E. L., Amundsen, C., Balakrishnan, R., Binkley, G., Chan, E. T., Christie, K. R., Costanzo, M. C., Dwight, S. S., Engel, S. R., Fisk, D. G., Hirschman, J. E., Hitz, B. C., Karra, K., Krieger, C. J., Miyasato, S. R., Nash, R. S., Park, J., Skrzypek, M. S., … Wong, E. D. (2012). Saccharomyces genome database: The genomics resource of budding yeast. Nucleic Acids Research, 40, D&00–D705. https://doi.org/10.1093/nar/gkr1029
Chojnacki, S., Cowley, A., Lee, J., Foix, A., & Lopez, R. (2017). Programmatic access to bioinformatics tools from EMBL‐EBI update: 2017. Nucleic Acids Research, 45, W550–W553. https://doi.org/10.1093/nar/gkx273
Davis, P., Zarowiecki, M., Arnaboldi, V., Becerra, A., Cain, S., Chan, J., Chen, W. J., Cho, J., da Veiga Beltrame, E., Diamantakis, S., Gao, S., Grigoriadis, D., Grove, C. A., Harris, T. W., Kishore, R., Le, T., Lee, R. Y. N., Luypaert, M., Müller, H. M., … Sternberg, P. W. (2022). WormBase in 2022‐data, processes, and tools for analyzing Caenorhabditis elegans. Genetics, 220(4), iyac003. https://doi.org/10.1093/genetics/iyac003
Eddy, S. R. (1998). Profile hidden Markov models. Bioinformatics, 14, 755–763. https://doi.org/10.1093/bioinformatics/14.9.755
Edman, P., Högfeldt, E., Sillén, L. G., & Kinell, P.‐O. (1950). Method for determination of the amino acid sequence in peptides. Acta Chemica Scandinavica, 4, 283–293. https://doi.org/10.3891/acta.chem.scand.04‐0283
Franklin, R. E. (1956). Structure of tobacco mosaic virus: Location of the ribonucleic acid in the tobacco mosaic virus particle. Nature, 177, 928–930. https://doi.org/10.1038/177928b0
Gramates, L. S., Marygold, S. J., Santos, G. D., Urbano, J. M., Antonazzo, G., Matthews, B. B., Rey, A. J., Tabone, C. J., Crosby, M. A., Emmert, D. B., Falls, K., Goodman, J. L., Hu, Y., Ponting, L., Schroeder, A. J., Strelets, V. B., Thurmond, J., Zhou, P., & the FlyBase Consortium. (2017). FlyBase at 25: Looking to the future. Nucleic Acids Research, 45, D663–D671. https://doi.org/10.1093/nar/gkw1016
Hernandez, P., Müller, M., & Appel, R. D. (2006). Automated protein identification by tandem mass spectrometry: Issues and strategies. Mass Spectrometry Reviews, 25, 235–254. https://doi.org/10.1002/mas.20068
Jones, P., Binns, D., Chang, H. Y., Fraser, M., Li, W., McAnulla, C., McWilliam, H., Maslen, J., Mitchell, A., Nuka, G., Pesseat, S., Quinn, A. F., Sangrador‐Vegas, A., Scheremetjew, M., Yong, S. Y., Lopez, R., & Hunter, S. (2014). InterProScan 5: Genome‐scale protein function classification. Bioinformatics, 30, 1236–1240. https://doi.org/10.1093/bioinformatics/btu031
Kersey, P. J., Allen, J. E., Allot, A., Barba, M., Boddu, S., Bolt, B. J., Carvalho‐Silva, D., Christensen, M., Davis, P., Grabmueller, C., Kumar, N., Liu, Z., Maurel, T., Moore, B., McDowall, M. D., Maheswari, U., Naamati, G., Newman, V., Ong, C. K., … Yates, A. (2018). Ensembl Genomes 2018: An integrated omics infrastructure for non‐vertebrate species. Nucleic Acids Research, 46, D802–D808. https://doi.org/10.1093/nar/gkx1011
Kodama, Y., Mashima, J., Kosuge, T., Kaminuma, E., Ogasawara, O., Okubo, K., Nakamura, Y., & Takagi, T. (2018). DNA data bank of Japan: 30th anniversary. Nucleic Acids Research, 46, D30–D35. https://doi.org/10.1093/nar/gkx926
Ladunga, I. (2002). Finding homologs to nucleotide sequences using network BLAST searches. Current Protocols in Bioinformatics, 00, 3.3.1–3.3.25. https://doi.org/10.1002/0471250953.bi0303s00
Larkin, A., Marygold, S. J., Antonazzo, G., Attrill, H., Dos Santos, G., Garapati, P. V., Goodman, J. L., Gramates, L. S., Millburn, G., Strelets, V. B., Tabone, C. J., Thurmond, J., & FlyBase Consortium (2020). FlyBase: Updates to the Drosophila melanogaster knowledge base. Nucleic Acids Research, 49, D899–D907. https://doi.org/10.1093/nar/gkaa1026
Lee, R. Y. N., Howe, K. L., Harris, T. W., Arnaboldi, V., Cain, S., Chan, J., Chen, W. J., Davis, P., Gao, S., Grove, C., Kishore, R., Muller, H. M., Nakamura, C., Nuin, P., Paulini, M., Raciti, D., Rodgers, F., Russell, M., Schindelman, G., … Sternberg, P. W. (2018). WormBase 2017: Molting into a new stage. Nucleic Acids Research, 46, D869–D874. https://doi.org/10.1093/nar/gkx998
Li, W., Cowley, A., Uludag, M., Gur, T., McWilliam, H., Squizzato, S., Park, Y. M., Buso, N., & Lopez, R. (2015). The EMBL‐EBI bioinformatics web and programmatic tools framework. Nucleic Acids Research, 43, W580–W584. https://doi.org/10.1093/nar/gkv279
Lopez, R., Duggan, K., Harte, N., & Kibria, A. (2003). Public services from the European Bioinformatics Institute. Briefings in Bioinformatics, 4, 332–340. https://doi.org/10.1093/bib/4.4.332
Madeira, F., Madhusoodanan, N., Lee, J., Eusebi, A., Niewielska, A., Tivey, A. R. N., Lopez, R., & Butcher, S. (2024). The EMBL‐EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Research, gkae241. https://doi.org/10.1093/nar/gkac241
Madeira, F., Park, Y. M., Lee, J., Buso, N., Gur, T., Madhusoodanan, N., Basutkar, P., Tivey, A. R. N., Potter, S. C., Finn, R. D., & Lopez, R. (2019). The EMBL‐EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Research, 47, W597–600. https://doi.org/10.1093/nar/gkz268
Madeira, F., Pearce, M., Tivey, A. R. N., Basutkar, P., Lee, J., Edbali, O., Madhusoodanan, N., Kolesnikov, A., & Lopez, R. (2022). Search and sequence analysis tools services from EMBL‐EBI in 2022. Nucleic Acids Research, 50, W276–W279. https://doi.org/10.1093/nar/gkac240
Martin, F. J., Amode, M. R., Aneja, A., Austine‐Orimoloye, O., Azov, A. G., Barnes, I., Becker, A., Bennett, R., Berry, A., Bhai, J., Bhurji, S. K., Bignell, A., Boddu, S., Branco Lins, P. R., Brooks, L., Ramaraju, S. B., Charkhchi, M., Cockburn, A., Da Rin Fiorretto, L., … Flicek, P. (2023). Ensembl 2023. Nucleic Acids Research, 51, D933–D941. https://doi.org/10.1093/nar/gkac958
McWilliam, H., Li, W., Uludag, M., Squizzato, S., Park, Y. M., Buso, N., Cowley, A. P., & Lopez, R. (2013). Analysis tool web services from the EMBL‐EBI. Nucleic Acids Research, 41, W597–600. https://doi.org/10.1093/nar/gkt376
McWilliam, H., Valentin, F., Goujon, M., Li, W., Narayanasamy, M., Martin, J., Miyar, T., & Lopez, R. (2009). Web services at the European Bioinformatics Institute‐2009. Nucleic Acids Research, 37, W6–W10. https://doi.org/10.1093/nar/gkp302
Mulder, N. J., & Apweiler, R. (2003). The InterPro database and tools for protein domain analysis. Current Protocols in Bioinformatics, 2, 2.7.1–2.7.19. https://doi.org/10.1002/0471250953.bi0207s02
Park, Y. M., Squizzato, S., Buso, N., Gur, T., & Lopez, R. (2017). The EBI search engine: EBI search as a service—Making biological data accessible for all. Nucleic Acids Research, 45, W545–W549. https://doi.org/10.1093/nar/gkx359
Pearson, W. R. (2016). Finding protein and nucleotide similarities with FASTA. Current Protocols in Bioinformatics, 53, 3.9.1–3.9.25. https://doi.org/10.1002/0471250953.bi0309s53
Pearson, W. R., & Lipman, D. J. (1988). Improved tools for biological sequence comparison. Proceedings of the National Academy of Sciences, 85, 2444–2448. https://doi.org/10.1073/pnas.85.8.2444
Pettersson, E., Lundeberg, J., & Ahmadian, A. (2009). Generations of sequencing technologies. Genomics, 93, 105–111. https://doi.org/10.1016/j.ygeno.2008.10.003
Potter, S. C., Luciani, A., Eddy, S. R., Park, Y., Lopez, R., & Finn, R. D. (2018). HMMER web server: 2018 update. Nucleic Acids Research, 46, W200–W204. https://doi.org/10.1093/nar/gky448
Roberts, R. J., & Murray, K. (1976). Restriction endonuclease. Critical Reviews in Biochemistry and Molecular Biology, 4, 123–164. https://doi.org/10.3109/10409237609105456
Sanger, F., & Coulson, A. R. (1975). A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase. Journal of Molecular Biology, 94, 441k–448. https://doi.org/10.1016/0022‐2836(75)90213‐2
Sayers, E. W., Cavanaugh, M., Clark, K., Pruitt, K. D., Sherry, S. T., Yankie, L., & Karsch‐Mizrachi, I. (2024). GenBank 2024 update. Nucleic Acids Research, 52(D1), D134–D137. https://doi.org/10.1093/nar/gkad903
Schwartz, E. M., & Sternberg, P. W. (2004). Searching WormBase for information about Caenorhabditis elegans. Current Protocols in Bioinformatics, 6, 1.8.1–1.8.44. https://doi.org/10.1002/0471250953.bi0108s6
Shank, S. D., Weaver, S., & Pond, S. L. K. (2018). phylotree.js—A JavaScript library for application development and interactive data visualization in phylogenetics. BMC Bioinformatics, 19, 276. https://doi.org/10.1186/s12859‐018‐2283‐2
Sievers, F., & Higgins, D. G. (2014). Clustal Omega. Current Protocols in Bioinformatics, 48, 3.13.1–3.13.16. https://doi.org/10.1002/0471250953.bi0313s48
Sievers, F., & Higgins, D. G. (2018). Clustal Omega for making accurate alignments of many protein sequences. Protein Science, 27, 135–145. https://doi.org/10.1002/pro.3290
Sievers, F., Wilm, A., Dineen, D., Gibson, T. J., Karplus, K., Li, W., Lopez, R., McWilliam, H., Remmert, M., Söding, J., Thompson, J. D., & Higgins, D. G. (2011). Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega. Molecular Systems Biology, 7, 539. https://doi.org/10.1038/msb.2011.75
Silvester, N., Alako, B., Amid, C., Cerdeño‐Tarrága, A., Clarke, L., Cleland, I., Harrison, P. W., Jayathilaka, S., Kay, S., Keane, T., Leinonen, R., Liu, X., Martínez‐Villacorta, J., Menchi, M., Reddy, K., Pakseresht, N., Rajan, J., Rossello, M., Smirnov, D., … Cochrane, G. (2018). The European nucleotide archive in 2017. Nucleic Acids Research, 46, D36–D40. https://doi.org/10.1093/nar/gkx1125
Skrzypek, M. S., & Hirschman, J. (2011). Using the Saccharomyces Genome Database (SGD) for analysis of genomic information. Current Protocols in Bioinformatics, 35, 1.20.1–1.20.23. https://doi.org/10.1002/0471250953.bi0120s35
Squizzato, S., Park, Y. M., Buso, N., Gur, T., Cowley, A., Li, W., Uludag, M., Pundir, S., Cham, J. A., McWilliam, H., & Lopez, R. (2015). The EBI Search engine: Providing search and retrieval functionality for biological data from EMBL‐EBI. Nucleic Acids Research, 43, W585–W588. https://doi.org/10.1093/nar/gkv316
Tanizawa, Y., Fujisawa, T., Kodama, Y., Kosuge, T., Mashima, J., Tanjo, T., & Nakamura, Y. (2023). DNA Data Bank of Japan (DDBJ) update report 2022. Nucleic Acids Research, 51, D101–D105. https://doi.org/10.1093/nar/gkac1083
UniProt Consortium. (2019). UniProt: A worldwide hub of protein knowledge. Nucleic Acids Research, 47(D1), D506–D515. https://doi.org/10.1093/nar/gky1049
UniProt Consortium. (2023). UniProt: The universal protein knowledgebase in 2023. Nucleic Acids Research, 51, D523–D531. https://doi.org/10.1093/nar/gkac1052
Valentin, F., Squizzato, S., Goujon, M., McWilliam, H., Paern, J., & Lopez, R. (2010). Fast and efficient searching of biological data resources‐using EB‐eye. Briefings in Bioinformatics, 11, 375–384. https://doi.org/10.1093/bib/bbp065
Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M., & Barton, G. J. (2009). Jalview version 2‐A multiple sequence alignment editor and analysis workbench. Bioinformatics, 25, 1189–1191. https://doi.org/10.1093/bioinformatics/btp033
Wolfsberg, T. G. (2007). Using the NCBI map viewer to browse genomic sequence data. Current Protocols in Bioinformatics, 16, 1.5.1–1.5.22. https://doi.org/10.1002/0471250953.bi0105s16
Wu, C., & Nebert, D. W. (2004). Update on genome completion and annotations: Protein information resource. Human Genomics, 1, 229–233. https://doi.org/10.1186/1479‐7364‐1‐3‐229
Yuan, D., Ahamed, A., Burgin, J., Cummins, C., Devraj, R., Gueye, K., Gupta, D., Gupta, V., Haseeb, M., Ihsan, M., Ivanov, E., Jayathilaka, S., Kadhirvelu, V. B., Kumar, M., Lathi, A., Leinonen, R., McKinnon, J., Meszaros, L., O'Cathail, C., … Cochrane, G. (2024). The European nucleotide archive in 2023. Nucleic Acids Research, 52(D1), D92–D97. https://doi.org/10.1093/nar/gkad1067
Zerbino, D. R., Achuthan, P., Akanni, W., Amode, M. R., Barrell, D., Bhai, J., Billis, K., Cummins, C., Gall, A., Girón, C. G., Gil, L., Gordon, L., Haggerty, L., Haskell, E., Hourlier, T., Izuogu, O. G., Janacek, S. H., Juettemann, T., To, J. K., … Flicek, P. (2018). Ensembl 2018. Nucleic Acids Research, 46, D754–D761. https://doi.org/10.1093/nar/gkx1098

Auteurs

Fábio Madeira (F)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Nandana Madhusoodanan (N)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Joonheung Lee (J)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Alberto Eusebi (A)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Ania Niewielska (A)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Adrian R N Tivey (ARN)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Stuart Meacham (S)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Rodrigo Lopez (R)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Sarah Butcher (S)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Humans Adult Male Female Video Games

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH