Using Phaser and ensembles to improve the performance of SIMBAD.

SIMBAD contaminants ensembles molecular-replacement pipeline sequence independent structure solution

Journal

Acta crystallographica. Section D, Structural biology
ISSN: 2059-7983
Titre abrégé: Acta Crystallogr D Struct Biol
Pays: United States
ID NLM: 101676043

Informations de publication

Date de publication:
01 Jan 2020
Historique:
received: 16 07 2019
accepted: 06 11 2019
entrez: 8 1 2020
pubmed: 8 1 2020
medline: 11 6 2020
Statut: ppublish

Résumé

The conventional approach to search-model identification in molecular replacement (MR) is to screen a database of known structures using the target sequence. However, this strategy is not always effective, for example when the relationship between sequence and structural similarity fails or when the crystal contents are not those expected. An alternative approach is to identify suitable search models directly from the experimental data. SIMBAD is a sequence-independent MR pipeline that uses either a crystal lattice search or MR functions to directly locate suitable search models from databases. The previous version of SIMBAD used the fast AMoRe rotation-function search. Here, a new version of SIMBAD which makes use of Phaser and its likelihood scoring to improve the sensitivity of the pipeline is presented. It is shown that the additional compute time potentially required by the more sophisticated scoring is counterbalanced by the greater sensitivity, allowing more cases to trigger early-termination criteria, rather than running to completion. Using Phaser solved 17 out of 25 test cases in comparison to the ten solved with AMoRe, and it is shown that use of ensemble search models produces additional performance benefits.

Identifiants

pubmed: 31909738
pii: S2059798319015031
doi: 10.1107/S2059798319015031
pmc: PMC6939438
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1-8

Subventions

Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/L009544/1
Pays : United Kingdom

Informations de copyright

open access.

Références

Acta Crystallogr D Biol Crystallogr. 2010 Jan;66(Pt 1):22-5
pubmed: 20057045
Proc Natl Acad Sci U S A. 2010 Dec 14;107(50):21476-81
pubmed: 21098306
Acta Crystallogr D Biol Crystallogr. 2011 Apr;67(Pt 4):303-12
pubmed: 21460448
Acta Crystallogr D Struct Biol. 2018 Jul 1;74(Pt 7):595-605
pubmed: 29968670
Cell. 1992 Mar 20;68(6):1145-62
pubmed: 1547508
PLoS Comput Biol. 2011 Oct;7(10):e1002195
pubmed: 22039361
Acta Crystallogr D Biol Crystallogr. 2010 Feb;66(Pt 2):213-21
pubmed: 20124702
Structure. 1998 Jan 15;6(1):75-88
pubmed: 9493269
Acta Crystallogr D Biol Crystallogr. 2004 Mar;60(Pt 3):432-8
pubmed: 14993666
Acta Crystallogr D Struct Biol. 2018 Mar 1;74(Pt 3):167-182
pubmed: 29533225
Acta Crystallogr A. 1990 Feb 1;46 ( Pt 2):73-82
pubmed: 2180438
Acta Crystallogr D Biol Crystallogr. 2011 Apr;67(Pt 4):355-67
pubmed: 21460454
Acta Crystallogr D Struct Biol. 2016 Oct 1;72(Pt 10):1081-1089
pubmed: 27710929
Acta Crystallogr D Struct Biol. 2018 Feb 1;74(Pt 2):143-151
pubmed: 29533240
Structure. 2000 Nov 15;8(11):R213-20
pubmed: 11080645
Acta Crystallogr D Struct Biol. 2016 Mar;72(Pt 3):375-87
pubmed: 26960124
Acta Crystallogr D Biol Crystallogr. 2008 Jan;64(Pt 1):1-10
pubmed: 18094461
Acta Crystallogr D Biol Crystallogr. 2013 Nov;69(Pt 11):2266-75
pubmed: 24189239
Acta Crystallogr D Biol Crystallogr. 2008 Jan;64(Pt 1):125-32
pubmed: 18094476
Protein Sci. 2003 Sep;12(9):1865-71
pubmed: 12930986
J Mol Biochem. 2012;1(2):76-85
pubmed: 27882309
Acta Crystallogr D Biol Crystallogr. 2015 Feb;71(Pt 2):338-43
pubmed: 25664744
Acta Crystallogr D Struct Biol. 2019 Dec 1;75(Pt 12):1051-1062
pubmed: 31793899
Acta Crystallogr D Biol Crystallogr. 2001 Oct;57(Pt 10):1373-82
pubmed: 11567148
J Appl Crystallogr. 2007 Aug 1;40(Pt 4):658-674
pubmed: 19461840
Acta Crystallogr D Biol Crystallogr. 2004 Dec;60(Pt 12 Pt 1):2169-83
pubmed: 15572770
Acta Crystallogr D Biol Crystallogr. 2013 Nov;69(Pt 11):2209-15
pubmed: 24189232
Acta Crystallogr D Biol Crystallogr. 1993 Nov 1;49(Pt 6):588-91
pubmed: 15299496
J Mol Biol. 2002 Feb 1;315(5):1129-43
pubmed: 11827481
Acta Crystallogr D Biol Crystallogr. 2012 Dec;68(Pt 12):1622-31
pubmed: 23151627

Auteurs

Adam J Simpkin (AJ)

Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England.

Felix Simkovic (F)

Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England.

Jens M H Thomas (JMH)

Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England.

Martin Savko (M)

Synchrotron SOLEIL, L'Orme des Merisiers, BP 48, 91192 Saint Aubin, Gif-sur-Yvette, France.

Andrey Lebedev (A)

STFC, Rutherford Appleton Laboratory, Harwell Oxford, Didcot OX11 0FA, England.

Ville Uski (V)

STFC, Rutherford Appleton Laboratory, Harwell Oxford, Didcot OX11 0FA, England.

Charles C Ballard (CC)

STFC, Rutherford Appleton Laboratory, Harwell Oxford, Didcot OX11 0FA, England.

Marcin Wojdyr (M)

Global Phasing Ltd, Cambridge CB3 0AX, England.

William Shepard (W)

Synchrotron SOLEIL, L'Orme des Merisiers, BP 48, 91192 Saint Aubin, Gif-sur-Yvette, France.

Daniel J Rigden (DJ)

Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England.

Ronan M Keegan (RM)

Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
alpha-Synuclein Humans Animals Mice Lewy Body Disease

Classifications MeSH