Combining docking pose rank and structure with deep learning improves protein-ligand binding mode prediction
Joseph A. Morrone
and
Jeffrey K. Weber
and
Tien Huynh
and
Heng Luo
and
Wendy D. Cornell
arXiv e-Print archive - 2019 via Local arXiv
Keywords:
q-bio.BM, physics.bio-ph, stat.ML
First published: 2019/10/07 (5 years ago) Abstract: We present a simple, modular graph-based convolutional neural network that
takes structural information from protein-ligand complexes as input to generate
models for activity and binding mode prediction. Complex structures are
generated by a standard docking procedure and fed into a dual-graph
architecture that includes separate sub-networks for the ligand bonded topology
and the ligand-protein contact map. This network division allows contributions
from ligand identity to be distinguished from effects of protein-ligand
interactions on classification. We show, in agreement with recent literature,
that dataset bias drives many of the promising results on virtual screening
that have previously been reported. However, we also show that our neural
network is capable of learning from protein structural information when, as in
the case of binding mode prediction, an unbiased dataset is constructed. We
develop a deep learning model for binding mode prediction that uses docking
ranking as input in combination with docking structures. This strategy mirrors
past consensus models and outperforms the baseline docking program in a variety
of tests, including on cross-docking datasets that mimic real-world docking use
cases. Furthermore, the magnitudes of network predictions serve as reliable
measures of model confidence