Small-Molecule Library Screening by Docking With Pyrx: Sargis Dallakyan and Arthur J. Olson
Small-Molecule Library Screening by Docking With Pyrx: Sargis Dallakyan and Arthur J. Olson
Small-Molecule Library Screening by Docking With Pyrx: Sargis Dallakyan and Arthur J. Olson
Abstract
Virtual molecular screening is used to dock small-molecule libraries to a macromolecule in order to find
lead compounds with desired biological function. This in silico method is well known for its application in
computer-aided drug design. This chapter describes how to perform small-molecule virtual screening by
docking with PyRx, which is open-source software with an intuitive user interface that runs on all major
operating systems (Linux, Windows, and Mac OS). Specific steps for using PyRx, as well as considerations
for data preparation, docking, and data analysis, are also described.
Key words Virtual molecular screening, Computer-aided drug design, Molecular docking, PubChem,
AutoDock, Vina, Open Babel
1 Introduction
Jonathan E. Hempel et al. (eds.), Chemical Biology: Methods and Protocols, Methods in Molecular Biology,
vol. 1263, DOI 10.1007/978-1-4939-2269-7_19, © Springer Science+Business Media New York 2015
243
244 Sargis Dallakyan and Arthur J. Olson
2 Materials
2.1 Hardware 1. Dell Studio 540S with Intel Core 2 Duo CPU at 2.53 GHz,
and Software 4 GB memory (RAM), ATI Radeon HD 3400 series graphics
card, and 32-bit Windows Vista operating system.
2. Binary distribution of PyRx version 0.8 for Windows available
free from http://pyrx.sourceforge.net.
2.2 Input Files To start with structure-based virtual screening, structures of the
target macromolecule and small molecules are needed as input
files. There are a number of publicly available websites where users
can download these input files. Used in this chapter are DrugBank
[7] to get the structure of zanamivir, PubChem [8] for 3D structure
of sucrose, and Protein Data Bank [9] to get 3D structures of
influenza virus neuraminidase and sialic acid.
1. Open a preferred web browser and visit http://www.drug-
bank.ca/drugs/DB00558, click on SDF link next to
Download, and save that page as DB00558.sdf.
2. Go to http://pubchem.ncbi.nlm.nih.gov/summary/summary.
cgi?cid=5988, click SDF icon on top right, and select 3D SDF: Save.
3. Visit http://www.pdb.org/pdb/explore/explore.do?structure
Id=2BAT, click on Download Files, and select PDB File (Text).
The reason for choosing these particular molecules is that they
are familiar to most of the readers and computations can be run
relatively quickly on a PC. To apply the protocol described in
Subheading 3 to other binding target and ligands, users would
need to obtain input files corresponding to their specific binding
target and ligands. Selection of the binding target depends on the
biological problem of interest, and it is assumed that the 3D struc-
ture of the target is available in PDB format through Protein Data
Bank (http://pdb.org) or other sources (see Note 1). Selection of
ligands depends on whether virtual screening is used for lead dis-
covery or lead optimization. For lead discovery, it is advised to
include as many ligands with diverse shapes, sizes, and composition
as possible. Since individual docking computations are indepen-
dent from each other, users are practically only limited by compu-
tational power available at their disposal. For lead optimization, on
the other hand, ligands are selected to closely match the lead com-
pound [10]. One of the advantages of virtual screening is that it is
not limited to commercially available compounds; a ligand file for
a novel compound not found in any of the databases can also be
used (see Note 2).
3 Methods
3.1 Prepare Input Before input files can be used for virtual screening, they must be
Files for Docking converted to the PDBQT file format suitable for docking with
AutoDock Vina [11].
246 Sargis Dallakyan and Arthur J. Olson
3.2 Run Virtual 1. Select Vina Wizard tab under the Controls panel and click on
Screening Using Vina the Start button.
Wizard 2. Select 558_uff_E = 197.68.pdbqt, 5988.pdbqt and SialicAcid.
pdbqt under the Ligands folder (use the Shift key for selecting
multiple ligands).
3. Select 2BAT under the Macromolecules folder and click on the
Forward button on Vina Wizard.
4. Click on the Maximize button under Vina Search Space and
then click on the Forward button. This starts AutoDock Vina
and docks each ligand, one by one, to neuraminidase (2BAT).
It takes less than 20 min to complete this virtual screening on
a PC mentioned in Subheading 2.1 (see Note 4) (Fig. 1).
5. After virtual screening is completed, PyRx automatically
advances to Analyze Results page, where results of virtual
screening computation can be viewed. AutoDock Vina, by
default, outputs the ten best binding modes for each docking
run (see Note 5). Left-click on Binding Affinity (kcal/mol)
table header cell under Analyze Results tab to sort this table by
predicted binding affinity (see Note 6).
248 Sargis Dallakyan and Arthur J. Olson
Fig. 1 A screenshot of the PyRx virtual screening tool. The table on the left lists predicted binding affinity of
zanamivir (2BAT_558_uff_E = 197.68), sialic acid (2BAT_SalicAcid), and sucrose (2BAT_5988) for influenza
virus neuraminidase (2BAT). The 3D scene on the right shows line drawing and transparent molecular surface
of neuraminidase. Ball-and-stick models for zanamivir and sialic acid are also shown on this 3D scene
4 Notes
Acknowledgements
References
1. Ng R (2008) Drugs: from discovery to approval, 9. Berman HM et al (2002) The protein data bank.
2nd edn. Wiley-Blackwell, Hoboken, NJ Acta Crystallogr D Biol Crystallogr 58:899–907
2. Swinney DC, Anthony J (2011) How were 10. Scior T et al (2012) Recognizing pitfalls in
new medicines discovered? Nat Rev Drug virtual screening: a critical review. J Chem Inf
Discov 10:507–519 Model 52:867–881
3. Jacob RB, Andersen T, McDougal OM 11. Trott O, Olson AJ (2010) AutoDock Vina:
(2012) Accessible high-throughput virtual improving the speed and accuracy of docking with
screening molecular docking software for stu- a new scoring function, efficient optimization, and
dents and educators. PLoS Comput Biol multithreading. J Comput Chem 31:455–461
8:e1002499 12. Rappe AK et al (1992) UFF, a full periodic
4. Lindell SD, Pattenden LC, Shannon J (2009) table force field for molecular mechanics and
Combinatorial chemistry in the agrosciences. molecular dynamics simulations. J Am Chem
Bioorg Med Chem 17:4035–4046 Soc 114:10024–10035
5. Varghese JN et al (1992) The structure of the 13. O'Boyle NM et al (2011) Open Babel: an
complex between influenza virus neuramini- open chemical toolbox. J Cheminform 3:33
dase and sialic acid, the viral receptor. Proteins 14. Perryman AL et al (2010) A dynamic model of
14:327–332 HIV integrase inhibition and drug resistance.
6. McKimm-Breschkin JL (2013) Influenza J Mol Biol 397:600–615
neuraminidase inhibitors: antiviral action and 15. Hanwell MD et al (2012) Avogadro: an
mechanisms of resistance. Influenza Other advanced semantic chemical editor, visualiza-
Respir Viruses 7:25–36 tion, and analysis platform. J Cheminform 4:17
7. Knox C et al (2011) DrugBank 3.0: a compre- 16. Muchmore SW et al (2010) Cheminformatic
hensive resource for ‘omics’ research on drugs. tools for medicinal chemists. J Med Chem
Nucleic Acids Res 39:D1035–D1041 53:4830–4841
8. Li Q et al (2010) PubChem as a public 17. Nicholls A (2008) What do we know and when
resource for drug discovery. Drug Discov do we know it? J Comput Aided Mol Des
Today 15:1052–1057 22:239–255