Frog v1.01.
FRee On line druG conformation generation.

What is Frog ?
Frog is intended to generate 3D for drugs, usually described using a 1D or 2D representation. Frog performs isomer identification from ambiguous compound description. Frog is able to generate multi-conformations per isomer.
Frog is a tool that results from a collaborative work involving several teams at RPBS.
Frog is based on Frowns (http://frowns.sourceforge.net/), a chemoinformatics toolkit written in python, to which several features have been added.
Frog has some limitations (as almost every other package of the field).


c1(nn(c(c1Br)C)CC(Nc1ncccc1C1CCC[NH+]1C)=O)[NH](O)=O ==> #1: c1(nn(c(c1Br)C)CC(Nc1ncccc1[C@H]1CCC[N@H+]1C)=O)[NH](O)=O
#2: c1(nn(c(c1Br)C)CC(Nc1ncccc1[C@H]1CCC[N@@H+]1C)=O)[NH](O)=O
#3: c1(nn(c(c1Br)C)CC(Nc1ncccc1[C@@H]1CCC[N@H+]1C)=O)[NH](O)=O
#4: c1(nn(c(c1Br)C)CC(Nc1ncccc1[C@@H]1CCC[N@@H+]1C)=O)[NH](O)=O


1: isomer1
2: isomer2 3: isomer3 4: isomer4
 
                        

1. History
2. Features
3. Limitations
4. Usage
5. Examples, sample tests
6. Concepts
7. Validation
8. Availability (news since 2009 January)
9. Citations

History:
Features:

1: Weininger, D. SMILES, a Chemical Language and Information System. 1. Introduction to Methodology and Encoding Rules. J. Chem. Inf. Comput. Sci. 1988, 28, 31-36

2: Arthur Dalby et al., J. Chem. Inf. Comput. Sci, 1992, 32, 244-255

Limitations:

3: Halgren T. A.; Merck Molecular Force Field: I. Basis, Form, Scope, Parameterization, and Performance of MMFF94 (490-519), II. MMFF94 van des Waals and Electrostatic Parameters for Intermolecular Interactions (520-552), III. Molecular Geometries and Vibrational Frequencies for MMFF94 (553-586), IV. Conformational Energies and Geometries for MMFF94 (587-615), V. Extension of MMFF94 Using Experimental Data, Additional Computational Data, and Empirical Rules (616-641). J. Comp. Chem., 1996, Vol.17, Nos. 5 & 6


Usage:
Note on conformations / processing:
Using "Quick3D" or "Single", the first conformation not presenting strong steric clashes will be returned.This is intended to provide rapidly a correct 3D geometry of the compound
for one or all its isomers.
Only using "Multi", conformations of low energy are returned. Be aware that this is under current optimisation on two directions: (i) relevance of the lowest energy
conformations (ii) computational speed. At present too large compounds still require important computational time. This is under investigation.

Note on formats:
Smiles files should be on the form of one smiles per line, such as:
O=C(CCCCCCCCCC[C@H]1[C@@H]2[C@@H](c3c(C1)cc(cc3)O)CC[C@@]1([C@@H](CC[C@@H]21)O)C)[N@@](C)CCCC compound_Identifier
O=C(CCCCCCCCCC[C@H]1[C@@H]2[C@@H](c3c(C1)cc(cc3)O)CC[C@@]1([C@@H](CC[C@@H]21)O)C)[N@](C)CCCC  another_compound_identifier
O=C1[C@@H]2[C@H](N[C@H](N1)N)[N@](CCC(CO)CO)[CH]N2 TKinh5_penciclovir_1KI3pdb

The sdf format should be on the form:
sdf description

Examples:

Paste smiles: 
CC(=C(C)C(O)C)F
Select Unambiguate.
Resulting smiles are:

CC(=C(/C)[C@H](O)C)/F
CC(=C(/C)[C@H](O)C)\F
CC(=C(/C)[C@@H](O)C)/F

CC(=C(/C)[C@@H](O)C)\F

A more complex sample test (19 smiles) can be accessed here.
The unambiguation results (38 smiles) can be accessed here.
The results of the quick3D generation (19 compounds) can be accessed here. (mol2 format)
The results of the single generation (33 compounds) can be accessed here. (mol2 format) (the difference between 38 and 33 stands for axial/equatorial conformations possible for some cycles, and the fact that Frog randomly selected a maximum of 8 isomers upon 16 for 1 compound).
The results of the multi generation (10 conformations at max per isomer) can be accessed here. (mol2 format, 275 conformations) (note: for some compounds, less than 10 conformations of low energy were identified).

Some 3D conformations generated using Frog (multiconformations), for which experimental data is available here can be accessed here

Random test upon 992 compounds from Specs, Chembridge and Ambinter, using as values of energetic treshold of 100.0, number of Monte Carlo steps of 100, number of conformations of 10.
The input smiles are here.
The unambiguated smiles (1238) are here.
The mol2 output (12668 conformers) is here.
The log file is here.
Compounds not processed (might not be ADME/Tox compliant) here.
Note: the number of 12668 (i.e. more than 1238 x 10) is due to componds for which axial equatorial conformers have been considered. See for instance compound Chembridge-6439335, 2 smiles to describe the isomers, but 4 conformers considered due to axial/equatorial conformations.

Concepts:

4: Sadowski, J.; Gasteiger, J. From Atoms and Bonds to Three-Dimensional Atomic Coordinates: Automatic Model Builders. Chem. Rev. 1993, 93, 2567-2581

compound.png ==>
graph.png
Compound decomposition as cycles, linkers, appendices.

Validation tests:

Availability:


Citations:

Using Frog, please cite: