-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
GitHub Action
committed
Aug 29, 2024
0 parents
commit 5eb0c67
Showing
73 changed files
with
7,439 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
# Sphinx build info version 1 | ||
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. | ||
config: 9cf4d0d6805e81865f50473ee73df2e7 | ||
tags: 645f666f9bcd5a90fca523b33c5a78b7 |
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Empty file.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,57 @@ | ||
================= | ||
Functional Groups | ||
================= | ||
|
||
FGUtils provides a class :py:class:`~fgutils.query.FGQuery` to query a | ||
molecules functional groups. | ||
|
||
Functional group tree | ||
===================== | ||
|
||
.. code-block:: | ||
Functional Group Parents Pattern | ||
-------------------------------------------------------------------------- | ||
ether [ROOT] ROR | ||
├── ketal [ether] RC(OR)(OR)R | ||
│ ├── acetal [ketal] RC(OC)(OC)H | ||
│ └── hemiketal [ketal, alcohol] RC(OH)(OR)R | ||
│ └── hemiacetal [hemiketal] RC(OC)(OH)H | ||
├── ester [ketone, ether] RC(=O)OR | ||
│ ├── anhydride [ester] RC(=O)OC(=O)R | ||
│ ├── peroxy_acid [ester, peroxide] RC(=O)OOH | ||
│ ├── carbamate [ester, amide] ROC(=O)N(R)R | ||
│ └── carboxylic_acid [ester, alcohol] RC(=O)OH | ||
├── alcohol [ether] COH | ||
│ ├── hemiketal [ketal, alcohol] RC(OH)(OR)R | ||
│ │ └── hemiacetal [hemiketal] RC(OC)(OH)H | ||
│ ├── carboxylic_acid [ester, alcohol] RC(=O)OH | ||
│ ├── enol [alcohol] C=COH | ||
│ ├── primary_alcohol [alcohol] CCOH | ||
│ │ └── secondary_alcohol [primary_alcohol] C(C)(C)OH | ||
│ │ └── tertiary_alcohol [secondary_alcohol] C(C)(C)(C)OH | ||
│ └── phenol [alcohol] C:COH | ||
└── peroxide [ether] ROOR | ||
└── peroxy_acid [ester, peroxide] RC(=O)OOH | ||
thioether [ROOT] RSR | ||
└── thioester [ketone, thioether] RC(=O)SR | ||
amine [ROOT] RN(R)R | ||
├── amide [ketone, amine] RC(=O)N(R)R | ||
│ └── carbamate [ester, amide] ROC(=O)N(R)R | ||
└── anilin [amine] C:CN(R)R | ||
carbonyl [ROOT] C(=O) | ||
├── ketene [carbonyl] RC(R)=C=O | ||
└── ketone [carbonyl] RC(=O)R | ||
├── amide [ketone, amine] RC(=O)N(R)R | ||
│ └── carbamate [ester, amide] ROC(=O)N(R)R | ||
├── thioester [ketone, thioether] RC(=O)SR | ||
├── ester [ketone, ether] RC(=O)OR | ||
│ ├── anhydride [ester] RC(=O)OC(=O)R | ||
│ ├── peroxy_acid [ester, peroxide] RC(=O)OOH | ||
│ ├── carbamate [ester, amide] ROC(=O)N(R)R | ||
│ └── carboxylic_acid [ester, alcohol] RC(=O)OH | ||
├── acyl_chloride [ketone] RC(=O)Cl | ||
└── aldehyde [ketone] RC(=O)H | ||
nitrose [ROOT] RN=O | ||
└── nitro [nitrose] RN(=O)O | ||
nitrile [ROOT] RC#N |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
=============== | ||
Getting Started | ||
=============== | ||
|
||
Installation | ||
============ | ||
|
||
|
||
Query Functional Groups | ||
======================= | ||
|
||
|
||
|
||
A Simple Reaction Proxy | ||
======================= |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,184 @@ | ||
============ | ||
Graph Syntax | ||
============ | ||
|
||
FGUtils has its own graph description language. The syntax is closely related | ||
to the SMILES format for molecules and reactions. It is kind of an extenstion | ||
to SMILES to support modeling ITS graphs and reaction patterns. To convert the | ||
SMILES-like description into a graph object use the | ||
:py:class:`~fgutils.parse.Parser` class. The Caffeine molecular graph can be | ||
obtained as follows:: | ||
|
||
import matplotlib.pyplot as plt | ||
from fgutils import Parser | ||
from fgutils.vis import plot_as_mol | ||
|
||
parser = Parser() | ||
mol = parser("CN1C=NC2=C1C(=O)N(C(=O)N2C)C") | ||
|
||
fig, ax = plt.subplots(1, 1) | ||
plot_as_mol(mol, ax) | ||
plt.show() | ||
|
||
.. image:: figures/caffeine_example.png | ||
:width: 300 | ||
|
||
Besides parsing common SMILES it is possible to generate molecule-like graphs | ||
with more abstract nodes, i.e., arbitrary node labels. Arbitrary node labels | ||
are surrounded by ``{}`` (e.g. ``{label}``). This abstract labeling can be used | ||
to substitute nodes with specific patterns. In this context the labels are | ||
group names of :py:class:`~fgutils.proxy.ProxyGroup` objects. A ProxyGroup | ||
defines a set of sub-graphs to be replaced for the labeled node. This can be | ||
done by using a :py:class:`~fgutils.proxy.Proxy`. Propyl acetate can be created | ||
by replacing the labeled node with the propyl group:: | ||
|
||
import matplotlib.pyplot as plt | ||
from fgutils import Parser | ||
from fgutils.proxy import MolProxy, ProxyGroup | ||
from fgutils.vis import GraphVisualizer | ||
|
||
pattern = "CC(=O)O{propyl}" | ||
propyl_group = ProxyGroup("propyl", "CCC") | ||
parser = Parser() | ||
proxy = MolProxy(pattern, propyl_group, parser=parser) | ||
|
||
g = parser(pattern) | ||
mol = next(proxy) | ||
|
||
vis = GraphVisualizer() | ||
fig, ax = plt.subplots(1, 2, dpi=100, figsize=(12, 3)) | ||
vis.plot(g, ax[0], title="Core Pattern") | ||
vis.plot(mol, ax[1], title="Generated Molecule") | ||
plt.show() | ||
|
||
.. image:: figures/labeled_node_example.png | ||
:width: 600 | ||
|
||
|
||
.. note:: | ||
|
||
A node can have more than one label. This can be done by separating the | ||
labels with a comma, e.g.: ``{label_1,label_2}``. | ||
|
||
In the example above the ProxyGroup has only one subgraph pattern. In general, | ||
a ProxyGroup is a collection of several possible subgraphs from which one is | ||
selected when a new sample is instantiated. To get more information on how | ||
graphs are sample take a look at the :py:class:`~fgutils.proxy.GraphSampler` | ||
class and the :py:class:`~fgutils.proxy.ProxyGroup` constructor. By default a | ||
pattern has one anchor at index 0. If you need more control over how a subgraph | ||
is inserted into a parent graph you can instantiate the | ||
:py:class:`~fgutils.proxy.ProxyGraph` class. For a ProxyGraph you can provide a | ||
list of anchor node indices. The insertion of the subgraph into the parent | ||
depends on the number of anchor nodes in the subgraph and the number of edges | ||
to the labeled node in the parent. The first edge in the parent connects to the | ||
first anchor node in the subgraph and so forth. The following example | ||
demonstrates the insertion with multiple anchor nodes:: | ||
|
||
import matplotlib.pyplot as plt | ||
from fgutils.proxy import MolProxy, ProxyGroup, ProxyGraph, Parser | ||
from fgutils.vis import GraphVisualizer | ||
|
||
pattern = "N{g}C{g}(O)C" | ||
g_1 = ProxyGroup("g", ProxyGraph("C1CCCCC1", anchor=[1, 3])) | ||
g_2 = ProxyGroup("g", ProxyGraph("C1CCCCC1", anchor=[1, 3, 4])) | ||
|
||
parser = Parser() | ||
proxy1 = MolProxy(pattern, g_1) | ||
proxy2 = MolProxy(pattern, g_2) | ||
|
||
parent_graph = parser(pattern) | ||
mol1 = next(proxy1) | ||
mol2 = next(proxy2) | ||
|
||
vis = GraphVisualizer(show_edge_labels=False) | ||
fig, ax = plt.subplots(1, 3, dpi=200, figsize=(20, 3)) | ||
vis.plot(parent_graph, ax[0], title="parent") | ||
vis.plot(mol1, ax[1], title="2 anchor nodes") | ||
vis.plot(mol2, ax[2], title="3 anchor nodes") | ||
plt.show() | ||
|
||
.. image:: figures/multiple_anchor_example.png | ||
:width: 1000 | ||
|
||
Another extension to the SMILES notation is the encoding of bond changes. This | ||
feature is required to model reaction mechanisms as ITS graph. Changing bonds | ||
are surrounded by ``<>`` (e.g. ``<1, 2>`` for the formation of a double bond | ||
from a single bond). The extended notation allows the automated generation of | ||
reaction examples with complete atom-to-atom maps. The following code snippet | ||
demonstrates the generation of a few Diels-Alder reactions. The **diene** and | ||
**dienophile** groups can of course be extended to increase varaity of the | ||
samples:: | ||
|
||
|
||
import random | ||
import matplotlib.pyplot as plt | ||
from fgutils.proxy import ProxyGroup, ProxyGraph, ReactionProxy | ||
from fgutils.proxy_collection.common import common_groups | ||
from fgutils.vis import plot_reaction, plot_its | ||
from fgutils.chem.its import get_its | ||
|
||
|
||
electron_donating_group = ProxyGroup( | ||
"electron_donating_group", | ||
["{methyl}", "{ethyl}", "{propyl}", "{aryl}", "{amine}"], | ||
) | ||
electron_withdrawing_group = ProxyGroup( | ||
"electron_withdrawing_group", | ||
["{alkohol}", "{ether}", "{aldehyde}", "{ester}", "{nitrile}"], | ||
) | ||
diene_group = ProxyGroup( | ||
"diene", | ||
ProxyGraph("C<2,1>C<1,2>C<2,1>C{electron_donating_group}", anchor=[0, 3]), | ||
) | ||
dienophile_group = ProxyGroup( | ||
"dienophile", | ||
ProxyGraph("C<2,1>C{electron_withdrawing_group}", anchor=[0, 1]), | ||
) | ||
groups = common_groups + [ | ||
electron_donating_group, | ||
electron_withdrawing_group, | ||
diene_group, | ||
dienophile_group, | ||
] | ||
|
||
proxy = ReactionProxy("{diene}1<0,1>{dienophile}<0,1>1", groups) | ||
|
||
n = 4 | ||
fig, ax = plt.subplots(n, 2, width_ratios=[2, 1], figsize=(20, n * 4)) | ||
for i, (g, h) in enumerate(random.sample(list(proxy), n)): | ||
plot_reaction(g, h, ax[i, 0], title="Reaction") | ||
plot_its(get_its(g, h), ax[i, 1], title="ITS Graph") | ||
plt.tight_layout() | ||
plt.show() | ||
|
||
.. image:: figures/diels_alder_example.png | ||
:width: 1000 | ||
|
||
This proxy can now generate Diels-Alder reaction samples. A few of the results | ||
are shown in the figure above. On the left side the reaction and on the right | ||
side the resulting ITS. The results are balanced and have complete atom-to-atom | ||
maps. The atom-to-atom maps are correct as long as the configuration makes | ||
sence in the chemical domain. Note that the synthesizability of the generated | ||
samples can not be guaranteed. It soley depends on what ProxyGroups and | ||
ProxyGraphs are configured. For a comprehensive Diels-Alder reaction proxy take | ||
a look at the | ||
:py:class:`~fgutils.proxy_collection.diels_alder_proxy.DielsAlderProxy` class | ||
and the section TODO. This class is also able to generate negative Diels-Alder | ||
reaction samples, i.e., reactions where a Diels-Alder graph transformation rule | ||
is theoretically applicable but the reaction will never happen in reality. | ||
|
||
.. note:: | ||
|
||
The ``electron_donating_group`` and ``electron_withdrawing_group`` serve as | ||
a collection of other groups to simplify the notation. They consist of a | ||
single node with multiple labels. When iterating the next sample from the | ||
proxy the labeled nodes get replaced by the pattern from one of the groups. | ||
The group/label is chosen randomly with uniform distribution. | ||
|
||
|
||
.. warning:: | ||
|
||
The call ``list(proxy)`` will generate all possible instantiations at once. | ||
Depending on the configuration this can take a long time to complete. If the | ||
core ProxyGroup graph sampling is not unique this can even result in an | ||
endless loop. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
.. FGUtils documentation master file, created by | ||
sphinx-quickstart on Tue Aug 20 09:10:34 2024. | ||
You can adapt this file completely to your liking, but it should at least | ||
contain the root `toctree` directive. | ||
Welcome to FGUtils's documentation! | ||
=================================== | ||
|
||
.. toctree:: | ||
:maxdepth: 1 | ||
|
||
getting_started | ||
functional_groups | ||
graph_syntax | ||
proxy_collection | ||
references |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,95 @@ | ||
================ | ||
Proxy Collection | ||
================ | ||
|
||
|
||
.. _diels-alder_reaction_proxy: | ||
|
||
Diels-Alder Reaction Proxy | ||
========================== | ||
|
||
The class | ||
:py:class:`~fgutils.proxy_collection.diels_alder_proxy.DielsAlderProxy` | ||
implements a comprehensive Diels-Alder reaction generation proxy. It generates | ||
ring formation reaction samples between diene and dienophile molecules. It can | ||
also be used to generate negative reaction samples, i.e., reactions where a | ||
Diels-Alder graph transformation rule is theoretically applicable but the | ||
reaction will never happen in reality. | ||
|
||
The Diels-Alder Reaction proxy has the following reaction center graphs | ||
configured (ProxyGraphs of the *core* ProxyGroup). Node labels show the atom | ||
type or group name and the node index and edge labels depict the bond type. | ||
Bond changes are depicted as tuples ``(<bond_g>,<bond_h>)`` where the first | ||
entry is the bond in the reactants and the second entry is the bond in the | ||
product. | ||
|
||
|
||
.. image:: figures/proxy_collection_DAReactionProxy.png | ||
:width: 1000 | ||
|
||
The **Intra Molecular Center** is used to generate intra molecular Diels-Alder | ||
reactions. It depicts the entire reaction center. It's easy to identify the [4 | ||
+ 2] cycloaddition reaction center between the diene part (C=CC=C, node 0-3) | ||
and the dienophile part (C=C, node 5-6). Diene and dienophile are connected | ||
by node 4, a graph from the **intra_mol_bridge** group. The **Inter Molecular | ||
Center** depicts the abstracted reaction center for inter molecular Diels-Alder | ||
reactions. | ||
|
||
.. note:: | ||
|
||
Have you noticed that the **Inter Molecular Center** is a multi graph (it | ||
has multiple edges between two vertices)? Molecular graphs are never multi | ||
graphs, the bond order is represented as label on the unique edge between | ||
two atoms. However, to represent this abstract form of the reaction center | ||
it's necessary to have independent edges for both single bond formations. | ||
To parse this type of graphs you must set the ``use_multigraph=True`` | ||
argument in :py:class:`~fgutils.parse.Parser`. | ||
|
||
The next level of abstraction are the **diene** and **dienophile** ProxyGroups. | ||
They provide the structures depicted in the following two figures, | ||
respectively. Anchor nodes, the nodes where the substructures connect to the | ||
parent, are surrounded by ``[]``. | ||
|
||
**Graphs from the diene group:** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_diene.png | ||
:width: 1000 | ||
|
||
**Graphs from the dienophile group:** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_dienophile.png | ||
:width: 1000 | ||
|
||
**Graphs from the electron_donating_group:** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_EDG.png | ||
:width: 1000 | ||
|
||
**Graphs from the electron_withdrawing_group:** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_EWG.png | ||
:width: 1000 | ||
|
||
**A few of the 2873 Diels-Alder reactions generated by the proxy:** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_DA_reactions.png | ||
:width: 1000 | ||
|
||
Invalid Sample Generation (negative data) | ||
----------------------------------------- | ||
|
||
The :py:class:`~fgutils.proxy_collection.diels_alder_proxy.DielsAlderProxy` | ||
provides the possibility to generate Diels-Alder counter examples, i.e., | ||
reactions where base on the structure a Diels-Alder graph transformation rule | ||
would be applicable but the reaction can not happen because the diene is for | ||
example in a fixed s-trans conformation. | ||
|
||
**Graphs from the (s-trans) diene group:** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_s-trans_diene.png | ||
:width: 1000 | ||
|
||
**A few of the 6253 INVALID Diels-Alder reactions generated by the proxy (negative data):** | ||
|
||
.. image:: figures/proxy_collection_DAReactionProxy_DA_reactions_neg.png | ||
:width: 1000 |
Oops, something went wrong.