rdkit.Chem.rdSubstructLibrary module

rdkit.Chem.rdSubstructLibrary.AddPatterns((SubstructLibrary)sslib[, (int)numThreads=1]) None :

Add pattern fingerprints to the given library, use numThreads=-1 to use all available cores

C++ signature :

void AddPatterns(RDKit::SubstructLibraryWrap {lvalue} [,int=1])

AddPatterns( (SubstructLibrary)sslib, (FPHolderBase)patterns [, (int)numThreads=1]) -> None :

Add pattern fingerprints to the given library, use numThreads=-1 to use all available cores

C++ signature :

void AddPatterns(RDKit::SubstructLibraryWrap {lvalue},boost::shared_ptr<RDKit::FPHolderBase> [,int=1])

class rdkit.Chem.rdSubstructLibrary.CachedMolHolder((object)self)

Bases: MolHolderBase

Holds molecules in their binary representation. This allows more molecules to be held in memory at a time

AddMol(mol) -> adds a molecule to the molecule holder, returns index of molecule

AddBinary(data) -> adds a picked molecule molecule to the molecule holder, returns index of molecule

The data is stored as-is, no checking is done for validity.

GetMol(idx) -> return the molecule at index idx

C++ signature :

void __init__(_object*)

AddBinary((CachedMolHolder)self, (str)pickle) int :

Add a binary pickle to the molecule holder, no checking is done on the input data

C++ signature :

unsigned int AddBinary(RDKit::CachedMolHolder {lvalue},std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)

class rdkit.Chem.rdSubstructLibrary.CachedSmilesMolHolder((object)self)

Bases: MolHolderBase

Holds molecules as smiles string This allows more molecules to be held in memory at a time

AddMol(mol) -> adds a molecule to the molecule holder, returns index of molecule

AddSmiles(smiles) -> adds a smiles string to the molecule holder, returns index of molecule

The smiles is stored as-is, no checking is done for validity.

GetMol(idx) -> return the molecule at index idx

C++ signature :

void __init__(_object*)

AddSmiles((CachedSmilesMolHolder)self, (str)smiles) int :

Add a trusted smiles string to the molecule holder, no checking is done on the input data

C++ signature :

unsigned int AddSmiles(RDKit::CachedSmilesMolHolder {lvalue},std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)

class rdkit.Chem.rdSubstructLibrary.CachedTrustedSmilesMolHolder((object)self)

Bases: MolHolderBase

Holds molecules as trusted smiles string This allows more molecules to be held in memory at a time and avoids RDKit sanitization overhead. See: http://rdkit.blogspot.com/2016/09/avoiding-unnecessary-work-and.html

AddMol(mol) -> adds a molecule to the molecule holder, returns index of molecule

AddSmiles(smiles) -> adds a smiles string to the molecule holder, returns index of molecule

The smiles is stored as-is, no checking is done for validity.

GetMol(idx,s) -> return the molecule at index idx,

note, only light sanitization is done here, for instance the molecules RingInfo is not initialized

C++ signature :

void __init__(_object*)

AddSmiles((CachedTrustedSmilesMolHolder)self, (str)smiles) int :

Add a trusted smiles string to the molecule holder, no checking is done on the input data

C++ signature :

unsigned int AddSmiles(RDKit::CachedTrustedSmilesMolHolder {lvalue},std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)

class rdkit.Chem.rdSubstructLibrary.FPHolderBase

Bases: instance

Raises an exception This class cannot be instantiated from Python

AddFingerprint((FPHolderBase)self, (ExplicitBitVect)v) int :

Adds a raw bit vector to the fingerprint database, returns the index of the supplied pattern

C++ signature :

unsigned int AddFingerprint(RDKit::FPHolderBase {lvalue},ExplicitBitVect)

AddMol((FPHolderBase)self, (Mol)m) int :

Adds a molecule to the fingerprint database, returns the index of the new pattern

C++ signature :

unsigned int AddMol(RDKit::FPHolderBase {lvalue},RDKit::ROMol)

GetFingerprint((FPHolderBase)self, (int)idx) ExplicitBitVect :

Return the bit vector at the specified index

C++ signature :

ExplicitBitVect GetFingerprint(RDKit::FPHolderBase {lvalue},unsigned int)

MakeFingerprint((FPHolderBase)self, (Mol)mol) ExplicitBitVect :

Compute the query bits for the holder

C++ signature :

ExplicitBitVect* MakeFingerprint(RDKit::FPHolderBase {lvalue},RDKit::ROMol)

PassesFilter((FPHolderBase)self, (int)idx, (ExplicitBitVect)query) bool :

Returns True if the specified index passes the filter supplied by the query bit vector

C++ signature :

bool PassesFilter(RDKit::FPHolderBase {lvalue},unsigned int,ExplicitBitVect)

class rdkit.Chem.rdSubstructLibrary.KeyFromPropHolder((object)self)

Bases: KeyHolderBase

Holds keys to return external references to the molecules in the molholder. By default use the _Name property but can be overridden to be any property

C++ signature :

void __init__(_object*)

__init__( (object)self, (str)propname) -> None :

C++ signature :

void __init__(_object*,std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)

GetPropName((KeyFromPropHolder)self) str :

Return the key for the given molecule index

C++ signature :

std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > GetPropName(RDKit::KeyFromPropHolder {lvalue})

class rdkit.Chem.rdSubstructLibrary.KeyHolderBase

Bases: instance

Raises an exception This class cannot be instantiated from Python

AddKey((KeyHolderBase)self, (str)arg1) int :

Add a key to the key holder, must be manually synced

C++ signature :

unsigned int AddKey(RDKit::KeyHolderBase {lvalue},std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)

AddMol((KeyHolderBase)self, (Mol)m) int :

Adds a molecule to the fingerprint database, returns the index of the new pattern

C++ signature :

unsigned int AddMol(RDKit::KeyHolderBase {lvalue},RDKit::ROMol)

GetKey((KeyHolderBase)self, (int)arg1) str :

Return the key at the specified index

C++ signature :

std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > GetKey(RDKit::KeyHolderBase {lvalue},unsigned int)

GetKeys((KeyHolderBase)self, (_vectj)indices) _vectNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE :

Returns the keys for the given indices as return by GetMatches

ARGUMENTS:
  • indices: The indices of the keys

C++ signature :

std::vector<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > > GetKeys(RDKit::KeyHolderBase {lvalue},std::vector<unsigned int, std::allocator<unsigned int> >)

class rdkit.Chem.rdSubstructLibrary.MolHolder((object)self)

Bases: MolHolderBase

Holds raw in-memory molecules AddMol(mol) -> adds a molecule to the molecule holder, returns index of molecule GetMol(idx,sanitize=True) -> return the molecule at index idx

C++ signature :

void __init__(_object*)

class rdkit.Chem.rdSubstructLibrary.MolHolderBase

Bases: instance

Raises an exception This class cannot be instantiated from Python

AddMol((MolHolderBase)self, (Mol)m) int :

Adds molecule to the molecule holder

C++ signature :

unsigned int AddMol(RDKit::MolHolderBase {lvalue},RDKit::ROMol)

GetMol((MolHolderBase)self, (int)arg1) Mol :

Returns a particular molecule in the molecule holder

ARGUMENTS:
  • idx: which molecule to return

  • sanitize: if sanitize is False, return the internal molecule state [default True]

NOTE: molecule indices start at 0

C++ signature :

boost::shared_ptr<RDKit::ROMol> GetMol(RDKit::MolHolderBase {lvalue},unsigned int)

class rdkit.Chem.rdSubstructLibrary.PatternHolder((object)self)

Bases: FPHolderBase

Holds fingerprints with optional, user-defined number of bits (default: 2048) used for filtering of molecules.

C++ signature :

void __init__(_object*)

__init__( (object)self, (int)numBits) -> None :

C++ signature :

void __init__(_object*,unsigned int)

class rdkit.Chem.rdSubstructLibrary.SubstructLibrary((object)self)

Bases: instance

SubstructLibrary: This provides a simple API for substructure searching large datasets The SubstructLibrary takes full advantage of available threads during the search operation. Basic operation is simple

>>> from __future__ import print_function
>>> import os
>>> from rdkit import Chem, RDConfig
>>> from rdkit.Chem import rdSubstructLibrary
>>> library = rdSubstructLibrary.SubstructLibrary()
>>> for mol in Chem.SDMolSupplier(os.path.join(RDConfig.RDDataDir,
...                               'NCI', 'first_200.props.sdf')):
...   idx = library.AddMol(mol)
>>> core = Chem.MolFromSmarts('CCCCOC')
>>> indices = library.GetMatches(core)
>>> len(indices)
11

Substructure matching options can be sent into GetMatches:

>>> indices = library.GetMatches(core, useChirality=False)
>>> len(indices)
11

Controlling the number of threads or the maximum number of matches returned: is also available (the default is to run on all cores)

>>> indices = library.GetMatches(core, numThreads=2, maxResults=10)
>>> len(indices)
10

Working on larger datasets:

Molecules are fairly large objects and will limit the number that can be kept in memory. To assist this we supply three other molecule holders:

CachedMolHolder - stores molecules as their pickled representation

CachedSmilesMolHolder - stores molecules internally as smiles strings

CachedTrustedSmilesMolHolder = excepts (and stores) molecules as trusted smiles strings

Using Pattern fingerprints as a pre-filter: Pattern fingerprints provide an easy way to indicate whether the substructure search should be be done at all. This is particularly useful with the Binary and Smiles based molecule holders as they have an expensive molecule creation step in addition to the substructure searching step

>>> library = rdSubstructLibrary.SubstructLibrary(rdSubstructLibrary.CachedSmilesMolHolder(),
...                                               rdSubstructLibrary.PatternHolder())
>>> for mol in Chem.SDMolSupplier(os.path.join(RDConfig.RDDataDir,
...                               'NCI', 'first_200.props.sdf')):
...   idx = library.AddMol(mol)
>>> indices = library.GetMatches(core)
>>> len(indices)
11

This (obviously) takes longer to initialize. However, both the molecule and pattern holders can be populated with raw data, a simple example is below:

>>> import csv
>>> molholder = rdSubstructLibrary.CachedSmilesMolHolder()
>>> pattern_holder = rdSubstructLibrary.PatternHolder()
>>> with open(os.path.join(RDConfig.RDDataDir, 'NCI', 'first_200.tpsa.csv')) as inf:
...   for i, row in enumerate(csv.reader(inf)):
...     if i:
...       idx = molholder.AddSmiles(row[0])
...       idx2 = pattern_holder.AddFingerprint(
...           pattern_holder.MakeFingerprint(Chem.MolFromSmiles(row[0])))
...       assert idx==idx2
>>> library = rdSubstructLibrary.SubstructLibrary(molholder,pattern_holder)
>>> indices = library.GetMatches(core)
>>> len(indices)
11

Finally, the KeyFromPropHolder can be used to use external keys such as compound names. By default the holder uses the ‘_Name’ property but can be changed to any property.

>>> library = rdSubstructLibrary.SubstructLibrary(rdSubstructLibrary.MolHolder(), rdSubstructLibrary.KeyFromPropHolder())
>>> m = Chem.MolFromSmiles('CCC')
>>> m.SetProp('_Name', 'Z11234')
>>> idx = library.AddMol(m)
>>> indices = library.GetMatches(m)
>>> list(library.GetKeyHolder().GetKeys(indices))
['Z11234']
C++ signature :

void __init__(_object*)

__init__( (object)self, (MolHolderBase)molecules) -> None :

C++ signature :

void __init__(_object*,boost::shared_ptr<RDKit::MolHolderBase>)

__init__( (object)self, (MolHolderBase)molecules, (FPHolderBase)fingerprints) -> None :

C++ signature :

void __init__(_object*,boost::shared_ptr<RDKit::MolHolderBase>,boost::shared_ptr<RDKit::FPHolderBase>)

__init__( (object)self, (MolHolderBase)molecules, (KeyHolderBase)keys) -> None :

C++ signature :

void __init__(_object*,boost::shared_ptr<RDKit::MolHolderBase>,boost::shared_ptr<RDKit::KeyHolderBase>)

__init__( (object)self, (MolHolderBase)molecules, (FPHolderBase)fingerprints, (KeyHolderBase)keys) -> None :

C++ signature :

void __init__(_object*,boost::shared_ptr<RDKit::MolHolderBase>,boost::shared_ptr<RDKit::FPHolderBase>,boost::shared_ptr<RDKit::KeyHolderBase>)

__init__( (object)self, (str)pickle) -> None :

C++ signature :

void __init__(_object*,std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)

AddMol((SubstructLibrary)self, (Mol)mol) int :

Adds a molecule to the substruct library

C++ signature :

unsigned int AddMol(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol)

CountMatches((SubstructLibrary)self, (Mol)query[, (bool)recursionPossible=True[, (bool)useChirality=True[, (bool)useQueryQueryMatches=False[, (int)numThreads=-1]]]]) int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (Mol)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (Mol)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (Mol)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (object)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (object)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (MolBundle)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (MolBundle)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (MolBundle)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (MolBundle)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (object)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

CountMatches( (SubstructLibrary)self, (object)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,RDKit::SubstructMatchParameters [,int=-1])

CountMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> int :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

unsigned int CountMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

GetFpHolder((SubstructLibrary)self) FPHolderBase :
C++ signature :

boost::shared_ptr<RDKit::FPHolderBase> GetFpHolder(RDKit::SubstructLibraryWrap {lvalue})

GetKeyHolder((SubstructLibrary)self) KeyHolderBase :
C++ signature :

boost::shared_ptr<RDKit::KeyHolderBase> GetKeyHolder(RDKit::SubstructLibraryWrap {lvalue})

GetMatches((SubstructLibrary)self, (Mol)query[, (bool)recursionPossible=True[, (bool)useChirality=True[, (bool)useQueryQueryMatches=False[, (int)numThreads=-1[, (int)maxResults=1000]]]]]) _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (Mol)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (Mol)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (Mol)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (object)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (object)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (MolBundle)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (MolBundle)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (MolBundle)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (MolBundle)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (object)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1 [, (int)maxResults=1000]]]]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1 [,int=1000]]]]])

GetMatches( (SubstructLibrary)self, (object)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMatches( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1 [, (int)maxResults=1000]]) -> _vectj :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

  • maxResults: maximum number of results to return

C++ signature :

std::vector<unsigned int, std::allocator<unsigned int> > GetMatches(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1 [,int=1000]])

GetMol((SubstructLibrary)self, (int)idx) Mol :

Returns a particular molecule in the molecule holder

ARGUMENTS:
  • idx: which molecule to return

NOTE: molecule indices start at 0

C++ signature :

boost::shared_ptr<RDKit::ROMol> GetMol(RDKit::SubstructLibraryWrap {lvalue},unsigned int)

GetMolHolder((SubstructLibrary)self) MolHolderBase :
C++ signature :

boost::shared_ptr<RDKit::MolHolderBase> GetMolHolder(RDKit::SubstructLibraryWrap {lvalue})

GetSearchOrder((SubstructLibrary)self) tuple :

Returns the search order for the library

NOTE: molecule indices start at 0

C++ signature :

boost::python::tuple GetSearchOrder(RDKit::SubstructLibraryWrap)

HasMatch((SubstructLibrary)self, (Mol)query[, (bool)recursionPossible=True[, (bool)useChirality=True[, (bool)useQueryQueryMatches=False[, (int)numThreads=-1]]]]) bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (Mol)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (Mol)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (Mol)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::ROMol,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (object)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (object)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::TautomerQuery,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (MolBundle)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (MolBundle)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (MolBundle)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (MolBundle)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::MolBundle,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (object)query [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx [, (bool)recursionPossible=True [, (bool)useChirality=True [, (bool)useQueryQueryMatches=False [, (int)numThreads=-1]]]]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,unsigned int,unsigned int [,bool=True [,bool=True [,bool=False [,int=-1]]]])

HasMatch( (SubstructLibrary)self, (object)query, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,RDKit::SubstructMatchParameters [,int=-1])

HasMatch( (SubstructLibrary)self, (object)query, (int)startIdx, (int)endIdx, (SubstructMatchParameters)parameters [, (int)numThreads=-1]) -> bool :

Get the matches for the query.

Arguments:
  • query: substructure query

  • startIdx: index to search from

  • endIdx: index (non-inclusize) to search to

  • numThreads: number of threads to use, -1 means all threads

C++ signature :

bool HasMatch(RDKit::SubstructLibraryWrap {lvalue},RDKit::GeneralizedSubstruct::ExtendedQueryMol,unsigned int,unsigned int,RDKit::SubstructMatchParameters [,int=-1])

InitFromStream((SubstructLibrary)self, (AtomPairsParameters)stream) None :

Deserialize a substructure library from a python bytes stream. Python doesn’t allow seeking operations inside a unicode or string stream anymore so this requires opening a file in binary mode or using an io.ByteIO type object

ARGUMENTS:
  • stream: a binary stream like object

SubstructLibrary.Serialize already writes a binary stream

>>> from rdkit.Chem import rdSubstructLibrary
>>> import io
>>> lib = rdSubstructLibrary.SubstructLibrary()
>>> stream = io.BytesIO( lib.Serialize() )
>>> lib.InitFromStream(stream)

remember to write to text and read from a binary stream

>>> with open('rdkit.sslib', 'w') as f: lib.ToStream(f)
>>> with open('rdkit.sslib', 'rb') as f: lib.InitFromStream(f)
C++ signature :

void InitFromStream(RDKit::SubstructLibraryWrap {lvalue},boost::python::api::object {lvalue})

Serialize((SubstructLibrary)self) object :
C++ signature :

boost::python::api::object Serialize(RDKit::SubstructLibraryWrap)

SetSearchOrder((SubstructLibrary)self, (AtomPairsParameters)seq) None :

Sets the search order for the library

ARGUMENTS:
  • order: sequence of molecule indices

NOTE: molecule indices start at 0

C++ signature :

void SetSearchOrder(RDKit::SubstructLibraryWrap {lvalue},boost::python::api::object)

ToStream((SubstructLibrary)self, (AtomPairsParameters)stream) None :

Serialize a substructure library to a python text stream. The stream can be a file in text mode or an io.StringIO type object

ARGUMENTS:
  • stream: a text or text stream like object

>>> from rdkit.Chem import rdSubstructLibrary
>>> import io
>>> lib = rdSubstructLibrary.SubstructLibrary()
>>> stream = io.StringIO()
>>> lib.ToStream(stream)

or

>>> with open('rdkit.sslib', 'w') as stream:
...  lib.ToStream(stream)
C++ signature :

void ToStream(RDKit::SubstructLibraryWrap,boost::python::api::object {lvalue})

rdkit.Chem.rdSubstructLibrary.SubstructLibraryCanSerialize() bool :

Returns True if the SubstructLibrary is serializable (requires boost serialization

C++ signature :

bool SubstructLibraryCanSerialize()

class rdkit.Chem.rdSubstructLibrary.TautomerPatternHolder((object)self)

Bases: FPHolderBase

Holds tautomeric fingerprints with optional, user-defined number of bits (default: 2048) used for filtering of molecules. These fingerprints are designed to be used with TautomerQueries.

C++ signature :

void __init__(_object*)

__init__( (object)self, (int)numBits) -> None :

C++ signature :

void __init__(_object*,unsigned int)