chemfp.openeye_toolkit module

The chemfp toolkit API wrapper for OpenEye’s OEChem toolkit.

This module is also available as chemfp.openeye.

chemfp.openeye_toolkit.name

The string “openeye”.

chemfp.openeye_toolkit.software

The string used in output file metadata to describe this version of OEChem. For example, “OEChem/20220607”.

chemfp.openeye_toolkit.is_available

True if the OEChem toolkit is available, otherwise False.

This mostly used as chemfp.openeye.is_available as this module cannot be imported if OEChem is not available.

chemfp.openeye_toolkit.circular

The available version of the ‘OpenEye-Circular’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyeCircularFingerprintType_v2 with the full type:

OpenEye-Circular/2 numbits=4096 minradius=0 maxradius=5 atype=Arom|AtmNum|Chiral|EqHalo|FCharge|HCount btype=Order
chemfp.openeye_toolkit.maccs166

The available version of the ‘OpenEye-MACCS166’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyeMACCSFingerprintType_v3 with the full type:

OpenEye-MACCS166/3
chemfp.openeye_toolkit.mdl_screen

The available version of the ‘OpenEye-MDLScreen’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyeMDLScreenFingerprintType_v1 with the full type:

OpenEye-MDLScreen/1
chemfp.openeye_toolkit.molecule_screen

The available version of the ‘OpenEye-MoleculeScreen’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyeMoleculeScreenFingerprintType_v1 with the full type:

OpenEye-MoleculeScreen/1
chemfp.openeye_toolkit.path

The available version of the ‘OpenEye-Path’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyePathFingerprintType_v2 with the full type:

OpenEye-Path/2 numbits=4096 minbonds=0 maxbonds=5 atype=Arom|AtmNum|Chiral|EqHalo|FCharge|HvyDeg|Hyb btype=Order|Chiral
chemfp.openeye_toolkit.smarts_screen

The available version of the ‘OpenEye-SMARTSScreen’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyeSMARTSScreenFingerprintType_v1 with the full type:

OpenEye-SMARTSScreen/1
chemfp.openeye_toolkit.tree

The available version of the ‘OpenEye-Tree’ fingerprint type, for example, an instance of chemfp.openeye_types.OpenEyeTreeFingerprintType_v2 with the full type:

OpenEye-Tree/2 numbits=4096 minbonds=0 maxbonds=4 atype=Arom|AtmNum|Chiral|FCharge|HvyDeg|Hyb btype=Order
chemfp.openeye_toolkit.add_tag(mol, tag, value)

Add an SD tag value to the OEChem molecule

Parameters:
  • mol (an OEChem molecule) – the molecule

  • tag (string) – the SD tag name

  • value (string) – the text for the tag

Returns:

None

chemfp.openeye_toolkit.copy_molecule(mol)

Return a new OEGraphMol which is a copy of the given OEChem molecule

Parameters:

mol (an Open Babel molecule) – the molecule to copy

Returns:

a new OBMol instance

chemfp.openeye_toolkit.create_bytes(mol, format, id=None, writer_args=None, errors='strict', level=None)

Convert an OEChem molecule into a structure record in the given format as a byte string

If id is not None then use it instead of the molecule’s own title. Warning: this may briefly modify the molecule, so may not be thread-safe.

Parameters:
  • mol (an OEChem molecule) – the molecule to use for the output

  • format (a format name string, or Format object) – the output structure format

  • id (a string, or None to use the molecule's own id) – an alternate record id

  • writer_args (a dictionary) – writer arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • level (None, a positive integer, or one of the strings 'min', 'default', or 'max') – compression level to use for compressed formats

Returns:

a string

chemfp.openeye_toolkit.create_fasta(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate a FASTA record from an OEChem molecule

This is equivalent to calling:

create_string(mol, "fasta", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_inchi(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, include_id: bool = True, errors: str = 'strict') str | None

Generate an InChI string and its id from an OEChem molecule

This is equivalent to calling:

create_string(mol, "inchi", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘ReconnectedMetals’ = 16

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘RelativeStereo’ = 32

  • ‘RacemicStereo’ = 64

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • include_id (Boolean (default: True)) – if true, include the molecule id in the output

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_inchikey(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, include_id: bool = True, errors: str = 'strict') str | None

Generate an InChIKey string and its id from an OEChem molecule

This is equivalent to calling:

create_string(mol, "inchikey", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘RacemicStereo’ = 64

  • ‘ReconnectedMetals’ = 16

  • ‘RelativeStereo’ = 32

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • include_id (Boolean (default: True)) – if true, include the molecule id in the output

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_inchikeystring(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict') str | None

Generate an InChIKey string from an OEChem molecule

This is equivalent to calling:

create_string(mol, "inchikeystring", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘RacemicStereo’ = 64

  • ‘ReconnectedMetals’ = 16

  • ‘RelativeStereo’ = 32

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_inchistring(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict') str | None

Generate an InChI string from an OEChem molecule

This is equivalent to calling:

create_string(mol, "inchistring", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘ReconnectedMetals’ = 16

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘RelativeStereo’ = 32

  • ‘RacemicStereo’ = 64

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_molfile(mol: Any, *, id: str | None = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate a molfile record from an OEChem molecule

This is equivalent to calling:

create_string(mol, "molfile", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 279 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_oez(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate an OEZ string from an OEChem molecule

This is equivalent to calling:

create_string(mol, "oez", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_pdb(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate a PDB record from an OEChem molecule

This is equivalent to calling:

create_string(mol, "pdb", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘BONDS’ = 1

  • ‘BOTH’ = 4 (in default bit flags)

  • ‘CHARGE’ = 8

  • ‘CurrentResidues’ = 2048

  • ‘DELPHI’ = 24

  • ‘ELEMENT’ = 64 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 128

  • ‘HETBONDS’ = 256 (in default bit flags)

  • ‘NoResidues’ = 1024

  • ‘OEResidues’ = 512 (in default bit flags)

  • ‘ORDERS’ = 2

  • ‘OrderAtoms’ = 4096 (in default bit flags)

  • ‘RADIUS’ = 16

  • ‘TER’ = 32

  • ‘Default’ = 4932 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_sdf(mol: Any, *, id: str | None = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate an SDF record from an OEChem molecule

This is equivalent to calling:

create_string(mol, "sdf", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8 (in default bit flags)

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 287 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_sdf3k(mol: Any, *, id: str | None = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate an SDF record in V3000 format from an OEChem molecule

This is equivalent to calling:

create_string(mol, "sdf3k", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8 (in default bit flags)

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 287 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_sequence(mol: Any, *, id: str | None = None, flavor: int | str | None = 'Default', errors: str = 'strict') str | None

Generate an IUPAC sequence from an OEChem molecule

This is equivalent to calling:

create_string(mol, "sequence", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_smi(mol: Any, *, id: str | None = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', cxsmiles: bool = False, errors: str = 'strict') str | None

Generate a SMILES string and its id from an OEChem molecule

This is equivalent to calling:

create_string(mol, "smi", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘AllBonds’ = 2048

  • ‘AtomMaps’ = 32 (in default bit flags)

  • ‘AtomStereo’ = 8 (in default bit flags)

  • ‘BondStereo’ = 16 (in default bit flags)

  • ‘Canonical’ = 64 (in default bit flags)

  • ‘EnhStereo’ = 4096

  • ‘ExtBonds’ = 512

  • ‘Hydrogens’ = 2

  • ‘ImpHCount’ = 1024

  • ‘Isotopes’ = 1 (in default bit flags)

  • ‘Kekule’ = 128

  • ‘RGroups’ = 4 (in default bit flags)

  • ‘SuperAtoms’ = 256

  • ‘Default’ = 125 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • cxsmiles (Boolean (default: False)) – If true, set the output flavor generate ChemAxon CXSMILES extensions

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_smiles(mol: Any, *, id: str | None = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', cxsmiles: bool = False, errors: str = 'strict') str | None

Generate a SMILES string and its id from an OEChem molecule

This is equivalent to calling:

create_string(mol, "smi", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘AllBonds’ = 2048

  • ‘AtomMaps’ = 32 (in default bit flags)

  • ‘AtomStereo’ = 8 (in default bit flags)

  • ‘BondStereo’ = 16 (in default bit flags)

  • ‘Canonical’ = 64 (in default bit flags)

  • ‘EnhStereo’ = 4096

  • ‘ExtBonds’ = 512

  • ‘Hydrogens’ = 2

  • ‘ImpHCount’ = 1024

  • ‘Isotopes’ = 1 (in default bit flags)

  • ‘Kekule’ = 128

  • ‘RGroups’ = 4 (in default bit flags)

  • ‘SuperAtoms’ = 256

  • ‘Default’ = 125 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • cxsmiles (Boolean (default: False)) – If true, set the output flavor generate ChemAxon CXSMILES extensions

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_smistring(mol: Any, *, id: str | None = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', cxsmiles: bool = False, errors: str = 'strict') str | None

Generate a SMILES string from an OEChem molecule

This is equivalent to calling:

create_string(mol, "smistring", id=id, writer_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘AllBonds’ = 2048

  • ‘AtomMaps’ = 32 (in default bit flags)

  • ‘AtomStereo’ = 8 (in default bit flags)

  • ‘BondStereo’ = 16 (in default bit flags)

  • ‘Canonical’ = 64 (in default bit flags)

  • ‘EnhStereo’ = 4096

  • ‘ExtBonds’ = 512

  • ‘Hydrogens’ = 2

  • ‘ImpHCount’ = 1024

  • ‘Isotopes’ = 1 (in default bit flags)

  • ‘Kekule’ = 128

  • ‘RGroups’ = 4 (in default bit flags)

  • ‘SuperAtoms’ = 256

  • ‘Default’ = 125 (in default bit flags)

Parameters:
  • mol (an OEChem molecule) – a molecule object

  • id (None or a string (default: None)) – an alternate identifier for the output record, if relevant

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • cxsmiles (Boolean (default: False)) – If true, set the output flavor generate ChemAxon CXSMILES extensions

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a string, or None if errors are ignored

chemfp.openeye_toolkit.create_string(mol, format, id=None, writer_args=None, errors='strict')

Convert an OEChem molecule into a structure record in the given format as a Unicode string

If id is not None then use it instead of the molecule’s own title. Warning: this may briefly modify the molecule, so may not be thread-safe.

Parameters:
  • mol (an OEChem molecule) – the molecule to use for the output

  • format (a format name string, or Format object) – the output structure format

  • id (a string, or None to use the molecule's own id) – an alternate record id

  • writer_args (a dictionary) – writer arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

Returns:

a string

chemfp.openeye_toolkit.get_format(format)

Get the named format, or raise a ValueError

This will raise a ValueError if OEChem does not implement the format format_name or that format is not available.

Parameters:

format_name (a string) – the format name

Returns:

a chemfp.base_toolkit.Format object

chemfp.openeye_toolkit.get_formats(include_unavailable=False)

Get the list of structure formats that OEChem supports

If include_unavailable is True then also include OEChem formats which aren’t available to this specific version of OEChem.

Parameters:

include_unavailable (True or False) – include unavailable formats?

Returns:

a list of chemfp.base_toolkit.Format objects

chemfp.openeye_toolkit.get_id(mol)

Get the molecule’s id using OEChem’s GetTitle()

Parameters:

mol (an OEChem molecule) – the molecule

Returns:

a string

chemfp.openeye_toolkit.get_input_format(format)

Get the named input format, or raise a ValueError

This will raise a ValueError if OEChem does not implement the format format_name or that format is not an input format.

Parameters:

format_name (a string) – the format name

Returns:

a chemfp.base_toolkit.Format object

chemfp.openeye_toolkit.get_input_format_from_source(source=None, format=None)

Get the most appropriate format given the available source and format information

If format is a chemfp.base_toolkit.Format then return it. If it’s a Format-like object with “name” and “compression” attributes use it to make a real Format object with the same attributes. If it’s a string then use it to create a Format object.

If format is None, use the source to auto-detect the format. If auto-detection is not possible, assume it’s an uncompressed SMILES file.

Parameters:
  • source (a filename (as a string), a file object, or None to read from stdin) – the structure data source.

  • format (a Format(-like) object, string, or None) – format information, if known.

Returns:

a chemfp.base_toolkit.Format object

chemfp.openeye_toolkit.get_input_formats()

Get the list of supported OEChem input formats

Returns:

a list of chemfp.base_toolkit.Format objects

chemfp.openeye_toolkit.get_output_format(format)

Get the named format, or raise a ValueError

This will raise a ValueError if OEChem does not implement the format format_name or that format is not an output format.

Parameters:

format_name (a string) – the format name

Returns:

a chemfp.base_toolkit.Format object

chemfp.openeye_toolkit.get_output_format_from_destination(destination=None, format=None)

Get the most appropriate format given the available destination and format information

If format is a chemfp.base_toolkit.Format then return it. If it’s a Format-like object with “name” and “compression” attributes use it to make a real Format object with the same attributes. If it’s a string then use it to create a Format object.

If format is None, use the destination to auto-detect the format. If auto-detection is not possible, assume it’s an uncompressed SMILES file.

Parameters:
  • destination (a filename (as a string), a file object, or None to read from stdin) – the structure data source.

  • format (a Format(-like) object, string, or None) – format information, if known.

Returns:

a chemfp.base_toolkit.Format object

chemfp.openeye_toolkit.get_output_formats()

Get the list of supported OEChem output formats

Returns:

a list of chemfp.base_toolkit.Format objects

chemfp.openeye_toolkit.get_tag(mol, tag)

Get the named SD tag value, or None if it doesn’t exist

Parameters:
  • mol (an OEChem molecule) – the molecule

  • tag (string) – the SD tag name

Returns:

a string, or None

chemfp.openeye_toolkit.get_tag_pairs(mol)

Get a list of all SD tag (name, value) pairs for the molecule

Parameters:

mol (an OEChem molecule) – the molecule

Returns:

a list of (string name, string value) pairs

chemfp.openeye_toolkit.is_licensed()

Return True if the OEChem toolkit license is valid, otherwise False.

This does not check if the OEGraphSim license is valid. I haven’t yet figured out how I want to handle that distinction. In the meanwhile you’ll need to use the OEChem API yourself.

Returns:

True or False

chemfp.openeye_toolkit.is_valid_aromaticity(aromaticity_name)

Return True for known aromaticity names, otherwise False

chemfp.openeye_toolkit.make_id_and_molecule_parser(format, id_tag=None, reader_args=None, errors='strict')

Create a specialized function which takes a record and returns an (id, OEGraphMol) pair

The returned function is optimized for reading many records from individual strings because it only does parameter validation once. The function will reuse the OEGraphMol for successive calls, so make a copy if you want to keep it around. However, I haven’t really noticed much of a performance difference between this and chemfp.openeye_toolkit.parse_id_and_molecule() so I suggest you use that function directly instead of making a specialized function. (Let me know if making a specialized function is useful.)

See chemfp.openeye_toolkit.read_molecules() for details about the other parameters.

Parameters:
  • format (a format name string, or Format object) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

Returns:

a function of the form parser(record string) -> (id, OEGraphMol)

chemfp.openeye_toolkit.open_fasta_writer(destination: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Open a FASTA file to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "fasta", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_fasta_writer_to_string(*, flavor: int | str | None = 'Default', errors: str = 'strict')

Open a FASTA file to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("fasta", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_inchi_writer(destination: None | str | BinaryIO, *, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, include_id: bool = True, errors: str = 'strict')

Open an InChI file (with InChI and optional id) to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "inchi", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘ReconnectedMetals’ = 16

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘RelativeStereo’ = 32

  • ‘RacemicStereo’ = 64

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • include_id (Boolean (default: True)) – if true, include the molecule id in the output

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_inchi_writer_to_string(*, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, include_id: bool = True, errors: str = 'strict')

Open an InChI file (with InChI and optional id) to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("inchi", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘ReconnectedMetals’ = 16

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘RelativeStereo’ = 32

  • ‘RacemicStereo’ = 64

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • include_id (Boolean (default: True)) – if true, include the molecule id in the output

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_inchikey_writer(destination: None | str | BinaryIO, *, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, include_id: bool = True, errors: str = 'strict')

Open an InChIKey file (with InChIKey and optional id) to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "inchikey", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘RacemicStereo’ = 64

  • ‘ReconnectedMetals’ = 16

  • ‘RelativeStereo’ = 32

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • include_id (Boolean (default: True)) – if true, include the molecule id in the output

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_inchikey_writer_to_string(*, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, include_id: bool = True, errors: str = 'strict')

Open an InChIKey file (with InChIKey and optional id) to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("inchikey", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘Chiral’ = 4

  • ‘FixedHLayer’ = 8

  • ‘Hydrogens’ = 1

  • ‘RacemicStereo’ = 64

  • ‘ReconnectedMetals’ = 16

  • ‘RelativeStereo’ = 32

  • ‘Stereo’ = 2 (in default bit flags)

  • ‘Default’ = 130 (in default bit flags)

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • include_id (Boolean (default: True)) – if true, include the molecule id in the output

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_molecule_writer(destination=None, format=None, writer_args=None, errors='strict', location=None, encoding='utf8', encoding_errors='strict', level=None)

Return a MoleculeWriter which can write OEChem molecules to a destination.

A chemfp.base_toolkit.MoleculeWriter has the methods write_molecule, write_molecules, and write_ids_and_molecules, which are ways to write an OEChem molecule, an OEChem molecule iterator, or an (id, OEChem molecule) pair iterator to a file.

Molecules are written to destination. The output format can be a string like “sdf.gz” or “smi”, a chemfp.base_toolkit.Format, or Format-like object with “name” and “compression” attributes, or None to auto-detect based on the destination. If auto-detection is not possible, the output will be written as uncompressed SMILES.

The writer_args dictionary parameters depend on the format. Every OEChem format supports:

  • aromaticity - one of “default”, “openeye”, “daylight”, “tripos”, “mdl”, “mmff”, or None

  • flavor - a number, string-encoded number, or flavor string

A “flavor string” is a “|” or “,” separated list of format-specific flavor terms. It can be as simple as “Default”, or a more complex string like DEFAULT|-AtomStereo|-BondStero|Canonical to generate a canonical SMILES string without stereo information.

The supported output flavor terms for each format are:

  • SMILES - AtomMaps, AtomStereo, BondStereo, Canonical, ExtBonds, Hydrogens, ImpHCount, Isotopes, Kekule, RGroups, SuperAtoms

  • sdf - CurrentParity, MCHG, MDLParity, MISO, MRGP, MV30, NoParity, Default

  • mol2, mol2h - AtomNames, AtomTypeNames, BondTypeNames, Hydrogens, OrderAtoms, Substructure, Default

  • sln - Default

  • pdb - BONDS, BOTH, CHARGE, CurrentResidues, DELPHI, ELEMENT, FORMALCHARGE, FormalCrg, HETBONDS, NoResidues, OEResidues, ORDERS, OrderAtoms, RADIUS, TER, Default

  • xyz - Charges, Symbols, Default

  • cdx - Default

  • mopac - CHARGES, XYZ, Default

  • mf - Title, Default

  • oeb - Default

  • inchi, inchikey - Chiral, FixedHLayer, Hydrogens, ReconnectedMetals, Stereo, RelativeStereo, RacemicStereo, Default

You can also pass in a numeric value like 123 or a numeric string like “0”.

The errors parameter specifies how to handle errors. “strict” raises an exception, “report” sends a message to stderr and goes to the next record, and “ignore” goes to the next record.

The location parameter takes a chemfp.io.Location instance. If None then a default Location will be created.

Parameters:
  • destination (a filename, file object, or None to write to stdout) – the structure destination

  • format (a format name string, or Format(-like) object, or None to auto-detect) – the output structure format

  • writer_args (a dictionary) – writer parameters passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track writer state information

  • level (None, a positive integer, or one of the strings 'min', 'default', or 'max') – compression level to use for compressed formats (does not affect OEChem)

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_molecule_writer_to_bytes(format, writer_args=None, errors='strict', location=None, level=None)

Return a MoleculeStringWriter which can write OEChem molecule records to a byte string.

See chemfp.openeye_toolkit.open_molecule_writer() for full parameter details.

Use the writer’s chemfp.base_toolkit.MoleculeStringWriter.getvalue() to get the output string as a byte string.

Parameters:
  • format (a format name string, or Format(-like) object, or None to auto-detect) – the output structure format

  • writer_args (a dictionary) – writer arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track writer state information

  • level (None, a positive integer, or one of the strings 'min', 'default', or 'max') – compression level to use for compressed formats (does not affect OEChem)

Returns:

a chemfp.base_toolkit.MoleculeStringWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_molecule_writer_to_string(format, writer_args=None, errors='strict', location=None)

Return a MoleculeStringWriter which can write OEChem molecule records to a Unicode string.

See chemfp.openeye_toolkit.open_molecule_writer() for full parameter details.

Use the writer’s chemfp.base_toolkit.MoleculeStringWriter.getvalue() to get the output string as a Unicode string.

Parameters:
  • format (a format name string, or Format(-like) object, or None to auto-detect) – the output structure format

  • writer_args (a dictionary) – writer arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track writer state information

Returns:

a chemfp.base_toolkit.MoleculeStringWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_oez_writer(destination: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Open an OEX file to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "oez", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_oez_writer_to_string(*, flavor: int | str | None = 'Default', errors: str = 'strict')

Open an OEX file to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("oez", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_pdb_writer(destination: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Open a PDB file to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "pdb", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘BONDS’ = 1

  • ‘BOTH’ = 4 (in default bit flags)

  • ‘CHARGE’ = 8

  • ‘CurrentResidues’ = 2048

  • ‘DELPHI’ = 24

  • ‘ELEMENT’ = 64 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 128

  • ‘HETBONDS’ = 256 (in default bit flags)

  • ‘NoResidues’ = 1024

  • ‘OEResidues’ = 512 (in default bit flags)

  • ‘ORDERS’ = 2

  • ‘OrderAtoms’ = 4096 (in default bit flags)

  • ‘RADIUS’ = 16

  • ‘TER’ = 32

  • ‘Default’ = 4932 (in default bit flags)

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_pdb_writer_to_string(*, flavor: int | str | None = 'Default', errors: str = 'strict')

Open a PDB file to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("pdb", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘BONDS’ = 1

  • ‘BOTH’ = 4 (in default bit flags)

  • ‘CHARGE’ = 8

  • ‘CurrentResidues’ = 2048

  • ‘DELPHI’ = 24

  • ‘ELEMENT’ = 64 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 128

  • ‘HETBONDS’ = 256 (in default bit flags)

  • ‘NoResidues’ = 1024

  • ‘OEResidues’ = 512 (in default bit flags)

  • ‘ORDERS’ = 2

  • ‘OrderAtoms’ = 4096 (in default bit flags)

  • ‘RADIUS’ = 16

  • ‘TER’ = 32

  • ‘Default’ = 4932 (in default bit flags)

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_sdf3k_writer(destination: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Open an SDF file in V3000 format to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "sdf3k", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8 (in default bit flags)

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 287 (in default bit flags)

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_sdf3k_writer_to_string(*, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Open an SDF file in V3000 format to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("sdf3k", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8 (in default bit flags)

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 287 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_sdf_writer(destination: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Open an SDF file to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "sdf", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8 (in default bit flags)

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 287 (in default bit flags)

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_sdf_writer_to_string(*, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Open an SDF file to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("sdf", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘Add2D’ = 256 (in default bit flags)

  • ‘CurrentParity’ = 64

  • ‘MCHG’ = 1 (in default bit flags)

  • ‘MDLParity’ = 16 (in default bit flags)

  • ‘MISO’ = 2 (in default bit flags)

  • ‘MRGP’ = 4 (in default bit flags)

  • ‘MV30’ = 8 (in default bit flags)

  • ‘NoParity’ = 32

  • ‘SuppressImp2ExpENHSTE’ = 1024

  • ‘SuppressTimestamps’ = 2048

  • ‘UnsetBad2DStereo’ = 512

  • ‘Default’ = 287 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_smi_writer(destination: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', cxsmiles: bool = False, errors: str = 'strict')

Open a SMILES file to write OEChem molecules

This is mostly equivalent to calling:

open_molecule_writer(destination, "smi", writer_args={...}, errors=errors)

along with compression based on the destination filename’s extension.

Available bit flag flavors are:

  • ‘AllBonds’ = 2048

  • ‘AtomMaps’ = 32 (in default bit flags)

  • ‘AtomStereo’ = 8 (in default bit flags)

  • ‘BondStereo’ = 16 (in default bit flags)

  • ‘Canonical’ = 64 (in default bit flags)

  • ‘EnhStereo’ = 4096

  • ‘ExtBonds’ = 512

  • ‘Hydrogens’ = 2

  • ‘ImpHCount’ = 1024

  • ‘Isotopes’ = 1 (in default bit flags)

  • ‘Kekule’ = 128

  • ‘RGroups’ = 4 (in default bit flags)

  • ‘SuperAtoms’ = 256

  • ‘Default’ = 125 (in default bit flags)

Parameters:
  • destination (None, a filename string, or a file-like object) – where to write the molecules

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • cxsmiles (Boolean (default: False)) – If true, set the output flavor generate ChemAxon CXSMILES extensions

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.open_smi_writer_to_string(*, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', cxsmiles: bool = False, errors: str = 'strict')

Open a SMILES file to write OEChem molecules to an in-memory string

This is equivalent to calling:

open_molecule_writer_to_string("smi", writer_args={...}, errors=errors)

Use write_molecules_to_string() to write compressed output.

Available bit flag flavors are:

  • ‘AllBonds’ = 2048

  • ‘AtomMaps’ = 32 (in default bit flags)

  • ‘AtomStereo’ = 8 (in default bit flags)

  • ‘BondStereo’ = 16 (in default bit flags)

  • ‘Canonical’ = 64 (in default bit flags)

  • ‘EnhStereo’ = 4096

  • ‘ExtBonds’ = 512

  • ‘Hydrogens’ = 2

  • ‘ImpHCount’ = 1024

  • ‘Isotopes’ = 1 (in default bit flags)

  • ‘Kekule’ = 128

  • ‘RGroups’ = 4 (in default bit flags)

  • ‘SuperAtoms’ = 256

  • ‘Default’ = 125 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Output flavor bit flags

  • cxsmiles (Boolean (default: False)) – If true, set the output flavor generate ChemAxon CXSMILES extensions

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeWriter expecting OEChem molecules

chemfp.openeye_toolkit.parse_fasta(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Parse a FASTA record using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "fasta", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘CustomResidues’ = 1

  • ‘EmbeddedSMILES’ = 2

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_id_and_molecule(content, format, id_tag=None, reader_args=None, errors='strict')

Parse the first structure record from content and return the (id, OEGraphMol) pair.

content is a string containing a single structure record in format format. (Additional records are ignored). See chemfp.openeye_toolkit.read_molecules() for details about the other parameters.

See chemfp.openeye_toolkit.read_molecules() for details about the other parameters. See chemfp.openeye_toolkit.parse_molecule() if just want the OEGraphMol molecule and not the the (id, OEGraphMol) pair.

Parameters:
  • content (a string) – the string containing a structure record

  • format (a format name string, or Format object) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

Returns:

an (id, OEGraphMol molecule) pair

chemfp.openeye_toolkit.parse_inchi(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict')

Parse an InChI string and its id using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "inchi", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_inchistring(content: str | bytes, *, flavor: int | str | None = 'Default', delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict')

Parse an InChI string using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "inchistring", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_molecule(content, format, id_tag=None, reader_args=None, errors='strict')

Parse the first structure record from the content string and return an OEGraphMol molecule.

content is a string containing a single structure record in format format. (Additional records are ignored). See chemfp.openeye_toolkit.read_molecules() for details about the other parameters. See chemfp.openeye_toolkit.parse_id_and_molecule() if you want the (id, OEGraphMol) pair instead of just the molecule.

Parameters:
  • content (a string) – the string containing a structure record

  • format (a format name string, or Format object) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

Returns:

an OEGraphMol molecule

chemfp.openeye_toolkit.parse_molfile(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Parse a molfile record using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "molfile", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘FixBondMarks’ = 1

  • ‘SuppressEmptyMolSkip’ = 4

  • ‘SuppressImp2ExpENHSTE’ = 2

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_oez(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Parse an OEZ string using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "oez", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_pdb(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Parse a PDB record using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "pdb", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘ALL’ = 8

  • ‘ALTLOC’ = 32526

  • ‘BondOrder’ = 1024 (in default bit flags)

  • ‘CHARGE’ = 32

  • ‘Connect’ = 4096 (in default bit flags)

  • ‘DATA’ = 16

  • ‘DELPHI’ = 96

  • ‘END’ = 2 (in default bit flags)

  • ‘ENDM’ = 4 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 256 (in default bit flags)

  • ‘ImplicitH’ = 512 (in default bit flags)

  • ‘RADIUS’ = 64

  • ‘Rings’ = 2048 (in default bit flags)

  • ‘SecStruct’ = 8192 (in default bit flags)

  • ‘TER’ = 1

  • ‘Default’ = 16134 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_sdf(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Parse an SDF record using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "sdf", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘FixBondMarks’ = 1

  • ‘SuppressEmptyMolSkip’ = 4

  • ‘SuppressImp2ExpENHSTE’ = 2

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_sequence(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Parse an IUPAC sequence using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "sequence", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘CustomResidues’ = 1

  • ‘EmbeddedSMILES’ = 2

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_smi(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, cxsmiles: bool = True, errors: str = 'strict')

Parse a SMILES string and its id using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "smi", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_smiles(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, cxsmiles: bool = True, errors: str = 'strict')

Parse a SMILES string and its id using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "smi", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.parse_smistring(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', cxsmiles: bool = True, errors: str = 'strict')

Parse a SMILES string using the OEChem toolkit

This is equivalent to calling:

parse_molecule(content, "smistring", reader_args={...}, errors=errors)

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

an OEChem molecule object

chemfp.openeye_toolkit.read_csv_ids_and_molecules(source, *, id_column=1, mol_column=2, dialect=None, has_header=True, compression='auto', format='smi', id_tag=None, reader_args=None, errors='report', csv_errors='strict', location=None, encoding='utf8', encoding_errors='strict')

Read ids and molecules from column(s) of a CSV file using OEChem.

Read from source, which may be a filename, a file-like object, or None to read from stdin.

Use id_column and mol_column to specify the columns containing the record identifier and molecule record. By default the identifiers come from column 1 (the first column) and the molecules from column 2 (the second column). Columns can be specified by integer position (starting with 1), or by a string matching the title from the header line. If id_column is None then the molecule id will come from parsing the molecule record.

Use dialect to specify the type of CSV file. The default of None infers the dialect from the filename extension; *.csv for comma-separated, and *.tsv for tab-separated. The dialect can be specified directly as “csv” or “tsv”, as a registered Python csv dialect at https://docs.python.org/3/library/csv.html (though “excel” is the same as “csv” and “excel-tab” is the same as “tsv”), or as a csv.Dialect or a .class:CSVDialect instance.

If has_header is True then the first line/record contains column titles, and if False then there are no column titles.

Use compression to specify how the file compression format. The default “auto” uses the filename extension. Other options are “gz” and “zst”, or the empty string “” to mean no compresssion.

Use format to specify the structure format for how to parse the molecule column. The default of ‘smi’ will parse it as a SMILES string and, if id_column=None, will also parse any identifier.

The id_tag and reader_args arguments contain additional format configuration parameters.

The errors and csv_errors describe how to handle failures in molecule parsing and CSV parsing, respectively. The default is to report molecule parse failures to stderr, and to stop parsing if a CSV row does not contain enough columns.

The location parameter takes a chemfp.io.Location instance. If None then a default Location will be created.

The encoding and encoding_errors are strings describing the input file character encoding, and how to handle decoding errors. See https://docs.python.org/3/library/codecs.html#error-handlers and https://docs.python.org/3/library/codecs.html#error-handlers for details.

Parameters:
  • source (a filename, file object, or None to read from stdin) – the CSV source

  • id_column (integer position (starting from 1), string, or None) – the column position or column title containing the identifier

  • mol_column (integer position (starting from 1), string) – the column position or column title containing the structure record

  • dialect (None, a string name, or a Dialect instance) – the CSV dialect

  • has_header (bool) – True if the first record contains titles, False of it does not

  • compression (string or None) – file compression format

  • format (a format name string, or Format object) – the molecule structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle molecule parse errors

  • csv_errors (one of "strict", "report", or "ignore") – specify how to handle CSV errors

  • location (a chemfp.io.Location object, or None) – object used to track parser state information

  • encoding (string) – the name of the file’s character encoding

  • encoding_errors (string) – the method used handle decoding errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating (id, OEGraphMol) pairs

chemfp.openeye_toolkit.read_fasta_ids_and_molecules(source: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from a FASTA file using the OEChem toolkit

This is mostly equivalent to calling:

read_ids_and_molecules(source, "fasta", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘CustomResidues’ = 1

  • ‘EmbeddedSMILES’ = 2

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_fasta_ids_and_molecules_from_string(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from a string containing a FASTA file using the OEChem toolkit

This is equivalent to calling:

read_ids_and_molecules_from_string(content, "fasta", reader_args={...}, errors=errors)

Use read_ids_and_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘CustomResidues’ = 1

  • ‘EmbeddedSMILES’ = 2

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_fasta_molecules(source: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from a FASTA file using the OEChem toolkit

This is mostly equivalent to calling:

read_molecules(source, "fasta", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘CustomResidues’ = 1

  • ‘EmbeddedSMILES’ = 2

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_fasta_molecules_from_string(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from a string containing a FASTA file using the OEChem toolkit

This is equivalent to calling:

read_molecules_from_string(content, "fasta", reader_args={...}, errors=errors)

Use read_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘CustomResidues’ = 1

  • ‘EmbeddedSMILES’ = 2

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_ids_and_molecules(source=None, format=None, id_tag=None, reader_args=None, errors='strict', location=None, encoding='utf8', encoding_errors='strict')

Return an iterator that reads (id, OEGraphMol molecule) pairs from a structure file

See chemfp.openeye_toolkit.read_molecules() for full parameter details. The major difference is that this returns an iterator of (id, OEGraphMol) pairs instead of just the molecules.

Note: the reader will clear and reuse the OEGraphMol instance. Make a copy if you want to keep the molecule around.

Parameters:
  • source (a filename, file object, or None to read from stdin) – the structure source

  • format (a format name string, or Format object, or None to auto-detect) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track parser state information

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating (id, OEGraphMol) pairs

chemfp.openeye_toolkit.read_ids_and_molecules_from_string(content, format, id_tag=None, reader_args=None, errors='strict', location=None)

Return an iterator that reads (id, OEGraphMol) pairs from a string containing structure records

content is a string containing 0 or more records in the format format. See chemfp.openeye_toolkit.read_molecules() for details about the other parameters. See chemfp.openeye_toolkit.read_molecules_from_string() if you just want to read the OEGraphMol molecules instead of (id, OEGraphMol) pairs.

Note: the reader will clear and reuse the OEGraphMol instance. Make a copy if you want to keep the molecule around.

Parameters:
  • content (a string) – the string containing structure records

  • format (a format name string, or Format object) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track parser state information

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating (id, OEGraphMol) pairs

chemfp.openeye_toolkit.read_inchi_ids_and_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict')

Read ids and molecules from an InChI file (with InChI and optional id) using the OEChem toolkit

This is mostly equivalent to calling:

read_ids_and_molecules(source, "inchi", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_inchi_ids_and_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict')

Read ids and molecules from a string containing an InChI file (with InChI and optional id) using the OEChem toolkit

This is equivalent to calling:

read_ids_and_molecules_from_string(content, "inchi", reader_args={...}, errors=errors)

Use read_ids_and_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_inchi_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict')

Read molecules from an InChI file (with InChI and optional id) using the OEChem toolkit

This is mostly equivalent to calling:

read_molecules(source, "inchi", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_inchi_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, errors: str = 'strict')

Read molecules from a string containing an InChI file (with InChI and optional id) using the OEChem toolkit

This is equivalent to calling:

read_molecules_from_string(content, "inchi", reader_args={...}, errors=errors)

Use read_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_molecules(source=None, format=None, id_tag=None, reader_args=None, errors='strict', location=None, encoding='utf8', encoding_errors='strict')

Return an iterator that reads OEGraphMol molecules from a structure file

Iterate through the format structure records in source. If format is None then auto-detect the format based on the source. For SD files, use id_tag to get the record id from the given SD tag instead of the title line. (read_molecules() will ignore the id_tag. It exists to make it easier to switch between reader functions.)

Note: the reader will clear and reuse the OEGraphMol instance. Make a copy if you want to keep the molecule around.

The reader_args dictionary parameters depend on the format. Every OEChem format supports:

  • aromaticity - one of “default”, “openeye”, “daylight”, “tripos”, “mdl”, “mmff”, or None

  • flavor - a number, string-encoded number, or flavor string

A “flavor string” is a “|” or “,” separated list of format-specific flavor terms. It can be a simple as “Default”, or a more complex string like “Default|-ENDM|DELPHI” which for the PDB reader starts with the default settings, removes the ENDM flavor, and adds the CHARGE and RADIUS flavors.

The supported input flavor terms for each format are:

  • SMILES - Canon, Strict, Default

  • sdf - Default

  • skc - Default

  • mol2, mol2h - M2H, Default

  • mmod - FormalCrg, Default

  • pdb - ALL, ALTLOC, BondOrder, CHARGE, Connect, DATA, DELPHI, END, ENDM, FORMALCHARGE, FormalCrg, ImplicitH, RADIUS, Rings, SecStruct, TER, TerMask, Default

  • xyz - BondOrder, Connect, FormalCrg, ImplicitH, Rings, Default

  • cdx - SuperAtoms, Default

  • oeb - Default

You can also pass in a numeric value like 123 or a numeric string like “0”.

In addition, the SMILES record readers have limited support for the “delimiter” reader_arg:

  • delimiter - one of “tab”, “space”, “to-eol”, the space or tab characters, or None

Note: the first whitespace after the SMILES string will always be treated as a delimiter.

The errors parameter specifies how to handle errors. “strict” raises an exception, “report” sends a message to stderr and goes to the next record, and “ignore” goes to the next record.

The location parameter takes a chemfp.io.Location instance. If None then a default Location will be created.

See chemfp.openeye_toolkit.read_ids_and_molecules() if you want (id, OEGraphMol) pairs instead of just the molecules.

Parameters:
  • source (a filename, file object, or None to read from stdin) – the structure source

  • format (a format name string, or Format object, or None to auto-detect) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader parameters passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track parser state information

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEGraphMol molecules

chemfp.openeye_toolkit.read_molecules_from_string(content, format, id_tag=None, reader_args=None, errors='strict', location=None)

Return an iterator that reads molecules from a string containing structure records

content is a string containing 0 or more records in the format format. See chemfp.openeye_toolkit.read_molecules() for details about the other parameters. See chemfp.openeye_toolkit.read_ids_and_molecules_from_string() if you want to read (id, OEGraphMol) pairs instead of just molecules.

Note: the reader will clear and reuse the OEGraphMol instance. Make a copy if you want to keep the molecule around.

Parameters:
  • content (a string) – the string containing structure records

  • format (a format name string, or Format object) – the input structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary) – reader arguments passed to the underlying toolkit

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

  • location (a chemfp.io.Location object, or None) – object used to track parser state information

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEGraphMol molecules

chemfp.openeye_toolkit.read_oez_ids_and_molecules(source: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from an OEX file using the OEChem toolkit

This is mostly equivalent to calling:

read_ids_and_molecules(source, "oez", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_oez_ids_and_molecules_from_string(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from a string containing an OEX file using the OEChem toolkit

This is equivalent to calling:

read_ids_and_molecules_from_string(content, "oez", reader_args={...}, errors=errors)

Use read_ids_and_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_oez_molecules(source: None | str | BinaryIO, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from an OEX file using the OEChem toolkit

This is mostly equivalent to calling:

read_molecules(source, "oez", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_oez_molecules_from_string(content: str | bytes, *, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from a string containing an OEX file using the OEChem toolkit

This is equivalent to calling:

read_molecules_from_string(content, "oez", reader_args={...}, errors=errors)

Use read_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘Default’ = 0

Parameters:
  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_pdb_ids_and_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from a PDB file using the OEChem toolkit

This is mostly equivalent to calling:

read_ids_and_molecules(source, "pdb", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘ALL’ = 8

  • ‘ALTLOC’ = 32526

  • ‘BondOrder’ = 1024 (in default bit flags)

  • ‘CHARGE’ = 32

  • ‘Connect’ = 4096 (in default bit flags)

  • ‘DATA’ = 16

  • ‘DELPHI’ = 96

  • ‘END’ = 2 (in default bit flags)

  • ‘ENDM’ = 4 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 256 (in default bit flags)

  • ‘ImplicitH’ = 512 (in default bit flags)

  • ‘RADIUS’ = 64

  • ‘Rings’ = 2048 (in default bit flags)

  • ‘SecStruct’ = 8192 (in default bit flags)

  • ‘TER’ = 1

  • ‘Default’ = 16134 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_pdb_ids_and_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from a string containing a PDB file using the OEChem toolkit

This is equivalent to calling:

read_ids_and_molecules_from_string(content, "pdb", reader_args={...}, errors=errors)

Use read_ids_and_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘ALL’ = 8

  • ‘ALTLOC’ = 32526

  • ‘BondOrder’ = 1024 (in default bit flags)

  • ‘CHARGE’ = 32

  • ‘Connect’ = 4096 (in default bit flags)

  • ‘DATA’ = 16

  • ‘DELPHI’ = 96

  • ‘END’ = 2 (in default bit flags)

  • ‘ENDM’ = 4 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 256 (in default bit flags)

  • ‘ImplicitH’ = 512 (in default bit flags)

  • ‘RADIUS’ = 64

  • ‘Rings’ = 2048 (in default bit flags)

  • ‘SecStruct’ = 8192 (in default bit flags)

  • ‘TER’ = 1

  • ‘Default’ = 16134 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_pdb_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from a PDB file using the OEChem toolkit

This is mostly equivalent to calling:

read_molecules(source, "pdb", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘ALL’ = 8

  • ‘ALTLOC’ = 32526

  • ‘BondOrder’ = 1024 (in default bit flags)

  • ‘CHARGE’ = 32

  • ‘Connect’ = 4096 (in default bit flags)

  • ‘DATA’ = 16

  • ‘DELPHI’ = 96

  • ‘END’ = 2 (in default bit flags)

  • ‘ENDM’ = 4 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 256 (in default bit flags)

  • ‘ImplicitH’ = 512 (in default bit flags)

  • ‘RADIUS’ = 64

  • ‘Rings’ = 2048 (in default bit flags)

  • ‘SecStruct’ = 8192 (in default bit flags)

  • ‘TER’ = 1

  • ‘Default’ = 16134 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_pdb_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from a string containing a PDB file using the OEChem toolkit

This is equivalent to calling:

read_molecules_from_string(content, "pdb", reader_args={...}, errors=errors)

Use read_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘ALL’ = 8

  • ‘ALTLOC’ = 32526

  • ‘BondOrder’ = 1024 (in default bit flags)

  • ‘CHARGE’ = 32

  • ‘Connect’ = 4096 (in default bit flags)

  • ‘DATA’ = 16

  • ‘DELPHI’ = 96

  • ‘END’ = 2 (in default bit flags)

  • ‘ENDM’ = 4 (in default bit flags)

  • ‘FORMALCHARGE’ = 128

  • ‘FormalCrg’ = 256 (in default bit flags)

  • ‘ImplicitH’ = 512 (in default bit flags)

  • ‘RADIUS’ = 64

  • ‘Rings’ = 2048 (in default bit flags)

  • ‘SecStruct’ = 8192 (in default bit flags)

  • ‘TER’ = 1

  • ‘Default’ = 16134 (in default bit flags)

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_sdf_ids_and_molecules(source: None | str | BinaryIO, *, id_tag: None | str = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from an SDF file using the OEChem toolkit

This is mostly equivalent to calling:

read_ids_and_molecules(source, "sdf", id_tag=id_tag, reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘FixBondMarks’ = 1

  • ‘SuppressEmptyMolSkip’ = 4

  • ‘SuppressImp2ExpENHSTE’ = 2

  • ‘Default’ = 0

Parameters:
  • id_tag (a string, or None to use the title) – get the id from the named data item instead of using the record title

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_sdf_ids_and_molecules_from_string(content: str | bytes, *, id_tag: None | str = None, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read ids and molecules from a string containing an SDF file using the OEChem toolkit

This is equivalent to calling:

read_ids_and_molecules_from_string(content, "sdf", id_tag=id_tag, reader_args={...}, errors=errors)

Use read_ids_and_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘FixBondMarks’ = 1

  • ‘SuppressEmptyMolSkip’ = 4

  • ‘SuppressImp2ExpENHSTE’ = 2

  • ‘Default’ = 0

Parameters:
  • id_tag (a string, or None to use the title) – get the id from the named data item instead of using the record title

  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_sdf_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from an SDF file using the OEChem toolkit

This is mostly equivalent to calling:

read_molecules(source, "sdf", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘FixBondMarks’ = 1

  • ‘SuppressEmptyMolSkip’ = 4

  • ‘SuppressImp2ExpENHSTE’ = 2

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_sdf_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', errors: str = 'strict')

Read molecules from a string containing an SDF file using the OEChem toolkit

This is equivalent to calling:

read_molecules_from_string(content, "sdf", reader_args={...}, errors=errors)

Use read_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘FixBondMarks’ = 1

  • ‘SuppressEmptyMolSkip’ = 4

  • ‘SuppressImp2ExpENHSTE’ = 2

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_smi_ids_and_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, cxsmiles: bool = True, errors: str = 'strict')

Read ids and molecules from a SMILES file using the OEChem toolkit

This is mostly equivalent to calling:

read_ids_and_molecules(source, "smi", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_smi_ids_and_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, cxsmiles: bool = True, errors: str = 'strict')

Read ids and molecules from a string containing a SMILES file using the OEChem toolkit

This is equivalent to calling:

read_ids_and_molecules_from_string(content, "smi", reader_args={...}, errors=errors)

Use read_ids_and_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.IdAndMoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_smi_molecules(source: None | str | BinaryIO, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, cxsmiles: bool = True, errors: str = 'strict')

Read molecules from a SMILES file using the OEChem toolkit

This is mostly equivalent to calling:

read_molecules(source, "smi", reader_args={...}, errors=errors)

along with decompression based on the source filename’s extension.

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.read_smi_molecules_from_string(content: str | bytes, *, aromaticity: Literal['none', 'openeye', 'daylight', 'tripos', 'mdl', 'mmff'] | None = None, flavor: int | str | None = 'Default', has_header: bool = False, delimiter: Literal['to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', ' ', '\t'] | None = None, cxsmiles: bool = True, errors: str = 'strict')

Read molecules from a string containing a SMILES file using the OEChem toolkit

This is equivalent to calling:

read_molecules_from_string(content, "smi", reader_args={...}, errors=errors)

Use read_molecules_from_string() if the content is compressed.

Available bit flag flavors are:

  • ‘Canon’ = 2

  • ‘Strict’ = 1

  • ‘Default’ = 0

Parameters:
  • aromaticity (None, or one of 'none', 'openeye', 'daylight', 'tripos', 'mdl', or 'mmff') – The OEChem aromaticity model

  • flavor (None, integer or string with "|"- or ","-separated terms (default: "Default")) – Input flavor bit flags

  • has_header (Boolean (default: False)) – If true, treat the first line of the SMILES file as a header

  • delimiter (One of None, 'to-eol', 'space', 'tab', 'comma', 'whitespace', 'native', or the space or tab characters (default: None)) – The separator between the SMILES and the id

  • cxsmiles (Boolean (default: True)) – If true, look for ChemAxon CXSMILES extensions after the SMILES string

  • errors (one of "strict", "ignore", or "log") – specify how to handle errors

Returns:

a chemfp.base_toolkit.MoleculeReader iterating OEChem molecules

chemfp.openeye_toolkit.set_id(mol, id)

Set the molecule’s id using OEChem’s SetTitle()

Parameters:
  • mol (an OEChem molecule) – the molecule

  • id (string) – the new id

Returns:

None

chemfp.openeye_toolkit.suppress_log_output()

Return a context manager to disable OEChem logging.

One entry, set the OEThrow error level to Quiet. On exit, restore the previous value.

Example:
with suppress_output():

… do something which may log …

The returned context manager is a re-entrant singleton object. The context manager may be entered multiple times. Logging will not be re-enabled until the matching number of exits.

Returns:

a context manager

chemfp.openeye_toolkit.translate_record(content, in_format='smi', out_format='smi', *, id_tag=None, reader_args=None, writer_args=None, id=None, errors='strict')

Translate a molecule record from one format to another

Use the RDKit toolkit to parse the content as format in_format (default: “smi”) and translate it into out_format (default: “smi”). For an SDF record, use id_tag to get the record id from the given SD tag instead of the title line. Use reader_args and writer_args to configure format-specific parameters. Use id to set the id of the output record.

The errors parameter specifies how to handle errors. “strict” raises an exception, “report” sends a message to stderr and goes to the next record, and “ignore” goes to the next record.

Parameters:
  • content (a string) – the string containing a structure record

  • in_format (a format name string, or Format object) – the input structure format

  • out_format (a format name string, or Format object) – the output structure format

  • id_tag (string, or None to use the record title) – SD tag containing the record id

  • reader_args (a dictionary, or None) – reader arguments for the specified in_format

  • writer_args (a dictionary, or None) – writer arguments for the specified out_format

  • id (a string, or None to use the default) – the record id to use for the output record

  • errors (one of "strict", "report", or "ignore") – specify how to handle errors

Returns:

a string