Skip to main content

A SEAMM plug-in to read common formats in computational chemistry

Project description

SEAMM Read Structure Plug-in

GitHub pull requests Build Status Code Coverage Code Quality Documentation Status Updates for Dependencies PyPi VERSION

A SEAMM plug-in to read common formats in computational chemistry

The current version uses Open Babel as an engine to parse various formats such as PDB, Mol2 or XYZ and transform them into the SEAMM structure format for further use in SEAMM flowcharts.

Features

  • Emphasis in ease-of-use and simplicity. The public interface consists of a single function.

  • Automatic file type recognition.

  • OpenBabel as an engine to parse formats, but other engines can be easily implemented.

  • Easily extensible to new formats.

  • Current support for PDB, Mol2 and XYZ files.

Example

import read_structure_step
seamm_structure = read_structure_step.read("spc.xyz")

Acknowledgements

This package was created with Cookiecutter and the molssi-seamm/cookiecutter-seamm-plugin project template.

Developed by the Molecular Sciences Software Institute (MolSSI), which receives funding from the National Science Foundation under award ACI-1547580

History

2023.7.28 – Implemented ranges for reading XYZ and SDF files.

2023.7.27.1 – Removed debug printing.

2023.7.27 – Support for .gz and .bz2 files, and multi-structure .xyz files
  • Handle .gz and .bz2 files for .sdf and .xyz extensions.

  • Handle multi-structure XYZ files with a blank line between records.

2023.7.6 – Bugfixes
  • Fixed output of number of structures written to SDF files, which was off by 1.

  • Cleaned up the output for the write-structure step

2023.1.30 – Fixed issue#43, duplicate systems or configuration created

  • Reading a single structure from e.g. a .sdf file created a second system or configuration, depending on the stucture control options.

2023.1.24 – Added handler for XYZ files and added properties

  • Added a custom handler for XYZ files to cope with some of the variant formats.

    • Supports files with no atom count on the first line

    • Supports the variant used in the Minnesota Solubility database, which has 3 header lines.

  • Add capability to store properties into the database for formats such as MOPAC and SDF that can handle properties. Also can output the properties when writing the files.

  • Fixed bugs if the system name or configuration name is not a string, but rather a number.

2022.10.28 – Fixed bug reading cif and mmcif files

  • There was a bug that caused a crash when reading cif and mmcif files, and potentially some other formats. It has been fixed throughout.

  • The standard error for properties were missing a commma in the property name. The comma is standard elsewhere in SEAMM so add it here: ‘<prop>, stderr’

2022.10.26 – Handling OpenBabel error messages for MOPAC .mop files

Hiding messages about errors Kekulizing structures, which doesn’t seem to be a serious issue, and printing any other messages as warnings.

2021.2.12 (12 February 2021)

  • Updated the README file to give a better description.

  • Updated the short description in setup.py to work with the new installer.

  • Added keywords for better searchability.

2021.2.4 (4 February 2021)

Updated for compatibility with the new system classes in MolSystem 2021.2.2 release.

2020.12.5 (5 December 2020)

Internal: switching CI from TravisCI to GitHub Actions, and in the process moving documentation from ReadTheDocs to GitHub Pages where it is consolidated with the main SEAMM documentation.

2020.8.1 (1 August 2020)

Removed leftover debug print statements.

0.9 (15 April 2020)

  • General bug fixing and code cleanup.

  • Part of release of all modules.

0.7.1 (23 November 2019)

First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

read_structure_step-2023.7.28.tar.gz (894.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

read_structure_step-2023.7.28-py2.py3-none-any.whl (105.9 kB view details)

Uploaded Python 2Python 3

File details

Details for the file read_structure_step-2023.7.28.tar.gz.

File metadata

  • Download URL: read_structure_step-2023.7.28.tar.gz
  • Upload date:
  • Size: 894.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for read_structure_step-2023.7.28.tar.gz
Algorithm Hash digest
SHA256 92704ef39aee8839c91b4e47fe8f1efff2c9aa7ccc57f02a017b87b9a5113006
MD5 cbe9cc894bd236bd744e329dc0f5ca35
BLAKE2b-256 67fbf95139c6d8f5a5ae3f366de93aa75d70e2699bcc90d7097163e0e63aa35a

See more details on using hashes here.

File details

Details for the file read_structure_step-2023.7.28-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for read_structure_step-2023.7.28-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 4cddd4ac2bf9dfb3cff0f2badaa5e2375e8b3d9dd4e760e6e20406402680b0d1
MD5 7685dd431da24be3db02565325edaa64
BLAKE2b-256 fdc49d0f007e84fc19a600a8767d1cb7a39a4c804a725446c9091bdef4d642a2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page