Skip to main content

File IO routines for reading and writing OpenCypher files

Project description

grand-cypher-io

File IO routines for reading and writing OpenCypher files.


Why?

  • To enable the use of OpenCypher files as a standard graph interchange format.
  • To simplify reading and writing in-memory Python graphs to a Neo4j or Neptune database.
  • To serialize and deserialize graphs for long-term (e.g., archival) immutable storage.

Compatibilities

  • All routines that expect a graph can be run with Grand Graph.nx objects.
  • You can mock most of a Neo4j database, using this repository for IO and in conjunction with Grand-Cypher for query execution.
  • Designed for use with AWS Neptune

Usage

Export a graph to OpenCypher-readable files

from grand_cypher_io import graph_to_opencypher_buffers
# `graph` is nx.DiGraph or compatible
vert_buffer, edge_buffer = graph_to_opencypher_buffers(graph)
with open('vertices.csv', 'w') as f:
    f.write(vert_buffer.read())
with open('edges.csv', 'w') as f:
    f.write(edge_buffer.read())

Import a graph from OpenCypher-readable files

from grand_cypher_io import opencypher_buffers_to_graph
with open('vertices.csv', 'r') as f:
    vert_buffer = io.StringIO(f.read())
with open('edges.csv', 'r') as f:
    edge_buffer = io.StringIO(f.read())
graph = opencypher_buffers_to_graph(vert_buffer, edge_buffer)

Usage Considerations

Edge addition implies vertices

When adding an edge to a graph, the vertices of the edge are also added to the graph. This is counter to the behavior of Neo4j imports, but compatible with the Grand graph library assumptions, and greatly reduces the inner-loop complexity of the import process.

Because these implicit vertices have no properties, they are easy to detect and filter out of the graph after importing, if desired.

This behavior also means that it is possible to create a full structural graph from a set of edges alone, without any vertices.

The __labels__ magic attribute

Following the Grand-Cypher convention, the __labels__ attribute is used to store the labels of a node. This is an iterable of strings. The __labels__ attribute is not required, but if it is present, it will be used to populate the labels attribute of the node for the purposes of writing to an OpenCypher file.

Likewise, the __labels__ attribute is used to populate the labels attribute of a node when reading from an OpenCypher file.

Made with 💙 at JHU APL

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grand_cypher_io-0.1.0.tar.gz (12.4 kB view hashes)

Uploaded Source

Built Distribution

grand_cypher_io-0.1.0-py3-none-any.whl (12.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page