simple-ddl-parser

Simple DDL Parser to parse SQL & HQL ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Simple DDL Parser

How to install

pip install simple-ddl-parser

Parser tested on different DDLs for PostgreSQL & Hive. Types that are used in your DB does not matter, so parser must also work successfuly to any DDL for SQL DB.

If you have samples that cause an error - please open the issue (but don’t forget to add ddl example), I will be glad to fix it.

This parser take as input SQL DDL statements or files, for example like this:

    create table prod.super_table
(
    data_sync_id bigint not null default 0,
    id_ref_from_another_table int REFERENCES another_table (id)
    sync_count bigint not null REFERENCES count_table (count),
    sync_mark timestamp  not  null,
    sync_start timestamp  not null default now(),
    sync_end timestamp  not null,
    message varchar(2000) null,
    primary key (data_sync_id, sync_start)
);

And produce output like this (information about table name, schema, columns, types and properties):

[
    {
        "columns": [
            {
                "name": "data_sync_id", "type": "bigint", "size": None,
                "nullable": False, "default": None, "references": None,
            },
            {
                "name": "id_ref_from_another_table", "type": "int", "size": None,
                "nullable": False, "default": None, "references": {"table": "another_table", "column": "id"},
            },
            {
                "name": "sync_count", "type": "bigint", "size": None,
                "nullable": False, "default": None, "references": {"table": "count_table", "column": "count"},
            },
            {
                "name": "sync_mark", "type": "timestamp", "size": None,
                "nullable": False, "default": None, "references": None,
            },
            {
                "name": "sync_start", "type": "timestamp", "size": None,
                "nullable": False, "default": None, "references": None,
            },
            {
                "name": "sync_end", "type": "timestamp", "size": None,
                "nullable": False, "default": None, "references": None,
            },
            {
                "name": "message", "type": "varchar", "size": 2000,
                "nullable": False, "default": None, "references": None,
            },
        ],
        "primary_key": ["data_sync_id", "sync_start"],
        "table_name": "super_table",
        "schema": "prod",
    }
]

Or one more example

CREATE TABLE "paths" (
  "id" int PRIMARY KEY,
  "title" varchar NOT NULL,
  "description" varchar(160),
  "created_at" timestamp,
  "updated_at" timestamp
);

and result

[{
'columns': [
    {'name': 'id', 'type': 'int', 'nullable': False, 'size': None, 'default': None, 'references': None},
    {'name': 'title', 'type': 'varchar', 'nullable': False, 'size': None, 'default': None, 'references': None},
    {'name': 'description', 'type': 'varchar', 'nullable': False, 'size': 160, 'default': None, 'references': None},
    {'name': 'created_at', 'type': 'timestamp', 'nullable': False, 'size': None, 'default': None, 'references': None},
    {'name': 'updated_at', 'type': 'timestamp', 'nullable': False, 'size': None, 'default': None, 'references': None}],
'primary_key': ['id'],
'table_name': 'paths', 'schema': None
}]

If you pass file or text block with more when 1 CREATE TABLE statement when result will be list of such dicts. For example:

Input:

CREATE TABLE "countries" (
  "id" int PRIMARY KEY,
  "code" varchar(4) NOT NULL,
  "name" varchar NOT NULL
);

CREATE TABLE "path_owners" (
  "user_id" int,
  "path_id" int,
  "type" int DEFAULT 1
);

Output:

[
    {'columns': [
        {'name': 'id', 'type': 'int', 'size': None, 'nullable': False, 'default': None, 'references': None},
        {'name': 'code', 'type': 'varchar', 'size': 4, 'nullable': False, 'default': None, 'references': None},
        {'name': 'name', 'type': 'varchar', 'size': None, 'nullable': False, 'default': None, 'references': None}],
     'primary_key': ['id'],
     'table_name': 'countries',
     'schema': None},
    {'columns': [
        {'name': 'user_id', 'type': 'int', 'size': None, 'nullable': False, 'default': None, 'references': None},
        {'name': 'path_id', 'type': 'int', 'size': None, 'nullable': False, 'default': None, 'references': None},
        {'name': 'type', 'type': 'int', 'size': None, 'nullable': False, 'default': 1, 'references': None}],
     'primary_key': [],
     'table_name': 'path_owners',
     'schema': None}
]

How to use

From python code

from simple_ddl_parser import DDLParser


parse_results = DDLParser("""create table dev.data_sync_history(
    data_sync_id bigint not null,
    sync_count bigint not null,
    sync_mark timestamp  not  null,
    sync_start timestamp  not null,
    sync_end timestamp  not null,
    message varchar(2000) null,
    primary key (data_sync_id, sync_start)
); """).run()

print(parse_results)

To parse from file

from simple_ddl_parser import parse_from_file

result = parse_from_file('tests/test_one_statement.sql')
print(result)

From command line

simple-ddl-parser is installed to environment as command sdp

sdp path_to_ddl_file

# for example:

sdp tests/test_two_tables.sql

You will see the output in schemas folder in file with name test_two_tables_schema.json

If you want to have also output in console - use -v flag for verbose.

sdp tests/test_two_tables.sql -v

If you don’t want to dump schema in file and just print result to the console, use –no-dump flag:

sdp tests/test_two_tables.sql --no-dump

More examples & tests

You can find in tests/functional folder.

Dump result in json

To dump result in json use argument .run(dump=True)

You also can provide a path where you want to have a dumps with schema with argument

TODO in next Releases

Support for separate ALTER TABLE statements for Foreigein keys like

ALTER TABLE "material_attachments" ADD FOREIGN KEY ("material_id") REFERENCES "materials" ("id");

Support for parse CREATE INDEX statements
Add to command line args: to pass folder with ddls to convert, pass path to get the output results
Support ARRAYs

Historical context

This library is an extracted parser code from https://github.com/xnuinside/fakeme (Library for fake relation data generation, that I used in several work projects, but did not have time to make from it normal open source library)

For one of the work projects I needed to convert SQL ddl to Python ORM models in auto way and I tried to use https://github.com/andialbrecht/sqlparse but it works not well enough with ddl for my case (for example, if in ddl used lower case - nothing works, primary keys inside ddl are mapped as column name not reserved word and etc.). So I remembered about Parser in Fakeme and just extracted it & improved.

How to contribute

Please describe issue that you want to solve and open the PR, I will review it as soon as possible.

Any questions? Ping me in Telegram: https://t.me/xnuinside

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.5.0

May 18, 2024

1.4.0

May 14, 2024

1.3.0

May 11, 2024

1.2.1

May 9, 2024

1.2.0 yanked

May 9, 2024

Reason this release was yanked:

parsetab was unupdated, sorry for that

1.1.0

Apr 21, 2024

1.0.4

Mar 25, 2024

1.0.3

Jan 20, 2024

1.0.2

Jan 14, 2024

1.0.1

Jan 11, 2024

1.0.0

Jan 9, 2024

0.32.1

Jan 7, 2024

0.32.0

Jan 7, 2024

0.31.3

Jan 5, 2024

0.31.2

Dec 16, 2023

0.31.1

Nov 5, 2023

0.31.0

Aug 22, 2023

0.30.0

Mar 29, 2023

0.29.1

Jan 7, 2023

0.29.0

Nov 20, 2022

0.28.1

Oct 31, 2022

0.28.0

Oct 30, 2022

0.27.0

Aug 6, 2022

0.26.5

Jul 1, 2022

0.26.4

Jun 10, 2022

0.26.3

Jun 7, 2022

0.26.2

May 6, 2022

0.26.1

Apr 24, 2022

0.26.0

Mar 28, 2022

0.25.0

Feb 7, 2022

0.24.2

Jan 19, 2022

0.24.1

Jan 6, 2022

0.24.0

Jan 4, 2022

0.23.0

Dec 24, 2021

0.22.6

Nov 27, 2021

0.22.5

Nov 20, 2021

0.22.4

Nov 20, 2021

0.22.3

Nov 17, 2021

0.22.2

Nov 17, 2021

0.22.1

Nov 16, 2021

0.22.0

Nov 14, 2021

0.21.2

Oct 10, 2021

0.21.1

Oct 8, 2021

0.21.0

Oct 6, 2021

0.20.0

Oct 3, 2021

0.19.9

Sep 29, 2021

0.19.8

Sep 27, 2021

0.19.7

Sep 18, 2021

0.19.6

Sep 17, 2021

0.19.5

Sep 16, 2021

0.19.4

Aug 29, 2021

0.19.3

Aug 29, 2021

0.19.2

Aug 16, 2021

0.19.1

Aug 10, 2021

0.19.0

Aug 5, 2021

0.18.0

Aug 2, 2021

0.17.0

Jul 2, 2021

0.16.3

Jun 13, 2021

0.16.2

Jun 8, 2021

0.16.0

May 22, 2021

0.15.0

May 16, 2021

0.14.0

May 3, 2021

0.12.1

Apr 20, 2021

0.12.0

Apr 17, 2021

0.12.0a1 pre-release

Apr 17, 2021

0.12.0a0 pre-release

Apr 13, 2021

0.11.1

Apr 8, 2021

0.11.0

Apr 7, 2021

0.10.2

Apr 5, 2021

0.10.1

Apr 4, 2021

0.9.0

Apr 3, 2021

0.8.1

Apr 1, 2021

0.8.0

Apr 1, 2021

0.8.0a1 pre-release

Apr 1, 2021

0.8.0a0 pre-release

Mar 31, 2021

0.7.4

Mar 22, 2021

0.7.3

Mar 19, 2021

0.7.2

Mar 17, 2021

0.7.0

Mar 16, 2021

0.7.0a0 pre-release

Mar 16, 2021

0.6.1

Mar 15, 2021

0.6.0

Mar 14, 2021

0.6.0a0 pre-release

Mar 14, 2021

0.5.0

Mar 11, 2021

0.5.0a4 pre-release

Mar 11, 2021

0.5.0a3 pre-release

Mar 11, 2021

0.5.0a2 pre-release

Mar 11, 2021

0.5.0a1 pre-release

Mar 11, 2021

0.5.0a0 pre-release

Mar 10, 2021

0.4.0

Mar 8, 2021

0.4.0a0 pre-release

Mar 8, 2021

This version

0.3.1

Mar 7, 2021

0.3.1a0 pre-release

Mar 7, 2021

0.3.0

Mar 7, 2021

0.3.0a0 pre-release

Mar 7, 2021

0.2.1

Mar 7, 2021

0.2.1a2 pre-release

Mar 7, 2021

0.2.1a1 pre-release

Mar 7, 2021

0.2.1a0 pre-release

Mar 7, 2021

0.2.0

Mar 6, 2021

0.1.0

Mar 6, 2021

0.1.0a1 pre-release

Mar 6, 2021

0.1.0a0 pre-release

Mar 6, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simple-ddl-parser-0.3.1.tar.gz (8.8 kB view hashes)

Uploaded Mar 7, 2021 Source

Built Distribution

simple_ddl_parser-0.3.1-py3-none-any.whl (8.3 kB view hashes)

Uploaded Mar 7, 2021 Python 3

Hashes for simple-ddl-parser-0.3.1.tar.gz

Hashes for simple-ddl-parser-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`d6da1d22f7b2024cf8185e7b520562f61e1f2057e389b74548b650b11d484c07`
MD5	`2153c0676149a3bf897125a6bfaa44d4`
BLAKE2b-256	`d6208fe9ea9fa01cb61ce3b282378aaef9b3b7825ce8c84e8b6a090f92bad09f`

Hashes for simple_ddl_parser-0.3.1-py3-none-any.whl

Hashes for simple_ddl_parser-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`990bcfa8aeff23e89e663c65573e369a923420bcc3dc6a0fe5bcc2f801297687`
MD5	`f7fee8a09421029494c98dd9b828d936`
BLAKE2b-256	`3a04e15712287b96db28168967c54872217c5510b7e2c57de0cb4a063dae6484`