Skip to main content
Avatar for moj-analytical-services from gravatar.com

moj-analytical-services

Username    moj-analytical-services
Date joined   Joined

19 projects

iam-builder

Last released

A lil python package to generate iam policies

pydbtools

Last released

A python package to query data via amazon athena and bring it into a pandas df using aws-wrangler.

splink

Last released

Fast probabilistic data linkage at scale

data-linter

Last released

data linter

form-tools

Last released

None

data-engineering-pulumi-components

Last released

Reusable components for use in Pulumi Python projects

arrow_pd_parser

Last released

MoJ arrow-pd-parser

database-testing-tools

Last released

A package to test our databases

s3_data_packer

Last released

mojap-metadata

Last released

A python package to manage metadata

etl_manager

Last released

A python package to manage etl processes on AWS

dataengineeringutils3

Last released

Data engineering utils Python 3 version

splink-graph

Last released

a small set of graph functions to be used from pySpark on top of networkx and graphframes

mojap-airflow-tools

Last released

A few wrappers and tools to use Airflow on the Analytical Platform

athena-tools

Last released

set of useful Athena db creation tools

splink-data-generation

Last released

Generate synthetic data with a specified data generating process

splink-data-standardisation

Last released

gluejobutils

Last released

Python 2.7 utils for glue jobs

pdf2embeddings

Last released

NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page