Cereja is a bundle of useful functions that I don't want to rewrite.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Cereja 🍒

Python package

Cereja was written only with the Standard Python Library, and it was a great way to improve knowledge in the Language also to avoid the rewriting of code.

Getting Started DEV

Don't be shy \0/ ... Clone the repository and submit a function or module you made or use some function you liked.

See CONTRIBUTING 💻

Setup

Install

pip install --user cereja

or for all users

pip install cereja

Cereja Example usage

See some of the Cereja tools

To access the Cereja's tools you need to import it import cereja as cj.

📝 FileIO

Create new files

import cereja as cj

file_json = cj.FileIO.create('./json_new_file.json', data={'k': 'v', 'k2': 'v2'})

file_txt = cj.FileIO.create('./txt_new_file.txt', ['line1', 'line2', 'line3'])

file_json.save()
file_txt.save()

print(file_json.exists)
# True
print(file_txt.exists)
# True


# see what you can do .txt file
print(cj.can_do(file_txt))

# see what you can do .json file
print(cj.can_do(file_json))

Load and edit files

import cereja as cj

file_json = cj.FileIO.load('./json_new_file.json')

print(file_json.data)
# {'k': 'v', 'k2': 'v2'}

file_json.add(key='new_key', value='value')
print(file_json.data)
# {'k': 'v', 'k2': 'v2', 'new_key': 'value'}

file_txt = cj.FileIO.load('./txt_new_file.txt')

print(file_txt.data)
# ['line1', 'line2', 'line3']

file_txt.add('line4')
print(file_txt.data)
# ['line1', 'line2', 'line3', 'line4']

file_txt.save(exist_ok=True)  # Override
file_json.save(exist_ok=True)  # Override

📍 Path

import cereja as cj

file_path = cj.Path('/my/path/file.ext')
print(cj.can_do(file_path))
# ['change_current_dir', 'cp', 'created_at', 'exists', 'get_current_dir', 'is_dir', 'is_file', 'is_hidden', 'is_link', 'join', 'last_access', 'list_dir', 'list_files', 'mv', 'name', 'parent', 'parent_name', 'parts', 'path', 'rm', 'root', 'rsplit', 'sep', 'split', 'stem', 'suffix', 'updated_at', 'uri']

🆗 HTTP Requests

import cereja as cj

# Change url, headers and data values.
url = 'localhost:8000/example'
headers = {'Authorization': 'TOKEN'} # optional
data = {'q': 'test'} # optional

response = cj.request.post(url, data=data, headers=headers)

if response.code == 200:
    data = response.data
    # have a fun!

⏳ Progress

import cereja as cj
import time

my_iterable = ['Cereja', 'is', 'very', 'easy']

for i in cj.Progress.prog(my_iterable):
    print(f"current: {i}")
    time.sleep(2)

# Output on terminal ...

# 🍒 Sys[out] » current: Cereja 
# 🍒 Sys[out] » current: is 
# 🍒 Cereja Progress » [▰▰▰▰▰▰▰▰▰▰▰▰▰▰▰▱▱▱▱▱▱▱▱▱▱▱▱▱▱] - 50.00% - 🕢 00:00:02 estimated

🧠 Data Preparation

📊 Freq

import cereja as cj

freq = cj.Freq([1, 2, 3, 3, 10, 10, 4, 4, 4, 4])
# Output -> Freq({1: 1, 2: 1, 3: 2, 10: 2, 4: 4})

freq.most_common(2)
# Output -> {4: 4, 3: 2}

freq.least_freq(2)
# Output -> {2: 1, 1: 1}

freq.probability
# Output -> OrderedDict([(4, 0.4), (3, 0.2), (10, 0.2), (1, 0.1), (2, 0.1)])

freq.sample(min_freq=1, max_freq=2)
# Output -> {3: 2, 10: 2, 1: 1, 2: 1}

# Save json file.
freq.to_json('./freq.json')

🧹 Text Preprocess

import cereja as cj

text = "Oi tudo bem?? meu nome é joab!"

text = cj.preprocess.remove_extra_chars(text)
print(text)
# Output -> 'Oi tudo bem? meu nome é joab!'

text = cj.preprocess.separate(text, sep=['?', '!'])
# Output -> 'Oi tudo bem ? meu nome é joab !'

text = cj.preprocess.accent_remove(text)
# Output -> 'Oi tudo bem ? meu nome e joab !'

# and more ..

# You can use class Preprocessor ...
preprocessor = cj.Preprocessor(stop_words=(),
                               punctuation='!?,.', to_lower=True, is_remove_punctuation=False,
                               is_remove_stop_words=False,
                               is_remove_accent=True)

print(preprocessor.preprocess(text))
# Output -> 'oi tudo bem ? meu nome e joab !'

print(preprocessor.preprocess(text, is_destructive=True))
# Output -> 'oi tudo bem meu nome e joab'

🔣 Tokenizer

import cereja as cj

text = ['oi tudo bem meu nome é joab']

tokenizer = cj.Tokenizer(text, use_unk=True)

# tokens 0 to 9 is UNK
# hash_ used to replace UNK
token_sequence, hash_ = tokenizer.encode('meu nome é Neymar Júnior')
# Output -> [([10, 12, 11, 0, 1], 'eeb755960ce70c')]

decoded_sequence = tokenizer.decode(token_sequence)
# Output -> ['meu', 'nome', 'é', '{0}', '{1}']

decoded_sequence = ' '.join(decoded_sequence)

tokenizer.replace_unks(decoded_sequence, hash_)
# Output -> 'meu nome é Neymar Júnior'

⏸ Corpus

Great training and test separator.

import cereja as cj

X = ['how are you?', 'my name is Joab', 'I like coffee', 'how are you joab?', 'how', 'we are the world']
Y = ['como você está?', 'meu nome é Joab', 'Eu gosto de café', 'Como você está joab?', 'como', 'Nós somos o mundo']

corpus = cj.Corpus(source_data=X, target_data=Y, source_name='en', target_name='pt')
print(corpus)  # Corpus(examples: 6 - source_vocab_size: 13 - target_vocab_size:15)
print(corpus.source)  # LanguageData(examples: 6 - vocab_size: 13)
print(corpus.target)  # LanguageData(examples: 6 - vocab_size: 15)

corpus.source.phrases_freq
# Counter({'how are you': 1, 'my name is joab': 1, 'i like coffee': 1, 'how are you joab': 1, 'how': 1, 'we are the world': 1})

corpus.source.word_freq
# Counter({'how': 3, 'are': 3, 'you': 2, 'joab': 2, 'my': 1, 'name': 1, 'is': 1, 'i': 1, 'like': 1, 'coffee': 1, 'we': 1, 'the': 1, 'world': 1})

corpus.target.phrases_freq
# Counter({'como você está': 1, 'meu nome é joab': 1, 'eu gosto de café': 1, 'como você está joab': 1, 'como': 1, 'nós somos o mundo': 1})

corpus.target.words_freq
# Counter({'como': 3, 'você': 2, 'está': 2, 'joab': 2, 'meu': 1, 'nome': 1, 'é': 1, 'eu': 1, 'gosto': 1, 'de': 1, 'café': 1, 'nós': 1, 'somos': 1, 'o': 1, 'mundo': 1})

# split_data function guarantees test data without data identical to training
# and only with vocabulary that exists in training
train, test = corpus.split_data()  # default percent of training is 80%

🔢 Array

import cereja as cj

cj.array.is_empty(data)  # False
cj.array.get_shape(data)  # (2, 3)

data = cj.array.flatten(data)  # [1, 2, 3, 3, 3, 3]
cj.array.prod(data)  # 162
cj.array.sub(data)  # -13
cj.array.div(data)  # 0.006172839506172839

cj.array.rand_n(0.0, 2.0, n=3)  # [0.3001196087729699, 0.639679494102923, 1.060200897124107]
cj.array.rand_n(1, 10)  # 5.086403830031244
cj.array.array_randn((3, 3,
                      3))  # [[[0.015077210355770374, 0.014298110484612511, 0.030410666810216064], [0.029319083335697604, 0.0072365209507707666, 0.010677361074992], [0.010576754075922935, 0.04146379877648334, 0.02188348813336284]], [[0.0451851551098092, 0.037074906805326824, 0.0032484586475421007], [0.025633380630695347, 0.010312669541918484, 0.0373624007621097], [0.047923908102496145, 0.0027939333359724224, 0.05976224377251878]], [[0.046869510719106486, 0.008325638358172866, 0.0038702998343255893], [0.06475268683502387, 0.0035638592537234623, 0.06551037943638163], [0.043317416824708604, 0.06579372884523939, 0.2477564291871006]]]
cj.array.group_items_in_batches(items=[1, 2, 3, 4], items_per_batch=3, fill=0)  # [[1, 2, 3], [4, 0, 0]]
cj.array.remove_duplicate_items(['hi', 'hi', 'ih'])  # ['hi', 'ih'] 
cj.array.get_cols([['line1_col1', 'line1_col2'],
                   ['line2_col1', 'line2_col2']])  # [['line1_col1', 'line2_col1'], ['line1_col2', 'line2_col2']]
cj.array.dotproduct([1, 2], [1, 2])  # 5

a = cj.array.array_gen((3, 3), 1)  # [[1, 1, 1], [1, 1, 1], [1, 1, 1]]
b = cj.array.array_gen((3, 3), 1)  # [[1, 1, 1], [1, 1, 1], [1, 1, 1]]
cj.array.dot(a, b)  # [[3, 3, 3], [3, 3, 3], [3, 3, 3]]
cj.mathtools.theta_angle((2, 2), (0, -2))  # 135.0

🧰 Utils

import cereja as cj

data = {"key1": 'value1', "key2": 'value2', "key3": 'value3', "key4": 'value4'}

cj.utils.chunk(list(range(10)), batch_size=3)
# [[0, 1, 2], [3, 4, 5], [6, 7, 8], [9]]
cj.utils.chunk(list(range(10)), batch_size=3, fill_with=0, is_random=True)
# [[9, 7, 8], [0, 3, 2], [4, 1, 5], [6, 0, 0]]

# Invert Dict
cj.utils.invert_dict(data)
# Output -> {'value1': 'key1', 'value2': 'key2', 'value3': 'key3', 'value4': 'key4'}

# Get sample of large data
cj.utils.sample(data, k=2, is_random=True)
# Output -> {'key1': 'value1', 'key4': 'value4'}

cj.utils.fill([1, 2, 3, 4], max_size=20, with_=0)
# Output -> [1, 2, 3, 4, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]

cj.utils.rescale_values([1, 2, 3, 4, 5, 6, 7, 8, 9, 10], granularity=4)
# Output -> [1, 3, 5, 7]

cj.utils.import_string('cereja.file._io.FileIO')
# Output -> <class 'cereja.file._io.FileIO'>

cj.utils.list_methods(cj.Path)
# Output -> ['change_current_dir', 'cp', 'get_current_dir', 'join', 'list_dir', 'list_files', 'mv', 'rm', 'rsplit', 'split']


cj.utils.string_to_literal('[1,2,3,4]')
# Output -> [1, 2, 3, 4]

cj.utils.time_format(3600)
# Output -> '01:00:00'

cj.utils.truncate("Cereja is fun.", k=3)
# Output -> 'Cer...'

data = [[1, 2, 3], [3, 3, 3]]
cj.utils.is_iterable(data)  # True
cj.utils.is_sequence(data)  # True
cj.utils.is_numeric_sequence(data)  # True

See Usage - Jupyter Notebook

License

This project is licensed under the MIT License - see the LICENSE file for details

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

2.1.2

Jan 7, 2026

2.1.1

Sep 10, 2025

2.1.0

Jun 11, 2025

2.0.9

Jun 6, 2025

2.0.8

Mar 3, 2025

2.0.7

Feb 11, 2025

2.0.6

Feb 9, 2025

2.0.5

Oct 26, 2024

2.0.4

Oct 14, 2024

2.0.3

Oct 5, 2024

2.0.2

Sep 19, 2024

2.0.1

Sep 3, 2024

2.0.0.post1

Aug 15, 2024

2.0.0

Aug 15, 2024

1.9.9

Apr 24, 2024

1.9.8

Mar 17, 2024

1.9.7

Dec 21, 2023

1.9.6

Dec 21, 2023

1.9.5

Dec 15, 2023

1.9.4

Dec 1, 2023

1.9.3

Nov 22, 2023

1.9.2

Nov 11, 2023

1.9.1

Oct 3, 2023

1.9.0

Sep 30, 2023

1.8.9

Aug 24, 2023

1.8.8

Aug 18, 2023

1.8.7

Aug 11, 2023

1.8.6

Aug 11, 2023

1.8.5

Jul 8, 2023

1.8.4

Mar 12, 2023

1.8.3

Jan 26, 2023

1.8.2

Jan 26, 2023

1.8.1

Jan 26, 2023

1.8.0

Nov 18, 2022

1.7.9

Oct 22, 2022

1.7.8

Sep 25, 2022

1.7.7

Aug 30, 2022

1.7.6

Aug 7, 2022

1.7.5

Jul 20, 2022

1.7.4

Jun 17, 2022

1.7.3

May 19, 2022

1.7.2

May 5, 2022

1.7.1

Apr 27, 2022

1.7.0

Apr 27, 2022

1.6.9

Apr 22, 2022

1.6.8

Apr 9, 2022

1.6.7

Apr 1, 2022

1.6.6

Mar 29, 2022

1.6.5

Mar 25, 2022

1.6.4

Mar 24, 2022

1.6.2

Mar 23, 2022

1.6.1

Feb 14, 2022

1.6.0

Dec 28, 2021

1.5.9

Oct 15, 2021

1.5.8

Sep 11, 2021

1.5.6

Sep 6, 2021

1.5.5

Sep 2, 2021

1.5.4

Aug 13, 2021

1.5.3

Aug 8, 2021

1.5.2

Aug 7, 2021

1.5.1

Aug 2, 2021

1.5.0

Jul 31, 2021

This version

1.4.9

Jul 24, 2021

1.4.8

Jul 1, 2021

1.4.7

Jun 28, 2021

1.4.6

Jun 25, 2021

1.4.5

Jun 25, 2021

1.4.4.post1

Jun 13, 2021

1.4.4

Jun 13, 2021

1.4.3

Jun 13, 2021

1.4.2

May 31, 2021

1.4.1

May 30, 2021

1.4.0

May 29, 2021

1.3.9.post1

May 29, 2021

1.3.9

May 29, 2021

1.3.8

May 24, 2021

1.3.7

May 2, 2021

1.3.6

Apr 22, 2021

1.3.5

Apr 8, 2021

1.3.4.post1

Mar 29, 2021

1.3.4

Mar 28, 2021

1.3.3

Mar 10, 2021

1.3.2

Mar 6, 2021

1.3.1

Mar 1, 2021

1.3.0

Jan 10, 2021

1.3.0b0 pre-release

Dec 22, 2020

1.2.9.post1

Dec 4, 2020

1.2.9

Dec 4, 2020

1.2.8.post1

Nov 15, 2020

1.2.8

Nov 15, 2020

1.2.7.post1

Nov 8, 2020

1.2.7

Nov 8, 2020

1.2.6.post2

Nov 6, 2020

1.2.6.post1

Oct 26, 2020

1.2.6

Oct 25, 2020

1.2.5.post2

Oct 22, 2020

1.2.5.post1

Oct 22, 2020

1.2.5

Oct 22, 2020

1.2.4

Oct 9, 2020

1.2.3

Sep 19, 2020

1.2.2

Aug 11, 2020

1.2.1

Jun 14, 2020

1.2.0

Jun 13, 2020

1.1.6

Jun 6, 2020

1.1.5

Jun 6, 2020

1.1.4.post2

May 29, 2020

1.1.4.post1

May 29, 2020

1.1.4

May 28, 2020

1.1.3

Apr 24, 2020

1.1.2.post1

Apr 6, 2020

1.1.2

Apr 4, 2020

1.1.2rc0 pre-release

Apr 3, 2020

1.1.1

Mar 27, 2020

1.1.0

Mar 12, 2020

1.0.6

Feb 27, 2020

1.0.5

Feb 27, 2020

1.0.4

Feb 27, 2020

1.0.3

Feb 27, 2020

1.0.2

Feb 20, 2020

1.0.2.dev20200202084148 pre-release

Feb 2, 2020

1.0.1.post1

Jan 27, 2020

1.0.1

Jan 26, 2020

1.0.1.dev20200125094908 pre-release

Jan 25, 2020

1.0.1.dev20200125093017 pre-release

Jan 25, 2020

1.0.1.dev20200120192751 pre-release

Jan 20, 2020

1.0.0

Jan 20, 2020

0.3.5

Jan 20, 2020

0.3.4

Jan 19, 2020

0.3.3

Jan 16, 2020

0.3.0

Jan 14, 2020

0.2.6

Jan 12, 2020

0.2.5

Jan 10, 2020

0.2.2

Jan 8, 2020

0.2.1

Dec 30, 2019

0.2.0

Nov 17, 2019

0.1.9

Nov 11, 2019

0.1.8

Nov 11, 2019

0.1.7

Sep 30, 2019

0.1.6

Sep 30, 2019

0.1.5

Sep 28, 2019

0.1.4

Sep 26, 2019

0.1.3

Sep 25, 2019

0.1.2

Sep 25, 2019

0.1.1

Sep 19, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cereja-1.4.9.tar.gz (75.4 kB view details)

Uploaded Jul 24, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cereja-1.4.9-py3-none-any.whl (103.4 kB view details)

Uploaded Jul 24, 2021 Python 3

File details

Details for the file cereja-1.4.9.tar.gz.

File metadata

Download URL: cereja-1.4.9.tar.gz
Upload date: Jul 24, 2021
Size: 75.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.6

File hashes

Hashes for cereja-1.4.9.tar.gz
Algorithm	Hash digest
SHA256	`2194bb7b3885a92d2e4f2a72b784a7d37ecbbac6d02c521195e4eac5323189cb`
MD5	`6c5ede1be0a1d26a410c9f0cf53ad2b2`
BLAKE2b-256	`e0a0bdf5fe63c0b572cf8ef5e0f3650cfe79a4c8b067a595a3421faa65342408`

See more details on using hashes here.

File details

Details for the file cereja-1.4.9-py3-none-any.whl.

File metadata

Download URL: cereja-1.4.9-py3-none-any.whl
Upload date: Jul 24, 2021
Size: 103.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.6

File hashes

Hashes for cereja-1.4.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`60ad8cbdc35e6877c9425f4f2bc1ad6bf8b05f19a73c22a7e104765a5fc7ecfe`
MD5	`0fb8b31b9af49555bc57c7172e94f25b`
BLAKE2b-256	`3e9a8d6e82725363c972b915b729c8208063c27f275255697aebd4d4a3c7cfd5`

See more details on using hashes here.

cereja 1.4.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cereja 🍒

Getting Started DEV

Setup

Install

Cereja Example usage

📝 FileIO

Create new files

Load and edit files

📍 Path

🆗 HTTP Requests

⏳ Progress

🧠 Data Preparation

🔢 Array

🧰 Utils

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes