Manga browser/downloader.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tankobon

logo

PyPI - License PyPI PyPI - Python Version Lines of code

What?

tankobon is a website scraper for comics and mangas. tankobon relies on stores, which define how to parse a website for chapters and chapters for links to the pages themselves. (somewhat like youtube-dl extractors.) Currently, the following websites are supported:

komi-san.com
m.mangabat.com
mangadex.org
mangakakalot.com

Creating a Store

A store is a regular Python module in the stores/ folder. It should provide a Manga class, which is a subclass of tankobon.base.GenericManga. The following methods below must be implemented:

`get_chapters(self) -> Generator[Tuple[str, Dict[str, str]], None, None]`

Yield a two-tuple of (chapter_number, chapter_info) where chapter_info looks like this:

{
    "title": ...,  # chapter title
    "url": ...,  # chapter url
    "volume": ...,  # volume, i.e '0'
}

Volume may or may not be given; no volume implies volume 0. Example:

def get_chapters(self):
    # use self.soup to access the title page
    for href in self.soup.find_all("a", href=True):
        # validify href here and parse chapter id
        ...
        yield chapter_id, {"title": href.text, "url": href["href"]}

`get_pages(self, chapter_url: str) -> List[str]`

Return a list of urls to a chapter's pages, given its url. The pages must be in order (page 1 is [0], page 2 is [1], etc.) Example:

def get_pages(self, chapter_url):
    pages = []
    # to get the chapter's html, use self.session.get (requests session)
    # or self.get_soup (html already parsed by BeautifulSoup).
    chapter_page = self.get_soup(chapter_url)
    for href in chapter_page.find_all("a", href=True):
        # validify href here
        ...
        pages.append(href["href"])
    return pages

The following methods below may or may not be implemented: generic implementations are provided.

`get_title(self) -> str`

Return the title of the manga. Example:

def get_title(self):
    return self.soup.title

`get_covers(self) -> Dict[str, str]`

Return a dictionary map of volume (i.e '0', '1') to its cover. Example:

def get_covers(self):
    # The website might have a different api to obtain covers,
    # but we'll just fake one here.

    # (And yes, I do know dictionary comprehensions are better.)
    covers = {}
    for cover in self.soup.find_all("li"):
        covers[cover.a.text] = cover.a["href"]
    
    return covers

Index Compatibility

Between version v3.1.0a1 and v3.2.0a0, the location of the index file has moved from site-packages to ~/.tankobon/index.json, specific to each install of tankobon.

Todo

download pre-parsed indexes from a special Github repo (tankobon-index?)
create GUI to make downloading easier (like youtube-DLG)

Usage

tankobon download 'https://komi-san.com'  # download all chapters
tankobon store info 'komi_san/https://komi-san.com'  # and then get info on the chapters

Install

python(3) -m pip install tankobon

Build

All my python projects now use flit to build and publish. To build, do flit build.

License

MIT.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2022.2.1

Feb 1, 2022

2022.1.20

Jan 20, 2022

2022.1.18

Jan 18, 2022

2021.10.2

Oct 2, 2021

2021.7.11

Jul 11, 2021

2021.6.8

Jun 22, 2021

2021.6.7

Jun 22, 2021

2021.6.6

Jun 20, 2021

2021.6.5

Jun 18, 2021

2021.6.3

Jun 14, 2021

2021.6.2

Jun 11, 2021

2021.6.1

Jun 10, 2021

6.3.0

May 27, 2021

6.2.0

May 13, 2021

6.1.1

May 11, 2021

6.0.0

Apr 26, 2021

5.0.0b1 pre-release

Feb 20, 2021

5.0.0b0 pre-release

Feb 16, 2021

4.2.0b3 pre-release

Dec 27, 2020

4.2.0b2 pre-release

Dec 21, 2020

This version

4.2.0b1 pre-release

Dec 20, 2020

4.2.0b0 pre-release

Dec 18, 2020

4.1.2b0 pre-release

Dec 5, 2020

4.1.1b0 pre-release

Dec 5, 2020

4.1.0b0 pre-release

Dec 4, 2020

4.0.2b0 pre-release

Nov 30, 2020

4.0.1b0 pre-release

Nov 29, 2020

4.0.0b0 pre-release

Nov 28, 2020

3.2.0a0 pre-release

Nov 21, 2020

3.1.0a1 pre-release

Nov 12, 2020

3.1.0a0 pre-release

Nov 11, 2020

3.0.0a1 pre-release

Nov 10, 2020

3.0.0a0 pre-release

Nov 9, 2020

2.0.0a4 pre-release

Nov 6, 2020

2.0.0a3 pre-release

Nov 6, 2020

2.0.0a2 pre-release

Nov 5, 2020

2.0.0a1 pre-release

Nov 5, 2020

1.1.0a1 pre-release

Nov 5, 2020

1.0.0a1 pre-release

Oct 30, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tankobon-4.2.0b1.tar.gz (27.1 kB view hashes)

Uploaded Dec 20, 2020 Source

Built Distribution

tankobon-4.2.0b1-py3-none-any.whl (17.9 kB view hashes)

Uploaded Dec 20, 2020 Python 3

Hashes for tankobon-4.2.0b1.tar.gz

Hashes for tankobon-4.2.0b1.tar.gz
Algorithm	Hash digest
SHA256	`f713ba4f0547969b057489e0a2c7c26a1802f6bfcd7766ad6667ffef1a064aa4`
MD5	`c2ccca4d8df737200e37223a222d05bb`
BLAKE2b-256	`33222bbd9a928b53fd9e499bae70248772ae0733a3cfeaf813f920127d8db2ae`

Hashes for tankobon-4.2.0b1-py3-none-any.whl

Hashes for tankobon-4.2.0b1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`48f6225a4cfac7ca2f7ac78dc5b3c5c37965f2971ca2c026c5f5ad97c1ecc9b9`
MD5	`43eee6f652fb5a5fe03c52e953874f7f`
BLAKE2b-256	`e743ad3ae255423836d982c17b7000082a804bdd490ffbd3c6f51a560c59840f`