edgar

Scrape data from SEC's EDGAR

Project description

A small library to access files from SEC’s edgar.

Installation

pip install edgar

Example

To get a company’s latest 5 10-Ks, run

from edgar import Company
company = Company("Oracle Corp", "0001341439")
tree = company.get_all_filings(filing_type = "10-K")
docs = edgar.get_documents(tree, no_of_documents=5)

from edgar import Company, TXTML

company = Company("INTERNATIONAL BUSINESS MACHINES CORP", "0000051143")
doc = company.get_10K()
text = TXTML.parse_full_10K(doc)

To get all companies and find a specific one, run

from edgar import Edgar
edgar = Edgar()
possible_companies = edgar.find_company_name("Cisco System")

To get XBRL data, run

from edgar import Company, XBRL, XBRLElement

company = Company("Oracle Corp", "0001341439")
results = company.get_data_files_from_10K("EX-101.INS", isxml=True)
xbrl = XBRL(results[0])
XBRLElement(xbrl.relevant_children_parsed[15]).to_dict() // returns a dictionary of name, value, and schemaRef

API

Company

The Company class has two fields:

name (company name)
cik (company CIK number)

get_filings_url

Returns a url to fetch filings data

Input
- filing_type: The type of document you want. i.e. 10-K, S-8, 8-K. If not specified, it’ll return all documents
- prior_to: Time prior which documents are to be retrieved. If not specified, it’ll return all documents
- ownership: defaults to include. Options are include, exclude, only.
- no_of_entries: defaults to 100. Returns the number of entries to be returned. Maximum is 100.

get_all_filings

Returns the HTML in the form of lxml.html

Input
- filing_type: The type of document you want. i.e. 10-K, S-8, 8-K. If not specified, it’ll return all documents
- prior_to: Time prior which documents are to be retrieved. If not specified, it’ll return all documents
- ownership: defaults to include. Options are include, exclude, only.
- no_of_entries: defaults to 100. Returns the number of entries to be returned. Maximum is 100.

get_10Ks

Returns the HTML in the form of lxml.html of concatenation of all the documents in the 10-K

Input
- no_of_documents (default: 1): numer of documents to be retrieved

get_document_type_from_10K

Returns the HTML in the form of lxml.html of the document within 10-K

Input
- document_type: Tye type of document you want, i.e. 10-K, EX-3.2
- no_of_documents (default: 1): numer of documents to be retrieved

get_data_files_from_10K

Returns the HTML in the form of lxml.html of the data file within 10-K

Input
- document_type: Tye type of document you want, i.e. EX-101.INS
- no_of_documents (default: 1): numer of documents to be retrieved
- isxml (default: False): by default, things aren’t case sensitive and is parsed with html in lxml. If this is True, then it is parsed withetree` which is case sensitive

Edgar

Gets all companies from EDGAR

get_cik_by_company_name

Input
- name: name of the company

get_company_name_by_cik

Input
- cik: cik of the company

find_company_name

Input
- words: input words to search the company

get_documents

Returns a list of strings, each string contains the body of the specified document from input

Input
- tree: lxml.html form that is returned from Company.getAllFilings
- no_of_documents: number of document returned. If it is 1, the returned result is just one string, instead of a list of strings. Defaults to 1.

XBRL

Parses data from XBRL

relevant_children
- get children that are not context
relevant_children_parsed
- get children that are not context, unit, schemaRef
- cleans tags

Project details

Release history Release notifications | RSS feed

5.6.3

Oct 13, 2024

5.6.2

Oct 12, 2024

5.6.0

Oct 9, 2024

5.5.2

Oct 9, 2024

5.5.1

Oct 9, 2024

5.5.0

Oct 9, 2024

5.4.3

Nov 15, 2022

5.4.2

Nov 15, 2022

5.4.1

Aug 29, 2020

5.4.0

Aug 29, 2020

5.3.8

Jul 18, 2020

5.3.7

Jul 14, 2020

5.3.6

Jul 14, 2020

5.3.5

Jul 14, 2020

5.3.4

Jul 6, 2020

5.3.3

Jul 6, 2020

5.3.2

Jul 6, 2020

5.3.1

Jul 6, 2020

5.3.0

Jul 6, 2020

5.2.0

Jun 14, 2020

5.1.16

May 31, 2020

5.1.15

Mar 29, 2020

5.1.14

Feb 17, 2020

5.1.13

Feb 17, 2020

5.1.12

Feb 17, 2020

5.1.11

Feb 17, 2020

5.1.10

Feb 1, 2020

5.1.9

Jan 6, 2020

5.1.8

Jan 5, 2020

5.1.7

Dec 29, 2019

5.1.6

Dec 29, 2019

5.1.5

Dec 29, 2019

5.1.4

Dec 29, 2019

5.1.3

Dec 29, 2019

5.1.2

Dec 28, 2019

5.1.1

Dec 28, 2019

5.1.0

Dec 28, 2019

5.0.0

Dec 28, 2019

4.1.12

Dec 17, 2019

This version

4.1.11

Dec 16, 2019

4.1.10

Dec 16, 2019

4.1.9

Dec 15, 2019

4.1.8

Dec 15, 2019

4.1.7

Dec 15, 2019

4.1.6

Dec 15, 2019

4.1.5

Dec 15, 2019

4.1.4

Dec 15, 2019

4.1.3

Dec 15, 2019

4.1.2

Dec 15, 2019

4.1.1

Dec 15, 2019

4.1.0

Dec 15, 2019

4.0.0

Dec 8, 2019

3.0.1

Nov 2, 2019

3.0.0

Oct 26, 2019

2.0.4

Oct 19, 2019

2.0.3

Oct 19, 2019

2.0.2

Oct 19, 2019

2.0.1

Oct 19, 2019

2.0.0

Oct 19, 2019

1.0.0

Jan 7, 2018

0.3.0

Jul 18, 2017

0.2.0

Jul 12, 2017

0.1

Jun 21, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edgar-4.1.11.tar.gz (6.1 kB view details)

Uploaded Dec 16, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

edgar-4.1.11-py3-none-any.whl (7.0 kB view details)

Uploaded Dec 16, 2019 Python 3

File details

Details for the file edgar-4.1.11.tar.gz.

File metadata

Download URL: edgar-4.1.11.tar.gz
Upload date: Dec 16, 2019
Size: 6.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.6.9

File hashes

Hashes for edgar-4.1.11.tar.gz
Algorithm	Hash digest
SHA256	`62f093f7032aa37fc083777ce4b35cf3ac687fcb76dff96c499b978ad95fbd91`
MD5	`73a17e80442887e46328590274c0f078`
BLAKE2b-256	`2b94fa5e436cbab85271933d0d9c8dee6f43e45e1bb1ee879d021f40fe60865a`

See more details on using hashes here.

File details

Details for the file edgar-4.1.11-py3-none-any.whl.

File metadata

Download URL: edgar-4.1.11-py3-none-any.whl
Upload date: Dec 16, 2019
Size: 7.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.6.9

File hashes

Hashes for edgar-4.1.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`049887723a2630db38b88ee73c82563b81a316fd70b4bf75724227c8232e3ff6`
MD5	`d457515d9db15f008e16d77465149a63`
BLAKE2b-256	`122735e29490cf311dc03911e5deeb33976e67cc1f2ac692093d46ec1dcee512`

See more details on using hashes here.

edgar 4.1.11

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Installation

Example

API

Company

get_filings_url

get_all_filings

get_10Ks

get_document_type_from_10K

get_data_files_from_10K

Edgar

get_cik_by_company_name

get_company_name_by_cik

find_company_name

get_documents

XBRL

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes