Library to find URLs and check their validity.
Project description
urlfinderlib
Python library for finding URLs in documents and arbitrary data and checking their validity.
Basic usage
from urlfinderlib import find_urls
with open('/path/to/file', 'rb') as f:
print(find_urls(f.read())
base_url usage
If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.
from urlfinderlib import find_urls
with open('/path/to/file', 'rb') as f:
print(find_urls(f.read(), base_url='http://somewebsite.com/')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file urlfinderlib-0.11.10.tar.gz.
File metadata
- Download URL: urlfinderlib-0.11.10.tar.gz
- Upload date:
- Size: 10.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c37d0e320e9b64162a5a4cc3aeb2ee835a79cb94db01463c7653790bc286b174
|
|
| MD5 |
588b853ab62434192c334bf536dada78
|
|
| BLAKE2b-256 |
99c13657a6103d8bbebb3042eeab8d3e4191182239d809a25534e0fae5db3208
|
File details
Details for the file urlfinderlib-0.11.10-py3-none-any.whl.
File metadata
- Download URL: urlfinderlib-0.11.10-py3-none-any.whl
- Upload date:
- Size: 14.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3c789aea28a74afa5aa36a9b316a31514540df70b6da8763db9643ccc168a6fe
|
|
| MD5 |
1eeb9e9fef77302b6b7eb6c405020459
|
|
| BLAKE2b-256 |
8c4d3bc38f5d7c43b831241708b9a129a6a458b7b6a5fdbcc7a551a105c2a694
|