Skip to main content

Library to find URLs and check their validity.

Project description

urlfinderlib

Python library for finding URLs in documents and arbitrary data and checking their validity.

Basic usage

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read())

base_url usage

If you are trying to find URLs inside of an HTML file, the paths in the URLs are likely relative to their location on the server hosting the HTML. You can use the base_url parameter in this case to extract these "relative" URLs.

from urlfinderlib import find_urls

with open('/path/to/file', 'rb') as f:
    print(find_urls(f.read(), base_url='http://somewebsite.com/')

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urlfinderlib-0.11.10.tar.gz (10.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

urlfinderlib-0.11.10-py3-none-any.whl (14.2 kB view details)

Uploaded Python 3

File details

Details for the file urlfinderlib-0.11.10.tar.gz.

File metadata

  • Download URL: urlfinderlib-0.11.10.tar.gz
  • Upload date:
  • Size: 10.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.9

File hashes

Hashes for urlfinderlib-0.11.10.tar.gz
Algorithm Hash digest
SHA256 c37d0e320e9b64162a5a4cc3aeb2ee835a79cb94db01463c7653790bc286b174
MD5 588b853ab62434192c334bf536dada78
BLAKE2b-256 99c13657a6103d8bbebb3042eeab8d3e4191182239d809a25534e0fae5db3208

See more details on using hashes here.

File details

Details for the file urlfinderlib-0.11.10-py3-none-any.whl.

File metadata

  • Download URL: urlfinderlib-0.11.10-py3-none-any.whl
  • Upload date:
  • Size: 14.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.9

File hashes

Hashes for urlfinderlib-0.11.10-py3-none-any.whl
Algorithm Hash digest
SHA256 3c789aea28a74afa5aa36a9b316a31514540df70b6da8763db9643ccc168a6fe
MD5 1eeb9e9fef77302b6b7eb6c405020459
BLAKE2b-256 8c4d3bc38f5d7c43b831241708b9a129a6a458b7b6a5fdbcc7a551a105c2a694

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page