Skip to main content

A coverage-guided fuzzer for Python and Python extensions.

Project description

Atheris: A Coverage-Guided, Native Python Fuzzer

Atheris is a coverage-guided Python fuzzing engine. It supports fuzzing of Python code, but also native extensions written for CPython. Atheris is based off of libFuzzer. When fuzzing native code, Atheris can be used in combination with Address Sanitizer or Undefined Behavior Sanitizer to catch extra bugs.

Installation Instructions

Atheris supports Linux (32- and 64-bit) and Mac OS X, Python versions 3.6-3.9.

You can install prebuilt versions of Atheris with pip:

pip3 install atheris

These wheels come with a built-in libFuzzer, which is fine for fuzzing Python code. If you plan to fuzz native extensions, you may need to build from source to ensure the libFuzzer version in Atheris matches your Clang version.

Building from Source

Atheris relies on libFuzzer, which is distributed with Clang. If you have a sufficiently new version of clang on your path, installation from source is as simple as:

# Build latest release from source
pip3 install --no-binary atheris atheris
# Build development code from source
git clone https://github.com/google/atheris.git
cd atheris
pip3 install .

If you don't have clang installed or it's too old, you'll need to download and build the latest version of LLVM. Follow the instructions in Installing Against New LLVM below.

Mac

Apple Clang doesn't come with libFuzzer, so you'll need to install a new version of LLVM from head. Follow the instructions in Installing Against New LLVM below.

Installing Against New LLVM

# Building LLVM
git clone https://github.com/llvm/llvm-project.git
cd llvm-project
mkdir build
cd build
cmake -DLLVM_ENABLE_PROJECTS='clang;compiler-rt' -G "Unix Makefiles" ../llvm
make -j 10  # This step is very slow

# Installing Atheris
CLANG_BIN="$(pwd)/bin/clang" pip3 install <whatever>

Using Atheris

Example:

import atheris

with atheris.instrument_imports():
  import some_library
  import sys

def TestOneInput(data):
  some_library.parse(data)

atheris.Setup(sys.argv, TestOneInput)
atheris.Fuzz()

When fuzzing Python, Atheris will report a failure if the Python code under test throws an uncaught exception.

Python coverage

Atheris collects Python coverage information by instrumenting bytecode. There are 3 options for adding this instrumentation to the bytecode:

  • You can instrument the libraries you import:
    with atheris.instrument_imports():
      import foo
      from bar import baz
    
    This will cause instrumentation to be added to foo and bar, as well as any libraries they import.
  • Or, you can instrument individual functions:
    @atheris.instrument_func
    def my_function(foo, bar):
      print("instrumented")
    
  • Or finally, you can instrument everything:
    atheris.instrument_all()
    
    Put this right before atheris.Setup(). This will find every Python function currently loaded in the interpreter, and instrument it. This might take a while.

Why am I getting "No interesting inputs were found"?

You might see this error:

ERROR: no interesting inputs were found. Is the code instrumented for coverage? Exiting.

You'll get this error if the first 2 calls to TestOneInput didn't produce any coverage events. Even if you have instrumented some Python code, this can happen if the instrumentation isn't reached in those first 2 calls. (For example, because you have a nontrivial TestOneInput). You can resolve this by adding an atheris.instrument_func decorator to TestOneInput, using atheris.instrument_all(), or moving your TestOneInput function into an instrumented module.

Fuzzing Native Extensions

In order for fuzzing native extensions to be effective, your native extensions must be instrumented. See Native Extension Fuzzing for instructions.

Integration with OSS-Fuzz

Atheris is fully supported by OSS-Fuzz, Google's continuous fuzzing service for open source projects. For integrating with OSS-Fuzz, please see https://google.github.io/oss-fuzz/getting-started/new-project-guide/python-lang.

API

The atheris module provides three key functions: instrument_imports(), Setup() and Fuzz().

In your source file, import all libraries you wish to fuzz inside a with atheris.instrument_imports():-block, like this:

# library_a will not get instrumented
import library_a

with atheris.instrument_imports():
    # library_b will get instrumented
    import library_b

Generally, it's best to import atheris first and then import all other libraries inside of a with atheris.instrument_imports() block.

Next, define a fuzzer entry point function and pass it to atheris.Setup() along with the fuzzer's arguments (typically sys.argv). Finally, call atheris.Fuzz() to start fuzzing. You must call atheris.Setup() before atheris.Fuzz().

instrument_imports(include=[], exclude=[])

  • include: A list of fully-qualified module names that shall be instrumented.
  • exclude: A list of fully-qualified module names that shall NOT be instrumented.

This should be used together with a with-statement. All modules imported in said statement will be instrumented. However, because Python imports all modules only once, this cannot be used to instrument any previously imported module, including modules required by Atheris. To add coverage to those modules, use instrument_all() instead.

A full list of unsupported modules can be retrieved as follows:

import sys
import atheris
print(sys.modules.keys())

instrument_func(func)

  • func: The function to instrument.

This will instrument the specified Python function and then return func. This is typically used as a decorator, but can be used to instrument individual functions too. Note that the func is instrumented in-place, so this will affect all call points of the function.

This cannot be called on a bound method - call it on the unbound version.

instrument_all()

This will scan over all objects in the interpreter and call instrument_func on every Python function. This works even on core Python interpreter functions, something which instrument_imports cannot do.

This function is experimental.

Setup(args, test_one_input, internal_libfuzzer=None)

  • args: A list of strings: the process arguments to pass to the fuzzer, typically sys.argv. This argument list may be modified in-place, to remove arguments consumed by the fuzzer. See the LibFuzzer docs for a list of such options.
  • test_one_input: your fuzzer's entry point. Must take a single bytes argument. This will be repeatedly invoked with a single bytes container.
  • internal_libfuzzer: Indicates whether libfuzzer will be provided by atheris or by an external library (see using_sanitizers.md). If unspecified, Atheris will determine this automatically. If fuzzing pure Python, leave this as True.

Fuzz()

This starts the fuzzer. You must have called Setup() before calling this function. This function does not return.

In many cases Setup() and Fuzz() could be combined into a single function, but they are separated because you may want the fuzzer to consume the command-line arguments it handles before passing any remaining arguments to another setup function.

FuzzedDataProvider

Often, a bytes object is not convenient input to your code being fuzzed. Similar to libFuzzer, we provide a FuzzedDataProvider to translate these bytes into other input forms.

You can construct the FuzzedDataProvider with:

fdp = atheris.FuzzedDataProvider(input_bytes)

The FuzzedDataProvider then supports the following functions:

def ConsumeBytes(count: int)

Consume count bytes.

def ConsumeUnicode(count: int)

Consume unicode characters. Might contain surrogate pair characters, which according to the specification are invalid in this situation. However, many core software tools (e.g. Windows file paths) support them, so other software often needs to too.

def ConsumeUnicodeNoSurrogates(count: int)

Consume unicode characters, but never generate surrogate pair characters.

def ConsumeString(count: int)

Alias for ConsumeBytes in Python 2, or ConsumeUnicode in Python 3.

def ConsumeInt(int: bytes)

Consume a signed integer of the specified size (when written in two's complement notation).

def ConsumeUInt(int: bytes)

Consume an unsigned integer of the specified size.

def ConsumeIntInRange(min: int, max: int)

Consume an integer in the range [min, max].

def ConsumeIntList(count: int, bytes: int)

Consume a list of count integers of size bytes.

def ConsumeIntListInRange(count: int, min: int, max: int)

Consume a list of count integers in the range [min, max].

def ConsumeFloat()

Consume an arbitrary floating-point value. Might produce weird values like NaN and Inf.

def ConsumeRegularFloat()

Consume an arbitrary numeric floating-point value; never produces a special type like NaN or Inf.

def ConsumeProbability()

Consume a floating-point value in the range [0, 1].

def ConsumeFloatInRange(min: float, max: float)

Consume a floating-point value in the range [min, max].

def ConsumeFloatList(count: int)

Consume a list of count arbitrary floating-point values. Might produce weird values like NaN and Inf.

def ConsumeRegularFloatList(count: int)

Consume a list of count arbitrary numeric floating-point values; never produces special types like NaN or Inf.

def ConsumeProbabilityList(count: int)

Consume a list of count floats in the range [0, 1].

def ConsumeFloatListInRange(count: int, min: float, max: float)

Consume a list of count floats in the range [min, max]

def PickValueInList(l: list)

Given a list, pick a random value

def ConsumeBool()

Consume either True or False.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atheris-2.0.6.tar.gz (57.0 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

atheris-2.0.6-cp39-cp39-manylinux2014_x86_64.whl (26.7 MB view details)

Uploaded CPython 3.9

atheris-2.0.6-cp39-cp39-macosx_11_0_x86_64.whl (2.6 MB view details)

Uploaded CPython 3.9macOS 11.0+ x86-64

atheris-2.0.6-cp38-cp38-manylinux2014_x86_64.whl (26.7 MB view details)

Uploaded CPython 3.8

atheris-2.0.6-cp38-cp38-macosx_11_0_x86_64.whl (2.6 MB view details)

Uploaded CPython 3.8macOS 11.0+ x86-64

atheris-2.0.6-cp37-cp37m-manylinux2014_x86_64.whl (26.8 MB view details)

Uploaded CPython 3.7m

atheris-2.0.6-cp37-cp37m-macosx_11_0_x86_64.whl (2.6 MB view details)

Uploaded CPython 3.7mmacOS 11.0+ x86-64

atheris-2.0.6-cp36-cp36m-manylinux2014_x86_64.whl (26.7 MB view details)

Uploaded CPython 3.6m

atheris-2.0.6-cp36-cp36m-macosx_10_9_x86_64.whl (2.6 MB view details)

Uploaded CPython 3.6mmacOS 10.9+ x86-64

File details

Details for the file atheris-2.0.6.tar.gz.

File metadata

  • Download URL: atheris-2.0.6.tar.gz
  • Upload date:
  • Size: 57.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.26.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.6

File hashes

Hashes for atheris-2.0.6.tar.gz
Algorithm Hash digest
SHA256 965df331009028bdcfefbc9c033b7d43737514c93feddf7d254e8c7fc381d4c1
MD5 b30a65373b3514119ccd54b492501597
BLAKE2b-256 97af54324dd3d6c4b47e22cefe9ada5f4e1f6d8ad0ae46a72e73e434b5ca84d4

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp39-cp39-manylinux2014_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp39-cp39-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 26.7 MB
  • Tags: CPython 3.9
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.26.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.6

File hashes

Hashes for atheris-2.0.6-cp39-cp39-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d0a9c26d3325ac25f7e87609287862b2bc56a6d5281cdce0f961d91d98b97833
MD5 438cfbb393789c764b3f95a65f86a025
BLAKE2b-256 002a1c3684c282acb2b8df7b373edec52b8570f5964593e85fb82510bbb9df7c

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp39-cp39-macosx_11_0_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp39-cp39-macosx_11_0_x86_64.whl
  • Upload date:
  • Size: 2.6 MB
  • Tags: CPython 3.9, macOS 11.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.8

File hashes

Hashes for atheris-2.0.6-cp39-cp39-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 f28e7225e2067ebe0bb6cc933885d5dc6c6e0f1df4aa8a71eb2f61013659e7fc
MD5 f77bfc43a8394d9bbe66668c59ae2bbd
BLAKE2b-256 f0192e6d42bafd84cf99c91fa3b4ebfc122d163daf2577d8ac7277f18ed5d52b

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp38-cp38-manylinux2014_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp38-cp38-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 26.7 MB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.26.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.6

File hashes

Hashes for atheris-2.0.6-cp38-cp38-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 544b279cd93d5c475f2fd916fd93504db9fc5d098e437b22c7ce521c1470ede4
MD5 a773f287eabc1a24257916d4a4ff3ab1
BLAKE2b-256 736c58a6ccb0bcb52e648498b05d57f851b51cfbe5c116b138749061a22c85f8

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp38-cp38-macosx_11_0_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp38-cp38-macosx_11_0_x86_64.whl
  • Upload date:
  • Size: 2.6 MB
  • Tags: CPython 3.8, macOS 11.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.8

File hashes

Hashes for atheris-2.0.6-cp38-cp38-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 eac6fa72ed51f32097c9c0b4dd357e37428f4fe5857cd1f8582e19015f77def6
MD5 f4b482c48a306e5c39bc1218bcee936c
BLAKE2b-256 d38c98b4305c423aab0789732b722a2a6d6a6f58754bd42697b75c865d5a171a

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp37-cp37m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp37-cp37m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 26.8 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.26.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.6

File hashes

Hashes for atheris-2.0.6-cp37-cp37m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 916d09547ae9fe193c62ae7a0c6c9cff44f3a0bf892980898b22800358ade8f9
MD5 ae76d708a251ea7b41791c20a5a34fe9
BLAKE2b-256 a1529293b51a8838793d88db258754cab8b5c20ed0bc3ca18f1dac56d6a03a46

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp37-cp37m-macosx_11_0_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp37-cp37m-macosx_11_0_x86_64.whl
  • Upload date:
  • Size: 2.6 MB
  • Tags: CPython 3.7m, macOS 11.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.8

File hashes

Hashes for atheris-2.0.6-cp37-cp37m-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 9725ab933c15e12b6c559d534e9b071b58f3626833eed31623a8416a4f441c5e
MD5 b6ca4faeb953bacf13ec56e218bf9406
BLAKE2b-256 d4046904af07ed590de44785ec4c5f417de47315083e16c6f04ebdeac37d5213

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp36-cp36m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp36-cp36m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 26.7 MB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.26.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.52.0 CPython/3.8.6

File hashes

Hashes for atheris-2.0.6-cp36-cp36m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 b8fca86ce60e4f57467194637f68ce923df6105877850f7c760a3ddf858a5064
MD5 71f317774ddff3b2fc04a87a7ea899b5
BLAKE2b-256 9cf3eb41dfd05b217940bc322a0eb174dc397ed4da0a5921e4fa9d4b4f671995

See more details on using hashes here.

File details

Details for the file atheris-2.0.6-cp36-cp36m-macosx_10_9_x86_64.whl.

File metadata

  • Download URL: atheris-2.0.6-cp36-cp36m-macosx_10_9_x86_64.whl
  • Upload date:
  • Size: 2.6 MB
  • Tags: CPython 3.6m, macOS 10.9+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.8

File hashes

Hashes for atheris-2.0.6-cp36-cp36m-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 7bffff523a7532ce21301a3430e1c7989b864fbdf2716ca0a1d99b698efa7f4c
MD5 3c3c0954d754c3c82f1ab17942142381
BLAKE2b-256 8c9af6c4db2bab1c1bccc413335fea701c00f6fd56b43cdd26a6cc199c3fdc4d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page