Skip to main content

HuggingFace runtime for MLServer

Project description

HuggingFace runtime for MLServer

This package provides a MLServer runtime compatible with HuggingFace Transformers.

Usage

You can install the runtime, alongside mlserver, as:

pip install mlserver mlserver-huggingface

For further information on how to use MLServer with HuggingFace, you can check out this worked out example.

Settings

The HuggingFace runtime exposes a couple extra parameters which can be used to customise how the runtime behaves. These settings can be added under the parameters.extra section of your model-settings.json file, e.g.

---
emphasize-lines: 5-8
---
{
  "name": "qa",
  "implementation": "mlserver_huggingface.HuggingFaceRuntime",
  "parameters": {
    "extra": {
      "task": "question-answering",
      "optimum_model": true
    }
  }
}
These settings can also be injected through environment variables prefixed with `MLSERVER_MODEL_HUGGINGFACE_`, e.g.

```bash
MLSERVER_MODEL_HUGGINGFACE_TASK="question-answering"
MLSERVER_MODEL_HUGGINGFACE_OPTIMUM_MODEL=true
```

Reference

You can find the full reference of the accepted extra settings for the HuggingFace runtime below:

.. autopydantic_settings:: mlserver_huggingface.settings.HuggingFaceSettings

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlserver-huggingface-1.3.2rc1.tar.gz (15.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlserver_huggingface-1.3.2rc1-py3-none-any.whl (20.6 kB view details)

Uploaded Python 3

File details

Details for the file mlserver-huggingface-1.3.2rc1.tar.gz.

File metadata

  • Download URL: mlserver-huggingface-1.3.2rc1.tar.gz
  • Upload date:
  • Size: 15.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for mlserver-huggingface-1.3.2rc1.tar.gz
Algorithm Hash digest
SHA256 bd56ae7862ea2276b1e0b7752df9840b4875b0e00b2d7d587493a3682990e165
MD5 3414debf0cc0db8d54e4c4c347c3b110
BLAKE2b-256 59d36f1464b9a7e66799cb71ba7621c21ddab7b6506baa1bb81093f2b4d04df8

See more details on using hashes here.

File details

Details for the file mlserver_huggingface-1.3.2rc1-py3-none-any.whl.

File metadata

File hashes

Hashes for mlserver_huggingface-1.3.2rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 aacd838ab75ee72e8ae23e982756ac2022869b0b2b329d42014ab300f707cc78
MD5 2110bc72d62df61d3c00ffabb1cd1495
BLAKE2b-256 986ed4e03e0520d5312aefe98964850acaa10e24aa3412e32e7186220cea47cb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page