Transformer-based models implemented in tensorflow 2.x(Keras)

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

transformers-keras

Python package

Transformer-based models implemented in tensorflow 2.x(Keras).

Installation

pip install -U transformers-keras

Models

Transformer[DELETED]
- Attention Is All You Need.
- Here is a tutorial from tensorflow:Transformer model for language understanding
BERT
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
ALBERT
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

BERT

Supported pretrained models:

All the BERT models pretrained by google-research/bert
All the BERT & RoBERTa models pretrained by ymcui/Chinese-BERT-wwm

from transformers_keras import Bert

# Used to predict directly
model = Bert.from_pretrained('/path/to/pretrained/bert/model')
# segment_ids and mask inputs are optional
model.predict((input_ids, segment_ids, mask))
# or
model(inputs=(input_ids, segment_ids, mask))

# Used to fine-tuning
def build_bert_classify_model(pretrained_model_dir, trainable=True, **kwargs):
    input_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='input_ids')
    # segment_ids and mask inputs are optional
    segment_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='segment_ids')

    bert = Bert.from_pretrained(pretrained_model_dir, **kwargs)
    bert.trainable = trainable

    sequence_outputs, pooled_output = bert(inputs=(input_ids, segment_ids))
    outputs = tf.keras.layers.Dense(2, name='output')(pooled_output)
    model = tf.keras.Model(inputs=[input_ids, segment_ids], outputs=outputs)
    model.compile(loss='binary_cross_entropy', optimizer='adam')
    return model

model = build_bert_classify_model(
            pretrained_model_dir=os.path.join(BASE_DIR, 'chinese_wwm_ext_L-12_H-768_A-12'),
            trainable=True)
model.summary()

ALBERT

Supported pretrained models:

All the ALBERT models pretrained by google-research/albert

from transformers_keras import Albert

# Used to predict directly
model = Bert.from_pretrained('/path/to/pretrained/albert/model')
# segment_ids and mask inputs are optional
model.predict((input_ids, segment_ids, mask))
# or
model(inputs=(input_ids, segment_ids, mask))

# Used to fine-tuning 
def build_albert_classify_model(pretrained_model_dir, trainable=True, **kwargs):
    input_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='input_ids')
    # segment_ids and mask inputs are optional
    segment_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='segment_ids')

    albert = Albert.from_pretrained(pretrained_model_dir, **kwargs)
    albert.trainable = trainable

    sequence_outputs, pooled_output = albert(inputs=(input_ids, segment_ids))
    outputs = tf.keras.layers.Dense(2, name='output')(pooled_output)
    model = tf.keras.Model(inputs=[input_ids, segment_ids], outputs=outputs)
    model.compile(loss='binary_cross_entropy', optimizer='adam')
    return model

model = build_albert_classify_model(
            pretrained_model_dir=os.path.join(BASE_DIR, 'albert_base'),
            trainable=True)
model.summary()

Load other pretrained models

If you want to load models pretrained by other implementationds, whose config and trainable weights are a little different from previous, you can subclass AbstractAdapter to adapte these models:

from transformers_keras.adapters import AbstractAdapter
from transformers_keras import Bert, Albert

# load custom bert models
class MyBertAdapter(AbstractAdapter):

    def adapte_config(self, config_file, **kwargs):
        # adapte model config here
        # you can refer to `transformers_keras.adapters.bert_adapter`
        pass

    def adapte_weights(self, model, config, ckpt, **kwargs):
        # adapte model weights here
        # you can refer to `transformers_keras.adapters.bert_adapter`
        pass

bert = Bert.from_pretrained('/path/to/your/bert/model', adapter=MyBertAdapter())

# or, load custom albert models
class MyAlbertAdapter(AbstractAdapter):

    def adapte_config(self, config_file, **kwargs):
        # adapte model config here
        # you can refer to `transformers_keras.adapters.albert_adapter`
        pass

    def adapte_weights(self, model, config, ckpt, **kwargs):
        # adapte model weights here
        # you can refer to `transformers_keras.adapters.albert_adapter`
        pass

albert = Albert.from_pretrained('/path/to/your/albert/model', adapter=MyAlbertAdapter())

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.5.1

Nov 23, 2021

0.5.0

Nov 22, 2021

0.4.9

Oct 20, 2021

0.4.8

Oct 12, 2021

0.4.7

Sep 30, 2021

0.4.6

Sep 29, 2021

0.4.5

Sep 7, 2021

0.4.4

Aug 19, 2021

0.4.3

Aug 17, 2021

0.4.2

Aug 16, 2021

0.4.1

Aug 2, 2021

0.4.0

Jul 25, 2021

0.3.1

Jul 13, 2021

0.3.0

Jun 29, 2021

0.2.6

Jun 2, 2021

0.2.5

May 18, 2021

0.2.4

May 11, 2021

0.2.3

Mar 3, 2021

This version

0.2.2

Jan 24, 2021

0.2.1

Dec 23, 2020

0.2.0

Dec 15, 2020

0.1.4

Jul 17, 2020

0.1.3

Jul 5, 2020

0.1.2

Jun 8, 2020

0.1.1

Jun 8, 2020

0.1.0

Jun 7, 2020

0.0.1

Sep 23, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transformers_keras-0.2.2.tar.gz (17.5 kB view hashes)

Uploaded Jan 24, 2021 Source

Built Distribution

transformers_keras-0.2.2-py3-none-any.whl (28.8 kB view hashes)

Uploaded Jan 24, 2021 Python 3

Hashes for transformers_keras-0.2.2.tar.gz

Hashes for transformers_keras-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`dd1a5f975d86e0370b88b25132860a07e25569f62e4350d352903b4e8334d2d9`
MD5	`19cc5a6690c8d892bf58e42656c6c0d3`
BLAKE2b-256	`b57e3f7fcddaef87bc90cd9155765e3c2ae8a5d7911a597001f09e622b99428a`

Hashes for transformers_keras-0.2.2-py3-none-any.whl

Hashes for transformers_keras-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`535fa94d2306239780165bd245b3515c57cb8087588bbe6490a4b8e0210309e1`
MD5	`906ca5cd588e6007d700836b2c9b8832`
BLAKE2b-256	`5516a407820d4ae5bd7b555176feb1d3f3575a93227d6680b7fdacaacc31d077`