Transformer-based models implemented in tensorflow 2.x(Keras)

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

transformers-keras

Python package

Transformer-based models implemented in tensorflow 2.x(Keras).

Installation

pip install -U transformers-keras

Models

Transformer[DELETED]
- Attention Is All You Need.
- Here is a tutorial from tensorflow:Transformer model for language understanding
BERT
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
ALBERT
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

BERT

Supported pretrained models:

All the BERT models pretrained by google-research/bert
All the BERT & RoBERTa models pretrained by ymcui/Chinese-BERT-wwm

Feature Extraction Examples:

from transformers_keras import Bert

# Used to predict directly
model = Bert.from_pretrained('/path/to/pretrained/bert/model')
# segment_ids and mask inputs are optional
input_ids = tf.constant([[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]])
segment_ids, attention_mask = None, None
sequence_outputs, pooled_output = model(input_ids, segment_ids, attention_mask, training=False)

Also, you can optionally get the hidden states and attention weights of each encoder layer:

from transformers_keras import Bert

# Used to predict directly
model = Bert.from_pretrained(
    '/path/to/pretrained/bert/model', 
    return_states=True, 
    return_attention_weights=True)
input_ids = tf.constant([[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]])
segment_ids, attention_mask = None, None
sequence_outputs, pooled_output, states, attn_weights = model(input_ids, segment_ids, attention_mask, training=False)

Fine-tuning Examples

# Used to fine-tuning
def build_bert_classify_model(pretrained_model_dir, trainable=True, **kwargs):
    input_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='input_ids')
    # segment_ids and mask inputs are optional
    segment_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='segment_ids')

    bert = Bert.from_pretrained(pretrained_model_dir, **kwargs)
    bert.trainable = trainable

    sequence_outputs, pooled_output = bert(input_ids, segment_ids, None)
    outputs = tf.keras.layers.Dense(2, name='output')(pooled_output)
    model = tf.keras.Model(inputs=[input_ids, segment_ids], outputs=outputs)
    model.compile(loss='binary_cross_entropy', optimizer='adam')
    return model

model = build_bert_classify_model(
            pretrained_model_dir=os.path.join(BASE_DIR, 'chinese_wwm_ext_L-12_H-768_A-12'),
            trainable=True)
model.summary()

ALBERT

Supported pretrained models:

All the ALBERT models pretrained by google-research/albert

Feature Extraction Examples

from transformers_keras import Albert

# Used to predict directly
model = Albert.from_pretrained('/path/to/pretrained/albert/model')
input_ids = tf.constant([[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]])
segment_ids, attention_mask = None, None
sequence_outputs, pooled_output = model(input_ids, segment_ids, attention_mask, training=False)

Also, you can optionally get the hidden states and attention weights of each encoder layer:

from transformers_keras import Albert

# Used to predict directly
model = Albert.from_pretrained(
    '/path/to/pretrained/albert/model', 
    return_states=True, 
    return_attention_weights=True)
# segment_ids and mask inputs are optional
input_ids = tf.constant([[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]])
segment_ids, attention_mask = None, None
sequence_outputs, pooled_output, states, attn_weights = model(input_ids, segment_ids, mask, training=False)

Fine-tuing Examples

# Used to fine-tuning 
def build_albert_classify_model(pretrained_model_dir, trainable=True, **kwargs):
    input_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='input_ids')
    # segment_ids and mask inputs are optional
    segment_ids = tf.keras.layers.Input(shape=(None,), dtype=tf.int32, name='segment_ids')

    albert = Albert.from_pretrained(pretrained_model_dir, **kwargs)
    albert.trainable = trainable

    sequence_outputs, pooled_output = albert(input_ids, segment_ids, None)
    outputs = tf.keras.layers.Dense(2, name='output')(pooled_output)
    model = tf.keras.Model(inputs=[input_ids, segment_ids], outputs=outputs)
    model.compile(loss='binary_cross_entropy', optimizer='adam')
    return model

model = build_albert_classify_model(
            pretrained_model_dir=os.path.join(BASE_DIR, 'albert_base'),
            trainable=True)
model.summary()

Advanced Usage

Here are some advanced usages:

Skip loadding weights from checkpoint
Load other pretrained models

Skip loadding weights from checkpoint

You can skip loadding some weights from ckpt.

Examples:

from transformers_keras import Bert, Albert

ALBERT_MODEL_PATH = '/path/to/albert/model'
albert = Albert.from_pretrained(
    ALBERT_MODEL_PATH,
    # return_states=False,
    # return_attention_weights=False,
    skip_token_embedding=True,
    skip_position_embedding=True,
    skip_segment_embedding=True,
    skip_pooler=True,
    ...
    )

BERT_MODEL_PATH = '/path/to/bert/model'
bert = Bert.from_pretrained(
    BERT_MODEL_PATH,
    # return_states=False,
    # return_attention_weights=False,
    skip_token_embedding=True,
    skip_position_embedding=True,
    skip_segment_embedding=True,
    skip_pooler=True,
    ...
    )

All supported kwargs to skip loadding weights:

skip_token_embedding, skip loadding token_embedding weights from ckpt
skip_position_embedding, skip loadding position_embedding weights from ckpt
skip_segment_embedding, skip loadding token_type_emebdding weights from ckpt
skip_embedding_layernorm, skip loadding layer_norm weights of emebedding layer from ckpt
skip_pooler, skip loadding pooler weights of pooler layer from ckpt

Load other pretrained models

If you want to load models pretrained by other implementationds, whose config and trainable weights are a little different from previous, you can subclass AbstractAdapter to adapte these models:

from transformers_keras.adapters import AbstractAdapter
from transformers_keras import Bert, Albert

# load custom bert models
class MyBertAdapter(AbstractAdapter):

    def adapte_config(self, config_file, **kwargs):
        # adapte model config here
        # you can refer to `transformers_keras.adapters.bert_adapter`
        pass

    def adapte_weights(self, model, config, ckpt, **kwargs):
        # adapte model weights here
        # you can refer to `transformers_keras.adapters.bert_adapter`
        pass

bert = Bert.from_pretrained('/path/to/your/bert/model', adapter=MyBertAdapter())

# or, load custom albert models
class MyAlbertAdapter(AbstractAdapter):

    def adapte_config(self, config_file, **kwargs):
        # adapte model config here
        # you can refer to `transformers_keras.adapters.albert_adapter`
        pass

    def adapte_weights(self, model, config, ckpt, **kwargs):
        # adapte model weights here
        # you can refer to `transformers_keras.adapters.albert_adapter`
        pass

albert = Albert.from_pretrained('/path/to/your/albert/model', adapter=MyAlbertAdapter())

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.5.1

Nov 23, 2021

0.5.0

Nov 22, 2021

0.4.9

Oct 20, 2021

0.4.8

Oct 12, 2021

0.4.7

Sep 30, 2021

0.4.6

Sep 29, 2021

0.4.5

Sep 7, 2021

0.4.4

Aug 19, 2021

0.4.3

Aug 17, 2021

0.4.2

Aug 16, 2021

0.4.1

Aug 2, 2021

0.4.0

Jul 25, 2021

0.3.1

Jul 13, 2021

0.3.0

Jun 29, 2021

0.2.6

Jun 2, 2021

This version

0.2.5

May 18, 2021

0.2.4

May 11, 2021

0.2.3

Mar 3, 2021

0.2.2

Jan 24, 2021

0.2.1

Dec 23, 2020

0.2.0

Dec 15, 2020

0.1.4

Jul 17, 2020

0.1.3

Jul 5, 2020

0.1.2

Jun 8, 2020

0.1.1

Jun 8, 2020

0.1.0

Jun 7, 2020

0.0.1

Sep 23, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transformers_keras-0.2.5.tar.gz (27.2 kB view hashes)

Uploaded May 18, 2021 Source

Built Distribution

transformers_keras-0.2.5-py3-none-any.whl (35.4 kB view hashes)

Uploaded May 18, 2021 Python 3

Hashes for transformers_keras-0.2.5.tar.gz

Hashes for transformers_keras-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`cc121b3421380590f22dd01425016664472da2ae87f29f5ed5074f66337319ce`
MD5	`5a4d8d20d98c0112757a4d10ed76ad99`
BLAKE2b-256	`96b8f3545d3d040faa926bc700157817cff87c169d9ae4ba72a0ee80bb72b129`

Hashes for transformers_keras-0.2.5-py3-none-any.whl

Hashes for transformers_keras-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2ba85338d5e10e60fbca567598202c76a21d8be11f070e7be7c9de805c9acb16`
MD5	`a9a862477b5b762dfe842c7e39113f88`
BLAKE2b-256	`9552ad89ed89540642e8d565b8854b2c3537c071e0a89ad287dc70f339e93331`