Skip to content

xlm.utils.text

Text utility functions for xlm.

remove_trailing_pads(text, tokenizer)

Remove trailing pad tokens from decoded text.

Parameters:

Name Type Description Default
text str

Decoded text string that may contain trailing pad tokens

required
tokenizer Tokenizer

Tokenizer instance containing pad_token

required

Returns:

Type Description
str

Text with trailing pad tokens removed