Technique
Byte Pair Encoding (BPE)
A sub-word tokenization algorithm that builds a vocabulary by repeatedly merging the most frequent pair of adjacent tokens in the training data.
Technique
A sub-word tokenization algorithm that builds a vocabulary by repeatedly merging the most frequent pair of adjacent tokens in the training data.
We use cookies
Anonymous analytics help us improve the site. You can opt out anytime. Learn more