You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Byte Pair Encoding (BPE) tokenizer implemented from scratch in Python. Features an interactive Streamlit playground to visualize token merging, trace vocab rules, analyze compression ratios, and verify lossless decoding in real time.