sipsalabs / ultracompress Star 11 Code Issues Pull requests Discussions Lossless 5-bit transformer compression. 22 architectures shipped (0.6B-405B incl. dense + MoE + SSM), 14 PPL-verified. Hermes-3-405B 1.0066x, Mistral-7B 1.00548x, Mixtral-8x7B 1.00368x. SHA-256-verifiable bit-identical reconstruction. OpenAI-compatible API at api.sipsalabs.com. pip install ultracompress python compression cuda inference pytorch transformer lossless quantization mlops deep-tech openai-api llm patent-pending ai-infrastructure 405b consumer-gpu 5-bit sipsa-labs experimental-tech Updated May 15, 2026 Python