#
405b
Here are 3 public repositories matching this topic...
Lossless 5-bit transformer compression. 22 architectures shipped (0.6B-405B incl. dense + MoE + SSM), 14 PPL-verified. Hermes-3-405B 1.0066x, Mistral-7B 1.00548x, Mixtral-8x7B 1.00368x. SHA-256-verifiable bit-identical reconstruction. OpenAI-compatible API at api.sipsalabs.com. pip install ultracompress
python compression cuda inference pytorch transformer lossless quantization mlops deep-tech openai-api llm patent-pending ai-infrastructure 405b consumer-gpu 5-bit sipsa-labs experimental-tech
-
Updated
May 15, 2026 - Python
Improve this page
Add a description, image, and links to the 405b topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the 405b topic, visit your repo's landing page and select "manage topics."