turboquant-py implements the TurboQuant and QJL vector quantization algorithms from Google Research (ICLR 2026 / AISTATS 2026). It compresses high-dimensional floating-point vectors to 1-4 bits per ...
Client │ FastAPI (async, uvicorn) ├─ /ingest POST — embed + index one document ├─ /batch_ingest POST — bulk ingest (syncs to background) ├─ /search POST — query → top-K hits + scores ├─ /stats GET — ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results