NumKong: Mixed Precision for All by ashvardanian
SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types, from 6-bit floats to 64-bit complex, across x86, Arm, RISC-V, and WASM, with bindings for Python, Rust, C, C++, Swift, JS, and Go
About NumKong: Mixed Precision for All
From the project's README at github.com/ashvardanian/NumKong. Lightly cleaned for readability; for the full source see the upstream repo.
Portable mixed-precision math, linear-algebra, & retrieval library with 2'000+ SIMD kernels for x86, Arm, RISC-V, LoongArch, Power, & WebAssembly, leveraging rare algebraic transforms with both 1D & 2D registers like AMX & SME, covering 15+ numeric types from 4-bit integers & 6-bit floats to 128-bit complex numbers, validated against 118-bit extended-precision baselines with saturation, casting, & rounding edge-case coverage, in a 5-100x smaller binary than other BLAS-like alternatives, co-designed with Tensor abstractions in C++, Python, Rust, JavaScript, GoLang, & Swift. Latency, Throughput, & Numerical Stability
Most libraries return dot products in the same type as the input, Float16 × Float16 → Float16, Int8 × Int8 → Int8. This leads to quiet overflow: a 2048-dimensional dot product can reach ±10 million, but maxes out at 127. NumKong promotes to wider accumulators, Float16 → Float32, BFloat16 → Float32, Int8 → Int32, Float32 → Float64, so results stay in range.
Health score breakdown
6-dimension composite. See methodology for formula and weights.
Adoption signals
Real-world usage data, pulled from each registry. The bigger the numbers, the more battle-tested the project.
| Signal | Value | Source |
|---|---|---|
| GitHub stars | 1.8k | github.com/ashvardanian/NumKong |
| GitHub forks | 117 | github.com/ashvardanian/NumKong |
Release & maintenance
Is this project actively maintained, or about to die? Check the recency of last commit and last release.
| Project age | 3.2 years | since Mar 2023 |
| Last commit | 2 days ago | May 4, 2026 |
| Releases shipped | 172 | last: 17 days ago |
Self-hosting cost across providers
Detected requirements: 4GB RAM, 40GB disk minimum. Cheapest plan per provider that meets the requirement.
| Provider | Plan | Specs | Monthly | |
|---|---|---|---|---|
| hetzner | CAX11 | 2c · 4GB · 40GB | $4.13 USD | Deploy → |
| vultr | VC2 | 1c · 1GB · 25GB | $5 USD | Deploy → |
| linode | Nanode 1GB | 1c · 1GB · 25GB | $5.12 USD | Deploy → |
| digitalocean | Basic Regular 1GB | 1c · 1GB · 25GB | $6 USD | Deploy → |
What people say on Hacker News
Ready to self-host NumKong: Mixed Precision for All?
Spin up a hetzner CAX11 (4GB RAM, 40GB disk) for $4.13/mo and follow the project's official install docs.
Data last refreshed May 7, 2026.
Similar open-source projects
Projects in our directory that replace the same SaaS or share topics with NumKong: Mixed Precision for All.