NumKong: Mixed Precision for All by ashvardanian

SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types, from 6-bit floats to 64-bit complex, across x86, Arm, RISC-V, and WASM, with bindings for Python, Rust, C, C++, Swift, JS, and Go

arm-neonassemblymetricssimdinformation-retrievalnumpyscipyvector-searchblasmatrix-multiplicationrusttensor
Verdict 74/100 health $4.13/mo cheapest, hetzner 2/5 setup difficulty Last release 17 days ago

Self-host NumKong: Mixed Precision for All on hetzner CAX11 for $4.13/mo.

Health score
74 /100
6-dim composite
Self-hosts from
$4.13 /mo
hetzner · CAX11
Difficulty
2 /5
Docker + read README
GitHub stars
1.8k
117 forks

About NumKong: Mixed Precision for All

From the project's README at github.com/ashvardanian/NumKong. Lightly cleaned for readability; for the full source see the upstream repo.

Portable mixed-precision math, linear-algebra, & retrieval library with 2'000+ SIMD kernels for x86, Arm, RISC-V, LoongArch, Power, & WebAssembly, leveraging rare algebraic transforms with both 1D & 2D registers like AMX & SME, covering 15+ numeric types from 4-bit integers & 6-bit floats to 128-bit complex numbers, validated against 118-bit extended-precision baselines with saturation, casting, & rounding edge-case coverage, in a 5-100x smaller binary than other BLAS-like alternatives, co-designed with Tensor abstractions in C++, Python, Rust, JavaScript, GoLang, & Swift. Latency, Throughput, & Numerical Stability

Most libraries return dot products in the same type as the input, Float16 × Float16 → Float16, Int8 × Int8 → Int8. This leads to quiet overflow: a 2048-dimensional dot product can reach ±10 million, but maxes out at 127. NumKong promotes to wider accumulators, Float16 → Float32, BFloat16 → Float32, Int8 → Int32, Float32 → Float64, so results stay in range.

Health score breakdown

6-dimension composite. See methodology for formula and weights.

activity
94
maturity
94
community
92
security
70
sustainability
65
adoption
28

Adoption signals

Real-world usage data, pulled from each registry. The bigger the numbers, the more battle-tested the project.

SignalValueSource
GitHub stars 1.8k github.com/ashvardanian/NumKong
GitHub forks 117 github.com/ashvardanian/NumKong

Release & maintenance

Is this project actively maintained, or about to die? Check the recency of last commit and last release.

Project age3.2 yearssince Mar 2023
Last commit2 days agoMay 4, 2026
Releases shipped172last: 17 days ago

Self-hosting cost across providers

Detected requirements: 4GB RAM, 40GB disk minimum. Cheapest plan per provider that meets the requirement.

ProviderPlanSpecsMonthly
hetzner CAX11 2c · 4GB · 40GB $4.13 USD Deploy →
vultr VC2 1c · 1GB · 25GB $5 USD Deploy →
linode Nanode 1GB 1c · 1GB · 25GB $5.12 USD Deploy →
digitalocean Basic Regular 1GB 1c · 1GB · 25GB $6 USD Deploy →

What people say on Hacker News

Ready to self-host NumKong: Mixed Precision for All?

Spin up a hetzner CAX11 (4GB RAM, 40GB disk) for $4.13/mo and follow the project's official install docs.

Data last refreshed May 7, 2026.

Similar open-source projects

Projects in our directory that replace the same SaaS or share topics with NumKong: Mixed Precision for All.

Frequently asked questions

Last verified . Data refreshes every 30 minutes.