How much does it cost to self-host NumKong: Mixed Precision for All?

NumKong: Mixed Precision for All can be self-hosted starting at $4.13/mo on hetzner CAX11. Detected requirements: 4GB RAM, 40GB disk.

Is NumKong: Mixed Precision for All actively maintained?

NumKong: Mixed Precision for All has a composite health score of 70/100 across activity, maturity, community, security, sustainability, and adoption. See /methodology/ for the formula.

How hard is NumKong: Mixed Precision for All to self-host?

NumKong: Mixed Precision for All has a self-hosting difficulty score of 2/5 (1 = one-click, 5 = complex Kubernetes setup).

NumKong: Mixed Precision for All by ashvardanian

C Apache-2.0

SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types, from 6-bit floats to 64-bit complex, across x86, Arm, RISC-V, and WASM, with bindings for Python, Rust, C, C++, Swift, JS, and Go

arm-neonassemblymetricssimdinformation-retrievalnumpyscipyvector-searchblasmatrix-multiplicationrusttensor

Verdict 70/100 health $4.13/mo cheapest, hetzner 2/5 setup difficulty Last release 2 months ago

Deploy NumKong: Mixed Precision for All on hetzner → View on GitHub

Health score
 70 /100 
6-dim composite

Self-hosts from

$4.13 /mo

hetzner · CAX11

Difficulty

2 /5

Docker + read README

GitHub stars

1.8k

117 forks

About NumKong: Mixed Precision for All

From the project's README at github.com/ashvardanian/NumKong. Lightly cleaned for readability; for the full source see the upstream repo.

Portable mixed-precision math, linear-algebra, & retrieval library with 2'000+ SIMD kernels for x86, Arm, RISC-V, LoongArch, Power, & WebAssembly, leveraging rare algebraic transforms with both 1D & 2D registers like AMX & SME, covering 15+ numeric types from 4-bit integers & 6-bit floats to 128-bit complex numbers, validated against 118-bit extended-precision baselines with saturation, casting, & rounding edge-case coverage, in a 5-100x smaller binary than other BLAS-like alternatives, co-designed with Tensor abstractions in C++, Python, Rust, JavaScript, GoLang, & Swift. Latency, Throughput, & Numerical Stability

Most libraries return dot products in the same type as the input, Float16 × Float16 → Float16, Int8 × Int8 → Int8. This leads to quiet overflow: a 2048-dimensional dot product can reach ±10 million, but maxes out at 127. NumKong promotes to wider accumulators, Float16 → Float32, BFloat16 → Float32, Int8 → Int32, Float32 → Float64, so results stay in range.

Health score breakdown

6-dimension composite. See methodology for formula and weights.

activity

maturity

community

security

sustainability

adoption

Adoption signals

Real-world usage data, pulled from each registry. The bigger the numbers, the more battle-tested the project.

Signal	Value	Source
GitHub stars	1.8k	github.com/ashvardanian/NumKong
GitHub forks	117	github.com/ashvardanian/NumKong

Release & maintenance

Is this project actively maintained, or about to die? Check the recency of last commit and last release.

Project age	3.3 years	since Mar 2023
Last commit	2 months ago	May 4, 2026
Releases shipped	172	last: 2 months ago

Self-hosting cost across providers

Detected requirements: 4GB RAM, 40GB disk minimum. Cheapest plan per provider that meets the requirement.

Provider	Plan	Specs	Monthly
hetzner	CAX11	2c · 4GB · 40GB	$4.13 USD	Deploy →
vultr	VC2	1c · 1GB · 25GB	$5 USD	Deploy →
linode	Nanode 1GB	1c · 1GB · 25GB	$5.12 USD	Deploy →
digitalocean	Basic Regular 1GB	1c · 1GB · 25GB	$6 USD	Deploy →