Bigdatagenomics Adam by bigdatagenomics

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

sparkbig-databioinformaticsgenomicsparquetavroscalajavapythonr
Verdict 70/100 health $4.13/mo cheapest, hetzner 2/5 setup difficulty Last release 4.1 years ago

Self-host Bigdatagenomics Adam on hetzner CAX11 for $4.13/mo.

Health score
70 /100
6-dim composite
Self-hosts from
$4.13 /mo
hetzner · CAX11
Difficulty
2 /5
Docker + read README
GitHub stars
1.0k
314 forks

About Bigdatagenomics Adam

From the project's README at github.com/bigdatagenomics/adam. Lightly cleaned for readability; for the full source see the upstream repo.

[](http://search.maven.org/#search%7Cga%7C1%7Corg.bdgenomics.adam) [](http://javadoc.io/doc/org.bdgenomics.adam/adam-core-spark32.12)

ADAM is a library and command line tool that enables the use of Apache Spark to parallelize genomic data analysis across cluster/cloud computing environments. ADAM uses a set of schemas to describe genomic sequences, reads, variants/genotypes, and features, and can be used with data in legacy genomic file formats such as SAM/BAM/CRAM, BED/GFF3/GTF, and VCF, as well as data stored in the columnar Apache Parquet format. On a single node, ADAM provides competitive performance to optimized multi-threaded tools, while enabling scale out to clusters with more than a thousand cores. ADAM's APIs can be used from Scala, Java, Python, R, and SQL. Why ADAM?

Over the last decade, DNA and RNA sequencing has evolved from an expensive, labor intensive method to a cheap commodity. The consequence of this is generation of massive amounts of genomic and transcriptomic data. Typically, tools to process and interpret these data are developed with a focus on excellence of the results generated, not on scalability and interoperabilit

Health score breakdown

6-dimension composite. See methodology for formula and weights.

activity
80
maturity
85
community
91
security
70
sustainability
65
adoption
29

Adoption signals

Real-world usage data, pulled from each registry. The bigger the numbers, the more battle-tested the project.

SignalValueSource
GitHub stars 1.0k github.com/bigdatagenomics/adam
GitHub forks 314 github.com/bigdatagenomics/adam

Release & maintenance

Is this project actively maintained, or about to die? Check the recency of last commit and last release.

Project age12.6 yearssince Nov 2013
Last commit3 months agoMar 17, 2026
Releases shipped62last: 4.1 years ago

Self-hosting cost across providers

Detected requirements: 4GB RAM, 40GB disk minimum. Cheapest plan per provider that meets the requirement.

ProviderPlanSpecsMonthly
hetzner CAX11 2c · 4GB · 40GB $4.13 USD Deploy →
vultr VC2 1c · 1GB · 25GB $5 USD Deploy →
linode Nanode 1GB 1c · 1GB · 25GB $5.12 USD Deploy →
digitalocean Basic Regular 1GB 1c · 1GB · 25GB $6 USD Deploy →

What people say on Hacker News

Ready to self-host Bigdatagenomics Adam?

Spin up a hetzner CAX11 (4GB RAM, 40GB disk) for $4.13/mo and follow the project's official install docs.

Data last refreshed Jun 21, 2026.

Similar open-source projects

Projects in our directory that replace the same SaaS or share topics with Bigdatagenomics Adam.

Frequently asked questions

Last verified . Health scores and costs are computed from public data.