Amber

Append-only storage for logs and traces. One binary, one directory, HTTP + gRPC API. Think "SQLite for observability".

Features

Append-only segments with zstd compression and per-block min/max stats in segment footer
Write-Ahead Log for crash recovery
Bitmap indexes (Roaring Bitmap) for fast field filtering (service, level, host)
Full-text search index for log body
Ribbon filters for high-cardinality fields (trace_id)
Sparse index for time-based segment pruning (skips 95%+ data without I/O)
OTLP compatible — gRPC (:4317) and HTTP endpoints for logs and traces
Log-trace correlation — trace viewer with span tree and linked logs
Retention policies — max age, max bytes, max segments
Embedded mode — use as a Go library without HTTP server
Built-in Web UI — Svelte-based log explorer and trace viewer

Quick Start

Binary

git clone https://github.com/hnlbs/amber.git
cd amber
make build
cp config.example.yaml config.yaml  # edit as needed
./amber config.yaml

Docker

docker build -t amber .
docker run -p 8080:8080 -p 4317:4317 \
  -v amber-data:/data \
  -v ./config.yaml:/data/config.yaml \
  amber

Embedded (Go library)

import "github.com/hnlbs/amber"

db, err := amber.Open("./data")
defer db.Close()

db.Log(ctx, amber.LogEntry{
    Level:   amber.LevelError,
    Service: "api-gateway",
    Body:    "connection refused",
})

result, err := db.QueryLogs(ctx, &amber.LogQuery{
    Services: []string{"api-gateway"},
    Limit:    100,
})

API

Ingest

# JSON
curl -X POST http://localhost:8080/api/v1/logs \
  -H "Authorization: Bearer <key>" \
  -d '[{"level":"ERROR","service":"api","body":"connection refused"}]'

# OTLP HTTP
curl -X POST http://localhost:8080/v1/logs \
  -H "Authorization: Bearer <key>" \
  -H "Content-Type: application/json" \
  -d @otlp_payload.json

Query

# Logs
curl "http://localhost:8080/api/v1/logs?service=api-gateway&level=ERROR&limit=50" \
  -H "Authorization: Bearer <key>"

# Trace
curl "http://localhost:8080/api/v1/traces/<trace_id>" \
  -H "Authorization: Bearer <key>"

# Services list
curl "http://localhost:8080/api/v1/services" \
  -H "Authorization: Bearer <key>"

Configuration

See config.example.yaml for all options. Key settings:

Setting	Default	Description
`storage.data_dir`	`./data`	Data directory
`storage.segment_max_records`	`1000000`	Records per segment before rotation
`ingest.batch_size`	`1000`	WAL batch size
`ingest.batch_timeout`	`100ms`	Max wait before flushing batch
`api.http_addr`	`:8080`	HTTP listen address
`api.grpc_addr`	`:4317`	gRPC listen address (OTLP)
`api.api_key`	(empty)	Bearer token (empty = auth disabled)
`retention.max_age`	`0s`	Max segment age (0 = disabled)

Benchmarks

100M records, VPS (8 vCPU, 16 GB RAM), obs-bench suite. All numbers are HTTP end-to-end (client-measured), p50 latency.

Query latency (p50, ms)

Query	Amber	Loki	ClickHouse	OpenSearch
R1 — point (service + level)	55	24	224	380
R2 — time range (1h window)	59	8.6	249	51
R3 — full-text search	57	28,941	197	25
R4 — rare token FTS	49	66,404	123	5.1
R5 — trace ID lookup	84	8.1	179	4.0

Amber server-side latency (excluding JSON serialization + network): R1 2.2 ms, R2 0.9 ms, R3 0.9 ms, R5 2.2 ms. The ~50 ms overhead is Go JSON encoding of 100 entries per response — same overhead applies to all systems but varies by serialization library.

Ingest throughput (W1)

System	rec/s
ClickHouse	336K
Loki	224K
OpenSearch	129K
Amber	118K

Storage efficiency (100M records, 30 GB raw)

System	Storage	Ratio	Idle RSS
Amber	5.9 GB	0.20	14.8 MiB
OpenSearch	20.8 GB	0.69	1,410 MiB
Loki	23.7 GB	0.79	96.8 MiB
ClickHouse	27.9 GB	0.93	462.6 MiB

Methodology and notes

Dataset: 100M synthetic log entries (10 services, 6 levels, realistic bodies with UUIDs), pre-generated NDJSON
Loadgen: 8 workers, 500 rec/batch, max throughput (no rate limit)
Queries: 20 qps, 4 workers, 60s per scenario, randomized parameters
VictoriaLogs excluded: bulk ingest via /insert/jsonline silently dropped records (storage = 8 KB after 100M ingest). Single-record inserts work; bulk persistence bug not investigated. Results would be misleading
ClickHouse FTS uses position(body, ?) instead of hasToken because hasToken treats _ as a token separator, rejecting UUIDs. This bypasses the tokenbf_v1 index — R3/R4 numbers reflect a full scan
Loki R3 had 11 errors (timeouts on 100M full-text scan)

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
cmd		cmd
internal		internal
media		media
ui		ui
.gitignore		.gitignore
.golangci.yml		.golangci.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
amber.go		amber.go
config.example.yaml		config.example.yaml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amber

Features

Quick Start

Binary

Docker

Embedded (Go library)

API

Ingest

Query

Configuration

Benchmarks

Query latency (p50, ms)

Ingest throughput (W1)

Storage efficiency (100M records, 30 GB raw)

License

About

Uh oh!

Releases 1

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Amber

Features

Quick Start

Binary

Docker

Embedded (Go library)

API

Ingest

Query

Configuration

Benchmarks

Query latency (p50, ms)

Ingest throughput (W1)

Storage efficiency (100M records, 30 GB raw)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Contributors

Uh oh!

Languages