feat: add production-readiness features (CORS, cache, health check, structured errors, request logging, graceful shutdown) by Sai-Prashanth123 · Pull Request #21 · karust/openserp

Sai-Prashanth123 · 2026-03-06T11:53:41Z

Summary

This PR adds 6 production-readiness features to make OpenSERP more robust, observable, and frontend-friendly.

Changes

1. CORS Middleware (`core/middleware.go`)

Configurable Access-Control-Allow-Origin/Methods/Headers headers
Handles preflight OPTIONS requests automatically
Enabled by default, configurable via --cors flag or cors: in config

2. Structured JSON Error Responses (`core/middleware.go`)

All API errors now return consistent JSON: {"error": "...", "code": 503, "message": "..."}
Replaces plain-text error responses for proper API consumption

3. Request Logging Middleware (`core/middleware.go`)

Logs every request with method, path, status code, latency, client IP
Uses WARN for 4xx, ERROR for 5xx status codes
Includes search query text for search endpoints

4. `/health` Endpoint (`core/server.go`)

Returns server status, uptime, engine initialization status, and system metrics (goroutines, memory, Go version)
Returns HTTP 503 if any engine is not initialized (degraded state)
Added HEALTHCHECK directive to Dockerfile for container orchestration

5. In-Memory Response Cache (`core/cache.go`)

Thread-safe cache with configurable TTL (default: 5 min) and max size (default: 1000)
Automatic background eviction of expired entries every 60s
X-Cache: HIT/MISS response header for transparency
GET /cache/stats endpoint for monitoring
Reduces search engine hits, lowering ban/captcha risk

6. Graceful Shutdown (`main.go`)

Handles SIGINT/SIGTERM signals for clean exit
Prevents orphaned browser processes on Ctrl+C or docker stop

New Config Options

Option	CLI Flag	Default	Description
`cache_ttl`	`--cache_ttl`	`300`	Cache TTL in seconds (0 = disabled)
`cache_max_size`	`--cache_max_size`	`1000`	Max cached responses
`cors`	`--cors`	`true`	Enable CORS headers

Files Changed (10 files, +634 lines)

File	Type
`core/middleware.go`	New — CORS, JSON errors, request logging
`core/cache.go`	New — TTL cache with eviction
`core/cache_test.go`	New — 6 cache unit tests
`core/middleware_test.go`	New — Middleware unit tests
`core/server.go`	Modified — Health endpoint, cache integration, middleware wiring
`cmd/root.go`	Modified — New config fields + CLI flags
`cmd/serve.go`	Modified — Pass ServerOptions with cache/CORS settings
`config.yaml`	Modified — New config keys with defaults
`Dockerfile`	Modified — Added HEALTHCHECK
`main.go`	Modified — Signal handling

Backward Compatibility

All changes are fully backward-compatible:

Cache is enabled by default but can be disabled with cache_ttl: 0
CORS is enabled by default but can be disabled with --cors=false
Existing API endpoints and response formats are unchanged
New endpoints (/health, /cache/stats) are additive only

…tructured errors, request logging, graceful shutdown) - Add CORS middleware with configurable origins/methods/headers - Add structured JSON error responses for all API errors - Add request logging middleware with latency, status, IP tracking - Add /health endpoint with engine status, uptime, and system metrics - Add in-memory response cache with TTL and automatic eviction - Add /cache/stats endpoint for cache monitoring - Add graceful shutdown signal handling (SIGINT/SIGTERM) - Add HEALTHCHECK directive to Dockerfile - Add cache_ttl, cache_max_size, cors config options and CLI flags - Add unit tests for cache and middleware

…ngine fallback) Add fault-tolerance and self-healing capabilities to handle the inherent disadvantages of web scraping (blocking, CAPTCHAs, engine downtime): - Exponential backoff retry: retries transient failures with increasing delays (1s->2s->4s...), skips retries on CAPTCHAs (IP-level issue) - Circuit breaker: auto-disables engines after consecutive failures, periodically tests recovery via half-open state - Proxy rotation: round-robin proxy pool with auto-disable after 3 consecutive failures, re-enables all when none are available - Engine fallback: when primary engine fails, automatically tries alternative engines transparently - Resilient megasearch: parallel search with circuit breaker protection - New /resilience/stats endpoint for monitoring breaker states - Health endpoint enhanced with circuit breaker degradation status - Configurable via CLI flags and config.yaml New files: core/retry.go, core/proxy.go, core/circuit_breaker.go, core/resilient.go, and comprehensive unit tests for all three.

karust

Thanks for the PR! I like the direction overall, and there are several useful additions here

Good ideas:

CORS support is useful for browser-based clients
caching identical requests can reduce repeated scraping pressure
healthcheck and stats endpoints are useful for Docker/ops visibility

One product-level question:

Should dedicated engine endpoints like /google/search or /bing/search fall back to a different engine at all? That may be surprising for users who explicitly want results from a specific engine. Fallback feels more appropriate for mega/* or for an explicitly enabled resilient mode.

A few suggestions:

move cache and resilience into separate sections in config.yaml instead of adding many flat app flags
make resilience optional
if resilience stays, let users configure fallback engines explicitly for more control
make cache optional as well, or disable it when cache_max_size=0

There are also a few issues I think should be fixed before merge:

fallback results are cached under the requested engine key, not the actual serving engine. Example: first /google/search fails over to Yandex and returns X-Fallback-Engine: yandex, X-Cache: MISS; second /google/search returns X-Cache: HIT, but now there is no indication the cached result actually came from Yandex
search requests still appear to be allowed during circuit-breaker open / half-open flow in ways that don’t match the intended behavior, and retry_in stats do not seem to reflect new failures correctly
rate limits via engine.GetRateLimiter() do not seem to be applied inside the resilient searcher
proxy rotation is configured, but the proxy pool does not seem to be used in actual resilient searches
/health returns HTTP 503 for a merely degraded state, which could cause unnecessary container restarts if only one engine is failing

I’m going to leave this open for now rather than merge it in the current form.
If you’d like to continue with it, I’d be happy to review another revision.

@Sai-Prashanth123

Based in part on work from PR #21 by @Sai-Prashanth123, adapted and integrated with project-specific fixes.

karust · 2026-04-04T20:18:45Z

Thanks again for PR and the amount of groundwork you put into it.

I went through the ideas and implementation in detail and ported/reworked the useful parts into the current codebase in a way that better matches OpenSERP’s direction and config model. The 0.6.0 release now includes the production-readiness improvements that were explored here, including health checks, cache/resilience work, proxy/runtime improvements, and related server wiring.

I’m closing this PR because the final implementation landed through a different integration/rework path rather than by merging this branch directly, but your work absolutely helped shape the release. Credit is also included in the 0.6.0 changelog.

Thanks for the contribution.

Sai-Prashanth123 added 3 commits March 6, 2026 17:18

fix: add missing fmt import in resilient.go

053c195

karust requested changes Mar 10, 2026

View reviewed changes

karust added a commit that referenced this pull request Mar 25, 2026

Add retry/circuit breaker, configurable fallback and CORS.

f219c78

Based in part on work from PR #21 by @Sai-Prashanth123, adapted and integrated with project-specific fixes.

karust closed this Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add production-readiness features (CORS, cache, health check, structured errors, request logging, graceful shutdown)#21

feat: add production-readiness features (CORS, cache, health check, structured errors, request logging, graceful shutdown)#21
Sai-Prashanth123 wants to merge 3 commits intokarust:mainfrom
Sai-Prashanth123:main

Sai-Prashanth123 commented Mar 6, 2026

Uh oh!

karust left a comment

Uh oh!

karust commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Sai-Prashanth123 commented Mar 6, 2026

Summary

Changes

1. CORS Middleware (core/middleware.go)

2. Structured JSON Error Responses (core/middleware.go)

3. Request Logging Middleware (core/middleware.go)

4. /health Endpoint (core/server.go)

5. In-Memory Response Cache (core/cache.go)

6. Graceful Shutdown (main.go)

New Config Options

Files Changed (10 files, +634 lines)

Backward Compatibility

Uh oh!

karust left a comment

Choose a reason for hiding this comment

Uh oh!

karust commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. CORS Middleware (`core/middleware.go`)

2. Structured JSON Error Responses (`core/middleware.go`)

3. Request Logging Middleware (`core/middleware.go`)

4. `/health` Endpoint (`core/server.go`)

5. In-Memory Response Cache (`core/cache.go`)

6. Graceful Shutdown (`main.go`)