🧠 PropIntel

PropIntel is a production-style real estate lead intelligence platform that ingests raw lead datasets, enriches them from external sources, verifies contact quality, and ranks lead readiness for outreach.

Built as a portfolio project to demonstrate practical delivery across backend systems, data workflows, and operator-friendly product UI.

Why clients pick this (quick pitch)

You get a repeatable lead intelligence workflow: upload files, run jobs, monitor progress, and export ranked leads.
It is resilient for long-running processing: batching, partial result persistence, termination controls, and resume support.
It is operator-friendly: dashboard tabs for analytics, job history, exploration, and runtime settings profiles.

🎯 Features

Flexible ingestion: CSV, JSON, and PropFlux-style inputs.
Website enrichment: contact extraction, chatbot detection, freshness signals, website speed scoring.
Google Maps enrichment: business matching, phone/website/location augmentation.
Conflict resolution: source-aware candidate merging with enrichment history.
Contact verification: verified / likely / low quality model.
Lead scoring: configurable scoring engine with explainable lead_reason.
Batch processing: incremental writes to DB, lower memory pressure, partial visibility.
Resumable jobs: failed/terminated jobs can be resumed.
Concurrency + rate limiting: provider-aware limits for Serper and Google Maps.
Responsive dashboard: control panel, analytics, job history, data explorer, engine settings.

📊 Current State

PropIntel currently runs end-to-end as a complete enrichment pipeline + dashboard system with:

strict config schema validation
SQLite-backed job/result persistence
batch lifecycle tracking (pending/processing/completed/failed/terminated)
partial results during processing
stop + resume controls
provider-aware runtime controls for concurrency and request pacing

Ongoing work is focused on deployment and post-MVP extensions (integrations, additional enrichment sources, and operational hardening).

Dashboard Screenshots

📋 Requirements

Python 3.11+
Node.js 20+ (for dashboard build/dev)
API keys for optional external enrichment:
- SERPER_API_KEY
- GOOGLE_MAPS_API_KEY

🚀 Quick Start

1. Setup backend

python3 -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

2. Configure environment

Create .env from .env.example and fill keys if you want external enrichment:

cp .env.example .env

3. Start API

python runner.py api --host 127.0.0.1 --port 8000 --reload

Health check:

curl http://127.0.0.1:8000/health

4. Start dashboard

cd frontend/dashboard
npm install
npm run dev

Open:

Dashboard: http://127.0.0.1:5173
API: http://127.0.0.1:8000

🖥 CLI Usage

runner.py supports:

api — start FastAPI server
run — execute pipeline locally and export artifacts

Examples:

# Start API
python runner.py api --host 0.0.0.0 --port 8080 --reload --log-level debug

# Run pipeline with CSV input
python runner.py run --input data/leads.csv --input-format csv --config config/sources.yaml --output output --log-level info

# Run pipeline with PropFlux-like JSON
python runner.py run --input data/propflux_export.json --input-format propflux

Output (where results go)

CLI runs create a timestamped folder under output/:

output/<timestamp>/

Artifacts per run:

leads_<timestamp>.json
leads_<timestamp>.csv
rejected_rows_<timestamp>.json
run_summary_<timestamp>.json
logs in logs/propintel_<timestamp>.log

Client-Facing Walkthrough (how this delivers results)

PropIntel is built for repeatable operations, not one-off scripts.

How it works

A dataset is uploaded (POST /jobs) or run through CLI.
Input rows are mapped/validated, normalized, deduplicated.
Job is split into batches and persisted in SQLite (job_batches).
Enrichment runs with configurable concurrency and provider-aware rate limits.
Each completed batch writes leads immediately to DB (partial results available).
Verification, conflict resolution, and scoring produce final lead intelligence fields.
Dashboard/API surfaces telemetry, progress, outputs, and resume/termination controls.

Dashboard Documentation (what to click + what to expect)

The dashboard lives in frontend/dashboard/ and talks to the FastAPI backend.

Main Control Panel

Upload dataset (csv | json | propflux)
Start, terminate, and resume jobs
See live progress (started/completed batches, row progress)
See recent jobs and quick-select active job

Latest Listings

Filter by score, contact quality, chatbot signal, freshness signal
Export active job results (JSON/CSV once completed)
Shows partial results while job is processing

Analytics

Total jobs, completed jobs
Average lead score
Verified contact rate

Job History

Full job listing with statuses and run metadata
Mobile-friendly card layout for non-desktop widths

Data Explorer

Select a job and inspect rows in detail
Live reload + responsive card/table behavior

Engine Settings

Validate and save profile JSON
Activate/delete profiles
Active profile is used by job processing

API Endpoints (used by dashboard + integrations)

Jobs

POST /jobs — create job from uploaded file
GET /jobs — paginated jobs list (limit, offset, optional status)
GET /jobs/{job_id} — status + counts + batch progress
POST /jobs/{job_id}/terminate — stop running job
POST /jobs/{job_id}/resume — resume failed/terminated job
GET /jobs/{job_id}/batches — batch lifecycle rows

Results

GET /jobs/{job_id}/results — returns current rows (partial=true until completed)
GET /jobs/{job_id}/rejected — rejected rows for the job
GET /jobs/{job_id}/export?format=csv|json — export completed results

Settings Profiles

GET /settings
POST /settings/validate
PUT /settings
POST /settings/activate
DELETE /settings/{name}

Runtime Config (high-level)

Configured in config/sources.yaml or via Engine Settings profiles:

input mapping/validation rules
website enrichment controls
google_maps enrichment controls
scoring weights and score behavior
runtime batching + worker concurrency + provider rate limits

Minimal runtime defaults are included; advanced knobs are optional.

Testing

Run backend tests:

python -m unittest discover -s tests -p "test_*.py"

Build dashboard:

cd frontend/dashboard
npm run build

CI workflow is included at .github/workflows/ci.yml for backend tests + frontend build.

Run Locally with Docker

This runs backend and frontend as separate containers with the frontend calling the backend via VITE_API_BASE_URL.

1) Build images

docker build -f deploy/fly/backend.Dockerfile -t propintel-api:local .
docker build -f deploy/fly/frontend.Dockerfile \
  --build-arg VITE_API_BASE_URL=http://localhost:8000 \
  -t propintel-web:local .

2) Run backend (public on localhost:8000)

docker run -d \
  --name propintel-api \
  -p 8000:8000 \
  --env-file .env \
  -v "$(pwd)/data:/app/data" \
  propintel-api:local

3) Run frontend (public on localhost:8080)

docker run -d \
  --name propintel-web \
  -p 8080:8080 \
  propintel-web:local

Open: http://localhost:8080

4) Stop and clean up

docker rm -f propintel-web propintel-api

Notes:

VITE_API_BASE_URL is compile-time for Vite; rebuild frontend image when backend URL changes.
Keep .env local only; do not commit secrets.

Deployment (Fly.io)

Deployment is configured for two public Fly.io apps:

propintel-web (public): serves the React dashboard
propintel-api (public): serves FastAPI + SQLite

Frontend uses VITE_API_BASE_URL at build time to call the public backend API.

Deployment assets:

deploy/fly/backend.Dockerfile
deploy/fly/frontend.Dockerfile
deploy/fly/backend.fly.toml
deploy/fly/frontend.fly.toml

1) Backend app

# Create app (one-time)
fly apps create propintel-api

# Create persistent volume for SQLite/uploads (one-time)
fly volumes create propintel_data --size 1 --region jnb --app propintel-api

# Set backend secrets
fly secrets set SERPER_API_KEY=... GOOGLE_MAPS_API_KEY=... -a propintel-api

# Deploy backend
fly deploy -c deploy/fly/backend.fly.toml

2) Frontend app (public)

# Create app (one-time)
fly apps create propintel-web

# Optional: set explicit backend URL for this frontend app build
# fly secrets set VITE_API_BASE_URL=https://propintel-api.fly.dev -a propintel-web

# Deploy frontend
fly deploy -c deploy/fly/frontend.fly.toml

Important notes

deploy/fly/frontend.fly.toml currently builds with VITE_API_BASE_URL=https://propintel-api.fly.dev.
If backend app domain changes, update build.args.VITE_API_BASE_URL and redeploy frontend.
Restrict backend CORS to your frontend domain(s) in production settings.

Security Notes

Never commit real API keys or .env files.
Use .env.example as the template.
Rotate any development keys before public release.

Built with love for practical lead intelligence and clean, reliable data workflows.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github/workflows		.github/workflows
assets		assets
backend		backend
config		config
deploy/fly		deploy/fly
docs		docs
frontend/dashboard		frontend/dashboard
logs		logs
sample_data		sample_data
tests		tests
.env.example		.env.example
.gitignore		.gitignore
PROJECT_NOTE.md		PROJECT_NOTE.md
README.md		README.md
requirements.txt		requirements.txt
runner.py		runner.py

Folders and files

Latest commit

History

Repository files navigation

🧠 PropIntel

Why clients pick this (quick pitch)

🎯 Features

📊 Current State

Dashboard Screenshots

📋 Requirements

🚀 Quick Start

1. Setup backend

2. Configure environment

3. Start API

4. Start dashboard

🖥 CLI Usage

Output (where results go)

Client-Facing Walkthrough (how this delivers results)

How it works

Dashboard Documentation (what to click + what to expect)

Main Control Panel

Latest Listings

Analytics

Job History

Data Explorer

Engine Settings

API Endpoints (used by dashboard + integrations)

Jobs

Results

Settings Profiles

Runtime Config (high-level)

Testing

Run Locally with Docker

1) Build images

2) Run backend (public on localhost:8000)

3) Run frontend (public on localhost:8080)

4) Stop and clean up

Deployment (Fly.io)

1) Backend app

2) Frontend app (public)

Important notes

Security Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages