Skip to content

api-evangelist/parasail-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Parasail (parasail)

Parasail is an AI Supercloud — a pay-per-token GPU inference platform aimed at AI startups and developers. Parasail orchestrates rented GPU capacity across 40+ data centers in 15+ countries to serve open-weight LLMs, vision/multimodal models, embedding models, and TTS/STT models on a serverless, dedicated, or batch basis. The platform exposes OpenAI-compatible /v1 endpoints for chat completions, completions, embeddings, batch, and models, plus a control-plane /api/v1 for managing dedicated GPU deployments of any Hugging Face or custom model. Parasail serves 500B+ tokens per day and is positioned as up to 30x cheaper than legacy cloud providers, with no quotas, no rate-limit penalties, and no long-term contracts. Co-founded by Mike Henry (ex-Mythic) and Tim Harris (ex-Swift Navigation); raised a $32M Series A in April 2026 (Touring Capital and Kindred Ventures) bringing total funding to $42M.

APIs.json: https://raw.githubusercontent.com/api-evangelist/parasail-ai/refs/heads/main/apis.yml

Scope

  • Position: Consuming
  • Access: 3rd-Party

Tags

  • AI
  • Artificial Intelligence
  • GPU
  • Inference
  • Large Language Models
  • Open Source Models
  • Hugging Face
  • Batch
  • Embeddings
  • Tokenmaxxing
  • Supercloud

Timestamps

  • Created: 2026-05-25T00:00:00.000Z
  • Modified: 2026-05-25

APIs

Parasail Inference API

OpenAI-compatible real-time and streaming inference API exposing serverless access to popular open-weight LLMs, embedding models, and the model catalog. Endpoints: /v1/chat/completions, /v1/completions, /v1/embeddings, /v1/models. Bearer-token authentication; pay-per-token billing; supports streaming, tool use, and structured outputs. Compatible with the OpenAI Python and TypeScript clients by overriding base_url.

Tags

  • AI
  • Artificial Intelligence
  • Inference
  • Chat
  • Embeddings
  • Models

Properties

Parasail Batch API

OpenAI-compatible Batch API for asynchronous inference workloads at 50% off serverless pricing (with an additional 30% off cached tokens). Supports /v1/chat/completions and /v1/embeddings in the OpenAI Batch file format (JSONL) with a 24-hour completion window. Includes a Files surface for uploading and downloading input/output/error JSONL files. Ideal for offline enrichment, dataset processing, and large-scale tokenmaxxing.

Tags

  • AI
  • Artificial Intelligence
  • Batch
  • Files

Properties

Parasail Dedicated Deployments API

Control-plane API for managing Parasail Dedicated and Dedicated Serverless deployments. Provision reserved GPU capacity (H100, A100, H200, etc.) running any Hugging Face or custom model, then list, retrieve, update, pause, resume, and delete deployments. Read-only API keys can list and retrieve but cannot mutate. Endpoint: /api/v1/dedicated/deployments.

Tags

  • AI
  • Artificial Intelligence
  • GPU
  • Deployments
  • Dedicated

Properties

Common Properties

Maintainers

FN: Kin Lane Email: info@apievangelist.com URL: https://apievangelist.com

Releases

No releases published

Packages

 
 
 

Contributors