Introduction

Paygentic is infrastructure for consumption-based monetization. Built for software where costs scale with usage - LLMs, data processing, compute resources - not user seats.

Why Paygentic Exists

Software economics changed. Fixed monthly subscriptions break when your costs vary per request. When running GPT-4 costs 0.03 USD per thousand tokens, charging $20/month regardless of usage creates unsustainable unit economics. The technical challenges multiply: Metering at Scale Tracking millions of micro-transactions per second without impacting application performance. Payment Certainty Guaranteeing funds exist before processing expensive operations. No post-facto billing surprises. Pricing Flexibility Supporting per-token, per-query, per-gigabyte, and hybrid models simultaneously across different customer segments. Instant Settlement Moving money in real-time as consumption occurs, not weeks later through invoicing cycles.

Architecture Philosophy

Paygentic collapses the artificial boundary between billing systems and payment processors. Usage tracking and fund movement happen atomically. This design eliminates entire categories of problems:

No reconciliation between usage logs and payment records
No delayed payment collection after service delivery
No manual invoice generation or payment chasing
No separate integration points for billing vs payments

Core Primitives

Products & Metrics Define services and measurable units. An LLM service tracks input tokens, output tokens, and fine-tuning hours as distinct metrics. Plans & Prices Package pricing strategies. Standard rates, dynamic pricing, percentage fees, or custom models per metric. Accounts Hold funds with atomic unit precision. Enable instant transfers between parties as usage occurs. Entitlements Reserve funds before consumption. Lock payment for a specific amount of usage, guaranteeing collection. Usage Events Report consumption in real-time. Each event triggers immediate pricing calculation and fund transfer.

Implementation Patterns

AI Services

Input/output token differentiation
Model-specific pricing tiers
Burst capacity reservations
Fine-tuning compute billing

Data Platforms

Storage per GB-month
Query compute seconds
Egress bandwidth charges
Reserved vs on-demand pricing

Developer Tools

Build minutes
Deployment slots
Bandwidth consumption
Concurrent execution limits

Technical Integration

Single API surface. Define your monetization model programmatically:

Create product and metrics
Configure pricing plans
Onboard customers
Report usage events
Funds move automatically

No webhook gymnastics. No batch processing delays. No payment gateway complexity.

Design Principles

Real-time by default. Every operation happens instantly. No batch processing unless explicitly requested. Precision matters. Atomic unit accounting enables sub-cent transactions at scale. Idempotent everything. Network failures don’t create duplicate charges or lost events. Geographic distribution. Edge processing for sub-10ms authorization globally.

Getting Started

Begin with Organizations to understand resource ownership. Then explore Pricing to model your monetization strategy. For hands-on integration, follow the Quickstart guide.

Getting Started

Authentication

Platform

Legal

Why Paygentic Exists

Architecture Philosophy

Core Primitives

Implementation Patterns

Technical Integration

Design Principles

Getting Started

Getting Started

Authentication

Platform

Legal

​Why Paygentic Exists

​Architecture Philosophy

​Core Primitives

​Implementation Patterns

​Technical Integration

​Design Principles

​Getting Started

Why Paygentic Exists

Architecture Philosophy

Core Primitives

Implementation Patterns

Technical Integration

Design Principles

Getting Started