Active

ZeptoR8R

Intelligent request router

A high-performance request router with AI-aware load balancing. ZeptoR8R distributes traffic intelligently based on model capacity, queue depth, and latency requirements — ensuring optimal AI inference performance.

View on GitHub

Quickstart

terminal

# Install ZeptoR8R
$ npm install -g @aisar/r8r
$ 
# Start with config
$ r8r --config ./r8r.yaml

Features

AI-Aware Routing

Routes requests based on model type, token count, and inference complexity.

Load Balancing

Distributes traffic across inference workers using real-time metrics.

Queue Management

Smart queuing with priority levels and timeout handling.

Low Latency

Sub-millisecond routing overhead with efficient connection pooling.

Architecture

ClientsHTTP/gRPC API consumers

RouterRequest classification and routing logic

Load BalancerWorker selection and health tracking

WorkersAI inference endpoints

Related Projects

Active

ZeptoClaw

Self-healing agent runtime

A lightweight, secure runtime for autonomous agents. ZeptoClaw provides process isolation, resource limits, and automatic recovery — letting you run AI agents with confidence on edge devices, servers, or embedded systems.

Open project briefView details

Experimental

ZeptoPM

Agent package manager

A decentralized package manager for agent components and capabilities. ZeptoPM makes it easy to share, discover, and install reusable agent modules — from tool integrations to complete agent templates.

Open project briefView details

Experimental

ZeptoRT

Real-time agent bus

A high-throughput message bus for real-time agent communication. ZeptoRT enables low-latency event streaming, pub/sub patterns, and request-reply semantics for distributed agent systems.

Open project briefView details