Roadmap · Building in public

From gateway to open inference marketplace.

ForgeRouter is being built step by step: first the OpenAI-compatible gateway and ForgeSoftware-hosted models, then ForgeNode onboarding, founding providers, marketplace routing, provider tiers and advanced policies.

OpenAI-compatible API ForgeNode onboarding Community providers Provider tiers Transparent pricing
In progress
Planned
Later
Research
Not committed
In progress

Phase 1: Gateway foundation

Establishing core infrastructure for secure, low-latency API access.

  • check User accounts & API keys
  • check v1/chat/completions endpoint
  • check Prepaid wallet system
In progress

Phase 2: ForgeSoftware provider

First platform-operated deployment to validate routing metrics.

deployment_id: fs-qwen-fr
Model: Qwen3.6-27B
Hardware: 2x A5000
Region: France (EU)
Planned

Phase 3: ForgeNode alpha

Standardized CLI onboarding for secure node enrollment.

$ pipx install forgenode
$ forgenode enroll
$ forgenode detect
$ forgenode publish
Planned

Phase 4: Founding providers

Inviting trusted community members to host models on the network.

Planned

Phase 5: Marketplace beta

Public deployment cards and advanced routing filters.

{
  "model": "meta-llama/Llama-3-70b-chat-hf",
  "routing_strategy": "latency_optimized",
  "filters": {
    "provider_tier": "founding",
    "max_latency_ms": 150
  }
}
Later

Phase 6: Open access

Any provider can join, subject to automated network verification and SLA monitoring.

Research

Phase 7: Advanced policies

Smart routing based on cost, data privacy regions, or hardware specific requests.

{
  "routing_strategy": "cost_optimized",
  "filters": {
    "region": "EU_ONLY",
    "min_vram_gb": 40
  }
}
Not committed

Phase 8: Decentralized settlement

Exploring trustless payment channels for provider settlements.

What we are not promising yet.

block No fake SLAs
visibility_off No hidden trust downgrades
money_off No guaranteed provider revenue

What we want to validate first.

speed Gateway latency overhead under load
security Secure isolation for ForgeNode hosts
analytics Accuracy of empirical throughput metrics

How to participate

code

For Developers

  • checkAccess multiple providers through a single API.
  • checkControl routing based on empirical metrics.
  • checkUnified billing through a prepaid wallet.
Read API Docs
dns

For Providers

  • checkMonetize idle GPU capacity easily.
  • checkStandardized onboarding via CLI.
  • checkTransparent performance metrics.
Apply as Founder

Build log

Oct 2023

Gateway Alpha launched

Internal testing of core routing layer.

Sep 2023

Initial concept

Architecture design and technical validation.

FAQ

How is routing latency measured? expand_more

We maintain active probing connections from our gateway to provider nodes, measuring Time To First Token (TTFT) continuously to inform our routing tables.

Can I bring my own models? expand_more

During early phases, we restrict the catalog to verified popular open weights. Custom model hosting is planned for Phase 6.

Build the open inference marketplace with us.