Now accepting early access signups

Privacy you can prove.
Models you can trust.

The API gateway that lets you control exactly who sees your data. Route to any model. Set your privacy level per request.

Join the Waitlist See How It Works

quickstart.py

                    from openai import OpenAI
                    
                    client = OpenAI(
                        base_url="https://northerninference.ca/v1",
                        api_key="ni_live_..."
                    )
                    
                    response = client.chat.completions.create(
                        model="anthropic/claude-sonnet-4.5",
                        messages=[{"role": "user", "content": "..."}],
                        extra_body={"privacy_tier": "managed_canadian_cloud"}
                    )
                

🇨🇦

Government & Public Sector

Built for Canadian data residency at Tier 3.
Ready for government.

Northern Inference infrastructure runs in AWS ca-central-1 (Montreal). Data at rest stays in Canada. We provide transparent chain-of-custody documentation showing exactly which entities handle your data at each privacy tier, so your security team can make informed decisions.

Canadian Data Residency

Inference runs in ca-central-1 (Montreal) and Azure Canada East. Data at rest stays in Canadian data centres. PIPEDA jurisdiction applies.

Transparent Data Custody

Every API request includes a clear record of which entities processed your data. You choose your privacy tier per request, from customer-hosted hardware (only you see data) to cloud providers with Canadian data residency.

Privacy Tier Model

Select your privacy level per request. Customer-hosted hardware for sensitive workloads, Canadian cloud for general use, or provider-default routes when Canadian data residency is not required.

PII Substitution

Opt-in automatic PII detection and substitution. Names, emails, phone numbers, and other identifiers are replaced with realistic fakes before leaving your infrastructure, then restored in responses. Full audit log available.

Built for Federal Procurement

Inherits AWS Canada's Cloud Service Provider attestations (SOC 2 Type II, ISO 27001, PCI-DSS). ITSG-33 PBMM control mapping in progress. Full status and roadmap on our Trust page.

Procurement-Friendly Pricing

Transparent per-token billing with no hidden costs. Clear invoicing in CAD. Easy to forecast and budget for departmental spending.

Standard API Format

Drop-in replacement for existing AI tools and prototypes. Government developers can adopt without rewriting existing integrations.

Request a Government Briefing

Priority access and dedicated onboarding for government and public sector organizations.

Full name *

Work email *

Department / Agency *

Job title

Province / Territory

Level of government

Tell us about your AI use case

We'll respond within 2 business days. Privacy Policy

PIPEDA Compliant AWS Canada Attestations ca-central-1 by Default Transparent Custody

your_app.py

                            # Before: locked to one provider
                            from openai import OpenAI
                            client = OpenAI()  # single provider
                            
                            # After: one line change, every model
                            client = OpenAI(
                                base_url="https://northerninference.ca/v1"
                            )
                            
                            # Canadian data residency (Bedrock/Azure)
                            r1 = client.chat.completions.create(
                                model="anthropic/claude-sonnet-4.5",
                                messages=messages,
                                extra_body={"privacy_tier": "managed_canadian_cloud"}
                            )
                            
                            # Broader model access -> provider-default route
                            r2 = client.chat.completions.create(
                                model="openai/gpt-4.1",
                                messages=messages,
                                extra_body={"privacy_tier": "provider_api"}
                            )
                        

Change one URL.
Add one parameter.

Northern Inference uses the industry-standard chat completions API format. Point any compatible SDK to our endpoint, add a privacy tier, and you're done. No code changes needed.

Works with every major AI SDK: Python, Node, Go, and more
Route sensitive data through your own hardware via NI. Unlimited tokens, no third parties
Switch models without changing code
Transparent per-token pricing for every provider

The Key Differentiator

Choose Your Level of Trust

Set your privacy tier on every API call. From zero-knowledge customer-hosted inference to provider-default global access, you control the trade-off.

Coming Soon

Customer-Hosted

Self-Hosted Hardware

Your hardware, your premises, open-source models. NI tunnels API requests into your machine; only you ever see prompts and completions. The strongest privacy guarantee we offer. Phase 5 of our roadmap.

Only you process data

Coming Soon

Confidential Compute

Nitro Enclave

AWS Nitro Enclaves with cryptographic isolation. Data encrypted with KMS before entering the enclave; AWS itself cannot read inside. Currently CPU-only (small models); GPU support pending Blackwell GA. Phase 5 of our roadmap.

Cryptographic isolation

Live

Canadian Cloud

Managed Cloud

AWS Bedrock and Azure OpenAI with Canadian data residency. Providers cannot train on your data. Infrastructure hosted by US-headquartered companies in Canadian data centres (ca-central-1, Canada East) under contractual data protection.

Cloud provider processes in Canada

Live

Provider-Default

Global Access

Broader model access through NI-routed upstream providers outside the Canadian-residency boundary. This can include direct APIs, Bedrock in US regions, Azure GlobalStandard, Vertex US regions, and other provider-default routes.

Exact route shown per request

Your Device

→

NI Gateway

→

Canadian Cloud

→

Model

Tier 3: Canadian cloud. AWS Bedrock or Azure OpenAI in ca-central-1. Provider cannot train on your data.

Built for Developers

One API. Every Model.

One API for every model. Switch providers with a single parameter.

Standard API Format

Uses the industry-standard chat completions format. Change one URL and your existing code works with 100+ models across providers.

Every Major Provider

Anthropic, OpenAI, Google, Meta, Mistral, Cohere. Access the best model for each task through one gateway.

Transparent Billing

See the provider cost and our fee separately on every request. Thinking tokens billed at the rate shown per model. No expiring credits. No surprise overages.

Per-Request Privacy

Set your privacy tier on every API call. Route sensitive prompts through your own hardware, casual queries through cloud providers.

PII Substitution

Names, emails, and identifiers are replaced with realistic fakes before reaching the model, then restored in responses. Opt-in per API key or per request.

Real-Time Analytics

Live dashboard with per-model, per-team cost breakdowns. Budget alerts and hard limits built in.

Canadian Sovereignty

Infrastructure in Canada (ca-central-1). PIPEDA jurisdiction. Transparent data custody for organizations that need it.

Simple Pricing

No Surprises. No Games.

Pay for what you use.

See the provider's cost and our routing fee separately on every request. No monthly minimums, no expiring credits, and thinking tokens are priced openly per model.

Transparent per-token rates No monthly minimums Credits never expire Thinking tokens priced openly

Early access members get priority onboarding + founding member pricing

See live per-token rates for every model →

Get Early Access

Be among the first to use Northern Inference.

Early access: priority onboarding + founding member pricing

Start free with CA$5 in credits. No credit card required.

Email address *

Name (optional)

Organization (optional)

Your role

Primary use case

Job title (optional)

No spam. Unsubscribe anytime. Privacy Policy

Share to move up the waitlist

Each referral moves you closer to the front of the line.

	Northern Inference	Venice.ai	OpenRouter	Direct APIs
Per-request privacy control
Canadian data residency (per request, Tier 3)
Transparent pricing
Standard API format
Per-request custody trail
PII substitution
No crypto required

Privacy you can prove.
Models you can trust.

Built for Canadian data residency at Tier 3.
Ready for government.

Canadian Data Residency

Transparent Data Custody

Privacy Tier Model

PII Substitution

Built for Federal Procurement

Procurement-Friendly Pricing

Standard API Format

Request a Government Briefing

Change one URL.
Add one parameter.

Choose Your Level of Trust

Self-Hosted Hardware

Nitro Enclave

Managed Cloud

Global Access

Available Models

One API. Every Model.

Standard API Format

Every Major Provider

Transparent Billing

Per-Request Privacy

PII Substitution

Real-Time Analytics

Canadian Sovereignty

Built Different

No Surprises. No Games.

Pay for what you use.

Get Early Access

Share to move up the waitlist

Privacy you can prove.Models you can trust.

Built for Canadian data residency at Tier 3.Ready for government.

Canadian Data Residency

Transparent Data Custody

Privacy Tier Model

PII Substitution

Built for Federal Procurement

Procurement-Friendly Pricing

Standard API Format

Request a Government Briefing

Change one URL.Add one parameter.

Choose Your Level of Trust

Self-Hosted Hardware

Nitro Enclave

Managed Cloud

Global Access

Available Models

Pre-order Hardware

One API. Every Model.

Standard API Format

Every Major Provider

Transparent Billing

Per-Request Privacy

PII Substitution

Real-Time Analytics

Canadian Sovereignty

Built Different

No Surprises. No Games.

Pay for what you use.

Get Early Access

Share to move up the waitlist

Privacy you can prove.
Models you can trust.

Built for Canadian data residency at Tier 3.
Ready for government.

Change one URL.
Add one parameter.