Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 8 additions & 8 deletions docs/cloud/models.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ import QwenIcon from '@site/static/img/icons/models/Qwen_logo.svg';
import ZaiIcon from '@site/static/img/icons/models/zai-logo.svg';
import CodeBlock from '@theme/CodeBlock';

NEAR AI Cloud provides access to leading AI models, each optimized for different use cases from advanced reasoning and tool calling to long-context processing and multilingual tasks. All models run in secure TEE environments with transparent, pay-per-use pricing.
NEAR AI Cloud provides access to leading AI models, each optimized for different use cases ranging from advanced reasoning and tool calling to long-context processing and multilingual tasks. All models run in secure TEE environments with transparent, pay-per-use pricing.

## Quick Reference

Expand All @@ -27,6 +27,13 @@ NEAR AI Cloud provides access to leading AI models, each optimized for different
</tr>
</thead>
<tbody>
<tr>
<td><CodeBlock language="text">zai-org/GLM-4.6-FP8</CodeBlock></td>
<td>200K</td>
<td>$1.20/M</td>
<td>$3.60/M</td>
<td>Agentic applications, advanced coding, tool use, refined writing</td>
</tr>
<tr>
<td><CodeBlock language="text">deepseek-ai/DeepSeek-V3.1</CodeBlock></td>
<td>128K</td>
Expand All @@ -48,13 +55,6 @@ NEAR AI Cloud provides access to leading AI models, each optimized for different
<td>$0.45/M</td>
<td>Ultra-long context (262K), reasoning, instruction following, multilingual</td>
</tr>
<tr>
<td><CodeBlock language="text">zai-org/GLM-4.6-FP8</CodeBlock></td>
<td>200K</td>
<td>$0.75/M</td>
<td>$2.00/M</td>
<td>Agentic applications, advanced coding, tool use, refined writing</td>
</tr>
</tbody>
</table>

Expand Down
51 changes: 16 additions & 35 deletions docs/cloud/private-inference.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -13,17 +13,17 @@ import { FeatureCard, FeatureCardGrid } from '@site/src/components/FeatureCard';

# Private Inference

When you use traditional AI services, your data passes through systems controlled by cloud providers and AI companies. Your prompts, the AI's responses, and even the processing of your requests are all visible to these third parties. This creates serious privacy concerns for sensitive applications.
When you use traditional AI services, your data passes through systems controlled by cloud providers and AI companies. Your prompts, the AI's responses, and even the processing of your requests are all visible to these third parties. This creates serious security concerns for sensitive applications.

**Private inference solves this problem.** It ensures that AI computations happen in a completely isolated environment where no one—not the cloud provider, not the model provider, not even NEARcan access your data. At the same time, you can independently verify that your requests were actually processed in this secure environment through cryptographic attestation.
**Private inference solves this problem.** It ensures that AI computations happen in a completely isolated environment where no one—not the cloud provider, not the model provider, not even NEAR AI can access your data. At the same time, you can independently verify that your requests were actually processed in this secure environment through cryptographic attestation.

This guide explains how NEAR AI Cloud implements private inference using Trusted Execution Environments (TEEs), the architecture that protects your data, and the security guarantees you can rely on.

---

## What is Private Inference?

Private inference is a method of running AI models where both your input data and the model's outputs remain completely hidden from everyone except you—even while the computation happens on remote servers you don't control.
Private inference is a method of running AI models where both your input data and the model's outputs remain completely hidden from everyone except the user and client even while the computation happens on remote servers you don't control.

Traditional cloud AI services require you to trust that providers won't access your data. Private inference eliminates this need for trust by using hardware-based security that makes it technically impossible for anyone to see your data, even with physical access to the servers.

Expand Down Expand Up @@ -56,7 +56,7 @@ NEAR AI Cloud's private inference provides three core guarantees:
</div>
<div className="feature-highlight-content">
<h3>Production Performance</h3>
<p>Hardware-accelerated TEEs with NVIDIA H200 GPUs deliver high-throughput inference with minimal latency overhead, making private inference practical for real-world applications.</p>
<p>Hardware-accelerated TEEs with <a href="https://www.nvidia.com/en-us/data-center/solutions/confidential-computing/" target="_blank">NVIDIA Confidential Computing</a> deliver high-throughput inference with minimal latency overhead, making private inference practical for real-world applications.</p>
</div>
</div>
</div>
Expand All @@ -69,13 +69,13 @@ NEAR AI Cloud's private inference provides three core guarantees:

NEAR AI Cloud combines Intel TDX and NVIDIA TEE technologies to create isolated, secure environments for AI computation:

- **Intel TDX (Trust Domain Extensions)**
- **[Intel TDX(Trust Domain Extensions)](https://www.intel.com/content/www/us/en/developer/tools/trust-domain-extensions/overview.html)** :
Creates confidential virtual machines (CVMs) that isolate your AI workloads from the host system, preventing unauthorized access to data in memory.

- **NVIDIA TEE**
- **[NVIDIA TEE](https://www.nvidia.com/en-us/data-center/solutions/confidential-computing/)** :
Provides GPU-level isolation for model inference, ensuring model weights and computations remain completely private during processing.

- **Cryptographic Attestation**
- **[Cryptographic Attestation](/cloud/verification)** :
Each TEE environment generates cryptographic proofs of its integrity and configuration, enabling independent verification of the secure execution environment.

### The Inference Process
Expand Down Expand Up @@ -104,26 +104,7 @@ When you make a request to NEAR AI Cloud, your data flows through a secure pipel

## Architecture Overview

NEAR AI Cloud operates through a distributed architecture consisting of an LLM Gateway and a network of Private LLM Nodes:

```
┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐
│ Private LLM │ │ Private LLM │ │ Private LLM │
│ (8x H200) │ │ (8x H200) │ │ (8x H200) │
├─────────────────┤ ├─────────────────┤ ├─────────────────┤
│ Intel TDX CVM │ │ Intel TDX CVM │ │ Intel TDX CVM │
│ NVIDIA TEE │ │ NVIDIA TEE │ │ NVIDIA TEE │
│ Private-ML-SDK │ │ Private-ML-SDK │ │ Private-ML-SDK │
└─────────────────┘ └─────────────────┘ └─────────────────┘
│ │ │
└───────────────────────┼───────────────────────┘
┌─────────────────┐
│ LLM Gateway │
│ Key Creation │
│ Usage Tracking │
└─────────────────┘
```
NEAR AI Cloud operates through a distributed architecture consisting of an LLM Gateway and a network of Private LLM Nodes.

### Private LLM Nodes

Expand Down Expand Up @@ -151,32 +132,32 @@ The LLM Gateway serves as the central orchestration layer:

NEAR AI Cloud's private inference implements multiple layers of security to protect your data:

- **Hardware-Level Isolation**
- **Hardware-Level Isolation** :
TEEs create isolated execution environments enforced at the hardware level, preventing unauthorized access to memory and computation even from privileged system administrators or cloud providers.

- **Secure Communication**
- **Secure Communication** :
All communication between your applications and the LLM infrastructure uses end-to-end encryption, protecting data in transit from network-level attacks.

- **Cryptographic Attestation**
- **Cryptographic Attestation** :
Every TEE environment generates cryptographic proofs that verify the integrity of the execution environment, allowing you to independently confirm your computations occurred in a genuine, unmodified TEE.

- **Result Authentication**
- **Result Authentication** :
All AI outputs are cryptographically signed inside the TEE before leaving the secure environment, ensuring the authenticity and integrity of responses.

### Threat Protection

NEAR AI Cloud's architecture protects against common attack vectors:

- **Malicious Infrastructure Providers**
- **Malicious Infrastructure Providers** :
Hardware-enforced TEE isolation prevents cloud infrastructure providers from accessing your prompts, model weights, or inference results, even with physical access to servers.

- **Network-Based Attacks**
- **Network-Based Attacks** :
End-to-end encryption protects your data during transmission, preventing man-in-the-middle attacks and network eavesdropping.

- **Model Extraction Attempts**
- **Model Extraction Attempts** :
Model weights remain encrypted and isolated within the TEE, making extraction computationally infeasible even for attackers with privileged system access.

**Result Tampering**
**Result Tampering** :
Cryptographic signatures generated inside the TEE ensure that responses cannot be modified in transit without detection, maintaining the integrity of AI outputs.

---
Expand Down
4 changes: 2 additions & 2 deletions docs/cloud/quickstart.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -20,8 +20,8 @@ import { FeatureCard, FeatureCardGrid } from '@site/src/components/FeatureCard';
## Setup

1) **Create your account** - Sign up at [cloud.near.ai](https://cloud.near.ai/)
2) **Add Credits** - Goto the "Credits" section and purchase credits based on your needs
3) **Generate API Key** - Goto the "API Keys" section and generate a new key
2) **Add Credits** - Go to the "Credits" section and purchase credits based on your needs
3) **Generate API Key** - Go to the "API Keys" section and generate a new key

:::tip Keep Your API Key Safe
Never share your API key publicly or commit it to version control. If compromised, you can regenerate it anytime from your dashboard.
Expand Down
2 changes: 1 addition & 1 deletion docs/cloud/verification/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ import ModelsIcon from '@site/static/img/icons/models-icon.svg';
import ChatIcon from '@site/static/img/icons/chat.svg';
import { FeatureCard, FeatureCardGrid } from '@site/src/components/FeatureCard';

[NEAR AI Cloud](https://cloud.near.ai) operates in Trusted Execution Environments (TEEs) which use cryptographic proofs to verify that your private AI conversations actually happened in secure, isolated environments - not on compromised systems or with unauthorized access.
[NEAR AI Cloud](https://cloud.near.ai) operates in Trusted Execution Environments (TEEs) which use cryptographic proofs to verify that your private AI conversations actually happened in secure, isolated environments that are not on compromised systems or with unauthorized access.

This section will show you step-by-step processes for checking these proofs, validating digital signatures, and confirming that your AI interactions haven't been tampered with.

Expand Down
13 changes: 13 additions & 0 deletions docs/index.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
---
id: index
title: NEAR AI Cloud - Private, Verifiable AI Platform
sidebar_label: Home
slug: /
description: Deploy and scale powerful language models on TEE-enabled GPUs with on-chain guarantees. Fully private, verifiable, and user-owned AI platform.
hide_title: true
displayed_sidebar: docsSidebar
---

import HomePageContent from '@site/src/components/HomePageContent';

<HomePageContent />
29 changes: 10 additions & 19 deletions src/pages/index.js → src/components/HomePageContent/index.js
Original file line number Diff line number Diff line change
@@ -1,18 +1,12 @@
import clsx from "clsx";
import Link from "@docusaurus/Link";
import useDocusaurusContext from "@docusaurus/useDocusaurusContext";
import Layout from "@theme/Layout";
import HomepageFeatures from "@site/src/components/HomepageFeatures";

import Heading from "@theme/Heading";
import styles from "./index.module.css";
import VerifiedGif from "@site/static/img/verified.gif";
import styles from "./styles.module.css";
import PrivateInferenceIcon from "@site/static/img/icons/private-inference.svg";
import VerificationIcon from "@site/static/img/icons/verification.svg";
import QuickstartIcon from "@site/static/img/icons/quickstart.svg";

function HomepageHeader() {
const { siteConfig } = useDocusaurusContext();
return (
<header className={clsx("hero hero--primary", styles.heroBanner)}>
<div className="container">
Expand All @@ -30,36 +24,33 @@ function HomepageHeader() {
<div className={styles.buttons}>
<Link
className="button button--secondary button--lg"
to="/cloud/quickstart"
to="/cloud/introduction"
>
Start Building
Get Started
</Link>
</div>
</div>
</header>
);
}

export default function Home() {
const { siteConfig } = useDocusaurusContext();
export default function HomePageContent() {
return (
<Layout
title="NEAR AI Cloud - Private, Verifiable AI Platform"
description="Deploy and scale powerful language models on TEE-enabled GPUs with on-chain guarantees. Fully private, verifiable, and user-owned AI platform."
>
<>
<HomepageHeader />
<main>
<div className="container margin-vert--lg">
<div className="row">
<div className="col col--8 col--offset-2">
<div className="col">
<div className="markdown">
<div className={styles.lead}>
<p className={styles.leadCopy}>
NEAR AI runs the models you trust in secure Trusted
Execution Environments (TEEs), ensuring your data stays
completely private. Model providers, cloud providers, and
NEAR cannot access, view, or use your data for training —
your information remains yours alone.
NEAR AI cannot access, view, or use your data for training.
<br />
<center><strong>Your information remains yours.</strong></center>
</p>
</div>
<div className={clsx("row", styles.featureRow)}>
Expand Down Expand Up @@ -134,6 +125,6 @@ export default function Home() {
</div>
</div>
</main>
</Layout>
</>
);
}
Loading