Chat · Reasoning · Tool calling
Language

Qwen3.6 35B-A3B

qwen3.6-35b-a3b-gguf-iq4xs

The default LLM — reasoning, tools, and vision behind an OpenAI-compatible endpoint.

No GPU. No setup. Cancel anytime.

Q

About

Qwen3.6 35B-A3B is the default model of Sogni Intelligence: a mixture-of-experts LLM with reasoning, native tool calling, and vision, served through an OpenAI-compatible endpoint — most SDKs work by swapping the base URL.

It can also drive every creative model in this catalog: enable sogni_tools and the model gains hosted image, video, and music generation as tool calls billed to the same account.

Pricing

On the Sogni API you pay per request in Spark (1 Spark = $0.005). Prefer a flat rate? The Sogni app runs Qwen3.6 35B-A3B and 100+ other models credit-free under fair use.

Configuration Spark USD
1M input tokens 60.0 Spark $0.30
1M output tokens 180 Spark $0.90
10K in + 2K out (typical request) 0.96 Spark $0.0048

API

One Sogni API key reaches every model on the Supernet — call Qwen3.6 35B-A3B by id.

const res = await fetch('https://api.sogni.ai/v1/chat/completions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${process.env.SOGNI_API_KEY}`,
  },
  body: JSON.stringify({
    model: 'qwen3.6-35b-a3b-gguf-iq4xs',
    messages: [{ role: 'user', content: 'Write a haiku about decentralized GPUs.' }],
  }),
});
const { choices } = await res.json();
console.log(choices[0].message.content);
curl https://api.sogni.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $SOGNI_API_KEY" \
  -d '{
    "model": "qwen3.6-35b-a3b-gguf-iq4xs",
    "messages": [{ "role": "user", "content": "Write a haiku about decentralized GPUs." }]
  }'

OpenAI-compatible — point any OpenAI client at https://api.sogni.ai/v1. Full reference at docs.sogni.ai.

Why run it on Sogni

Unlimited, not metered

Most platforms ration credits or rent a cloud GPU by the hour. Sogni gives you a flat monthly price for credit-free fair-use generation, powered by a creator-owned GPU network — or per-render Spark pricing if you'd rather pay as you go via the API.

No credits, ever

One flat price in the app. Generate under fair use without a per-image meter.

🧩

100+ models

Image, video, music, and language models in one workspace and one API key.

Per-render API

Prefer pay-as-you-go? Call Qwen3.6 35B-A3B by id and pay in Spark.

🌐

Powered by people

Runs on a decentralized GPU network where workers share subscription revenue.

FAQ

Qwen3.6 35B-A3B on Sogni

What is Qwen3.6 35B-A3B?

Qwen3.6 35B-A3B is the default model of Sogni Intelligence: a mixture-of-experts LLM with reasoning, native tool calling, and vision, served through an OpenAI-compatible endpoint — most SDKs work by swapping the base URL.

How much does Qwen3.6 35B-A3B cost on Sogni?

From 60.0 Spark ($0.30) per request on the Sogni API (1 Spark = $0.005) — or unlimited under fair use on a flat monthly Sogni plan.

How do I run Qwen3.6 35B-A3B?

Use it in Sogni Chat, or call the OpenAI-compatible API with model id qwen3.6-35b-a3b-gguf-iq4xs.

Do I need a GPU or ComfyUI?

No. Qwen3.6 35B-A3B runs on the Sogni Supernet — a decentralized network of creator GPUs — with no local install or graphics card required.

Start with Qwen3.6 35B-A3B today

Create in the app, or build with the API. Your call.