Novita AI Blog

How to Use Novita AI with Goose: 200+ LLM Models

Configure Novita AI as a native provider in Goose. Access 200+ open-source models at $0.02/M tokens for agentic coding workflows.

By Novita AI / May 18, 2026 / 5 minutes of reading

Best Text-to-Speech APIs in 2026: 8 Providers Compared

Compare the best TTS APIs in 2026 — Fish Audio, ElevenLabs, OpenAI, Azure, Google, and more. Real pricing, code examples, and honest use-case picks.

By Novita AI / May 6, 2026 / 10 minutes of reading

DeepSeek-V4-Pro on Novita AI: 1M Context, #1 LiveCodeBench Score

DeepSeek-V4-Pro is a 1.6T-parameter open-source MoE model delivering 1 LiveCodeBench score (93.5) and 1M-token context. Available now via Novita AI.

By Novita AI / April 26, 2026 / 9 minutes of reading

DeepSeek-V4-Flash on Novita AI: Fast Reasoning at Lower Cost

DeepSeek-V4-Flash is now available via Novita AI. 284B MoE model, 1M token context, selectable reasoning modes. $0.14/M input. OpenAI-compatible API.

By Novita AI / April 26, 2026 / 9 minutes of reading

Ling-2.6-1T on Novita AI: Free API, SWE-Bench SOTA, 1T Param Model

Ling-2.6-1T is Ant Group's trillion-scale model built on MLA + Hybrid Linear Attention — not standard MoE. It achieves open-source SOTA on agent benchmarks (SWE-bench, BFCLv4, TAU2

By Novita AI / April 24, 2026 / 9 minutes of reading

Which Inference Provider Is Right for AI Agents

AI agents have different infrastructure needs than chatbots. Learn the 5 criteria — tool calling, context, burst traffic, cold start, concurrency

By Novita AI / April 22, 2026 / 7 minutes of reading

Ling-2.6-flash on Novita AI: 340 Tokens/s, ~7x Token Efficiency

Ling-2.6-flash is a 104B MoE model (7.4B active) delivering 340 tokens/s and 7x better token efficiency than Nemotron-3-Super on agent benchmarks. Available now via OpenRouter with

By Novita AI / April 21, 2026 / 7 minutes of reading

Top Inference API Providers for Open-Source Models in 2026

Compare top inference API providers for open-source models: pricing, model coverage, and output quality across Novita AI, Together AI, Fireworks, DeepInfra, and Groq.

By Novita AI / April 21, 2026 / 8 minutes of reading

Kimi K2.6 on Novita AI: API Pricing ($0.95/$4.00), SWE-Bench & Agentic Coding

Kimi K2.6 is now on Novita AI. 1T MoE open-source model, 256K context, 58.6% SWE-Bench Pro — built for long-horizon agentic coding. Try free via OpenAI-compatible API.

By Novita AI / April 21, 2026 / 10 minutes of reading

Qwen 3.5 Medium Series VRAM Requirements: 27B, 35B, 122B GPU Deployment Guide

Master Qwen 3.5 Medium deployment: VRAM needs, quantization options & GPU setup on Novita AI—start in minutes

By Novita AI / April 20, 2026 / 5 minutes of reading

Kling 3.0 Now on Novita AI: Cinematic AI Video Generation at Scale

Kling v3.0 is now live on Novita AI. Generate 3-15s AI videos with native audio, multi-shot composition, and transparent per-second pricing. Standard from $0.168/s, Pro from $0.224

By Novita AI / April 19, 2026 / 5 minutes of reading

Top 8 AI Inference Platforms in 2026

Discover the top 8 AI inference platforms in 2026. Compare features, pricing, and performance of leading providers like Novita AI, Together AI, and Groq.

By Novita AI / April 18, 2026 / 8 minutes of reading

5 Best Sora Alternatives for AI Video Generation

Sora is shutting down. Discover 5 production-ready AI video alternatives on Novita AI — one API, native audio, up to 1080P.

By Novita AI / April 17, 2026 / 8 minutes of reading

How to Access Kimi K2.5: Web, API, Claude Code, Self-Host

Discover how to access Kimi K2.5 through web playground, API, or local deployment with minimal setup time.

By Novita AI / April 16, 2026 / 8 minutes of reading