LLM Archives

How to Build a Long-Context Code Review Workflow with Novita AI API

Learn how to use MiniMax M3 through the Novita AI API to review large code changes and turn model output into pull request comments developers can trust.

By Novita AI / June 5, 2026 / 12 minutes of reading

Best LLM API Providers in 2026: Novita AI vs Open Model Inference Platforms

A practical 2026 comparison of Novita AI, Together AI, Fireworks AI, DeepInfra, Baseten, and Friendli AI for model APIs, GPU scaling, agent infrastructure, and inference deployment.

By Novita AI / June 4, 2026 / 11 minutes of reading

DeepSeek V4 Pro Long-Context Reasoning: Developer Guide

A developer guide for DeepSeek V4 Pro on Novita AI, covering long-context reasoning, model variables, pricing, limits, and API examples.

By Novita AI / June 4, 2026 / 8 minutes of reading

MiniMax M3 API Quick Start with Novita AI

A developer quick start for calling MiniMax M3 through Novita AI, including the model ID, endpoint, tiered pricing, context limits, and starter code.

By Novita AI / June 4, 2026 / 7 minutes of reading

MiniMax M3 on Novita AI: 1M-Context Coding and Agentic AI for Developers

Use MiniMax M3 on Novita AI for coding, agentic workflows, 1M-token context, and multimodal input with OpenAI-compatible APIs.

By Novita AI / June 1, 2026 / 8 minutes of reading

Qwen3.6-27B on Novita AI: 262K Context for Agentic Coding

Use Qwen3.6-27B on Novita AI via OpenAI-compatible API. See model ID, pricing, 262K context, coding use cases, and gotchas.

By Novita AI / May 28, 2026 / 9 minutes of reading

Qwen3.7-Max on Novita AI: Agentic Coding for Long-Context Workflows

Qwen3.7-Max is available on Novita AI for agentic coding and long-context workflows. Review API access, pricing, limits, and use cases.

By Novita AI / May 22, 2026 / 6 minutes of reading

PegaFlow External KV Cache for vLLM

PegaFlow external KV cache helps vLLM serving teams preserve and share KV cache across restarts, instances, and RDMA nodes.

By Novita AI / May 20, 2026 / 6 minutes of reading

How to Use Novita AI with Goose: 200+ LLM Models

Configure Novita AI as a native provider in Goose. Access 200+ open-source models at $0.02/M tokens for agentic coding workflows.

By Novita AI / May 18, 2026 / 5 minutes of reading

DeepSeek-V4-Pro on Novita AI: 1M Context, #1 LiveCodeBench Score

DeepSeek-V4-Pro is a 1.6T-parameter open-source MoE model delivering #1 LiveCodeBench score (93.5) and 1M-token context. Available now via Novita AI.

By Novita AI / April 26, 2026 / 8 minutes of reading

DeepSeek-V4 model comparison on Novita AI

DeepSeek-V4-Flash on Novita AI: Fast Reasoning at Lower Cost

DeepSeek-V4-Flash is now available via Novita AI. 284B MoE model, 1M token context, selectable reasoning modes. $0.14/M input. OpenAI-compatible API.

By Novita AI / April 26, 2026 / 8 minutes of reading

Ling-2.6-1T backed by Novita AI — 1T parameter model API

Ling-2.6-1T on Novita AI: Free API, SWE-Bench SOTA, 1T Param Model

Ling-2.6-1T is Ant Group’s trillion-scale model built on MLA + Hybrid Linear Attention — not standard MoE. It achieves open-source SOTA on agent benchmarks (SWE-bench, BFCLv4, TAU2-Bench) with minimal token overhead, now exclusively backed by Novita AI.

By Novita AI / April 24, 2026 / 8 minutes of reading