Tag: Updates - Novita

Macaron V1 Tall on Novita AI: Lightweight MoL Model for Chat, Agent, Coding, and GenUI

Macaron V1 Tall is available on Novita AI with 262K context, function calling, reasoning, and time-limited free pricing.

By Novita AI / July 29, 2026 / 5 minutes of reading

Ling-3.0-flash on Novita AI: Free API for Agentic Inference

Use Ling-3.0-flash free on Novita AI: a 124B MoE model with 5.1B active parameters, 262K context, reasoning, and function calling.

By Novita AI / July 27, 2026 / 5 minutes of reading

Macaron V1 Venti on Novita AI: 748B Mixture-of-LoRA Flagship for Coding, Agents, and GenUI

Macaron V1 Venti is a 748B Mixture-of-LoRA flagship for coding, agents, and GenUI. See its specs, Novita AI availability, pricing, and best-fit use cases.

By Novita AI / July 27, 2026 / 5 minutes of reading

Hy3 on Novita AI: Launch, Pricing, and Developer Fit

Hy3 is available free on Novita AI via serverless API. 295B MoE, 21B active parameters, 256K context, three reasoning modes, and $0 per token.

By Novita AI / July 6, 2026 / 5 minutes of reading

How to Use CoBuddy in Claude Code via Novita AI

Step-by-step guide to configure CoBuddy (baidu/cobuddy) in Claude Code using Novita AI's OpenAI-compatible endpoint. API setup, pricing, and coding workflow tips.

By Novita AI / June 29, 2026 / 5 minutes of reading

How to Use DeepSeek V4 Flash API in Claude Code via Novita AI

Use DeepSeek in Claude Code via the V4 Flash API on Novita AI. Set four env vars, get 1M-token context, and cut costs 20x vs Claude Sonnet.

By Novita AI / June 29, 2026 / 6 minutes of reading

GLM 5.2 on Novita AI: Long-Context Launch, Pricing, and Developer Fit

GLM 5.2 is available on Novita AI with 1M context, 128K max output, function calling, structured outputs, and serverless API access.

By Novita AI / June 22, 2026 / 5 minutes of reading

Kimi K2.7 Code on Novita AI: Agentic Coding API

Kimi K2.7 Code is live on Novita AI with OpenAI-compatible chat API access, 256K context, tool calling, and multimodal inputs.

By Novita AI / June 13, 2026 / 5 minutes of reading

Nemotron 3 Nano 30B A3B on Novita AI: Launch, Pricing, and Quick Start

Nemotron 3 Nano 30B A3B is available on Novita AI as a Serverless LLM with OpenAI-compatible chat completions, 256K context, and pay-as-you-go token pricing.

By Novita AI / June 11, 2026 / 5 minutes of reading