Claude Code CLI Documentation: Setup, Slash Commands, and LLM API Integration
Complete Claude Code CLI documentation covering installation, CLI flags, slash commands, custom commands, and routing to Novita AI's LLM API.
Complete Claude Code CLI documentation covering installation, CLI flags, slash commands, custom commands, and routing to Novita AI's LLM API.
The best open source LLMs in 2026, how to choose between self-hosting and API inference, and how to run open-source coding agents without managing infrastructure.
Compare inference platforms for private generative AI endpoint deployment: dedicated capacity, network isolation, data residency, compliance posture, and Novita AI options.
A practical guide to adding MCP servers in Claude Code and Claude Desktop — covering claude mcp add, JSON config, tool routing, and sandbox execution.
Learn how to use the Vercel AI SDK to build AI-powered apps with streaming, tool calls, and agent loops. Includes Novita AI integration with code examples.
Call Kimi K2.7 Code on Novita AI using the OpenAI-compatible chat API. Includes model ID, pricing, context limits, vision input, function calling, and runnable examples.
Step-by-step guide to configure CoBuddy (baidu/cobuddy) in Claude Code using Novita AI's OpenAI-compatible endpoint. API setup, pricing, and coding workflow tips.
Configure DeepSeek V4 Flash in Claude Code via Novita AI. Set env vars, use the Anthropic-compatible endpoint, and cut costs vs Claude Sonnet.
Configure Kimi K2.7 Code in Claude Code via Novita AI's Anthropic endpoint. API key setup, model string, cost comparison, and coding workflow tips.
Compare developer services for operating many LLM APIs at team scale: SDK consistency, auth, billing consolidation, model lifecycle, governance, and observability.
How to operate a multi-provider LLM service that meets its uptime SLO: SLO design, provider health monitoring, alerting, incident playbooks, and fallback governance for production teams.
Choose the right serverless model inference platform by comparing cold starts, autoscaling, concurrency controls, GPU options, and when dedicated endpoints fit better.
See how to choose a full-service AI platform for open-model deployment, endpoint lifecycle, GPU backing, scaling, and ops handoff.
Access the GLM-4.6V API on Novita AI for vision tool calling, image understanding, and multimodal agents. OpenAI-compatible, $0.30/1M input tokens.