Novita AI now offers Baichuan-M2-32B, the world’s leading open-source medical AI model. This powerful model from Baichuan AI is designed for real-world medical reasoning tasks.
Built on Qwen2.5-32B with an innovative Large Verifier System, it delivers breakthrough medical performance while keeping strong general capabilities.
For businesses and developers who want to add medical AI to their apps, Novita AI makes it simple with easy-to-use APIs. Transform your healthcare applications with AI that outperforms all open-source models and many proprietary ones.
Current pricing on Novita AI: $0.07 / M input tokens, $0.07 / M output tokens
What is Baichuan-M2-32B
Baichuan-M2-32B is Baichuan AI’s medical-enhanced reasoning model, the second medical model released by Baichuan. Designed for real-world medical reasoning tasks, this model builds upon Qwen2.5-32B with an innovative Large Verifier System.
Through domain-specific fine-tuning on real-world medical questions, it achieves breakthrough medical performance while maintaining strong general capabilities.
Model Features
Baichuan-M2 incorporates three core technical innovations:
First, through the Large Verifier System, it combines medical scenario characteristics to design a comprehensive medical verification framework, including patient simulators and multi-dimensional verification mechanisms.
Second, through medical domain adaptation enhancement via Mid-Training, it achieves lightweight and efficient medical domain adaptation while preserving general capabilities.
Finally, it employs a multi-stage reinforcement learning strategy, decomposing complex RL tasks into hierarchical training stages to progressively enhance the model’s medical knowledge, reasoning, and patient interaction capabilities.

Core Highlights
🏆 World’s Leading Open-Source Medical Model: Outperforms all open-source models and many proprietary models on HealthBench, achieving medical capabilities closest to GPT-5
đź§ Doctor-Thinking Alignment: Trained on real clinical cases and patient simulators, with clinical diagnostic thinking and robust patient interaction capabilities
⚡ Efficient Deployment: Supports 4-bit quantization for single-RTX4090 deployment, with 58.5% higher token throughput in MTP version for single-user scenarios
Performance Results


HealthBench Scores
Baichuan-M2-32B leads all open-source models in medical performance:
| Model Name | HealthBench | HealthBench-Hard | HealthBench-Consensus |
|---|---|---|---|
| Baichuan-M2 | 60.1 | 34.7 | 91.5 |
| gpt-oss-120b | 57.6 | 30 | 90 |
| Qwen3-235B-A22B-Thinking-2507 | 55.2 | 25.9 | 90.6 |
| Deepseek-R1-0528 | 53.6 | 22.6 | 91.5 |
| GLM-4.5 | 47.8 | 18.7 | 85.3 |
| Kimi-K2 | 43 | 10.7 | 90.9 |
| gpt-oss-20b | 42.5 | 10.8 | 82.6 |
General Performance
The model maintains strong general capabilities:
| Benchmark | Baichuan-M2-32B | Qwen3-32B (Thinking) |
|---|---|---|
| AIME24 | 83.4 | 81.4 |
| AIME25 | 72.9 | 72.9 |
| Arena-Hard-v2.0 | 45.8 | 44.5 |
| CFBench | 77.6 | 75.7 |
| WritingBench | 8.56 | 7.90 |
Note: AIME uses max_tokens=64k, others use 32k; temperature=0.6 for all tests.
Getting Started Guide
Accessing Baichuan-M2-32B through Novita AI offers multiple pathways tailored to different technical expertise levels and use cases. Whether you’re a business user exploring medical AI capabilities or a developer building production healthcare applications, Novita AI provides the tools you need.
Use the Playground (Available Now – No Coding Required)
- Instant Access: Sign up and start experimenting with Baichuan-M2-32B in seconds
- Interactive Interface: Test medical reasoning prompts and visualize thinking outputs in real-time
- Model Comparison: Compare Baichuan-M2-32B with other leading medical models for your specific use case
The playground enables you to input medical queries directly, test various prompts, and see immediate results without any technical setup. Perfect for prototyping, testing medical AI ideas, and understanding model capabilities before full implementation.
Integrate via API (Live and Ready – For Developers)

Connect Baichuan-M2-32B to your healthcare applications with Novita AI’s unified REST API.
Option 1: Direct API Integration (Python Example):
from openai import OpenAI
client = OpenAI(
base_url="https://api.novita.ai/openai",
api_key="session_uTxJgPYNxWdB7G5Cpzzy3-HeBd7Hl5S_wnQAZGCflKDb5ElvYxSNN_yGTMUGb0bYIKHg3fqnQ3mrSBOUw7OD1A==",
)
model = "baichuan/baichuan-m2-32b"
stream = True # or False
max_tokens = 65536
system_content = "Be a helpful assistant"
temperature = 1
top_p = 1
min_p = 0
top_k = 50
presence_penalty = 0
frequency_penalty = 0
repetition_penalty = 1
response_format = { "type": "text" }
chat_completion_res = client.chat.completions.create(
model=model,
messages=[
{
"role": "system",
"content": system_content,
},
{
"role": "user",
"content": "Hi there!",
}
],
stream=stream,
max_tokens=max_tokens,
temperature=temperature,
top_p=top_p,
presence_penalty=presence_penalty,
frequency_penalty=frequency_penalty,
response_format=response_format,
extra_body={
"top_k": top_k,
"repetition_penalty": repetition_penalty,
"min_p": min_p
}
)
if stream:
for chunk in chat_completion_res:
print(chunk.choices[0].delta.content or "", end="")
else:
print(chat_completion_res.choices[0].message.content)
Option 2: Multi-Agent Workflows with OpenAI Agents SDK
Build sophisticated multi-agent healthcare systems using Baichuan-M2-32B:
- Plug-and-Play Integration: Use Baichuan-M2-32B in any OpenAI Agents workflow
- Advanced Agent Capabilities: Support for handoffs, routing, and tool integration with superior medical reasoning performance
- Scalable Architecture: Design agents that leverage Baichuan-M2-32B’s unified medical knowledge, clinical reasoning, and patient interaction capabilities
Key Features:
- OpenAI-Compatible API for seamless integration
- Flexible parameter control for fine-tuning medical responses
- Streaming support for real-time medical consultations
- Thinking mode access to understand medical reasoning process
Connect with Third-Party Platforms
Development Tools: Seamlessly integrate with popular IDEs and development environments like Cursor, Trae, Qwen Code and Cline through OpenAI-compatible APIs.
Orchestration Frameworks: Connect with LangChain, Dify, CrewAI, Langflow, and other AI orchestration platforms using official connectors.
Hugging Face Integration: Novita AI serves as an official inference provider of Hugging Face, ensuring broad ecosystem compatibility.
Novita AI handles all infrastructure, scaling, and optimization, letting you focus on building great healthcare applications with Baichuan-M2-32B’s powerful medical reasoning capabilities.
Healthcare Use Cases
Clinical Decision Support Systems
Baichuan-M2-32B’s doctor-thinking alignment and training on real clinical cases make it ideal for building sophisticated clinical decision support tools. Healthcare professionals can leverage the model’s medical reasoning capabilities to:
- Diagnostic assistance: Analyze patient symptoms and medical history to suggest potential diagnoses
- Treatment recommendations: Provide evidence-based treatment options based on clinical guidelines
- Drug interaction checking: Identify potential medication conflicts and contraindications
- Clinical workflow optimization: Streamline documentation and decision-making processes
Medical Education and Training
The model’s comprehensive medical knowledge and thinking transparency features enable advanced educational applications:
- Interactive case studies: Generate realistic patient scenarios for medical students and residents
- Clinical reasoning training: Demonstrate diagnostic thinking processes through the model’s thinking mode
- Medical simulation platforms: Create virtual patient interactions for hands-on learning experiences
- Continuing education tools: Provide up-to-date medical information and best practices
Patient Interaction and Health Consultation
With its patient simulator training and empathetic communication capabilities, the model excels in patient-facing applications:
- Virtual health assistants: Provide preliminary health guidance and symptom assessment
- Patient education platforms: Explain medical conditions and treatments in accessible language
- Telemedicine support: Assist healthcare providers during remote consultations
- Health literacy improvement: Help patients understand their medical information and care plans
Medical Research and Documentation
Researchers and healthcare institutions can utilize the model’s comprehensive medical knowledge for:
- Literature review automation: Analyze and summarize medical research papers and studies
- Clinical trial support: Assist with protocol development and patient screening criteria
- Medical documentation: Generate accurate clinical notes and patient summaries
- Healthcare analytics: Process large volumes of medical data for insights and trends
Important Usage Notice
Medical Disclaimer: For research and reference only; cannot replace professional medical diagnosis or treatment
Intended Use Cases: Medical education, health consultation, clinical decision support
Safe Use: Recommended under guidance of medical professionals
Conclusion
Baichuan-M2-32B represents a major breakthrough in open-source medical AI, now easily accessible through Novita AI’s platform. With world-leading medical performance and strong general capabilities, it’s perfect for businesses and developers building healthcare applications.
The model’s efficient deployment options and Apache License 2.0 make it suitable for both research and commercial use. Whether you’re creating medical education tools, health consultation apps, or clinical support systems, Baichuan-M2-32B on Novita AI provides the medical intelligence you need.
Start building with Baichuan-M2-32B on Novita AI today and transform your healthcare applications with this powerful medical AI model.
Novita AIÂ is an AI cloud platform that offers developers an easy way to deploy AI models using our simple API, while also providing the affordable and reliable GPU cloud for building and scaling.
Discover more from Novita
Subscribe to get the latest posts sent to your email.





