New Mistral Small 3.1 and DeepCoder-14B-Preview Models Added to the Ollama Library

Two new models are now available in the Ollama library: Mistral Small 3.1 and DeepCoder-14B-Preview. The Mistral Small 3.1 model shows significant improvements in text performance, understanding of multimodal data, and has an expanded context window up to 128,000 tokens. It surpasses the Gemma 3 and GPT-4o Mini models while maintaining a token output speed of 150 tokens per second. The model is released under the Apache 2.0 license and operates on a single RTX 4090 or Mac with 32 GB of RAM.

The DeepCoder-14B-Preview model stands out for its high accuracy in logical code analysis, achieving a Pass@1 rate of 60.6% on LiveCodeBench. This exceeds the performance of previous versions by 8%. The model was trained using distributed RL based on Deepseek-R1-Distilled-Qwen-14B and uses just 14 billion parameters.

Users can install these models in AI chat-bots, including the option to rent a server hourly for testing on the Ollama platform.

2025-04-02 UTC

Key DeepSeek engineers surrendered their passports at the request of Chinese authorities

2025-03-15 UTC

Google introduced the open-source model Gemma3 available in Ollama from version 0.6

2025-03-13 UTC

Google presents Gemma 3: new AI model with multimodality and 128K token context for smartphones and basic GPUs

2025-03-12 UTC

The new QwQ-32B model from the Qwen series is now available in Ollama with high accuracy and efficiency

2025-03-11 UTC

OpenAI Introduced GPT-4.5 The Largest AI Model Available to ChatGPT Plus Subscribers from March 5

2025-03-06 UTC