(2)
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks
Pulls
100K+
Stars
37
Last Updated
3 months
Solid LLaMA 3 update, reliable for coding, chat, and Q&A tasks
Pulls
100K+
Stars
23
Last Updated
9 months
Tiny LLM built for speed, edge devices, and local development
Pulls
100K+
Stars
32
Last Updated
5 months
Distilled LLaMA by DeepSeek, fast and optimized for real-world tasks
Pulls
100K+
Stars
76
Last Updated
9 months
Efficient multimodal AI for text, image, audio, and video on low-resource devices.
Pulls
50K+
Stars
10
Last Updated
7 months
The most advanced Qwen model yet, with major gains in text, vision, video, and reasoning.
Pulls
50K+
Stars
6
Last Updated
3 months
Google’s latest Gemma, in its QAT (quantization aware trained) variant
Pulls
50K+
Stars
20
Last Updated
4 months
Versatile Qwen update with better language skills and wider support
Pulls
50K+
Stars
8
Last Updated
9 months
Qwen3-Coder is Qwen’s new series of coding agent models.
Pulls
50K+
Stars
17
Last Updated
17 days
Newest LLama 3 release with improved reasoning and generation quality
Pulls
50K+
Stars
17
Last Updated
9 months
Efficient open model with top-tier performance and fast inference
Pulls
50K+
Stars
20
Last Updated
9 months
Ministral 3: compact vision-enabled model with near-24B performance, optimized for local edge use
Pulls
10K+
Stars
0
Last Updated
about 2 months
DeepCoder-14B-Preview is a code reasoning LLM fine-tuned to scale up to long context lengths
Pulls
10K+
Stars
13
Last Updated
10 months
SmolLM3 is a 3.1B model for efficient on-device use, with strong performance in chat
Pulls
10K+
Stars
7
Last Updated
7 months
Kimi K2 Thinking: open-source agent with deep reasoning, stable tool use, fast INT4, 256k context.
Pulls
10K+
Stars
0
Last Updated
about 2 months
Google’s latest Gemma, small yet strong for chat and generation
Pulls
10K+
Stars
1
Last Updated
3 months
Meta’s LLama 3.1: Chat-focused, benchmark-strong, multilingual-ready.
Pulls
10K+
Stars
6
Last Updated
10 months
DeepSeek-V3.2 boosts efficiency and reasoning with DSA, scalable RL, agentic data—IMO/IOI wins.
Pulls
10K+
Stars
5
Last Updated
about 2 months
Granite Docling is a multimodal model for efficient document conversion.
Pulls
10K+
Stars
2
Last Updated
4 months
Qwen3 Embedding: multilingual models for advanced text/ranking tasks like retrieval & clustering.
Pulls
10K+
Stars
0
Last Updated
3 months
Ministral 3: compact vision-enabled model with near-24B performance, optimized for local edge use
Pulls
10K+
Stars
2
Last Updated
about 2 months
Safety reasoning models for policy-based text classification and foundational safety tasks.
Pulls
10K+
Stars
1
Last Updated
3 months
Embedding Gemma is a state-of-the-art text embedding model from Google DeepMind
Pulls
10K+
Stars
3
Last Updated
5 months
Qwen3 is the latest Qwen LLM, built for top-tier coding, math, reasoning, and language tasks.
Pulls
10K+
Stars
0
Last Updated
3 months
Kimi K2 Thinking: open-source agent with deep reasoning, stable tool use, fast INT4, 256k context.
Pulls
10K+
Stars
0
Last Updated
about 2 months
Qwen3 Embedding: multilingual models for advanced text/ranking tasks like retrieval & clustering.
Pulls
10K+
Stars
0
Last Updated
3 months
Multilingual reranking model for text retrieval, scoring document relevance across 119 languages.
Pulls
10K+
Stars
0
Last Updated
2 months