Open Source AI

Llama, Mistral, Qwen, DeepSeek, and the open-source models that are reshaping AI economics. Self-hosting, fine-tuning, and breaking free from vendor lock-in.

18 articles

4 stories

Last 30 days

2 sources

AI News

6/29/2026

Qwen Unveils New Robotic Models

Qwen-RobotManip Technical Report: Alignment Unlocks Scale for Robotic Manipulation Foundation Models

Institution: Qwen | Authors: Haoqi Yuan, Zhixuan Liang, Anzhe Chen, Ye Wang, Haoyang Li arXiv Links arXiv | PDF AI summary Abstract A Vision-Language-Action foundation model for robotic manipulation achieves generalization through unified alignment across representation, motion, and behavior dimensions, enabling large-scale training on diverse data sources. Generated by Qwen/Qwen2.5-Coder-32B-Instruct 摘要用于机器人操作的视觉-语言-动作基础模型通过跨表示、运动和行为维度的统一对齐实现泛化，从而实现对不同数据源的大规模训练。由 Qwen/Qwen2.5-Coder-32B-Instru...

📰Hugging Face Daily Papers

2 sources

Anthropic

6/26/2026

Anthropic Accuses Alibaba Of Claude Theft

Anthropic accuses Alibaba of stealing Claude AI model using 25,000 fake accounts in massive cyber attack - Swarajya

Anthropic has formally accused Alibaba of organizing an unprecedented campaign to extract the capabilities of its generative AI model, Claude. The US AI company claims Alibaba's AI lab, Qwen, used 25,000 illicit accounts and commercial proxy services to bypass geo-restrictions and interact with Claude nearly 28.8 million times over six weeks. This incident highlights growing concerns over AI intellectual property protection and international AI competition risks.

📰Anthropic News Coverage · Ars Technica

2 sources

LLM Agents Struggle With Planning

Plans Don't Persist: Why Context Management Is Load Bearing for LLM Agents

Institution: Snowflake | Authors: Aman Mehta, Anupam Datta arXiv Links arXiv | PDF AI summary Abstract Standard LLM agents rely on plan content remaining in context rather than maintaining it as persistent state, with evidence shown through replay pairing diagnostics and compression stress tests. Generated by Qwen/Qwen2.5-Coder-32B-Instruct 摘要标准 LLM 代理依赖于保留在上下文中的计划内容，而不是将其维持为持久状态，并通过重放配对诊断和压缩压力测试显示证据。由 Qwen/Qwen2.5-Coder-32B-Instruct 生成 Abstract Generated by Qwen/Qwen2.5-Coder-32B-Instruct Lon...

📰Hugging Face Daily Papers

2 sources

Mistral

6/24/2026

Mistral Launches OCR 4

Mistral launches OCR 4, turning document extraction into a full enterprise AI play

Mistral AI on Tuesday released OCR 4, a document intelligence model that moves beyond raw text extraction to return structured representations of entire documents — complete with bounding boxes, block-type classification, and per-word confidence scores. The release marks Mistral's fourth generation of optical character recognition technology in roughly 15 months and lands at a moment when the company's pitch for European AI sovereignty has never been more commercially relevant. The model support...

📰MarkTechPost · VentureBeat

Anthropic

6/11/2026

Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks

Xiaomi's MiMo AI team has open-sourced MiMo Code V0.1.0, a terminal-native AI coding assistant that the Chinese electronics giant says outperforms Anthropic's Claude Code on key agentic coding benchmarks, especially on long-horizon, multi-step tasks (200+ steps) — at least, according to its own internal beta release and survey of 576 developers. It's also bundling limited-time free access to MiMo-V2.5, its multimodal flagship model with a million-token context window, requiring no registration t...

Hot

AI News

6/23/2026

GitHub joins coalition advocating for fixes to California AI Transparency Act to protect open source

We’re calling for targeted amendments to resolve conflicts with open source licensing and align with international transparency frameworks while preserving regulatory intent. The post GitHub joins coalition advocating for fixes to California AI Transparency Act to protect open source appeared first on The GitHub Blog.

Trending

OpenAI

6/8/2026

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

A joint research collaboration between researchers at the University of Illinois at Urbana-Champaign (UIUC), UC Berkeley, and the open source AI-native vector database platform Chroma unveiled Harness-1, a 20-billion parameter open-source search agent built atop OpenAI's gpt-oss-20B open source model that fundamentally redesigns how AI executes complex retrieval tasks. Harness-1 achieves a massive leap in performance, scoring 73% average on its ability to recall relevant information correctly fr...

Trending

OpenAI

6/15/2026

Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch

Z.ai launched GLM-5.2 on June 13, 2026, across every GLM Coding Plan tier. The headline is a usable 1-million-token context window plus High and Max effort levels. It drops into Claude Code, Cline, and OpenClaw through an Anthropic-compatible endpoint. No benchmarks shipped at launch, and MIT open weights are promised next week. The post Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch appeared first on MarkTechPost.

Trending

Anthropic

6/29/2026

OpenClaw Releases iOS and Android Companion Node Apps That Connect a Phone to a Self-Hosted AI Agent Gateway

OpenClaw's iOS and Android apps are companion nodes, not standalone chatbots. Each phone pairs to a self-hosted Gateway over WebSocket. This adds device hardware — camera, location, voice, and Canvas — to a local-first AI agent. Here is the architecture, the capabilities, and the trade-offs for builders. The post OpenClaw Releases iOS and Android Companion Node Apps That Connect a Phone to a Self-Hosted AI Agent Gateway appeared first on MarkTechPost.

Trending

OpenAI

6/12/2026

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out

Moonshot AI released Kimi K2.7-Code this week, an open-source update to its K2 coding model family, claiming leaner reasoning and double-digit performance gains. K2.7-Code is built on the same trillion-parameter mixture-of-experts architecture as its predecessor K2.6, and drops in via an OpenAI-compatible API — which matters for teams already running K2.6 in production gateways. When K2.6 launched in April, it topped OpenRouter's weekly LLM leaderboard — a ranking based on actual API routing dec...

Trending

DeepSeek

6/29/2026

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

Even as the geopolitical conversation around AI continues to grow more fraught following the U.S. government's actions to limit the new models from Anthropic and OpenAI, Chinese open source darling DeepSeek is back with yet another open release that could once again change AI development around the globe. Over the weekend, the firm released DSpark, a new, MIT-Licensed system designed to make large language models answer faster without changing what the underlying model is trying to say. The easi...

Trending

Mistral

Mistral AI Releases Leanstral 1.5: An Apache-2.0 Lean 4 Code Agent Model Solving 587 of 672 PutnamBench Problems

Mistral AI released Leanstral 1.5, a free Apache-2.0 code agent model for Lean 4. It saturates miniF2F and solves 587 of 672 PutnamBench problems. The 119B mixture-of-experts activates 6.5B parameters per token. We break down its architecture, benchmarks, real bug-finding case studies, and deployment code. The post Mistral AI Releases Leanstral 1.5: An Apache-2.0 Lean 4 Code Agent Model Solving 587 of 672 PutnamBench Problems appeared first on MarkTechPost.

Notable

Anthropic

6/14/2026

Databricks Open-Sources Omnigent: A Meta-Harness That Composes, Governs, and Shares AI Agents Across Claude Code, Codex, and Pi

Databricks has open-sourced Omnigent, a meta-harness that sits above coding agents like Claude Code, Codex, and Pi. It adds composition, contextual policies, and live session sharing under one interface, on terminal, web, desktop, and mobile. The Apache 2.0 project is in alpha. The post Databricks Open-Sources Omnigent: A Meta-Harness That Composes, Governs, and Shares AI Agents Across Claude Code, Codex, and Pi appeared first on MarkTechPost.

Notable

Liquid AI Ships LFM2.5-230M with llama.cpp, MLX, vLLM, SGLang, and ONNX Support for On-Device Inference

Liquid AI released LFM2.5-230M, its smallest model yet. The 230M-parameter, open-weight model runs on-device at 213 tok/s on a Galaxy S25 Ultra and 42 on a Raspberry Pi 5. Built on the LFM2 architecture, it targets tool use and data extraction, beating larger models like Qwen3.5-0.8B and Gemma 3 1B on instruction following. The post Liquid AI Ships LFM2.5-230M with llama.cpp, MLX, vLLM, SGLang, and ONNX Support for On-Device Inference appeared first on MarkTechPost.

Notable