Dera News

Open Source AI

Llama, Mistral, Qwen, DeepSeek, and the open-source models that are reshaping AI economics. Self-hosting, fine-tuning, and breaking free from vendor lock-in.

13 articles
1 stories
Last 30 days
(25 total matching)
Mistral article thumbnail
2 sources
Mistral logo
Mistral

Mistral Launches Remote Agents in Vibe

Latest coverage: Mistral AI Launches Remote Agents in Vibe and Mistral Medium 3.5 with 77.6% SWE-Bench Verified Score

Mistral AI has launched its new 'Mistral Medium 3.5' model, integrated into a remote coding agent called 'Vibe' and an AI chatbot 'Le Chat.' These tools now feature a 'Work mode' that allows for extended, autonomous coding and various business tasks using natural language, promising significant improvements in development and operational efficiency for SMBs.

📰AI Business · MarkTechPost
OpenAI article thumbnail
OpenAI logo
OpenAI

OpenAI launches Privacy Filter, an open source, on-device data sanitization model that removes personal information from enterprise datasets

In a significant shift toward local-first privacy infrastructure, OpenAI has released Privacy Filter, a specialized open-source model designed to detect and redact personally identifiable information (PII) before it ever reaches a cloud-based server. Launched today on AI code sharing community Hugging Face under a permissive Apache 2.0 license, the tool addresses a growing industry bottleneck: the risk of sensitive data "leaking" into training sets or being exposed during high-throughput inferen...

Hot
NVIDIA article thumbnail
NVIDIA and 1 others
NVIDIA

MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

Institution: Peking University | Authors: Ruijie Zhou, Fanxu Meng, Yufei Xu, Tongxuan Liu, Guangming Lu arXiv Links arXiv | PDF AI summary Abstract MISA replaces the dense token-wise indexing in sparse attention with a routed mixture-of-experts approach that reduces computational cost while maintaining performance and handling long contexts effectively. AI-generated summary 摘要 MISA 用路由专家混合方法取代了稀疏注意力中的密集标记索引,该方法降低了计算成本,同时保持了性能并有效处理长上下文。 AI 生成的摘要 Abstract DeepSeek Sparse Attention (DSA) sets the s...

Trending
AI News article thumbnail
AI News

Hugging Face Releases ml-intern: An Open-Source AI Agent that Automates the LLM Post-Training Workflow

Hugging Face just launched 'ml-intern,' an open-source AI agent built on their smolagents framework. This tool automates the entire post-training workflow for large language models, tackling everything from literature reviews and dataset discovery to training execution and iterative evaluation. It's a game-changer for SMBs looking to fine-tune LLMs, potentially slashing development costs and time.

Notable
Google article thumbnail
Google and 3 others
Google

Meet GitHub Spec-Kit: An Open Source Toolkit for Spec-Driven Development with AI Coding Agents

If you have spent time using AI coding agents — GitHub Copilot, Claude Code, Gemini CLI — you have probably run into this situation: you describe what you want, the agent generates a block of code that looks correct, compiles, and then subtly misses the actual intent. This “vibe-coding” approach can work for quick prototypes The post Meet GitHub Spec-Kit: An Open Source Toolkit for Spec-Driven Development with AI Coding Agents appeared first on MarkTechPost.

Notable
DeepSeek article thumbnail
DeepSeek logo
DeepSeek

Three reasons why DeepSeek’s new model V4 matters

DeepSeek, a Chinese AI company, has launched its new V4 model, promising high performance at a significantly lower cost than models from OpenAI or Anthropic. This release includes two versions, V4-Pro and V4-Flash, both designed to handle complex tasks and coding efficiently. The competitive pricing makes advanced AI capabilities a powerful and affordable option for small and medium-sized businesses.

Notable
NVIDIA article thumbnail
Anthropic and 1 others
NVIDIA

LightSeek Foundation Releases TokenSpeed, an Open-Source LLM Inference Engine Targeting TensorRT-LLM-Level Performance for Agentic Workloads

Inference efficiency has quietly become one of the most consequential bottlenecks in AI deployment. As agentic coding systems such as Claude Code, Codex, and Cursor scale from developer tools to infrastructure powering software development at large, the underlying inference engines serving those requests are under increasing strain. The LightSeek Foundation researchers have released TokenSpeed, an The post LightSeek Foundation Releases TokenSpeed, an Open-Source LLM Inference Engine Targeting T...

Notable
DeepSeek article thumbnail
OpenAI and 2 others
DeepSeek

China’s DeepSeek previews new AI model a year after jolting US rivals

Chinese AI company DeepSeek has released a preview of its V4 model, aiming to compete with major US AI systems. The company claims V4 offers significant improvements, especially in programming capabilities crucial for AI agents. This launch follows DeepSeek's previous model, R1, which garnered attention for its low training cost, and highlights China's advancements in AI despite ongoing scrutiny over hardware and data use.

AI News article thumbnail
AI News

MiniMax Just Open Sourced MiniMax M2.7: A Self-Evolving Agent Model that Scores 56.22% on SWE-Pro and 57.0% on Terminal Bench 2

MiniMax just open-sourced M2.7, an advanced AI model with self-evolution capabilities. This Mixture-of-Experts (MoE) model is designed to boost productivity in software engineering and office tasks, offering faster processing and lower operating costs. It performs well on complex benchmarks, suggesting potential for significant automation in SMB operations.

DeepSeek article thumbnail
OpenAI and 2 others
DeepSeek

DeepSeek could hit $45B valuation from its first investment round

Chinese AI lab DeepSeek is making waves by developing powerful large language models with significantly less computing power and cost than its U.S. counterparts. This efficiency has positioned the company for a potential $45 billion valuation in its initial funding round, highlighting a shift in the global AI landscape and the increasing competitiveness of non-U.S. innovators.

Meta article thumbnail
Meta logo
Meta

A Coding Implementation on Document Parsing Benchmarking with LlamaIndex ParseBench Using Python, Hugging Face, and Evaluation Metrics

A new guide has been released, showing how to leverage the ParseBench data set for evaluating document parsing systems. It provides specific instructions on loading the data set from Hugging Face and analyzing it with Python. This offers SMBs a structured way to benchmark their AI pipelines and improve information extraction from various document types.