Blog
Articles & insights
Software development, artificial intelligence, open source and experience reports.

Simplify AI Training with 🤗 Accelerate: The Complete Guide
Discover 🤗 Accelerate, the Hugging Face library that simplifies training your AI models on any hardware (GPU, TPU) without modifying your PyTorch code.

Habana Gaudi2 vs Nvidia A100: AI Performance Comparison
Discover the performance of Habana Gaudi2 processors against Nvidia A100 GPUs. A technical analysis to optimize the training and inference of your AI models.
Jun 11, 2026
Hugging Face and Pollen Robotics: Towards AI-Driven Robotics
Hugging Face acquires Pollen Robotics to democratize open-source, AI-driven robotics. Discover how this union will transform Embodied AI and LLM-based robot control.
Jun 10, 2026
Convert your models to ONNX with Optimum: The complete guide
Learn how to convert your Transformers models to ONNX format with the 🤗 Optimum library. Optimize your models for production, reduce latency, and simplify your deployments.
Jun 10, 2026
Implementing MCP Servers in Python: Create a Shopping Assistant with Gradio
Discover how to implement an MCP server in Python to build a shopping assistant with Virtual Try-On (VTON) and Gradio, leveraging the power of the Model Context Protocol.
Jun 9, 2026
Simplicity at the Heart of High-Performance Neural Networks
Bigger is not always better. Back to basics: why simplicity and rigor in building neural networks are the true keys to AI performance.
May 29, 2026
The 20 Best AI Chatbots in 2026
Discover the 20 best AI chatbots in 2026 to boost your sales, automate marketing, and improve productivity. A comprehensive comparison by use case.
May 29, 2026
AI and Biology: The Arc Virtual Cell Challenge Explained
The Arc Virtual Cell Challenge uses foundation models to simulate cellular behavior. Discover how AI is redefining research in molecular biology.
May 29, 2026
Xet on Hugging Face: Optimize Your Dataset Versioning
The integration of Xet into the Hugging Face Hub revolutionizes massive dataset versioning. Discover how this solution optimizes storage, speed, and collaboration.
May 29, 2026
Gemini 3.5 Flash: Efficiency at the Heart of AI Agents
Google unveils Gemini 3.5 Flash, an ultra-high-performance and efficient model designed to accelerate the large-scale deployment of autonomous AI agents.
May 29, 2026
Reeboot Fleet: we built the console we were missing to run our Raspberry Pis
Reeboot Fleet, our Raspberry Pi fleet management SaaS, is now in public access. One-command install, Tailscale terminal, targeted deployments — from €1.40 / device / month.
May 27, 2026
Hugging Face TGI Now Supports vLLM and TensorRT-LLM
Hugging Face announces multi-backend support for TGI, now enabling the use of vLLM and TensorRT-LLM for LLM inference in production. Increased flexibility for performance.
May 26, 2026
How to become a prompt engineer?
Becoming a prompt engineer is an achievable goal for curious tech profiles. Discover the key skills, the current market, and how to train yourself to master generative AI.
May 26, 2026
Hugging Face Introduces DOIs for Datasets and Models
Hugging Face now offers DOI (Digital Object Identifier) assignment for datasets and models, facilitating their citation and traceability in scientific research.
May 26, 2026
Personal Copilot: How to train your own coding assistant
Discover how to train your own personalized coding assistant using compact open-source models for increased security and better business relevance.
May 25, 2026
GLM-5.1: 754B parameters — Z.ai's agentic engineering flagship
Z.ai's GLM-5.1 is a 754B MoE model built for agentic engineering. It leads on SWE-Bench Pro (58.4%), scores 95.3% on AIME 2026, and is MIT licensed.
Apr 8, 2026
Gemma 4 31B: Google's multimodal model with 256K context and thinking mode
Google's Gemma 4 31B is a dense 30.7B multimodal model supporting text, images, and video with a 256K context window, native thinking mode, function calling, and 140+ languages — released under Apache
Apr 3, 2026
Chroma Context-1: the 20B agentic search model that edits its own context
Chroma's Context-1 is a 20B MoE agentic search model trained for multi-hop retrieval. It decomposes queries, calls tools in parallel, prunes irrelevant documents mid-search, and runs at up to 10x the
Apr 3, 2026
Cohere Transcribe: a 2B ASR model that tops the English leaderboard
Cohere Labs' Transcribe 03-2026 is a 2B Conformer-based ASR model ranked #1 on the English ASR leaderboard with a 5.42 average WER, supporting 14 languages at 524x real-time speed — faster and more ac
Apr 3, 2026
GLM-5: 744B parameters, 40B active — Z.ai's open-source frontier model
Z.ai's GLM-5 is a 744B MoE model with 40B active parameters, trained on 28.5T tokens. It scores 92.7% on AIME 2026, 77.8% on SWE-bench Verified, and is the best open-source model on HMMT Nov 2025.
Mar 30, 2026Voxtral-4B: Mistral's open-weights TTS model that speaks 9 languages in real time
Mistral just released Voxtral-4B-TTS, an open-weights text-to-speech model with 20 preset voices, 9 languages, and 70 ms latency — built for real-time voice agents and production deployment.
Mar 30, 2026
Qianfan-OCR: Baidu's 4B model that beats Gemini on document parsing
Baidu's Qianfan-OCR is a 4B end-to-end document understanding model that ranks #1 on OmniDocBench v1.5 — beating Gemini 3 Pro and DeepSeek-OCR-v2 on tables, formulas, layout, and key information extra
Mar 30, 2026
Qwen3.5-27B Distilled by Claude 4.6 Opus: A Local Reasoning Powerhouse
Discover how Jackrong distilled Claude 4.6 Opus reasoning into Qwen3.5-27B — a 28B open-source model that thinks for 9+ minutes autonomously, runs on a single GPU, and rivals frontier AI for coding an
Mar 24, 2026
Nemotron Cascade 2: NVIDIA's 30B model that won the math and coding Olympics
NVIDIA's Nemotron Cascade 2 is a 30B MoE model with only 3B activated parameters — and it just won gold medals at the 2025 International Mathematical Olympiad and International Olympiad in Informatics
Mar 24, 2026
NVIDIA Nemotron-3 Super: a 120B MoE model that runs on a single GPU
A deep dive into NVIDIA Nemotron-3 Super, a 120B-parameter Mixture of Experts model with only 12B active parameters, 1 million token context, and configurable reasoning — deployable on a single B200 G
Mar 20, 2026
Mistral Small 4: One Unified Model to Rule Reasoning, Code, and Vision
A deep dive into Mistral Small 4, the new model from Mistral AI that merges advanced reasoning, code generation, and multimodal capabilities into a 119-billion parameter Mixture of Experts architectur
Mar 18, 2026
Protect Yourself from the Enedis Scam
How to identify and avoid fraudulent calls impersonating Enedis.
Feb 21, 2026
The Photo Revolution on iPhone
Google launches a Snapseed camera app for iPhone with professional tools.
Feb 20, 2026Today's Tech News
ROG Strix SCAR 18, VPN and health: what you need to know.
Feb 19, 2026
Protect Your AI Conversations
How malicious extensions steal your ChatGPT history.
Jan 8, 2026Windows Ecosystem Embraces Android
A seamless fusion between your smartphone and your computer.
Jan 8, 2026
AI Comes to Life with Project Ava
A virtual companion that reacts to your emotions and your gameplay.
Jan 8, 2026
Mistral's Devstral 2: The Return of Sovereign Code AI
Reviewing Mistral's latest on-device coding models and their place in the Mistral 2 ecosystem.
Dec 17, 2025
Agentic AI Smartphones: The Next Frontier for Enterprise
Analyzing the impact of the ByteDance prototype and the future of agentic AI in the corporate world.
Dec 8, 2025
Claude Opus 4.5: The Next Generation of AI
Discover the latest features and improvements in Claude Opus 4.5
Nov 25, 2025
Innovation Tax Credit
Reeboot obtains CII approval
Nov 20, 2024Technology and Ecology
Discover our eco-responsible approach to technology
Nov 19, 2024