Community Blog & Articles

NEW 你也可以阅读这篇博客的中文版

Community Articles

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

about 21 hours ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

about 14 hours ago

Uncensor any LLM with abliteration

How I contributed a new model to the Transformers library using Codex

YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?

Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

about 8 hours ago

Mastering Tensor Dimensions in Transformers

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Small Language Models (SLM): A Comprehensive Overview

From GRPO to DAPO and GSPO: What, Why, and How

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

multimodalon-devicegemma4

Welcome Gemma 4: Frontier multimodal intelligence on device

+3

Holo3: Breaking the Computer Use Frontier

Falcon Perception

gradioserveropen-source

Any Custom Frontend with Gradio's Backend

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

Training mRNA Language Models Across 25 Species for $165

trlreinforcement-learningannouncement

TRL v1.0: Post-Training Library Built to Move with the Field

guideagentsinference-providers

Liberate your OpenClaw

+4

A New Framework for Evaluating Voice Agents (EVA)

Build a Domain-Specific Embedding Model in Under a Day

State of Open Source on Hugging Face: Spring 2026

Holotron-12B - High Throughput Computer Use Agent

hubstorageannouncement

Introducing Storage Buckets on the Hugging Face Hub

+8

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+5

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

about 21 hours ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

about 14 hours ago

Uncensor any LLM with abliteration

How I contributed a new model to the Transformers library using Codex

YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?

Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

about 8 hours ago

Mastering Tensor Dimensions in Transformers

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Small Language Models (SLM): A Comprehensive Overview

From GRPO to DAPO and GSPO: What, Why, and How

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

View all articles