Community Blog & Articles

Community Articles

Party is over: regularizing ColBERT models to fix efficient ANN methods

Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub

Introducing North Mini Code: Cohere’s First Model For Developers

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

QLORA SFT Distillation Effects on Qwen3.6 27B Agentic Coding Harness Fluency

Introducing Serge: GitHub-Native AI Code Review

No Photoshop, No Blender: Multimedia by Agent

V-Zero

about 22 hours ago

Code a simple RAG from scratch

KV Caching Explained: Optimizing Transformer Inference Efficiency

How We Built OpenMythos: A Cybersecurity LLM Trained from Scratch

The Office Meets Silicon Valley

build-small-hackathon

Enterprise AI benchmarks: head-to-head comparison of Falconer, Notion, Atlassian Rovo, Claude Code, and Codex

From GRPO to DAPO and GSPO: What, Why, and How

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

karina-zadorozhny

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

PitchFight AI: Practice the Pitch Before the Real Room

Closet Twin: Your AI-Powered Personal Stylist Built for the Build Small Hackathon

build-small-hackathon

🧬 Carbon-VEPor: Efficient Variant Effect Prediction with Carbon

build-small-hackathon

Continuous batching for GRPO, now in TRL

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Party is over: regularizing ColBERT models to fix efficient ANN methods

Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub

Introducing North Mini Code: Cohere’s First Model For Developers

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

QLORA SFT Distillation Effects on Qwen3.6 27B Agentic Coding Harness Fluency

Introducing Serge: GitHub-Native AI Code Review

No Photoshop, No Blender: Multimedia by Agent

V-Zero

about 22 hours ago

Code a simple RAG from scratch

KV Caching Explained: Optimizing Transformer Inference Efficiency

How We Built OpenMythos: A Cybersecurity LLM Trained from Scratch

The Office Meets Silicon Valley

build-small-hackathon

Enterprise AI benchmarks: head-to-head comparison of Falconer, Notion, Atlassian Rovo, Claude Code, and Codex

From GRPO to DAPO and GSPO: What, Why, and How

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

karina-zadorozhny

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

PitchFight AI: Practice the Pitch Before the Real Room

Closet Twin: Your AI-Powered Personal Stylist Built for the Build Small Hackathon

build-small-hackathon

🧬 Carbon-VEPor: Efficient Variant Effect Prediction with Carbon

build-small-hackathon

Continuous batching for GRPO, now in TRL

View all articles