MrezaPRZ (Mohammadreza Pourreza)

upvoted 2 articles 4 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11, 2025

•

176

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

88

upvoted an article 5 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

+3

Aug 8, 2025

•

89

upvoted a paper 8 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

upvoted a paper 9 months ago

Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL

Paper • 2503.23157 • Published Mar 29, 2025 • 10

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12, 2025

•

480

upvoted 2 articles about 1 year ago

Article

Hugging Face Welcomes the Qwen2.5-Coder Series

Nov 12, 2024

•

7

Article

Faster Assisted Generation with Dynamic Speculation

+5

Oct 8, 2024

•

49

upvoted a paper over 1 year ago

CHESS: Contextual Harnessing for Efficient SQL Synthesis

Paper • 2405.16755 • Published May 27, 2024 • 2

upvoted 5 articles over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

272

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

+4

Aug 21, 2024

•

41

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

262

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

+7

Apr 29, 2024

•

79

Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

+2

Apr 4, 2024

•

29

Mohammadreza Pourreza

AI & ML interests

Organizations

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Hugging Face Welcomes the Qwen2.5-Coder Series

Faster Assisted Generation with Dynamic Speculation

CHESS: Contextual Harnessing for Efficient SQL Synthesis

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Training and Finetuning Embedding Models with Sentence Transformers v3

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

Mohammadreza Pourreza

AI & ML interests

Organizations

MrezaPRZ's activity

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Hugging Face Welcomes the Qwen2.5-Coder Series

Faster Assisted Generation with Dynamic Speculation

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Training and Finetuning Embedding Models with Sentence Transformers v3

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B