Harry Soteriou's picture

40 60

Harry Soteriou PRO

HarrySoteriou

·

HarrySoteriou

AI & ML interests

LLMs, Deep Reinforcement Learning, TinyML, Computer Vision

Recent Activity

upvoted a paper 17 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

liked a Space about 1 month ago

OpenEvals/evaluation-guidebook

View all activity

Organizations

upvoted a paper 17 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 217

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted 4 papers 7 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 251

upvoted a paper 8 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

upvoted 5 papers 9 months ago

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17, 2025 • 58

Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models

Paper • 2505.02686 • Published May 5, 2025 • 16

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14, 2025 • 76

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14, 2025 • 71

LLMs Get Lost In Multi-Turn Conversation

Paper • 2505.06120 • Published May 9, 2025 • 7

upvoted an article 9 months ago

Article

All LLMs Will Be Sparse BitNet Hybrids

May 14, 2025

•

16

upvoted 7 papers 9 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 122

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 438

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 185

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 189

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 120

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7, 2025 • 139