Skander Moalla's picture

6 2

Skander Moalla

skandermoalla

·

https://skandermoalla.com/

AI & ML interests

DeepRL, RL finetuning

Recent Activity

upvoted a paper 26 days ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

upvoted a paper 26 days ago

Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions

liked a dataset 26 days ago

LukeBailey181Pub/D_3k

View all activity

Organizations

upvoted 2 papers 26 days ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 20

Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions

Paper • 2507.08068 • Published Jul 10, 2025 • 1

upvoted a paper about 2 months ago

Efficient RL Training for LLMs with Experience Replay

Paper • 2604.08706 • Published Apr 9 • 22

upvoted a paper 6 months ago

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis

Paper • 2407.09835 • Published Jul 13, 2024 • 1

upvoted a collection over 1 year ago

Tulu V1 Suite

The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources". • 34 items • Updated Mar 4, 2025 • 3