OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 10 days ago • 102
stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-4B_strategy_surplexity_t1_g3_run2_metrics Viewer • Updated 18 days ago • 164 • 60 • 2
LatentUMM: Dual Latent Alignment for Unified Multimodal Models Paper • 2605.17766 • Published May 18 • 9
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published May 14 • 146
kairawal/Llama-3.2-3B-Instruct-TL-SynthDolly-r16alpha32-E3-S73 Text Generation • 3B • Updated May 14 • 86 • 1
QEIL v2: Heterogeneous Computing for Edge Intelligence via Roofline-Derived Pareto-Optimal Energy Modeling and Multi-Objective Orchestration Paper • 2602.06057 • Published Apr 5 • 5
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 633