Ming Li

limingcv

·

https://liming-ai.github.io

liming-ai

AI & ML interests

Computer Vision, AIGC, VLM/LLM

Recent Activity

upvoted a paper 4 days ago

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

upvoted a paper 4 days ago

DanceOPD: On-Policy Generative Field Distillation

upvoted a paper 4 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

View all activity

Organizations

upvoted 3 papers 4 days ago

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

Paper • 2606.27313 • Published 5 days ago • 38

DanceOPD: On-Policy Generative Field Distillation

Paper • 2606.27377 • Published 5 days ago • 75

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 5 days ago • 47

upvoted a paper 26 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 29 days ago • 137

upvoted 3 papers about 2 months ago

RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO

Paper • 2605.15190 • Published May 14 • 13

ViPO: Visual Preference Optimization at Scale

Paper • 2604.24953 • Published Apr 29 • 3

Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization

Paper • 2604.24952 • Published Apr 27 • 6

upvoted 3 papers 3 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 167

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 53

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 37

upvoted 2 papers 5 months ago

LoL: Longer than Longer, Scaling Video Generation to Hour

Paper • 2601.16914 • Published Jan 23 • 23

Rethinking Video Generation Model for the Embodied World

Paper • 2601.15282 • Published Jan 21 • 46

upvoted 3 papers 6 months ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published Dec 23, 2025 • 95

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Paper • 2512.13507 • Published Dec 15, 2025 • 41

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 174

upvoted a paper 7 months ago

Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation

Paper • 2512.02457 • Published Dec 2, 2025 • 14

upvoted 3 papers 8 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 117

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published Oct 27, 2025 • 18

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 38

upvoted a paper 9 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 98