Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhi Zheng's picture
1 1 1

Zhi Zheng

zz1358m
Papercold's profile picture
·
https://zz1358m.github.io/zhizheng.github.io/

AI & ML interests

LLM reasoning, Trustworthy LLM, LLM application, Neural combinatorial optimization.

Recent Activity

liked a model about 2 months ago
zz1358m/SofT-GRPO-master
updated a model 2 months ago
zz1358m/SofT-GRPO-master
authored a paper 2 months ago
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
View all activity

Organizations

National University of Singapore's profile picture

Papers 4

arxiv:2511.06411
arxiv:2505.12348
arxiv:2501.08603
arxiv:2407.00312

models 2

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 • 8

zz1358m/Reasoning-CV

Updated Sep 10, 2025

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs