Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
8
18
Hao Sun
Holarissun
Follow
Ray2333's profile picture
Shailx's profile picture
2 followers
·
2 following
https://holarissun.github.io/
HolarisSun
holarissun
AI & ML interests
PhD@Uni.Cambridge. Deep RL, RL x LLM, RLHF.
Organizations
None yet
Holarissun
's models
356
Sort: Recently updated
Holarissun/gptj6b-aisft-hh-seqsampler-subset60000
Updated
Mar 11, 2024
Holarissun/gptj6b-aisft-hh-randsampler-subset2000
Updated
Mar 11, 2024
Holarissun/gptj6b-aisft-hh-seqsampler-subset2000
Updated
Mar 11, 2024
•
1
Holarissun/phi2-aisft-synhh-seqsampler-subset30000
Updated
Mar 11, 2024
Holarissun/phi2-aisft-synhh-randsampler-subset30000
Updated
Mar 10, 2024
Holarissun/phi2-aisft-hh-seqsampler-subset10000
Updated
Mar 10, 2024
Holarissun/phi2-aisft-hh-randsampler-subset10000
Updated
Mar 10, 2024
Holarissun/phi2-airl_sft-imdb-randsampler
Updated
Mar 10, 2024
Holarissun/phi2-airl_sft-imdb-seqsampler
Updated
Mar 10, 2024
Holarissun/gpt2full-airl_sft-imdb-seqsampler
Text Generation
•
0.1B
•
Updated
Mar 10, 2024
Holarissun/gpt2full-airl_sft-imdb-randsampler
Text Generation
•
0.1B
•
Updated
Mar 10, 2024
Holarissun/gpt2-airl_sft-imdb-randsampler
Updated
Mar 10, 2024
Holarissun/gpt2-airl_sft-imdb-seqsampler
Updated
Mar 10, 2024
•
3
Holarissun/zephyr3b-airl_sft-tldr-randsampler
Updated
Mar 10, 2024
Holarissun/zephyr3b-airl_sft-tldr-seqsampler
Updated
Mar 10, 2024
•
1
Holarissun/gemma2b-airl_sft-tldr-randsampler
Updated
Mar 9, 2024
Holarissun/gemma2b-airl_sft-tldr-seqsampler
Updated
Mar 9, 2024
Holarissun/gptj6b-airl_sft-tldr-randsampler
Updated
Mar 9, 2024
Holarissun/gptj6b-airl_sft-tldr-seqsampler
Updated
Mar 9, 2024
Holarissun/gpt2-airl_sft-tldr-seqsampler
Updated
Mar 8, 2024
Holarissun/gpt2-airl_sft-tldr-randsampler
Updated
Mar 8, 2024
Holarissun/phi2-airl_sft-tldr-seqsampler
Updated
Mar 8, 2024
Holarissun/phi2-sft-tldr
Updated
Mar 2, 2024
Holarissun/gpt2-sft-tldr
Text Generation
•
0.1B
•
Updated
Jan 17, 2024
•
1
Holarissun/trl_rm_tldr_gpt2
Text Classification
•
0.1B
•
Updated
Jan 10, 2024
Holarissun/gpt2-rm-tldr
Text Classification
•
0.1B
•
Updated
Jan 8, 2024
Previous
1
...
10
11
12
Next