RaushanTurganbay/GPT2_sft_and_dpo_tuned Text Generation β’ 0.4B β’ Updated Dec 4, 2023 β’ 10 β’ 1
RaushanTurganbay/reward_model_deberta_large_Anthropic_hh Text Classification β’ 0.4B β’ Updated Dec 2, 2023 β’ 13 β’ 1