Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Laura O'Mahony's picture

10 1 3

Laura O'Mahony

lomahony

Felladrin's profile picture

tahamajs's profile picture

·

_lauraaisling
lauraaisling

AI & ML interests

PhD student

Organizations

None yet

lomahony 's collections 4

Pythia-hh-all-sft-dpo

Pythia models supervised finetuned and DPO finetuned with all of Anthropic-hh-rlhf dataset for 1 epoch.

lomahony/eleuther-pythia160m-hh-sft

Text Generation • 0.2B • Updated Aug 12, 2023 • 34
lomahony/eleuther-pythia2.8b-hh-sft

Text Generation • Updated Aug 12, 2023 • 55 • 1
lomahony/eleuther-pythia410m-hh-sft

Text Generation • Updated Aug 12, 2023 • 88
lomahony/eleuther-pythia6.9b-hh-dpo

Text Generation • Updated Aug 12, 2023 • 151

pythia-helpful-epoch2

Pythia-2.8b supervised finetuned and DPO finetuned with the helpful subset of Anthropic-hh-rlhf dataset for a second epoch.

lomahony/pythia-2.8b-helpful-sft-epoch2

Text Generation • 3B • Updated Mar 6, 2024 • 6
lomahony/pythia-1b-helpful-sft-epoch2

Text Generation • 1B • Updated Mar 6, 2024 • 6
lomahony/pythia-1.4b-helpful-sft-epoch2

Text Generation • 1B • Updated Mar 6, 2024 • 6
lomahony/pythia-410m-helpful-sft-epoch2

Text Generation • 0.4B • Updated Mar 6, 2024 • 6

pythia-helpful-1epoch

Pythia-2.8b supervised finetuned and DPO finetuned with the helpful subset of Anthropic-hh-rlhf dataset for 1 epoch.

lomahony/pythia-410m-helpful-dpo

Text Generation • Updated May 14, 2024 • 25
lomahony/pythia-2.8b-helpful-sft

Text Generation • 3B • Updated May 14, 2024 • 128
lomahony/pythia-160m-helpful-sft

Text Generation • 0.2B • Updated Nov 13, 2024 • 20
lomahony/pythia-70m-helpful-sft

Text Generation • 70.4M • Updated Jan 20 • 17

Pythia-helpful 3 epochs

lomahony/pythia-2.8b-helpful-sft-3epochs

Text Generation • 3B • Updated Mar 14, 2024 • 9
lomahony/pythia-2.8b-helpful-sfted2-dpo-3epochs

Updated Mar 19, 2024
lomahony/pythia-2.8b-helpful-sfted1-dpo-3epochs

Updated Mar 19, 2024
lomahony/pythia-2.8b-helpful-sfted0-dpo-3epochs

Updated Mar 19, 2024

Pythia-hh-all-sft-dpo

Pythia models supervised finetuned and DPO finetuned with all of Anthropic-hh-rlhf dataset for 1 epoch.

lomahony/eleuther-pythia160m-hh-sft

Text Generation • 0.2B • Updated Aug 12, 2023 • 34
lomahony/eleuther-pythia2.8b-hh-sft

Text Generation • Updated Aug 12, 2023 • 55 • 1
lomahony/eleuther-pythia410m-hh-sft

Text Generation • Updated Aug 12, 2023 • 88
lomahony/eleuther-pythia6.9b-hh-dpo

Text Generation • Updated Aug 12, 2023 • 151

pythia-helpful-1epoch

Pythia-2.8b supervised finetuned and DPO finetuned with the helpful subset of Anthropic-hh-rlhf dataset for 1 epoch.

lomahony/pythia-410m-helpful-dpo

Text Generation • Updated May 14, 2024 • 25
lomahony/pythia-2.8b-helpful-sft

Text Generation • 3B • Updated May 14, 2024 • 128
lomahony/pythia-160m-helpful-sft

Text Generation • 0.2B • Updated Nov 13, 2024 • 20
lomahony/pythia-70m-helpful-sft

Text Generation • 70.4M • Updated Jan 20 • 17

pythia-helpful-epoch2

Pythia-2.8b supervised finetuned and DPO finetuned with the helpful subset of Anthropic-hh-rlhf dataset for a second epoch.

lomahony/pythia-2.8b-helpful-sft-epoch2

Text Generation • 3B • Updated Mar 6, 2024 • 6
lomahony/pythia-1b-helpful-sft-epoch2

Text Generation • 1B • Updated Mar 6, 2024 • 6
lomahony/pythia-1.4b-helpful-sft-epoch2

Text Generation • 1B • Updated Mar 6, 2024 • 6
lomahony/pythia-410m-helpful-sft-epoch2

Text Generation • 0.4B • Updated Mar 6, 2024 • 6

Pythia-helpful 3 epochs

lomahony/pythia-2.8b-helpful-sft-3epochs

Text Generation • 3B • Updated Mar 14, 2024 • 9
lomahony/pythia-2.8b-helpful-sfted2-dpo-3epochs

Updated Mar 19, 2024
lomahony/pythia-2.8b-helpful-sfted1-dpo-3epochs

Updated Mar 19, 2024
lomahony/pythia-2.8b-helpful-sfted0-dpo-3epochs

Updated Mar 19, 2024

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs