Merlina-ORPO-12B
This is the same training run as schneewolflabs/A0l-12B but with a custom ORPO implementation and beta=0.1.
- Downloads last month
- 23
This is the same training run as schneewolflabs/A0l-12B but with a custom ORPO implementation and beta=0.1.