Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AngelRaychev
/
policy_iteration_1

Text Generation
Transformers
Safetensors
llama
Generated from Trainer
trl
sft
conversational
text-generation-inference
Model card Files Files and versions
xet
Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • .gitattributes
    1.52 kB
    initial commit 10 months ago
  • README.md
    1.5 kB
    End of training 9 months ago
  • config.json
    878 Bytes
    End of training 10 months ago
  • generation_config.json
    132 Bytes
    End of training 10 months ago
  • loss_plot.png
    28.1 kB
    End of training 9 months ago
  • merges.txt
    466 kB
    End of training 10 months ago
  • model.safetensors
    538 MB
    xet
    End of training 9 months ago
  • special_tokens_map.json
    655 Bytes
    End of training 10 months ago
  • tokenizer.json
    3.52 MB
    End of training 10 months ago
  • tokenizer_config.json
    3.79 kB
    End of training 10 months ago
  • training_args.bin
    5.62 kB
    xet
    End of training 9 months ago
  • vocab.json
    801 kB
    End of training 10 months ago