License compatibility

#17

by Schilder - opened Jul 9, 2025

Jul 9, 2025

Hi，thanks for sharing SuperNova-Medius — the architecture looks impressive! I noticed in the README that the model leverages outputs from LLaMA-3.1-405B-Instruct as part of the distillation process. I was just wondering — given that the LLaMA 3.1 license explicitly restricts using its outputs to train or improve other large language models, how did you navigate that in this case?

No criticism at all — just genuinely curious how licensing considerations played into your pipeline!

Schilder

Jul 10, 2025

Looking forward to your response!

BlueNipples

Jul 16, 2025

I don't believe they used outputs, but rather weights.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment