Tiiny
/

TurboSparse-Mixtral-Instruct

Feature Extraction

turbosparsemixtral

Model card Files Files and versions

TurboSparse-Mixtral-Instruct / README.md

Yixin Song

Update README.md

1743888 verified over 1 year ago

|

956 Bytes

	---
	license: apache-2.0
	language:
	- en
	---

	# Model Card for TurboSparse-Mixtral
	The TurboSparse-Mixtral Large Language Model (LLM) is an sparsified version of the Mixtral.

	<img src="takeaway.pdf" alt="avatar" width="300" height="200"/>

	The average performance is evaluated using benchmarks from the OpenLLM Leaderboard.

	## Inference

	Our code for accelerating TurboSparse-Mixtral is currently being refined. Stay tuned! Now you can run this model like dense model.

	## Chat-Template

	During sparsification, we also utilize some SFT datasets.
	We take ChatML as our chat template:
	```
	<\|im_start\|>user\n{{content}}<\|im_end\|>\n<\|im_start\|>assistant\n
	```

	## Allow Finetuning

	As we merged the predictors for FFN neurons in models, you can finetune TurboSparse-Mixtral with any framework and algorithm.

	## License

	The model is licensed under Apache-2.0, while model weights are fully open for academic research and also allow free commercial usage.