| license: apache-2.0 | |
| datasets: | |
| - imagenet-1k | |
| - ade20k | |
| metrics: | |
| - accuracy | |
| - mIoU | |
| pipeline_tag: image-classification | |
| # Pyramid-VisionLLaMA-S | |
| # How to Use | |
| Please refer the [Github](https://github.com/Meituan-AutoML/VisionLLaMA) page for usage. | |
| # Citation | |
| ``` | |
| @article{chu2024visionllama, | |
| title={VisionLLaMA: A Unified LLaMA Interface for Vision Tasks}, | |
| author={Chu, Xiangxiang and Su, Jianlin and Zhang, Bo and Shen, Chunhua}, | |
| journal={arXiv preprint arXiv:2403.00522}, | |
| year={2024} | |
| } | |
| ``` |