Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ model-index:
|
|
| 21 |
|
| 22 |
# X-CLIP (base-sized model)
|
| 23 |
|
| 24 |
-
X-CLIP model (base-sized, patch resolution of
|
| 25 |
|
| 26 |
This model was trained using 8 frames per video, at a resolution of 224x224.
|
| 27 |
|
|
|
|
| 21 |
|
| 22 |
# X-CLIP (base-sized model)
|
| 23 |
|
| 24 |
+
X-CLIP model (base-sized, patch resolution of 16) trained fully-supervised on [Kinetics-400](https://www.deepmind.com/open-source/kinetics). It was introduced in the paper [Expanding Language-Image Pretrained Models for General Video Recognition](https://arxiv.org/abs/2208.02816) by Ni et al. and first released in [this repository](https://github.com/microsoft/VideoX/tree/master/X-CLIP).
|
| 25 |
|
| 26 |
This model was trained using 8 frames per video, at a resolution of 224x224.
|
| 27 |
|