bilalfaye
/

OneEncoder-text-image

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions

bilalfaye commited on Feb 21

Commit

50f445f

·

verified ·

1 Parent(s): b51d555

Update README.md

Files changed (1) hide show

README.md +6 -7

README.md CHANGED Viewed

@@ -10,7 +10,6 @@ base_model:
 - google-bert/bert-base-uncased
 ---
 # 🖼️📝 OneEncoder: A Unified Text & Image Model
 **OneEncoder** is a lightweight framework for cross-modal alignment, focusing on efficiently integrating **text and images** (with future extensions to other modalities). Unlike traditional methods relying on massive modality-specific encoders, OneEncoder progressively aligns different data types, making it cost-effective and performant even on small paired datasets.
@@ -25,14 +24,14 @@ base_model:
 - **Image-Text Retrieval**
 - **Multimodal Content Understanding**
 ## 📌 Resources
 🔗 **GitHub Repo**: [OneEncoder](https://github.com/b-faye/OneEncoder)
 🚀 **Hugging Face Demo**: [OneEncoder Retriever](https://huggingface.co/spaces/bilalfaye/OneEncoder-retriever)
 📓 **Demo Notebook**: [OneEncoder Demos](https://github.com/b-faye/OneEncoder/tree/main/demo)
 🔊 **OneEncoder for Text, Image & Audio**: [HF Model](https://huggingface.co/bilalfaye/OneEncoder-text-image-audio)
-# Authors:
-Bilal FAYE, Hanane AZZAG, Mustapha LEBBAH, Djamel BOUCHAFFRA

 - google-bert/bert-base-uncased
 ---
 # 🖼️📝 OneEncoder: A Unified Text & Image Model
 **OneEncoder** is a lightweight framework for cross-modal alignment, focusing on efficiently integrating **text and images** (with future extensions to other modalities). Unlike traditional methods relying on massive modality-specific encoders, OneEncoder progressively aligns different data types, making it cost-effective and performant even on small paired datasets.
 - **Image-Text Retrieval**
 - **Multimodal Content Understanding**
+## 📝 Authors
+📌 **Bilal FAYE**, Hanane AZZAG, Mustapha LEBBAH, Djamel BOUCHAFFRA
+## 📄 Research Paper
+📜 **arXiv**: [OneEncoder: Progressive Cross-Modal Alignment](https://arxiv.org/abs/2409.11059)
 ## 📌 Resources
 🔗 **GitHub Repo**: [OneEncoder](https://github.com/b-faye/OneEncoder)
 🚀 **Hugging Face Demo**: [OneEncoder Retriever](https://huggingface.co/spaces/bilalfaye/OneEncoder-retriever)
 📓 **Demo Notebook**: [OneEncoder Demos](https://github.com/b-faye/OneEncoder/tree/main/demo)
 🔊 **OneEncoder for Text, Image & Audio**: [HF Model](https://huggingface.co/bilalfaye/OneEncoder-text-image-audio)