--- title: DeepSeek-OCR Studio emoji: 🔍 colorFrom: blue colorTo: purple sdk: gradio sdk_version: 4.44.0 app_file: app.py pinned: false license: mit --- # 🔍 DeepSeek-OCR Studio Advanced OCR system based on [DeepSeek-OCR](https://github.com/deepseek-ai/DeepSeek-OCR), providing powerful document parsing capabilities. ## ✨ Features - **Multi-language OCR**: Support for Chinese, English, and many other languages - **Table Extraction**: Intelligent table recognition and markdown conversion - **Chart Analysis**: Extract data from charts and graphs - **Professional Drawings**: Semantic recognition of CAD drawings, flowcharts, etc. - **Layout Analysis**: Preserve document structure and formatting - **PDF Support**: Process PDF documents page by page ## 🚀 Quick Start 1. **Upload an Image or PDF**: Click to upload your document 2. **Optional Prompt**: Customize the OCR task (e.g., "Extract tables", "Analyze chart") 3. **Extract Text**: Click the button and wait for results ## 📝 Prompt Examples ### Basic OCR ``` Free OCR. ``` ### Table Extraction ``` Extract all tables and convert to markdown format. ``` ### Chart Analysis ``` Analyze this chart and extract data in table format. ``` ### Multi-language Documents ``` Extract all text in multiple languages. ``` ### Technical Drawings ``` Analyze this CAD drawing and describe its components. ``` ## ⚙️ Deployment Information - **Platform**: Hugging Face Spaces with ZeroGPU - **Model**: [deepseek-ai/DeepSeek-OCR](https://huggingface.co/deepseek-ai/DeepSeek-OCR) - **Processing Time**: 30-120 seconds per image/page - **PDF Limitation**: First 3 pages only (ZeroGPU constraint) ## 🔧 Local Deployment For full functionality with unlimited pages and faster processing: ```bash # Clone the repository git clone https://github.com/fufankeji/DeepSeek-OCR-Web.git cd DeepSeek-OCR-Web # Install dependencies bash install.sh # Start services bash start.sh ``` **Requirements**: - Linux OS - GPU with ≥7GB VRAM (16-24GB recommended) - Python 3.10-3.12 - CUDA 11.8 or 12.1/12.2 ## 📚 Documentation - [Official DeepSeek-OCR](https://github.com/deepseek-ai/DeepSeek-OCR) - [Web Interface Repository](https://github.com/fufankeji/DeepSeek-OCR-Web) - [Model on Hugging Face](https://huggingface.co/deepseek-ai/DeepSeek-OCR) ## 🙏 Acknowledgments - **DeepSeek AI**: For the amazing OCR model - **Hugging Face**: For providing ZeroGPU infrastructure - Original project: [DeepSeek-OCR-Web](https://github.com/fufankeji/DeepSeek-OCR-Web) ## 📄 License MIT License ## 🐛 Known Limitations on Spaces - ZeroGPU has time limits (120 seconds per request) - PDF processing limited to first 3 pages - First request takes longer (model loading) - Large images may timeout For production use, please deploy locally with dedicated GPU.