alanzhuly commited on
Commit
0d81aac
·
verified ·
1 Parent(s): 1be6088

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Llama-3.2-3B
2
+ Run **Llama-3.2-3B** optimized for **Intel NPUs** with [nexaSDK](https://sdk.nexa.ai).
3
+
4
+ ## Quickstart
5
+
6
+ 1. **Install nexaSDK** and create a free account at [sdk.nexa.ai](https://sdk.nexa.ai)
7
+ 2. **Activate your device** with your access token:
8
+
9
+ ```bash
10
+ nexa config set license '<access_token>'
11
+ ```
12
+ 3. Run the model on Qualcomm NPU in one line:
13
+
14
+ ```bash
15
+ nexa infer NexaAI/llama3.2-3B-intel-npu
16
+ ```
17
+
18
+ ## Model Description
19
+ **Llama-3.2-3B** is a compact member of the Llama 3.2 family, designed to provide strong general-purpose language modeling in a lightweight 3B parameter footprint.
20
+ It balances efficiency with capability, making it well-suited for edge devices, prototyping, and applications where latency and resource constraints are critical.
21
+
22
+ ## Features
23
+ - **Lightweight architecture**: 3B parameters optimized for fast inference and low memory usage.
24
+ - **Instruction-following**: Tuned for prompts, Q&A, and step-by-step reasoning.
25
+ - **Multilingual capabilities**: Covers a wide range of global languages at smaller scale.
26
+ - **Deployment flexibility**: Runs efficiently on consumer hardware and server environments.
27
+
28
+ ## Use Cases
29
+ - Conversational assistants and chatbots.
30
+ - Educational tools and lightweight tutoring systems.
31
+ - Prototyping and experimentation with large language models on limited resources.
32
+ - Applications where cost or latency is a priority over sheer scale.
33
+
34
+ ## Inputs and Outputs
35
+ **Input**: Text prompts—questions, commands, or code snippets.
36
+ **Output**: Natural language responses including answers, explanations, or structured outputs.
37
+
38
+ ## License
39
+ - Licensed under **Meta Llama 3.2 Community License**
40
+
41
+ ## References
42
+ - Model card: [https://huggingface.co/meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B)