LLM Course

0. ติดตั้งโปรแกรม

1. โมเดล Transformer

บทนำ การประมวลผลภาษาธรรมชาติ Transformers ชื่อนี้มีดียังไง?Transformers ทำงานยังไง?โมเดล Encoder โมเดล Decoder โมเดล sequence-to-sequence ข้อจำกัดจากอคติของข้อมูล สรุป คำถามท้ายบท

2. การใช้งาน 🤗 Transformers

3. การ fine-tune โมเดลที่ผ่านการเทรนมาแล้ว (pretrained model)

4. การแบ่งปันโมเดลและ tokenizers

6. ตัวตัดคำจาก 🤗 Tokenizers library

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

โมเดล Decoder

โมเดล decoder ใช้เพียงส่วน decoder จากโมเดล Transformer เท่านั้น ในแต่ละชั้น attention layer สามารถเข้าถึงคำที่อยู่ตำแหน่งก่อนหน้าในประโยคได้เท่านั้น โมเดลเหล่านี้เรียกว่า โมเดล auto-regressive

โมเดล pretrain ในกลุ่มนี้ใช้ในการทำนายคำต่อไปในประโยค เหมาะสำหรับงานในการสร้างข้อความ

ตัวแทนโมเดลในกลุ่มนี้ได้แก่:

Update on GitHub

←โมเดล Encoder โมเดล sequence-to-sequence→