-
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 66 -
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
Paper • 2301.00774 • Published • 4 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9 -
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper • 2401.15024 • Published • 73
Bui Van Hop
hllj
AI & ML interests
Computer Vision, Deep Learning, NLP
Organizations
PEFT
-
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper • 2309.14717 • Published • 46 -
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 101
Pruning
-
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 66 -
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
Paper • 2301.00774 • Published • 4 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9 -
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper • 2401.15024 • Published • 73
PEFT
-
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper • 2309.14717 • Published • 46 -
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 101
models 27
hllj/paligemma-3b-mix-224-vi-llava-checkpoint-10000
Updated
• 3
hllj/paligemma-3b-mix-224-vi-llava
Image-Text-to-Text • 3B • Updated
hllj/mistral-instruct-v0.2-awq-marlin
Text Generation • 7B • Updated
• 1
hllj/BloomZ-7B1-Vi-Math
Text Generation • 7B • Updated
• 3
hllj/Qwen-7B-Vi-Math
Text Generation • 8B • Updated
• 1
hllj/Zephyr-beta-7B-Vi-Math
Text Generation • 7B • Updated
• 3
hllj/Llama2-7B-Vi-Math
Text Generation • 7B • Updated
• 2
hllj/Mistral-7B-Vi-Math
Text Generation • 7B • Updated
• 2
hllj/sft-mistral-v1-clean-valid
Updated
hllj/sft-mistral-v2-clean-valid
Text Generation • Updated
datasets 7
hllj/medscape
Viewer
• Updated
• 508 • 14 • 1
hllj/quesmed
Viewer
• Updated
• 4.83k • 14
hllj/synthetic-text-embedding
Viewer
• Updated
• 10k • 40
hllj/Vi-VLM
Updated
• 5
hllj/vi_grade_school_math_mcq
Viewer
• Updated
• 2.73k • 27 • 3
hllj/vi_math_problem_crawl
Viewer
• Updated
• 10.2k • 26 • 1
hllj/vi_gsm8k
Viewer
• Updated
• 8.79k • 53 • 2