Embedding Labs

LoRA: Low-Rank Adaptation of Large Language Models

Hu et al. • 2021

Fine-tuningEfficiencyPEFT

Abstract

LoRA proposes freezing pre-trained model weights and injecting trainable rank decomposition matrices into each layer. This approach reduces trainable parameters significantly, making fine-tuning of large models feasible on limited hardware.

Perché È Importante

Made fine-tuning practical on constrained hardware
Standard approach for adapters and model personalization
Major reduction in storage and compute requirements

Vedi su arXiv Scarica PDF

Chiedi su questo articolo

Loading chat...