Skip to content
Embedding LabsEmbedding Labs
Embedding Labs
Torna alla Ricerca

LoRA: Low-Rank Adaptation of Large Language Models

Hu et al.2021

Fine-tuningEfficiencyPEFT

Abstract

LoRA proposes freezing pre-trained model weights and injecting trainable rank decomposition matrices into each layer. This approach reduces trainable parameters significantly, making fine-tuning of large models feasible on limited hardware.

Perché È Importante

  • Made fine-tuning practical on constrained hardware
  • Standard approach for adapters and model personalization
  • Major reduction in storage and compute requirements

Chiedi su questo articolo

Loading chat...