Full Fine-Tuning

Full parameter fine-tuning strategies for LLMs

Overview

Unlike PEFT methods, full fine-tuning updates all parameters in the LLM. It requires massive compute and memory but is necessary for deep domain adaptation or continued pre-training.

Challenges

  • Memory bottlenecks (optimizer states, gradients).
  • Catastrophic forgetting.

TODO: Add details on strategies (FSDP, DeepSpeed).

Additional Resources: