rohit.vision
Notes Graph Search IDE About Portfolio
Notes / NLP & LLMs / Fine-Tuning

Fine-Tuning

Model fine-tuning techniques

1.
Full Fine-Tuning WIP
Full parameter fine-tuning strategies for LLMs
2.
LoRA & QLoRA WIP
Parameter-efficient fine-tuning (PEFT) using Low-Rank Adaptation and Quantized LoRA
3.
TRL (Transformer Reinforcement Learning) WIP
Hugging Face library for RLHF, SFT, DPO, and post-training AI agents
4.
Unsloth WIP
High-speed and memory-efficient LLM fine-tuning library
GitHub LinkedIn Google Scholar

© 2026 Rohit Kumar. rohit.vision