4 related articles

Deep dive into LlamaFactory, an open-source unified fine-tuning framework supporting 100+ LLMs and VLMs with LoRA, QLoRA, RLHF methods, Web UI, 71K+ GitHub Stars, accepted at ACL 2024.
Deep DivesComplete guide to the three core LLM training stages: pre-training, supervised fine-tuning (SFT), and preference alignment (DPO/PPO), covering LoRA, distillation, quantization, and pruning.
TutorialsA systematic guide to LLM engineer core skills covering RAG, Agent app development and SFT, RLHF fine-tuning, with clear learning paths for different backgrounds.
TutorialsDeep dive into Hermes Agent's four progressive cases: terminal ReAct loop, Feishu AI assistant, four-layer persistent memory, and three-stage Skill evolution with DeepSeek support.