4 related articles

From linear regression and logistic regression to gradient descent, this guide derives the core mechanisms of neural networks step by step, covering Sigmoid, cross-entropy, activation functions, and backpropagation.

How can non-CS graduate students use AI tools like Cursor to efficiently complete their thesis? A complete guide covering data sourcing, code adaptation, and AI-assisted modifications.
Deep DivesAnalyzing the "worse is better" philosophy in large model architecture: why DeepSeek V4 dropped N-gram, why Transformer dominates AI, and three iron laws of simple, efficient model design.
Deep DivesComplete guide to the three core LLM training stages: pre-training, supervised fine-tuning (SFT), and preference alignment (DPO/PPO), covering LoRA, distillation, quantization, and pruning.