12 articles
New Species Discovered in New York's C…
Scientists set up insect traps in NYC's Central Park and Prospect Park to discover unknown species. With 90% of Earth's species still unnamed, urban biodiversity research is becoming a new trend in ecology.
The Full Story of the Higgs Boson Disc…
A Fermilab physicist's insider account of the Higgs boson discovery: the transatlantic race with CERN, behind-the-scenes details of the 2012 announcement, 14 years of verification, and the true origin of the 'God Particle' name.
ResearchYale and other institutions introduce SciMDR, a two-stage data synthesis pipeline enabling a 7B model to match GPT-5 level performance in scientific literature comprehension.
ResearchDeep analysis of Claude Code's open-source architecture: dual-loop design, 7-step tool pipeline, 4-layer token compression, memory systems, and multi-agent collaboration patterns.
ResearchMementoGUI is a plugin-style multimodal memory management framework that solves GUI agent forgetting in long-horizon tasks through dual time-scale memory and four memory control operators, boosting long-task completion without fine-tuning.
ResearchDeep dive into how the Humanize framework transforms LLM tokens into engineering productivity via Agent Loops. Covers KDA winning CUDA kernel contests, virtual hardware optimization, and 50% research cost reduction.
ResearchDeep dive into AISTATS 2024 paper MixupMP: revealing Deep Ensembles' fundamental UQ flaws and fixing them via Mixup augmentation and Martingale Posterior framework for better calibration and OOD detection.
AI Gaming Showdown: O3 Pro Demonstrate…
Researchers tested major AI models with Tetris, Super Mario, and Sokoban. O3 Pro showed unprecedented planning ability, becoming the only model to clear all levels. Game testing reveals AI's evolution from pattern matching to strategic thinking.
Optimize Anything: One API to Unify Op…
UC Berkeley and Stanford propose Optimize Anything, a universal text optimization framework that unifies optimization of CUDA kernels, agent architectures, and prompts through one declarative API.
ResearchMeta reveals Muse Spark technical details: three-dimensional scaling across pre-training, RL, and test-time inference achieves over 10x compute reduction versus Llama 4 Maverick.
ResearchGitHub is building a general-purpose accessibility AI Agent to automatically detect and fix software accessibility issues. Explore the technical challenges, human-AI collaboration, and industry impact.
ResearchEmpirical study of 110K open-source PRs comparing 5 AI coding agents (GitHub Copilot, Claude Code, Devin) on merge rates, code survival, and long-term maintainability—revealing AI code's 50% one-year survival rate.