The Difference Between Real-Time Retrieval and Training Data in LLM Citations
Every major LLM citation comes from one of two knowledge pathways: parametric knowledge learned during pre-training on static datasets, or retrieval-augmented generation that pulls live web content at query time. The ratio of parametric to retrieved citations varies dramatically by platform. ChatGPT answers approximately 60% of queries from parametric knowledge alone. Perplexity triggers live retrieval for nearly every query. Google AI Overviews use Google's continuously updated index. Treating all AI platforms as equivalent citation targets ignores the foundational architectural difference that determines what optimization works where. How Real-Time…