Discussed on HN. When you ask Claude, GPT-4, and Gemini the same factual question, they often disagree — sometimes strongly. This is a real problem for anyone building AI-powered products that need to be reliable. Consensus isn't a guarantee of correctness.