Why do different AI models give different answers to the same factual question?