AI, LLMReasoning vs. Recitation: The Limits of LLMs in New WorldsLLMs struggle with tasks like base-9 arithmetic & unconventional chess. Read about counterfactual evaluation.musing.oneJuly 16, 2024