Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory
This study investigates the performance of GPT-4, an advanced AI model developed by OpenAI, on the force concept inventory (FCI) to evaluate its accuracy, reasoning patterns, and the occurrence of false positives and false negatives. GPT-4 was tasked with answering the FCI questions across multiple sessions. Key findings include GPT-4’s proficiency in several FCI items, particularly those related to Newton’s third law, achieving perfect scores on many items. However, it struggled significantly with questions involving the interpretation of figures and spatial reasoning, resulting in a higher occurrence of false negatives where the reasoning was correct, but the answers were incorrect. Additionally, GPT-4 displayed several conceptual errors, such as misunderstanding the effect of friction and retaining the outdated impetus theory of motion. The study’s findings emphasize the importance of refining AI-driven tools to make them more effective in educational settings. Addressing both AI limitations and common misconceptions in physics can lead to improved educational outcomes.
Aldazharova, S., Issayeva, G., Maxutov, S., & Balta, N. (2024). Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory.
Contemporary Educational Technology, 16(4), ep538.
https://doi.org/10.30935/cedtech/15592
Aldazharova, S., Issayeva, G., Maxutov, S., and Balta, N. (2024). Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory.
Contemporary Educational Technology, 16(4), ep538.
https://doi.org/10.30935/cedtech/15592
Aldazharova S, Issayeva G, Maxutov S, Balta N. Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory.
CONT ED TECHNOLOGY. 2024;16(4), ep538.
https://doi.org/10.30935/cedtech/15592
Aldazharova, Salima, Gulnara Issayeva, Samat Maxutov, and Nuri Balta. "Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory".
Contemporary Educational Technology 2024 16 no. 4 (2024): ep538.
https://doi.org/10.30935/cedtech/15592
Aldazharova, Salima et al. "Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory".
Contemporary Educational Technology, vol. 16, no. 4, 2024, ep538.
https://doi.org/10.30935/cedtech/15592
Aldazharova S, Issayeva G, Maxutov S, Balta N. Assessing AI’s problem solving in physics: Analyzing reasoning, false positives and negatives through the force concept inventory. CONT ED TECHNOLOGY. 2024;16(4):ep538.
https://doi.org/10.30935/cedtech/15592