Automatic Short Answer Grading in Health Sciences with ChatGPT

LIBRO COMPLETO: The Education Revolution through Artificial Intelligence

CAPÍTULO 15

Automatic Short Answer Grading in Health Sciences with ChatGPT

FICHA TÉCNICA

Fecha de publicación:
04/11/2024

Doi del capítulo:

10.36006/09651-1-15

Título del libro: The Education Revolution through Artificial Intelligence

URL del libro:

https://octaedro.com/libro/the-education-revolution-through-artificial-intelligence/

ISBN del libro: 9788410282582

DOI del libro:

10.36006/09651-1

Automatic Short Answer Grading in Health Sciences with ChatGPT

Enlace de descarga

Abstract

Artificial Intelligence (AI) has emerged as a transformative tool in education, notably in facilitating automated exam grading. This study focuses on Automatic Short Answer Grading (ASAG) via ChatGPT-4, a widely accessible and versatile general-purpose generative AI model. We compare the grading outcomes from ChatGPT with those adjudicated by human evaluators within the health science domain. An evaluative framework was deployed to gauge the GPT-4 model’s concordance with an expert educator’s scoring. Human scores were compared to those offered by ChatGPT with different versions of prompts, specifically with 10 examples, 25 examples, and a grading rubric, employing a scoring metric that spans from 0 to 10 points, allowing for decimal values, without any model fine-tuning or parameter modulation. Our findings show that rubrics markedly enhance score alignment with an educator’s evaluative benchmarks, registering intraclass correlation coefficients surpassing 0.8, thus nearly mirroring human judgment. These results suggest that there is ample scope for increasing the effectiveness of ASAG using Large Language Models (LLM) such as ChatGPT. However, it is imperative to recognize that the operability of these systems is not yet fully reliable and stable, making human supervision necessary. The integration of expert supervision ensures both the accuracy and pedagogical validity of these automated tools.

Palabras clave

AI, ASAG, ChatGPT, health sciences, large language models

Autores

PhD Nuria Padros-Flores
Universidad Miguel Hernández, Spain
npadros@umh.es
https://orcid.org/0000-0001-5206-8857

Ivan Gadea Sáez
Universidad de Alicante, Spain
ivan.gadea@ua.es
https://orcid.org/0009-0009-2290-3220

PhD Carolina Alonso-Montero
Universidad Miguel Hernández, Spain
c.alonso@umh.es
https://orcid.org/0000-0002-8856-1907

Abstract

Palabras clave

Autores

Cómo citar

Referencias bibliográficas

Recibe nuestras novedades en tu email