Inspecting state of the art performance and NLP metrics in image-based medical report generation | Zendy

Pablo Pino | Zendy; Denis Parra | Zendy; Cecilia Besa | Zendy; Sergio Uribe | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Inspecting state of the art performance and NLP metrics in image-based medical report generation

Author(s) -

Pablo Pino,

Denis Parra,

Cecilia Besa,

Sergio Uribe

Publication year - 2020

Language(s) - English

Resource type - Conference proceedings

DOI - 10.52591/lxai202012128

Subject(s) - computer science , task (project management) , artificial intelligence , natural language processing , contrast (vision) , deep learning , simple (philosophy) , machine learning , philosophy , management , epistemology , economics

Several deep learning architectures have been proposed over the last years to deal with the task of generating a written report given an imaging exam as input. Most works evaluate the generated reports using standard Natural Language Processing (NLP) metrics (e.g. BLEU, ROUGE), reporting significant progress. This article contrast this progress by comparing state of the art (SOTA) models against weak baselines. We show that simple and even naive approaches yield near SOTA performance on most traditional NLP metrics. We conclude that evaluation methods in this task should be further studied towards correctly measuring clinical accuracy, involving physicians to contribute to this end.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research