TRECVID Task on Medical Video Question Answering

MedVidQA 2023

Introduction

The recent surge in the availability of online videos has changed the way of acquiring information and knowledge. Many people prefer instructional videos to teach or learn how to accomplish a particular task with a series of step-by-step procedures in an effective and efficient manner. In a similar way, medical instructional videos are more suitable and beneficial for delivering key information through visual and verbal communication to consumers' healthcare questions that demand instruction. With an aim to provide visual instructional answers to consumers' first aid, medical emergency, and medical educational questions, this TRECVID task on medical video question answering will introduce a new challenge to foster research toward designing systems that can understand medical videos to provide visual answers to natural language questions and equipped with the multimodal capability to generate instructional questions from the medical video. Following the success of the 1st MedVidQA shared task in the BioNLP workshop at ACL 2022, MedVidQA 2023 at TRECVID expanded the tasks and introduced a new track considering language-video understanding and generation. This track is comprised of two main tasks Video Corpus Visual Answer Localization (VCVAL) and Medical Instructional Question Generation (MIQG).

News

Important Dates

Join our Google Group for important updates! If you have any questions, ask in our Google Group or email us.

Registration and Submission

Tasks

For more details, please see our data description paper.

Datasets

Evaluation Metrics

Organizers

Deepak Gupta NLM, NIH
Dina Demner-Fushman NLM, NIH

References

[1] Deepak Gupta, Kush Attal, and Dina Demner-Fushman. A Dataset for Medical Instructional Video Classification and Question Answering, Sci Data 10, 158 (2023)
[2] Sparck Jones. Report on the need for and provision of an" ideal" information retrieval test collection. 1975.
[3] Chris Buckley, Darrin Dimmick, Ian Soboroff, and Ellen Voorhees. Bias and the limits of pooling for large collections. Information retrieval, 10(6):491–508, 2007
[4] Deepak Gupta, and Dina Demner-Fushman. "Overview of the MedVidQA 2022 Shared Task on Medical Video Question-Answering." BioNLP 2022@ ACL 2022 (2022): 264.
[5] Papineni, Kishore, et al. "Bleu: a method for automatic evaluation of machine translation." Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 2002. Vision, pages 2630–2640, 2019.
[6] Lin, Chin-Yew. "Rouge: A package for automatic evaluation of summaries." Text summarization branches out. 2004.
[7] Zhang, Tianyi, et al. "BERTScore: Evaluating Text Generation with BERT." International Conference on Learning Representations. 2019.