RAG
MMClima: A Framework for Multimodal Climate Science Data and Evaluation
MMClima is a new multimodal climate question answering framework that includes over 104,000 expert-validated question-answer pairs derived from articles, video transcriptions, and figures across five climate science domains. It features automated claim extraction and human validation, benchmarked against state-of-the-art multimodal language models. The release includes the dataset, evaluation pipeline, and fine-tuned model weights (mmclima-70b-txt), which surpass existing models in textual QA, providing practitioners with essential resources for developing AI systems that reason across diverse climate-related content.
climatemultimodalqa