MAI@Home

1st Multimodal AI Reasoning Challenge @Home


News

2024/9/10 This year's website is now available. This year's challenge is called "Multimodal AI Reasoning Challenge @Home (MAI@Home) 2025. For details, please see the application guidelines.

This website is underconstruction. Please refer the paper for the further information on dataset and rules of this challenge.

Abstract

As Embodied AI continues to evolve, understanding the spatiotemporal context of human actions in daily life is crucial for enabling robots to interact effectively with their environment. Datasets and benchmarks have been developed to facilitate research in this area, but many existing datasets suffer from limitations in data quality and annotation precision.

To address these challenges, we have created the Multimodal Dataset of Daily Life (MMDL) using a 3D VirtualHome-AIST simulator. MMDL offers a rich set of multimodal data, including:

  • High-quality video data: Realistic simulations of everyday life activities.
  • Event-centric spatiotemporal Knowledge Graphs: Detailed descriptions of events, locations, and objects in the environment.
  • Consistent annotations: Mechanically generated annotations using a standardized vocabulary, ensuring consistency and reducing human error.

To further evaluate the capabilities of Embodied AI systems, we have also developed the Multimodal Question Answering Dataset of Daily Life (MMQADL). This dataset contains a variety of questions designed to test a system's ability to understand and reason about human behavior in complex real-world scenarios.

The goal of this challenge is to encourage the development of Embodied AI systems that can effectively comprehend and respond to human interactions in everyday settings. By participating in this challenge, researchers can benchmark their models against state-of-the-art techniques and contribute to advancing the field of Embodied AI.

Schedule (Tentative)

2024/9/13 Challenge Website now open
2025/5/1 0:00 Challenge Application Deadline
2025/5/19 (Hybrid) Final Presentation and Awards Ceremony colocate with ICRA 2025

X Posts

Related Link

LOD Challenge2022

Acknowledgement

This work is based on results obtained from a project, JPNP20006 and JPNP180013, commissioned by the New Energy and Industrial Technology Development Organization (NEDO). And supported by JSPS Grant for Scientific Research 19H04168

Contact

kgrc@knowledge-graph.jp

Special Event Committee Member, Semantic Web and Ontology Special Interest Group, Japanese Society for Artificial Intelligence)