Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques

From MaRDI portal

Publication:5076318

Jump to:navigation, search

DOI10.1613/JAIR.1.12967OpenAlexW3157861865WikidataQ130888494 ScholiaQ130888494MaRDI QIDQ5076318

Grzegorz Chrupała

Publication date: 16 May 2022

Published in: Journal of Artificial Intelligence Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2104.13225

zbMATH Keywords

natural language speech processing machine learning vision

Mathematics Subject Classification ID

Artificial intelligence (68Txx)

Uses Software

Cites Work

RxR

This page was built for publication: Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5076318&oldid=19573421"