Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques

From MaRDI portal
Publication:5076318