Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques (Q5076318)

From MaRDI portal
Revision as of 15:50, 30 December 2024 by Import241228121245 (talk | contribs) (Normalize DOI.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)





scientific article; zbMATH DE number 7527537
Language Label Description Also known as
English
Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques
scientific article; zbMATH DE number 7527537

    Statements

    Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques (English)
    0 references
    0 references
    16 May 2022
    0 references
    natural language
    0 references
    vision
    0 references
    speech processing
    0 references
    machine learning
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers