Visual Genome
From MaRDI portal
Cited in
(17)- LMMS reloaded: transformer-based sense embeddings for disambiguation and beyond
- Global-affine and local-specific generative adversarial network for semantic-guided image generation
- Natural language guided object retrieval in images
- Learning 3D semantic scene graphs with instance embeddings
- OCNet: object context for semantic segmentation
- Handwritten mathematical expression recognition via paired adversarial learning
- CNN-RNN
- BiggerPicture
- AutoExtend
- CLEVR dataset
- VQA
- RGCNN
- YFCC100M
- Robust Plackett-Luce model for \(k\)-ary crowdsourced preferences
- SketchyGAN
- LXMERT
- Supervised Visual Attention for Simultaneous Multimodal Machine Translation
This page was built for software: Visual Genome