Flickr30K
From MaRDI portal
Cited in
(33)- Boost image captioning with knowledge reasoning
- Supervised Visual Attention for Simultaneous Multimodal Machine Translation
- Computer vision. Algorithms and applications
- scientific article; zbMATH DE number 7370635 (Why is no real title available?)
- A review of recurrent neural networks: LSTM cells and network architectures
- CIDEr
- DenseCap
- MorphoN
- MultiWOZ
- Im2Text
- I2T
- VQA
- Pixel-BERT
- DIME
- Habitat
- BiT
- DehazeNet
- VQ-Diffusion
- AOD-Net
- CLEVR
- COVAREP
- MDETR
- LXMERT
- MultiBench
- MultiViz
- RUBi
- UNITER
- VideoBERT
- VL-InterpreT
- ViLT
- VisualBERT
- ViLBERT
- Image restoration by learning morphological opening-closing network
This page was built for software: Flickr30K