4437 |
|
4319 |
Defoiling Foiled Image Captions
|
4243 |
Object Counts! Bringing Explicit Detections Back into Image Captioning
|
3973 |
Probing the Need for Visual Context in Multimodal Machine Translation
|
2150 |
Learning Simplifications for Specific Target Audiences
|
1826 |
EASSE: Easier Automatic Sentence Simplification Evaluation
|
1238 |
Deep Copycat Networks for Text-to-Text Generation
|
800 |
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
|
795 |
Distilling Translations with Visual Awareness
|