Cider: Consensus-based image description evaluation. Vedantam R, Lawrence C. R Vedantam, C Lawrence Zitnick… - Proceedings of the …, 2015 - openaccess.thecvf.com GSID: nV0tTEuFsiEJ
From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Lai A, Hodosh M. P Young, A Lai, M Hodosh… - Transactions of the …, 2014 - direct.mit.edu GSID: yosklvQV--8J
Deep visual-semantic alignments for generating image descriptions. [No authors listed] GSID: e_amWi47aREJ
Grad-cam: Visual explanations from deep networks via gradient-based localization. [No authors listed] GSID: Ad045yn4zKMJ