1. Aditya, S., Yang, Y., Baral, C., Aloimonos, Y., 2016. Answering image riddles using vision and reasoning through probabilistic soft logic. arXiv preprint arXiv:1611.05896.
2. Active vision;Aloimonos;Int. J. Comput. Vis.,1988
3. Spice: semantic propositional image caption evaluation;Anderson,2016
4. Vqa: Visual question answering;Antol,2015
5. Hinge-loss markov random fields: convex inference for structured prediction;Bach,2013