Explaining Digital Humanities by Aligning Images and Textual Descriptions