Creating a word vocabulary for the captions