The application is basically a functional Google Chrome Extension that generates a caption for an image which describes what is contained in the image. The image captioning is done by a deep learning network which runs on a cloud server (Heroku). The caption generated is converted into speech by JavaScript’s Web Speech AP (SpeechSynthesisUtterance API).
Project link: https://github.com/ArjunKrishnak/image-caption-generator