Emmanuel gives a step by step tutorial how to build a semantic search engine for text and images, with some coding example. This approach presented extend naturally to other applications such as image and video captioning, reading text from videos, selecting optimal thumbnails and generating code from sketches of websites and so on. There's sort of expressive models in both computer vision and NLP but now it is the time to bring those two worlds together and search for images using text and vice versa.Emmanuel is a Machine Learning Engineer at Stripe, and previous Head of AI at Insight Data Science. He has years of experience going from product ideation to effective implementations. At Insight, he has led over a hundred AI projects from ideation to finished product in a variety of domains including Computer Vision, Natural Language Processing, and Speech Processing.