) model that allows us not only to interact with the vision data via a language interface (visual QA), tackle... is also expected to make fundamental algorithmic contributions in building foundational models, contribute towards open source software...
King's College London