University College London

Recent video-language models' (VidLMs) performance on various video-language tasks has been outstanding. Such multimodal models only come with drawbacks. For example, it is shown that vision-language models have difficulty understanding compositional and order relations in images,...
Language gives humans an extraordinary level of general intellect and sets them apart from all other creatures. Importantly, language not only helps people interact with others better, but it also improves our capacity to think. Before...

Facebook AI Open-Sources CO3D (Common Objects in 3D) Data Set For 3D Reconstruction in Computer Vision Research

3D object reconstruction is a significant computer vision problem with AR/VR technology applications, such as telepresence and the generation of 3D models for gaming....

NVIDIA And King’s College London Uses Cambridge-1 To build AI Models To Generate Synthetic Brain Images

NVIDIA and King's College London have revealed new information about one of the first projects to be run on Cambridge-1, the UK's most powerful...

DeepMind and University College London Introduce Alchemy, A Novel Open-Source Benchmark For Meta-Reinforcement learning (RL) Research

Alchemy, a novel open-source benchmark for meta Reinforcement learning (RL) in the recent decade, has garnered much attention in the ML field. The RL...

Recent articles

Be the first to know the latest AI research breakthroughs.

X