Machine Learning

Recent video-language models' (VidLMs) performance on various video-language tasks has been outstanding. Such multimodal models only come with drawbacks. For example, it is shown that vision-language models have difficulty understanding compositional and order relations in images,...
Language gives humans an extraordinary level of general intellect and sets them apart from all other creatures. Importantly, language not only helps people interact with others better, but it also improves our capacity to think. Before...

Exploring AVFormer: Google AI’s Innovative Approach to Augment Audio-Only Models with Visual Information & Streamlined Domain Adaptation

One of the biggest obstacles facing automated speech recognition (ASR) systems is their inability to adapt to novel, unbounded domains. Audiovisual ASR (AV-ASR) is...

Meet STEVE-1: An Instructable Generative AI Model For Minecraft That Follows Both Text And Visual Instructions And Only Costs $60 To Train

Powerful AI models may now be operated and interacted with via language commands, making them widely available and adaptable. Stable Diffusion, which transforms natural...

This AI Paper Proposes A Self-Supervised Music Understanding Model Called MERT That Attains Overall SOTA Performance on 14 MIR Tasks

Self-supervised learning is being prominently used in Artificial Intelligence to develop intelligent systems. The transformer models like BERT and T5 have recently got popular...

Meet mmT5: A Modular Multilingual Sequence-To-Sequence Model That Outperforms mT5

Pre-trained models that speak many languages have performed excellently on natural language interpretation challenges. Large volumes of unlabeled data in hundreds of languages are...

Model Collapse: The Hidden Threat to LLMs and How to Keep AI Rea

With the craze of LLMs, such as widely popular GPT engines, every company, big or small, is in the race to either develop a...

50+ New Cutting-Edge AI Tools (2023)

AI tools are rapidly increasing in development, with new ones being introduced regularly. Check out some AI tools below that can enhance your daily...

Can (Very) Simple Math Informs RLHF For Large Language Models LLMs? This AI Paper Says Yes!

Incorporating human input is a key component of the recent impressive improvements in large language model (LLM) capacities, such as ChatGPT and GPT-4. To...

Meet CREATOR: A Novel AI Framework That Empowers LLMs To Create Their Own Tools Through Documentation And Code Realization

Large language models (LLMs) have made significant strides in recent years, such as GPT-3, Codex, PaLM, LLaMA, ChatGPT, and the more current GPT4. The...

Using An Artificial Intelligence Algorithm, Researchers at MIT and McMaster University have identified a new Antibiotic that can Kill a Type of Bacteria that...

MIT and McMaster University researchers have utilized artificial intelligence (AI) to discover a new antibiotic that effectively kills drug-resistant bacteria, particularly Acinetobacter baumannii, a...

Salesforce AI Research Introduces CodeTF: A One-Stop Transformer Library For Code Large Language Models (CodeLLM)

Over the past few years, AI has caused seismic shifts in the software engineering industry. Basic source code analysis is at the heart of...

Hey AI-Pa! Draw Me a Story: TaleCrafter is an AI Method that can Generate Interactive Visuals for Stories

Generative AI has come a long way recently. We are all familiar with ChatGPT, diffusion models, and more at this point. These tools are...

Google AI Introduces DIDACT For Training Machine Learning ML Models For Software Engineering Activities

Creating software does not happen in one giant leap. Step by step, it becomes better until it's ready to be merged into a code...

Recent articles

Be the first to know the latest AI research breakthroughs.

X