Language Model

Researchers from Datategy SAS in France and Math & AI Institute in Turkey propose one potential direction for the recently emerging multi-modal architectures. The central idea of their study is that well-studied Named Entity Recognition (NER)...
Large Language Models (LLMs) are at the forefront of Artificial Intelligence (AI) and show great promise to surpass human skills in this quickly changing field. But when these models get closer to superhuman capabilities, assessing them...

What Should You Choose Between Retrieval Augmented Generation (RAG) And Fine-Tuning?

Recent months have seen a significant rise in the popularity of Large Language Models (LLMs). Based on the strengths of Natural Language Processing, Natural...

Researchers from Microsoft Research and Georgia Tech Unveil Statistical Boundaries of Hallucinations in Language Models

A key issue that has recently surfaced in Language Models is the high rate at which Language Models (LMs) provide erroneous information, including references...

Alibaba AI Open-Sources Qwen Series that Includes Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B along with Qwen-Chat Series

With the most recent models in its Qwen series of open-source AI models, Alibaba Cloud is pushing the boundaries of AI technology even further....

Meet Ego-Exo4D: A Foundational Dataset and Benchmark Suite to Support Research on Video Learning and Multimodal Perception

Today, AI finds its application in almost every field imaginable. It has definitely transformed our lives, streamlining processes and enhancing efficiency in ways we...

Tencent AI Lab Introduces GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation

The problem of video understanding and generation scenarios has been addressed by researchers of Tencent AI Lab and The University of Sydney by presenting...

Google AI Research Present Translatotron 3: A Novel Unsupervised Speech-to-Speech Translation Architecture

Speech-to-speech translation (S2ST) has been a transformative technology in breaking down language barriers, but the scarcity of parallel speech data has hindered its progress....

Researchers from Shanghai Artificial Intelligence Laboratory and MIT Unveil Hierarchically Gated Recurrent Neural Network RNN: A New Frontier in Efficient Long-Term Dependency Modeling

The Hierarchically Gated Recurrent Neural Network (HGRN) technique developed by researchers from the Shanghai Artificial Intelligence Laboratory and MIT CSAI addresses the challenge of...

Meet MMMU: A New AI Benchmark for Expert-Level Multimodal Challenges Paving the Path to Artificial General Intelligence

Multimodal pre-training advancements address diverse tasks, exemplified by models like LXMERT, UNITER, VinVL, Oscar, VilBert, and VLP. Models such as FLAN-T5, Vicuna, LLaVA, and...

This AI Research Case Study from Microsoft Reveals How Medprompt Enhances GPT-4’s Specialist Capabilities in Medicine and Beyond Without Domain-Specific Training

Microsoft researchers address the challenge of improving GPT-4's ability to answer medical questions without domain-specific training. They introduce Medprompt, which employs different prompting strategies...

UC Berkeley Researchers Introduce Starling-7B: An Open Large Language Model (LLM) Trained by Reinforcement Learning from AI Feedback (RLAIF)

Large Language Models (LLMs) are artificial intelligence models for natural language processing tasks. These models are trained on massive datasets and can understand and...

Meta AI Introduces Seamless: A Publicly Available AI System that Unlocks Expressive Cross-Lingual Communication in Real-Time

New features and improvements in automatic voice translation have made it possible to accomplish much more, cover more languages, and work with more input...

DeepSeek Open-Sources DeepSeek-67B Model: The Latest ChatGPT Rival from China

Chinese AI startup DeepSeek AI has ushered in a new era in large language models (LLMs) by debuting the DeepSeek LLM family. Comprising the...

Recent articles

Unlock the full potential of your data with Julius AI: An advanced yet user-friendly data analyst tool for anyone

X