Author: Adnan Hassan

Adnan Hassan
321 POSTS0 COMMENTS
Hello, My name is Adnan Hassan. I am a consulting intern at Marktechpost and soon to be a management trainee at American Express. I am currently pursuing a dual degree at the Indian Institute of Technology, Kharagpur. I am passionate about technology and want to create new products that make a difference.

Researchers from China Introduce ControlLLM: An Artificial Intelligence Framework that Enables Large Language Models (LLMs) to Utilize Multi-Modal Tools for Solving Complex Real-World Task

The performance of LLMs in handling complex real-world tasks is impressive. However, there are cases where they may require assistance in using tools correctly...

Robots Get a ‘Gripping’ Upgrade: AO-Grasp Teaches Bots the Art of Not Dropping Your Stuff!

In recent years, robots have found increased usage in various industries, from manufacturing to healthcare. However, their effectiveness in carrying out tasks largely depends...

Researchers from the University of Michigan Chart New Territory in AI’s Theory of Mind: Unveiling a Taxonomy and Rigorous Protocols for Evaluation

A team of researchers from the University of Michigan advocates developing new benchmarks and evaluation protocols to assess the Theory of Mind (ToM) capability...

Meet FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions

In conversational AI, evaluating the Theory of Mind (ToM) through question-answering has become an essential benchmark. However, passive narratives need to improve in assessing...

Meet FreeNoise: A New Artificial Intelligence Method that can Generate Longer Videos with up to 512 Frames from Multiple Text Prompts

FreeNoise is introduced by researchers as a method to generate longer videos conditioned on multiple texts, overcoming limitations in existing video generation models. It...

Bridging AI and IMO Challenges: A Breakthrough in Formal Plane Geometry Systems

Through diligent effort and unwavering commitment, researchers embark on a multi-year journey to create a comprehensive formal planar geometry system to bridge the gap...

Beyond Fact or Fiction: Evaluating the Advanced Fact-Checking Capabilities of Large Language Models like GPT-4

Researchers from the University of Zurich focus on the role of Large Language Models (LLMs) like GPT-4 in autonomous fact-checking, evaluating their ability to...

Enhancing Factuality in AI: This AI Research Introduces Self-RAG for More Accurate and Reflective Language Models

Self-Reflective Retrieval-Augmented Generation (SELF-RAG) is a framework that enhances large language models (LLMs) by dynamically retrieving relevant information and reflecting on its generations. This...

Meet Davidsonian Scene Graph: A Revolutionary AI Framework for Assessing Text-to-Image AI with Precision

Text-to-image (T2I) models are difficult to evaluate and often rely on question generation and answering (QG/A) methods to assess text-image faithfulness. However, current QG/A...

Deciphering the Math in Images: How the New MathVista Benchmark is Pushing AI Boundaries in Visual and Mathematical Reasoning

MATHVISTA is introduced as a benchmark to assess the mathematical reasoning abilities of Large Language Models (LLMs) and Large Multimodal Models (LMMs) within visual...

This AI Paper Unlocks the Secret of In-Context Learning: How Language Models Encode Functions into Vector Magic

In autoregressive transformer language models, a neural mechanism is identified that represents an input-output function as a compact vector known as a function vector...

Researchers from China Propose ALCUNA: A Groundbreaking Artificial Intelligence Benchmark for Evaluating Large-Scale Language Models on New Knowledge Integration

Evaluating large-scale language models (LLMs) in handling new knowledge is challenging. Researchers from Peking University introduced KnowGen, a method to generate new knowledge by...

🐝 FREE AI Courses on RAG + Deployment of an Healthcare AI App + LangChain Colab Notebook all included

X