LG

FlashAttention-3, the latest release in the FlashAttention series, has been designed to address the inherent bottlenecks of the attention layer in Transformer architectures. These bottlenecks are crucial for the performance of large language models (LLMs) and...
One of the emerging challenges in artificial intelligence is whether next-token prediction can truly model human intelligence, particularly in planning and reasoning. Despite its extensive application in modern language models, this method might be inherently limited...

Researchers From the University of Toronto and LG AI Research Develop ‘Explainable’ Artificial Intelligence (AI) Algorithm

A team of researchers from the University of Toronto and LG AI Research have developed an explainable artificial intelligence (XAI) algorithm. The algorithm can...

NuminaMath 7B TIR Released: Transforming Mathematical Problem-Solving with Advanced Tool-Integrated Reasoning and Python REPL...

0
Numina has announced the release of its latest model, NuminaMath 7B TIR. This advanced language model is designed specifically for solving mathematical problems. The...

Tsinghua University Open Sources CodeGeeX4-ALL-9B: A Groundbreaking Multilingual Code Generation Model Outperforming Major Competitors...

0
In a significant leap forward for the field of code generation, the Knowledge Engineering Group (KEG) and Data Mining team at Tsinghua University have...

InternLM2.5-7B-Chat: Open Sourcing Large Language Models with Unmatched Reasoning, Long-Context Handling, and Enhanced Tool...

0
InternLM has unveiled its latest advancement in open large language models, the InternLM2.5-7B-Chat, available in GGUF format. This model is compatible with llama.cpp, an...

Jina AI Releases Jina Reranker v2: A Multilingual Model for RAG and Retrieval with...

0
Jina AI has released the Jina Reranker v2 (jina-reranker-v2-base-multilingual), an advanced transformer-based model fine-tuned for text reranking tasks. This model is designed to significantly...

Google Releases Gemma 2 Series Models: Advanced LLM Models in 9B and 27B Sizes...

0
Google has unveiled two new models in its Gemma 2 series: the 27B and 9B. These models showcase significant advancements in AI language processing,...

Recent articles

🐝 FREE AI Courses on RAG + Deployment of an Healthcare AI App + LangChain Colab Notebook all included

X