Microsoft

FlashAttention-3, the latest release in the FlashAttention series, has been designed to address the inherent bottlenecks of the attention layer in Transformer architectures. These bottlenecks are crucial for the performance of large language models (LLMs) and...
One of the emerging challenges in artificial intelligence is whether next-token prediction can truly model human intelligence, particularly in planning and reasoning. Despite its extensive application in modern language models, this method might be inherently limited...

Microsoft AI Research Proposes eXtensible Prompt (X-Prompt) for Prompting a Large Language Model (LLM) Beyond Natural Language (NL)

Due to their capacity to produce text comparable to human-written material and their versatility in various natural language processing (NLP) applications, large language models...

Microsoft AI Releases NTREX-128: A New Data Set for Machine Translation (MT) Evaluation from English into a Total of 128 Target Languages

Multilingual Neural Machine Translation (MNMT) reduces deployment costs by allowing a single system to translate sentences between several source and target languages. To gauge the...

This Artificial Intelligence (AI) Research Improves both the Lip-Sync and Rendering Quality of Talking Face Generation by Alleviating the one-to-many Mapping Challenge with Memories

Using talking face creation, it is possible to create lifelike video portraits of a target individual that correspond to the speech content. Given that...

Microsoft’s New AI Model, VALL-E, Can Generate Speech From Text Using Only A Three-Second Audio Sample

Through the advancement of neural networks and end-to-end modeling, the field of voice synthesis has made significant strides during the past ten years. Currently,...

Meet DeepLSD: A Generic Line Detector that Combines the Robustness of Deep Learning with the Accuracy of Handcrafted Detectors

In surroundings humans have created, line segments are common and efficiently convey the underlying picture structure. They complement feature points nicely because of their...

Meet ReCo: An AI Extension for Diffusion Models to Enable Region Control

Large-scale text-to-image models, looking at you Stable Diffusion, have dominated the machine learning space in recent months. They have shown extraordinary generation performance in...

Microsoft AI Research Introduces E5 Model Trained in a Contrastive Manner with Weak Supervision Signals

In the latest research, Microsoft researchers developed an E5 model designed for general-purpose text embeddings. Text embeddings, which are arbitrary-length text representations in the...

IOM Releases ItsĀ Second Synthetic DatasetĀ From Trafficking Victim Case Records Generated With Differential Privacy And AI From Microsoft

Researchers at Microsoft are committed to researching ways technology may help the world's most marginalized peoples improve their human rights situations. Their expertise spans...

Researchers From Stanford And Microsoft Have Proposed An Artificial Intelligence (AI) Approach That Uses Declarative Statements As Corrective Feedback For Neural Models With Bugs

The methods currently used to correct systematic issues in NLP models are either fragile or time-consuming and prone to shortcuts. Humans, on the other...

Meet Diffusion-GAN: A Novel GAN Framework That Leverages A Forward Diffusion Chain To Generate Gaussian-Mixture Distributed Instance Noise

Generative Adversarial Networks (or just GANs) have been widely used to generate synthetic data for different applications in recent years. The most commonly considered...

Researchers From Microsoft and TUDelft Propose An Artificial Intelligence (AI) Based Approach That Creates Synthetic Expression-Based Face Wrinkles

Synthetic data has frequently been used for a range of computer vision tasks, such as object identification, scene comprehension, eye tracking, hand tracking, and...

Microsoft AI Proposes ‘FocalNets’ Where Self-Attention is Completely Replaced by a Focal Modulation Module, Enabling To Build New Computer Vision Systems For high-Resolution Visual...

Human eyes allow us to see finely and coarsely objects by quickly adjusting their focal points to allow us to observe our surroundings from...

NuminaMath 7B TIR Released: Transforming Mathematical Problem-Solving with Advanced Tool-Integrated Reasoning and Python REPL...

0
Numina has announced the release of its latest model, NuminaMath 7B TIR. This advanced language model is designed specifically for solving mathematical problems. The...

Tsinghua University Open Sources CodeGeeX4-ALL-9B: A Groundbreaking Multilingual Code Generation Model Outperforming Major Competitors...

0
In a significant leap forward for the field of code generation, the Knowledge Engineering Group (KEG) and Data Mining team at Tsinghua University have...

InternLM2.5-7B-Chat: Open Sourcing Large Language Models with Unmatched Reasoning, Long-Context Handling, and Enhanced Tool...

0
InternLM has unveiled its latest advancement in open large language models, the InternLM2.5-7B-Chat, available in GGUF format. This model is compatible with llama.cpp, an...

Jina AI Releases Jina Reranker v2: A Multilingual Model for RAG and Retrieval with...

0
Jina AI has released the Jina Reranker v2 (jina-reranker-v2-base-multilingual), an advanced transformer-based model fine-tuned for text reranking tasks. This model is designed to significantly...

Google Releases Gemma 2 Series Models: Advanced LLM Models in 9B and 27B Sizes...

0
Google has unveiled two new models in its Gemma 2 series: the 27B and 9B. These models showcase significant advancements in AI language processing,...

Recent articles

šŸ FREE AI Courses on RAG + Deployment of an Healthcare AI App + LangChain Colab Notebook all included

X