Carnegie Mellon University

Researchers From Microsoft and CMU Introduce ‘COMPASS’: A General-Purpose Large-Scale Pretraining Pipeline For Perception-Action Loops in Autonomous Systems

Humans have the essential cognitive capacity to comprehend the world via multimodal sensory inputs and use this ability to perform a wide range of...

A New Study from CMU and Bosch Center for AI Demonstrated a New Transformer Paradigm in Computer Vision

After leveraging Convolutional Neural Network (CNN) for many years, since the advent of Transformers in Natural Language Processing (NLP), the computer vision community has...

CMU’s Latest Machine Learning Research Analyzes and Improves Spectral Normalization In GANs

GANs (generative adversarial networks) are cutting-edge deep generative models that are best known for producing high-resolution, photorealistic photographs. The goal of GANs is to...

Latest CMU Research Improves Reinforcement Learning With Lookahead Policy: Learning Off-Policy with Online Planning

Reinforcement learning (RL) is a technique that allows artificial agents to learn new tasks by interacting with their surroundings. Because of their capacity to...

CMU Researchers Propose A Computer Vision-Based Approach With Data-Frugal Deep Learning To Optimize Microstructure Imaging

Materials processing is the process of turning raw materials into final items through a sequence of phases or "unit operations." The activities entail a...

Meta AI and CMU Researchers Present ‘BANMo’: A New Neural Network-Based Method To Build Animatable 3D Models From Videos

Previous work on articulated 3D shape reconstruction has frequently relied on specialized sensors (e.g., synchronized multi-camera systems) or pre-built 3D deformable models (e.g., SMAL...

AI Researchers Propose ‘GANgealing’: A GAN-Supervised Algorithm That Learns Transformations of Input Images to Bring Them into Better Joint Alignment

The correspondence problem of visual alignment is one that computer vision algorithms must solve for many different applications.It's considered a critical element in Optical...

Researchers Develop A Unified Framework For Evaluating Natural Language Generation (NLG)

Natural language generation (NLG) is a broad term that encompasses a variety of tasks that generate fluent text from input data and other contextual...

CMU AI Researchers Present A New Study To Achieve Fairness and Accuracy in Machine Learning Systems For Public Policy

The rapid rise in machine learning applications in criminal justice, hiring, healthcare, and social service intentions substantially impacts society. These wide applications have heightened...

CMU Researchers Introduce ‘CatGym’, A Deep Reinforcement Learning (DRL) Environment For Predicting Kinetic Pathways To Surface Reconstruction in a Ternary Alloy

It isn't an easy task to design efficient new catalysts. In the case of multiple element mixtures, for example - researchers must take into...

CMU and MIT AI Researchers Present A New Method To Sketch Your Own GAN With A Pencil

Sketching is the most universally accessible way to convey a visual concept. In contrast, creating GAN models has traditionally required knowledge in deep learning...

Researchers at Facebook AI, UC Berkeley, and Carnegie Mellon University Announced Rapid Motor Adaptation (RMA), An Artificial Intelligence (AI) Technique Which Enables Legged Robots...

To achieve success in the real world, walking robots must adapt to whatever surfaces they encounter, objects they carry, and conditions they are in,...

NuminaMath 7B TIR Released: Transforming Mathematical Problem-Solving with Advanced Tool-Integrated Reasoning and Python REPL...

0
Numina has announced the release of its latest model, NuminaMath 7B TIR. This advanced language model is designed specifically for solving mathematical problems. The...

Tsinghua University Open Sources CodeGeeX4-ALL-9B: A Groundbreaking Multilingual Code Generation Model Outperforming Major Competitors...

0
In a significant leap forward for the field of code generation, the Knowledge Engineering Group (KEG) and Data Mining team at Tsinghua University have...

InternLM2.5-7B-Chat: Open Sourcing Large Language Models with Unmatched Reasoning, Long-Context Handling, and Enhanced Tool...

0
InternLM has unveiled its latest advancement in open large language models, the InternLM2.5-7B-Chat, available in GGUF format. This model is compatible with llama.cpp, an...

Jina AI Releases Jina Reranker v2: A Multilingual Model for RAG and Retrieval with...

0
Jina AI has released the Jina Reranker v2 (jina-reranker-v2-base-multilingual), an advanced transformer-based model fine-tuned for text reranking tasks. This model is designed to significantly...

Google Releases Gemma 2 Series Models: Advanced LLM Models in 9B and 27B Sizes...

0
Google has unveiled two new models in its Gemma 2 series: the 27B and 9B. These models showcase significant advancements in AI language processing,...

Recent articles

🐝 FREE AI Courses on RAG + Deployment of an Healthcare AI App + LangChain Colab Notebook all included

X