Computer Vision

Google AI Introduces ‘LIMoE’: One Of The First Large-Scale Architecture That Processes Both Images And Text Using A Sparse Mixture Of Experts

This Article is written as a summay by Marktechpost Staff based on the paper 'Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts'....

Chung-Ang University Researchers Developed SphereGAN: A Simple And Effective Generative Adversarial Network For Unsupervised Image Generation

This Article is written as a summay by Marktechpost Staff based on the paper 'SphereGAN: Sphere Generative Adversarial Network Based on Geometric Moment Matching...

Meet ‘VALHALLA’, a Machine Learning Method That can Hallucinate an Image of Written Words, and Then Use It to Help Translate The Text into...

This Article is written as a summay by Marktechpost Staff based on the paper 'VALHALLA: Visual Hallucination for Machine Translation'. All Credit For This...

Researchers From China Introduce Vision GNN (ViG): A Graph Neural Network For Computer Vision Systems

This Article is written as a summay by Marktechpost Staff based on the research paper 'Vision GNN: An Image is Worth Graph of Nodes'....

Researchers Introduce ruDALL-E For Generating Images from Text In Russia

This Article is written as a summay by Marktechpost Staff based on the Research article 'ruDALL-E: Generating Images from Text. Facing down the biggest...

Salesforce AI Research Propose ‘ALPRO’: A New Video-And-Language Representation Learning (Pre-Training) Framework

This Article is written as a summay by Marktechpost Staff based on the Research Paper 'Align and Prompt: Video-and-Language Pre-training with Entity Prompts'. All...

Deepfake AI Projects Are No Longer Permitted On Google Colab

On its Google Colaboratory platform, Google has restricted the training of AI systems that can produce deepfakes. Deepfakes-related work is included on the forbidden...

AI Researchers From PRIOR@AI2 Release ‘GRIT’: A General Robust Image Task Benchmark For Evaluating Computer Vision Model’s Performace

Most computer vision (CV) models are trained and assessed on a small number of concepts and with a strong assumption that the images and...

NVIDIA’s Cambridge-1 Supercomputer And MONAI Were Utilized By Researchers At King’s College London To Develop Open-Source Synthetic Brain Pictures, Which Will Help To Speed...

This Article is written as a summay by Marktechpost Staff based on the Research Article 'The Man With 100,000 Brains: AI’s Big Donation to...

Researchers Propose a Novel Framework ‘FaceMAE’, Where the Face Privacy and Recognition Performance are Considered Simultaneously

This Article is written as a summay by Marktechpost Staff based on the Research Paper 'FaceMAE: Privacy-Preserving Face Recognition via Masked Autoencoders'. All Credit For...

Google AI Proposes Contrastive Captioner (CoCa): A Novel Encoder-Decoder Model That Simultaneously Produces Aligned Unimodal Image And Text Embeddings

Machine learning (ML) model developers frequently start with a basic backbone model that has been trained at scale and can be applied to a...

This Machine Learning Startup, ‘SiMa.ai’, is Developing a System-on-Chip Platform for AI Applications

The demand for energy-efficient AI acceleration hardware with cheap capital costs is growing as AI advances in every discipline. Adding ML to existing products...

Recent articles

Check Out Our Super Cool AI Research Newsletter While It's Still Free

X