Apple Publishes Details About New 'MM1' AI Model

3 min read Original article ↗

Apple researchers have developed a new method for training large language models (LLMs) that seamlessly integrates both text and visual information.

hey siri banner apple
The company's findings, detailed in a research paper titled "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training," showcase a new approach to creating more intelligent and flexible AI systems. By utilizing a diverse dataset comprising image-caption pairs, interleaved image-text documents, and text-only data, Apple's claims that the MM1 model sets a new standard in AI's ability to perform tasks such as image captioning, visual question answering, and natural language inference with a high level of accuracy.

Apple's research focuses on the combination of different types of training data and model architectures, which enables the AI to understand and generate language based on a mix of visual and linguistic cues. This capability is vital for tasks that require a nuanced comprehension of the world, such as interpreting complex images or answering questions that involve visual elements.

The paper also highlights the MM1 model's exceptional in-context learning abilities, particularly in the largest 30 billion parameter configuration of the model. This version apparently exhibits remarkable capabilities for multi-step reasoning over multiple images using few-shot "chain-of-thought" prompting, a technique that allows the AI to perform complex, open-ended problem solving based on minimal examples.

This research emerges as part of Apple's broader initiative to enhance its AI capabilities amid growing competition. Earlier today, Bloomberg's Mark Gurman reported that Apple is in discussions with Google to license Google's Gemini generative large-language models to power new features coming to the iPhone as part of iOS 18.

Popular Stories

iPhone Users Who Pay for iCloud Storage Get Two New Perks on iOS 27

If you pay for extra iCloud storage on your iPhone, beyond the 5GB included for free, you might receive two more perks on iOS 27 at no additional cost. First, Apple said there will be daily usage limits for some of the new and enhanced Apple Intelligence features on iOS 27, including image generation. However, the company noted that "increased access" is available with "most" iCloud+ storage ...

Apple Announces New CarPlay Features on iOS 27, Including Video Apps

Back at WWDC 2025, Apple revealed that it was planning to allow CarPlay users to watch video via AirPlay in their vehicles while they are not driving, but we did not hear many specific details about this functionality until now. In a WWDC 2026 video aimed at developers, Apple said the CarPlay video feature is available in new vehicles that support it. When playing a video in an iPhone app...

Apple Says iOS 27 Adds These 12 New Features to Your iPhone

iOS 27's key new feature is a more intelligent and personal version of Siri, but the changes go well beyond that. In a press release today, Apple outlined additional enhancements coming across Apple Maps, Find My, Apple Wallet, Apple Music, and more. Apple Maps has gained an enhanced Flyover experience powered by AI, enabling you to view aerial imagery in "stunning detail" for select cities. ...