Apple Machine Studying Analysis at NeurIPS 2025

November 28, 2025

129

Apple researchers advance AI and ML via elementary analysis, and to assist the broader analysis neighborhood and assist speed up progress on this area, we share a lot of this work via publications and engagement at conferences.

Subsequent month, the thirty ninth annual Convention on Neural Data Processing Programs (NeurIPS), shall be held in San Diego, California, with a satellite tv for pc occasion additionally going down in Mexico Metropolis, Mexico. Apple is proud to as soon as once more to take part on this vital occasion for the neighborhood and to assist it with our sponsorship.

On the principal convention and related workshops, Apple researchers will current many papers throughout a wide range of subjects in ML. As highlighted under, this contains new works advancing privacy-preserving ML, understanding the strengths and limitations of reasoning fashions, sharing modern approaches to generative AI, and detailing a principled strategy to figuring out coaching information mixtures.

NeurIPS attendees will be capable to expertise demonstrations of Apple’s ML analysis in our sales space # 1103, throughout exhibition hours. Apple can be sponsoring and taking part in quite a lot of affinity group-hosted occasions that assist underrepresented teams within the ML neighborhood. A complete overview of Apple’s participation in and contributions to NeurIPS 2025 may be discovered right here, and a number of highlights observe under.

Advancing Privateness-Preserving ML

At Apple, we consider privateness is a elementary human proper, and advancing privacy-preserving methods in AI and ML is a crucial space of ongoing analysis. The work Apple researchers will current at NeurIPS this 12 months contains a number of papers sharing progress on this space.

Precisely estimating a discrete distribution from samples is a elementary process in statistical ML. Measuring accuracy by the Kullback-Leibler (KL) divergence error is beneficial for selling range and smoothness within the estimated distribution, and is vital in a spread of contexts, together with information compression, speech recognition, and language modeling. Within the Highlight paper, Occasion-Optimality for Non-public KL Distribution Estimation, Apple researchers discover how you can estimate chance distributions precisely whereas defending privateness. The work focuses on instance-optimality – designing algorithms that adapt to every particular dataset and carry out practically in addition to the very best technique for that case. The paper shares new algorithms that obtain this steadiness each with and with out differential privateness, exhibiting that distributions may be estimated precisely below KL error, whereas mathematically guaranteeing that no single particular person’s information may be inferred.

In differential privateness, randomizing which information factors are utilized in computations can amplify privateness, making it tougher to attach information to a person. Within the Highlight paper, Privateness Amplification by Random Allocation, Apple researchers analyze a brand new sampling technique known as random allocation. On this sampling scheme a person’s information is utilized in okay steps chosen randomly and uniformly from a sequence (or set) of t steps. The paper supplies first theoretical ensures and numerical estimation algorithms for this scheme. This enables for higher privateness analyses (and therefore higher privacy-utility tradeoffs) for a number of vital algorithms corresponding to well-liked variants of differentially non-public SGD and algorithms for environment friendly safe aggregation, corresponding to these introduced in PREAMBLE: Non-public and Environment friendly Aggregation by way of Block Sparse Vectors, one other paper that Apple researchers will current at NeurIPS this 12 months.

Understanding the Strengths and Limitations of Reasoning Fashions

Reasoning is a crucial functionality for AI, enabling methods to perform advanced aims that require planning and a number of steps – corresponding to fixing math and coding issues, in addition to duties for robots and digital assistants. Whereas the sphere has made important progress in creating reasoning fashions, elementary analysis that rigorously investigates the strengths and limitations of present approaches is crucial to additional advancing this functionality for the longer term.

At NeurIPS, Apple researchers will current The Phantasm of Pondering: Understanding the Strengths and Limitations of Reasoning Fashions by way of the Lens of Drawback Complexity, which explores how present AI fashions deal with advanced reasoning duties. With controllable puzzle environments, the work systematically checks how these fashions’ efficiency adjustments as issues enhance in complexity (see Determine 1). The paper exhibits that the accuracy of frontier Giant Reasoning Fashions (LRMs) collapses past sure complexities, and finds that LRMs’ reasoning effort will increase together with the complexity of a problem – up to some extent – after which it declines, regardless of having a ample token funds. The work additionally compares the efficiency of Giant Reasoning Fashions (LRMs) and LLMs with equal inference compute, discovering that LLMs outperform LRMs for low-complexity duties, LRMs present a bonus in medium-complexity duties, and each sorts fail for high-complexity duties. The paper supplies perception into LRMs’ strengths and limitations, elevating essential questions on these fashions’ reasoning capabilities at present, which can in the end illuminate alternatives to make LRMs extra succesful sooner or later.

One of many authors of the above paper can even ship an Expo Speak on the subject of reasoning on Tuesday, December 2, at 8:30am PST within the Higher Stage Ballroom 20AB. The speak will present a vital evaluate of reasoning in language fashions, spotlight why present evaluations may be deceptive, and emphasize that reasoning is not only about “what” fashions reply, however “how” they remedy issues.

Progressive Approaches to Generative AI

The business has made spectacular progress in high-resolution picture technology fashions, however the dominant approaches even have undesirable traits. Diffusion fashions are computationally costly in each coaching and inference, autoregressive generative fashions may be costly at inference and require quantization that may adversely have an effect on their output’s constancy, and hybrid fashions that apply autoregressive methods straight in steady house are advanced.

Within the NeurIPS Highlight paper, STARFlow: Scaling Latent Normalizing Flows for Excessive-resolution Picture Synthesis, Apple researchers share a scalable strategy that generates comparable high quality high-resolution photos (see Determine 2), with out the computational value and complexity of prior strategies. This technique builds on the Transformer Autoregressive Circulate (TARFlow), which mixes normalizing flows (NF) and the autoregressive transformer structure. STARFlow produces photos at resolutions and high quality ranges beforehand thought unreachable for NF fashions, rivaling prime diffusion and autoregressive strategies whereas sustaining actual probability modeling and quicker inference. This work is the primary profitable demonstration of normalizing flows at this scale and backbone, and it exhibits that normalizing flows are a robust different to diffusion fashions for AI picture technology.

Determine 2: generated samples from our mannequin with variable side ratios.

As generative AI fashions turn into more and more broadly used, environment friendly strategies to manage their generations – for instance to make sure they produce secure content material or present customers with the power to discover model adjustments – have gotten more and more vital. Ideally, these strategies ought to preserve output high quality, and never require a considerable amount of information or computational value at coaching or inference time.

Apple researchers have beforehand demonstrated that an efficient and environment friendly strategy to this problem is intervening completely on mannequin activations, with the aim of correcting distributional variations between activations seen when utilizing prompts from a supply vs. a goal set (e.g. poisonous and non-toxic sentences). At NeurIPS, Apple researchers will current LinEAS: Finish-to-end Studying of Activation Steering with a Distributional Loss,which describes linear end-to-end activation steering (LinEAS), an strategy educated with a world loss that accounts concurrently for all layer-wise distributional shifts (see Determine 3). LinEAS solely requires a handful of unpaired samples to be efficient, and beats related baselines on toxicity mitigation in language fashions. Its international optimization permits together with a sparsity regularization, leading to extra exact and focused interventions which might be efficient whereas preserving the bottom mannequin fluency. This technique is modality-agnostic is proven to outperform present activation-steering strategies at mitigating and together with new ideas on the output of single-step text-to-image technology fashions.

Determine 3: LinEAS learns light-weight maps to steer pretrained mannequin activations. With LinEAS, we acquire fine-grained management on text-to-image technology to induce exact kinds (within the determine) or take away objects. The identical process additionally permits controlling LLMs.

A Principled Strategy to Figuring out Coaching Knowledge Mixtures

Giant basis fashions are usually educated on information from a number of domains, and the information combination – the proportion of every area utilized in coaching – performs a vital position in mannequin efficiency. The usual strategy to deciding on this combination depends on trial and error, which turns into impractical for large-scale pretraining.

At NeurIPS, Apple researchers will current Scaling Legal guidelines for Optimum Knowledge Mixtures, which supplies a greater strategy to this elementary problem. The paper shares a scientific technique to find out the optimum information combination for any goal area utilizing scaling legal guidelines (see Determine 4). The scaling legal guidelines predicts the lack of a mannequin of dimension N educated with D tokens with a mix h . The paper exhibits that these scaling legal guidelines are common, and demonstrates their predictive energy for large-scale pretraining of enormous language fashions (LLMs), native multimodal fashions (NMMs), and enormous imaginative and prescient fashions (LVMs). It additionally exhibits that these scaling legal guidelines can extrapolate to new information mixtures and throughout scales: their parameters may be precisely estimated utilizing a number of small-scale coaching runs, and used to estimate the efficiency at bigger scales and unseen area weights. The scaling legal guidelines enable practitioners to derive the optimum area weights for any goal area below a given coaching funds (N, D), offering a principled different to pricey trial-and-error strategies.

Determine 4: Left: We derive scaling legal guidelines that predict the lack of a mannequin as a operate of mannequin dimension N, variety of coaching tokens D, and the area weights used to coach the mannequin (represented by the colour of every level). The scaling legislation is fitted with small-scale runs with completely different area weights, and used to foretell precisely the lack of large-scale fashions educated with new, unseen area weights. Proper: We discover the information combination scaling legislation based mostly on small-scale experiments (e.g., under 1B parameters) and use it to foretell the optimum information combination at bigger scales (e.g., 8B parameters). Each our additive and joint legal guidelines result in related efficiency, and higher than different mixtures (within the grey space).

Demonstrating ML Analysis within the Apple Sales space

Throughout exhibition hours, NeurIPS attendees will be capable to work together with stay demos of Apple ML analysis in sales space # 1103. These embody:

MLX – an open supply array framework designed for Apple silicon that allows quick and versatile ML and scientific computing on Apple {hardware}. The framework is optimized for Apple silicon’s unified reminiscence structure and leverages each the CPU and GPU. Guests will be capable to expertise two MLX demos:
- Picture technology with a big diffusion mannequin on an iPad Professional with M5 chip
- Distributed compute with MLX and Apple silicon: Guests will be capable to discover textual content and code technology with a 1 trillion-parameter mannequin working in Xcode on a cluster of 4 Mac Studios outfitted with M3 Extremely chips, every working with 512 GBs of unified reminiscence.
FastVLM – a household of mobile-friendly imaginative and prescient language fashions, constructed utilizing MLX. These fashions use a mixture of CNN and Transformer architectures for imaginative and prescient encoding designed particularly for processing high-resolution photos. Collectively, they display a robust strategy that achieves an optimum steadiness between accuracy and pace. Guests will get to expertise a real-time visible question-and-answer demo on iPhone 17 Professional Max.

Supporting the ML Analysis Neighborhood

Apple is dedicated to supporting underrepresented teams within the ML neighborhood, and we’re proud to once more sponsor a number of affinity teams internet hosting occasions onsite at NeurIPS 2025 in San Diego, together with Ladies in Machine Studying (WiML) (workshop on December 2), LatinX in AI (workshop on December 2), and Queer in AI (workshop and night social on December 4). Along with supporting these workshops with sponsorship, Apple workers can even be taking part at every of those, in addition to different occasions going down throughout the convention.

Be taught Extra about Apple ML Analysis at NeurIPS 2025

This put up highlights only a handful of the works Apple ML researchers will current at NeurIPS 2025, and a complete overview and schedule of our participation may be discovered right here.

Apple Machine Studying Analysis at NeurIPS 2025

Advancing Privateness-Preserving ML

Understanding the Strengths and Limitations of Reasoning Fashions

Progressive Approaches to Generative AI

A Principled Strategy to Figuring out Coaching Knowledge Mixtures

Demonstrating ML Analysis within the Apple Sales space

Supporting the ML Analysis Neighborhood

Be taught Extra about Apple ML Analysis at NeurIPS 2025

Related Articles

Coding the Pong Recreation from Scratch in Python

Google DeepMind Introduces Unified Latents (UL): A Machine Studying Framework that Collectively Regularizes Latents Utilizing a Diffusion Prior and Decoder

This loaded M3 iPad Air is underneath $1,000 proper now ($250 off)

Latest Articles

Coding the Pong Recreation from Scratch in Python

Google DeepMind Introduces Unified Latents (UL): A Machine Studying Framework that Collectively Regularizes Latents Utilizing a Diffusion Prior and Decoder

This loaded M3 iPad Air is underneath $1,000 proper now ($250 off)

Methods to construct one of the best emergency roadside package

A number of Brokers Auditing Your Callaway and Sant’Anna Diff-in-Diff (Half 2)