A briefing for Disney Experiences

Hybrid Data + AI Foundation

How Disney Streaming runs Databricks as the foundational data layer — with Snowflake as a first-class analyst surface and AI/ML built on top.

Prepared for Chris Taylor · The Databricks Disney team · May 2026
Begin →

Hybrid by design.

One foundation. Two consumption surfaces. AI on top.

Layer 03

Personalization, AI, ML & security

Powering Disney Streaming's full AI/ML estate — model serving, agentic workflows, fraud detection, and the security capabilities behind them.

Layer 02

Analyst consumption

Snowflake stays first-class. Same 360 Gold data, no copy, no ETL — read directly via Iceberg.

Layer 01

Foundational data + Unity Catalog

Databricks owns the catalog of record. One copy of every 360 table. Governed end-to-end.

One catalog.
One set of metrics.
One copy of data.

A Databricks proposal is underway to unify data and business logic within Disney Streaming — Experiences in early would shape Wave 1 scope.

Iceberg Interoperability

Physical layer

360 Gold data stored once, readable from Databricks and Snowflake. No copies, no 7 PB/month round-trips.

Unified Semantic Layer

Meaning layer

One YAML repo defines every metric. CI/CD fans out to UC Metric Views, Snowflake Semantic Views, Looker, Tableau.

Why it matters for Experiences: subscriber churn, LTV, DAU, attendance, dwell — defined once, not five different ways. Finance, BI, and AI agents all read the same number.

One architecture.
Every engine. Every team.

Iceberg and a unified semantic layer power the complete ecosystem — no lock-in, no rewrites, every tool a first-class consumer.

What Streaming delivers today.

Last 12 months on the hybrid pattern.

200M+
Subscribers on a unified data foundation
$300M+
Revenue opportunity for Ad Platforms (incremental, once ML models are fully deployed)
$13M
Annual savings from chargeback fraud ML on Disney+
5%
Engagement uplift on Disney+ through personalization
2,000+
Active users on the platform, 83% YoY growth
400+
Active teams developing on Databricks across Disney Streaming

Across every part of Streaming.

Active and in-flight Databricks AI/ML workloads at Disney Streaming, organized by business outcome.

Know Your Audience

Active

Identity & Customer 360 · Household & Device Graph · Audience Profiling · Audience Segmentation

Grow Your Audience

Active

Campaign Planning & Optimization · Media Mix Modeling · Market Research & Insights · Lookalike Audience Creation · Subscriber Lifecycle (churn, win-backs) · Lifecycle Marketing · Next Best Action

Audience Experience

Active

Search & Discovery · Content Recommendation · Real-time Personalization · Audience Activation · Engagement Tracking

Subscriptions & LTV

Active

Subscription Pricing & Packaging · Upsell / Cross-sell & Bundles · Subscriber Acquisition Cost & Playback · Subscriber LTV & Margin Optimization

Advertising Sales

Building

Ad Sales Pipeline & Deal Management · Inventory Forecasting & Packaging · Rate Cards, Discounts & Deal Governance · Direct / Programmatic Channel Mix · AI Agent-Assisted Media Selling

Ad Yield & Optimization

Building

Pricing & Yield Management · Ad Decisioning & Pacing · Contextual & Audience Targeting · AI Ad Creative Generation · Ad Experience Personalization · Measurement & Attribution · Incrementality Testing & Lift Measurement · Brand Safety & Suitability Controls

What the analyst layer alone can't deliver.

Real-time inference. Governed feature store. On-data AI.

Chargeback fraud

Real-time ML · online tables · UC governance

Streaming inference flags chargeback risk in-flight. Cut fraud rate from 18 bps to 10 bps — well below industry average.

Personalized search

Custom models · Vector Search

Custom models developed on Databricks powering personalized search with relevant context and recommendations.

Payment processor routing

Online feature store · streaming inference

ML-routed retry paths on payment failures. Reduces involuntary churn at the subscription gateway.

Customer support

Custom models · Guest 360

Custom models surface guest-specific details at every customer service touchpoint.

Subscriber data

Subscriber 360 · ML scoring · Unity Catalog

Unified subscriber data powers churn, LTV, and segmentation models — lakehouse-native, no out-of-band copies.

Personalization model training

Serverless GPU · MLR · Unity Catalog

Recommendation and ranking models trained on Databricks serverless GPU, directly on lakehouse data — no extra ETL to a separate training cluster.

The same foundation, your use cases.

Topics that map directly to what your team has already flagged.

Iceberg interop

Same hybrid pattern. Snowflake stays in the loop. No rip-and-replace.

Cost optimization

UC governance, automated jobs, and one-copy-of-data as the cost lever.

Guest-scale AI/ML

Personalization, dynamic pricing, ops forecasting — at park scale.

Semantic layer — Wave 1

Experiences in early. Define park-domain metrics alongside D2C in the same repo.

Next step — a working session to map one specific Experiences use case onto this foundation.

Let's talk →