Skip to content

Glossary

Precise definitions for terms that are commonly confused, conflated, or misused in enterprise data architecture. Each term includes what it is, what it is not, and where it fits in the target-state architecture.


Bronze / Silver / Gold

Data refinement stages within the EDP: raw ingestion (bronze), cleansed and conformed (silver), business-ready (gold). Not: business process stages or a workflow model. Architecture layer: Layer 4 (EDP).

Data Fabric

A vendor-marketed architectural approach emphasizing metadata-driven data integration across distributed systems. Not: a single product or platform. Not the same as data mesh. Included here for disambiguation -- this guide does not prescribe data fabric as a pattern.

Data Lake

A storage system for raw, unprocessed data in native formats (Parquet, JSON, CSV). Not: a data warehouse. Not governed by default. Architecture layer: Layer 4 (EDP), typically the bronze/raw tier.

Data Mesh

An organizational model for decentralized data ownership where domain teams own and publish data products. Not: a technology platform. Not a replacement for centralized infrastructure. See Data Mesh positioning (Phase 2) for detailed treatment. Architecture layer: organizational overlay across Layers 4--5.

Data Products

Governed, documented, discoverable datasets with defined ownership, schema, SLA, and quality contract. Not: raw tables, API endpoints, or "any data someone published." Architecture layer: Layer 5 (Semantic / Data Product Layer).

Data Warehouse

A structured analytical data store optimized for SQL queries, aggregations, and reporting. Not: an operational database. Not a data lake (though lakehouses blur this line). Architecture layer: Layer 4 (EDP).

Enterprise Data Platform (EDP)

An integrated analytical platform that ingests, historizes, governs, and serves data across business domains for analytics, AI, and regulatory reporting. Not: an operational platform, a transaction processing system, or a workflow engine. Architecture layer: Layer 4.

Feature Store

An ML-specific data system that manages the computation, storage, and serving of features for model training (offline) and inference (online). Not: a general-purpose data store. Bridges the analytical and operational worlds. Architecture layer: between Layer 5 and Layer 6.

Lakehouse

A platform combining data lake storage flexibility with data warehouse query capabilities, typically using open table formats (Delta Lake, Apache Iceberg). Not: a replacement for operational databases. Architecture layer: Layer 4 (EDP).

Operational Data Store (ODS)

A purpose-built store for current-state, operational access patterns with low latency and transactional consistency. Holds denormalized, access-optimized data for operational use. Not: the EDP. Not a data warehouse. Architecture layer: Layer 3a.

Raw / Curated / Consumption

Alternative naming for bronze/silver/gold. Same concept: data refinement stages within the EDP. Some organizations prefer this naming to avoid the "gold = final product" misconception. Architecture layer: Layer 4 (EDP).

Serving Layer

Infrastructure that sits between data products and operational consumers, optimized for low-latency, high-concurrency access. APIs, caches, materialized views, feature stores. Not: the EDP. Not the data product itself. Architecture layer: between Layer 5 and operational consumers.

Transaction Processing Platform

Systems designed for ACID-compliant, low-latency business transactions: payments, orders, bookings, settlements. Not: the EDP. Not analytical. Architecture layer: Layer 3.

Workflow Platform

Systems for orchestrating multi-step business processes with state management, human tasks, exception handling, and SLA tracking. Temporal, Camunda, or purpose-built BPM. Not: dbt. Not Airflow (which is a pipeline orchestrator, not a business workflow engine). Architecture layer: Layer 3.


Terms with dedicated pages in this guide (e.g., Data Mesh) are defined briefly here with a link to the full treatment.