Glossary/OLTP, OLAP & Workload Types

Grain

Grain is the level of detail represented by a single row in a table—the precise answer to the question "what does one record mean?"—and it is the foundational decision in dimensional modeling because every dimension and measure in a table must be consistent with it.

The grain of a fact table is the business event or state that one row captures. "One row per order line item," "one row per customer per day," and "one row per sensor reading" are all grain declarations. The grain fixes the meaning of the table: once you state it, you can test every proposed column by asking whether it is true at that level of detail. A column that varies more finely than the grain doesn't belong; a column that is constant across the grain is a candidate dimension attribute.

Grain is usually described in one of two ways. Transactional grain captures an event at its most atomic level (a single sale, a single click). Periodic or snapshot grain captures the state of something at regular intervals (account balance at end of day, inventory on hand each night). Atomic grain—the finest level the source data supports—is generally preferred because it can be rolled up to any coarser level later, whereas pre-aggregated grain throws away detail you can never recover.

Grain also governs how dimensions attach. A dimension can only be joined to a fact table if it makes sense at that grain. Choosing a finer grain gives you more analytical flexibility at the cost of storage and volume; choosing a coarser grain saves space but locks in the questions you can ask.

Key Characteristics

▶Defines exactly what a single row represents—one order line, one customer-day, one sensor reading
▶Declared explicitly before any dimension or measure is added to the table
▶Transactional grain captures atomic events; periodic/snapshot grain captures state at intervals
▶Atomic (finest) grain is preferred because it rolls up to any coarser level
▶Every column in the table must be true at the declared grain
▶Determines which dimensions can attach and how aggregations behave

Why It Matters

▶Mixed or undeclared grain is the root cause of a large share of analytics bugs—double-counting, meaningless averages, and unpredictable join fan-out
▶A single, explicit grain per table keeps aggregations correct and the model legible to everyone downstream
▶Finer grain buys analytical flexibility at the cost of storage; coarser grain saves space but forecloses future questions
▶Stapling header-level attributes (like order shipping cost) onto line-level rows multiplies them when summed
▶Prematurely aggregating to "save space" discards detail that no later question can recover

Example

Consider retail sales. A fact table at the grain "one row per order line" can answer questions about individual products, quantities, and discounts, and can still be summed up to order, customer, day, or region totals. A table pre-aggregated to "one row per store per day" cannot be decomposed back into product-level detail—that information is gone. Starting at the atomic line grain and aggregating on demand preserves every downstream question.

Coginiti Perspective

Coginiti encourages defining grain explicitly and early, then letting it propagate through the modeling layer. In CoginitiScript, each materialized block produces a recordset with a known, documented grain; the #+meta {} metadata and #+test blocks let teams assert and continuously verify grain assumptions—for example, a uniqueness test on the declared key catches accidental fan-out before it reaches a report. Because Coginiti favors ELT patterns over data movement, atomic source data stays in place on the warehouse or lakehouse and is remodeled as needed, so teams can land data at the finest grain and derive coarser aggregates with publication targets rather than discarding detail. In the semantic layer (SMDL), entities carry their grain implicitly through their primary keys and relationships, and measures are defined with explicit aggregation types so that a sum or average is always evaluated at the correct level regardless of how an analyst slices it in Semantic SQL.

Related Concepts

Slowly Changing Dimensions Conformed Dimensions Semi-Additive Measures Fan & Chasm Traps Fact Table Dimension Table

See Semantic Intelligence in Action

Coginiti operationalizes business meaning across your entire data estate.

Request a Demo

Grain

Key Characteristics

Why It Matters

Example

Coginiti Perspective

Related Concepts

More in OLTP, OLAP & Workload Types

Analytical Workload

Conformed Dimensions

Dimension Table

Fact Table

Fan and Chasm Traps

HTAP (Hybrid Transactional/Analytical Processing)

See Semantic Intelligence in Action