Taxonomy
A Taxonomy is a hierarchical classification system that organizes concepts, entities, or objects into categories and subcategories, establishing systematic relationships for organization and navigation.
Taxonomies are hierarchical ontologies: they organize information into parent-child relationships forming tree-like structures. The Linnaean biological taxonomy organizes life into Kingdom, Phylum, Class, Order, Family, Genus, and Species. In e-commerce, product taxonomies organize products into categories: Electronics > Computers > Laptops. Each level specifies a classification principle: biology classifies by evolutionary relationship, products by functionality. Taxonomies provide a structured way to organize information that users can navigate intuitively.
Taxonomies differ from flat tag systems (any tags applied to items) and from graphs (which allow multiple relationships per entity). Taxonomies enforce hierarchical structure: each entity has one parent in the hierarchy. This constraint makes taxonomies easier to understand and navigate compared to complex knowledge graphs. However, this simplicity comes at a cost: some relationships don't fit neatly into hierarchies. A product might belong to multiple categories (winter clothing and water-resistant items), requiring multi-hierarchical taxonomies or cross-cutting categorizations.
Taxonomies are foundational to information architecture in enterprises. Master data governance often establishes hierarchies (organization hierarchies, product hierarchies, account type hierarchies) that serve as references across systems. Taxonomies enable drill-down analytics: analyzing sales by product category, then drilling down to product subcategory.
Key Characteristics
- ▶Organizes concepts or entities in hierarchical parent-child relationships
- ▶Establishes a single primary classification path for each entity
- ▶Specifies a classification principle at each level (e.g., product type)
- ▶Enables intuitive navigation and browsing of categorized information
- ▶Facilitates drill-down analysis from broad to specific categories
- ▶Serves as authoritative reference for category definitions and relationships
Why It Matters
- ▶Provides intuitive organization enabling users to navigate complex information spaces
- ▶Facilitates consistent categorization across systems and teams
- ▶Enables dimensional analysis and drill-down capabilities in analytics
- ▶Reduces complexity compared to general knowledge graphs while providing structure
- ▶Supports search and discovery by providing navigation paths
- ▶Establishes master data reference standards for categories and hierarchies
Example
A product taxonomy for an e-commerce site: Electronics > Computers > Laptops > Gaming Laptops. This hierarchy enables customers to browse from broad (Electronics) to specific (Gaming Laptops). Analytics can report sales by category at each level, and when customers browse the hierarchy, they naturally refine their search.
Coginiti Perspective
SMDL dimensions can encode hierarchical taxonomies through relationship definitions and property organization, enabling drill-down analytics on category hierarchies defined in the semantic model. By formalizing taxonomies in SMDL, organizations maintain a single source of truth for category definitions that propagates consistently across all Semantic SQL queries and ODBC-connected tools like Power BI and Excel.
Related Concepts
More in Knowledge Representation
Concept Modeling
Concept Modeling is the process of defining and structuring the fundamental ideas, entities, and relationships within a domain to create a shared understanding that can be used for analytics, integration, and AI reasoning.
Entity
An Entity is a distinct object or concept that can be uniquely identified and described using properties and relationships, serving as a fundamental unit in knowledge representation and data modeling.
Entity Resolution
Entity Resolution is the process of identifying and matching records that represent the same real-world entity across databases, data sources, or versions, enabling unified views and accurate analytics.
Graph Database
A Graph Database is a specialized data system that stores and retrieves data organized as networks of connected entities and relationships, optimizing for traversal and pattern-matching queries over relational structure.
Knowledge Graph
A Knowledge Graph is a structured representation of information where entities (people, places, concepts) are nodes and relationships between them are edges, enabling semantic understanding and traversal of complex data.
Linked Data
Linked Data is a method of publishing structured information on the web using standard formats and linking that data to external sources, enabling automatic discovery and integration across diverse systems.
See Semantic Intelligence in Action
Coginiti operationalizes business meaning across your entire data estate.