All tracks / Foundations of Agentic HR / Taxonomic vs. inferential skills intelligence

Taxonomic vs. inferential skills intelligence

Why labeling skills is not the same as understanding them, and how inference changes the game

7 min read Foundations of Agentic HR

Two Fundamentally Different Architectures

When enterprise software vendors talk about “skills intelligence,” they are usually describing one of two fundamentally different architectures. The difference between them is not a matter of degree. It is a difference in kind, comparable to the difference between a paper card catalog and a search engine. Both help you find books. One does it by maintaining a fixed organizational scheme. The other does it by understanding what you are actually looking for.

The first architecture is taxonomic. It starts with a predefined list of skills, organized into hierarchies, and uses rules or manual curation to assign those labels to people, roles, and learning content. Workday Skills Cloud is the most prominent example of this approach, though most legacy HCM platforms follow the same basic pattern.

The second architecture is inferential. It starts with raw data (job descriptions, project histories, performance reviews, learning completions, collaboration patterns) and uses machine learning models, particularly semantic embeddings, to infer skills, their relationships, and their context without requiring a predefined label set.

The architecture you choose determines the ceiling of what your skills data can do for you.

How Taxonomic Systems Work

A taxonomic skills platform operates on a straightforward principle: define the universe of skills, then classify everything against that universe.

The process typically works as follows. A base library of skills is licensed or created, often containing 20,000 to 50,000 entries. Each skill has a canonical name, a definition, and a position in a hierarchy. Rules are written to map job titles, job descriptions, learning content, and employee profiles to specific skills in the library. Employees may also self-report skills by searching the library and selecting matching entries.

Reiko Tanaka, who spent four years managing a taxonomic skills platform at a global financial services firm, described the operational reality: “Every month we had a governance meeting. New skills to add, old skills to deprecate, duplicates to merge. We had 200 open requests in the queue at any given time. By the time we approved a skill like ‘Generative AI Prompt Engineering,’ half the organization was already doing it without any formal skill tag.”

The strengths of this approach are clarity and auditability. Every skill has a definition. Every relationship is explicit. For compliance-heavy industries, this transparency has real value.

But the weaknesses are structural:

Static by nature. The taxonomy only knows what has been explicitly added. New skills, emerging combinations, and context-dependent variations are invisible until a curator adds them.
Label-dependent. Two identical capabilities described differently (“People Analytics” vs. “Workforce Data Science”) appear as unrelated unless someone manually creates a synonym mapping.
Context-blind. “Python” means something different for a data scientist, a DevOps engineer, and a quantitative researcher. A taxonomic system assigns the same label to all three.
Maintenance-intensive. The taxonomy degrades without constant human attention, creating an ongoing operational cost that scales with organizational complexity.

How Inferential Systems Work

An inferential skills platform starts from a different premise: instead of defining the universe of skills and classifying against it, learn the structure of skills from the data itself.

The core technology is the semantic embedding. Rather than representing a skill as a label in a hierarchy, an inferential system represents it as a vector, a point in a high-dimensional mathematical space where proximity corresponds to meaning. “Machine Learning” and “Statistical Modeling” end up near each other not because a curator placed them in the same category, but because they appear in similar contexts, are held by similar people, and are applied to similar work.

This representation enables several capabilities that taxonomic systems cannot match:

Automatic relationship discovery. The system identifies that “Kubernetes orchestration” and “Docker container management” are closely related without anyone telling it so, because the data shows they co-occur, they appear in similar job descriptions, and people who have one frequently have the other.
Contextual disambiguation. “Python” for a data scientist and “Python” for a DevOps engineer occupy different regions of the embedding space because they appear in different contexts, alongside different co-occurring skills, and in different types of work.
Emergent skill detection. When a new capability begins appearing in job postings, project descriptions, and learning completions, the system detects it as a cluster of related activity before any curator names it.
Continuous evolution. The embedding space updates as new data arrives. There is no governance queue. The model adapts to reflect what is actually happening in the workforce.

David Okonkwo, who leads AI research at a workforce intelligence company, explained the difference using an analogy: “A taxonomy is like a filing cabinet. Everything has a drawer, a folder, and a label. An embedding space is like a map. Things that are close together are related, and you can discover neighborhoods, routes, and distances that nobody planned in advance.”

Head-to-Head Comparison

The following table compares the two approaches across the dimensions that matter most to enterprise buyers:

Dimension	Taxonomic Approach	Inferential Approach
Data model	Fixed hierarchy of labeled skills	Continuous vector space of semantic embeddings
How skills are identified	Manual curation or rules-based matching	Machine learning inference from contextual data
Handling new skills	Requires human addition to the taxonomy	Detected automatically as patterns emerge in data
Synonym resolution	Manual synonym mapping tables	Automatic, based on semantic proximity
Context sensitivity	None; one label per skill regardless of context	High; same skill represented differently by context
Maintenance burden	High; dedicated curation team required	Low; model updates continuously from data
Auditability	High; every classification is explicit	Moderate; requires explainability tooling
Cold start capability	Good; taxonomy provides structure immediately	Requires initial data to train models
Scale ceiling	Degrades as taxonomy grows beyond 5,000-10,000 active skills	Improves as more data becomes available
Cross-language support	Requires separate taxonomy per language	Multilingual embeddings handle translation natively
Skill adjacency and pathways	Manually curated career path mappings	Computed automatically from workforce transition data

The Workday Skills Cloud Example

Workday Skills Cloud is the most widely deployed taxonomic skills platform in the enterprise market. It maintains a library of over 50,000 skills, uses machine learning to suggest skill assignments, and provides a governance framework for organizations to customize and extend the base library.

To be fair, Workday has invested in ML-assisted features. The platform can suggest skills based on job profiles and can identify potential duplicates. But the underlying architecture remains taxonomic: there is a canonical list of skills, and the system’s job is to classify workforce data against that list.

Kenji Watanabe, an HR technology analyst, noted: “Workday Skills Cloud is excellent at governing a large skills taxonomy inside a system of record. The challenge is that governing a taxonomy and generating intelligence from skills data are two different problems. One is about data hygiene. The other is about insight generation.”

The practical implications show up in specific scenarios:

Scenario	Taxonomic System Response	Inferential System Response
“Which employees could transition into machine learning roles?”	Returns people tagged with ML-adjacent skills from the taxonomy	Returns people whose work patterns, learning trajectories, and skill combinations position them near ML capability in the embedding space, including people who have never been tagged with an ML skill
“What skills will we need for our autonomous vehicle initiative?”	Returns skills from the taxonomy that match keywords in the initiative description	Analyzes job postings, patents, and research papers in the autonomous vehicle domain to identify required capability clusters, including emergent skills not yet in any taxonomy
“How does our AI capability compare to the external market?”	Counts employees with AI-related skill tags and compares to external job posting data	Maps internal capability embeddings against external labor market embeddings to identify precise areas of strength and deficit at any level of granularity

The Convergence Question

A reasonable question is whether taxonomic platforms will simply add inferential capabilities over time. Some convergence is inevitable. Workday and other HCM vendors are investing in ML, and their platforms will get smarter.

But architecture constrains evolution. A system built around a fixed label set can add ML on top, but the label set remains the organizing principle. Sofia Lindberg, CTO of a workforce intelligence startup, made the point directly: “You can add inference to a taxonomy. But you cannot make a taxonomy think. The underlying data model determines what questions the system can answer, and a hierarchical label set will always be limited to questions that can be framed in terms of those labels.”

What This Means for Buyers

For organizations evaluating skills technology, the taxonomic vs. inferential distinction should be the first architectural question you ask. Not “how many skills are in your library” or “do you use AI,” but “what is your underlying data model, and what does it allow you to do that a different model would not?”

The answer will determine whether your skills platform can do what the business actually needs: not just label capabilities, but understand them, connect them to work, predict how they will evolve, and surface insights that no human curator could maintain at scale. That is the difference between a skills taxonomy and skills intelligence.

Key terms

Taxonomic Skills Intelligence

An approach to skills management that relies on predefined labels, hierarchies, and classification rules maintained by human curators or rules-based systems.

Inferential Skills Intelligence

An approach that uses machine learning, semantic embeddings, and contextual analysis to understand, relate, and reason about skills without depending on a fixed label set.

Semantic Embedding

A mathematical representation of a concept (such as a skill) as a vector in high-dimensional space, capturing meaning and relationships rather than just a label.

Skill Adjacency

The measurable proximity between two skills in an embedding space, indicating how closely related they are in terms of the underlying capability they represent.

Cold Start Problem

The challenge of making useful recommendations or inferences for new employees, new roles, or new skills that have little or no historical data in the system.

The bottom line

Taxonomic systems are brittle, slow to update, and disconnected from real work. Inferential systems are adaptive, contextual, and capable of surfacing insights that no human curator could maintain manually. The shift from taxonomy to inference is not incremental improvement. It is a category change.

← Previous The business does not speak skills Next → From skills ontology to workforce intelligence