Bitmask DP applies when the DP state involves a subset of elements. Instead of storing sets explicitly, the subset is encoded as a binary integer, which makes the state a plain number between 0 and 2^n - 1. Problems like "assign n people to n tasks" or "visit all n cities" become DPs over these integer-encoded subsets.

It only works when n is small (typically n <= 20), because the number of subsets grows exponentially. When the constraint fits, bitmask DP turns an exponential brute force over orderings into a polynomial sweep over subsets.

What Is Bitmask DP?

Consider a set of 4 elements: {A, B, C, D}. Each element is either included or not, which we can represent with a bit. The bitmask 1011 (which equals 11 in decimal) means elements at indices 0, 1, and 3 are included, while element 2 is not.

Every subset maps to exactly one integer between 0 and 2^n - 1. The bitmask compresses a subset into a single number, and that number becomes the DP state.

The following diagram shows the subset lattice for n = 3 elements {a, b, c}. Each node is a bitmask, and edges connect subsets that differ by exactly one element.

Each level represents subsets of a particular size. The DP transitions move upward through this lattice, building larger subsets from smaller ones.

Bit Manipulation Essentials

The following bit operations appear throughout bitmask DP solutions.

Operation	Code (Java)	Example (mask = 1010, i = 1)	Result
Check if i-th bit is set	`(mask >> i) & 1`	`(1010 >> 1) & 1`	1 (set)
Set the i-th bit	`mask \| (1 << i)`	`1010 \| 0010`	1010 (no change, already set)
Clear the i-th bit	`mask & ~(1 << i)`	`1010 & 1101`	1000
Toggle the i-th bit	`mask ^ (1 << i)`	`1010 ^ 0010`	1000
Count set bits	`Integer.bitCount(mask)`	`bitCount(1010)`	2
Check if mask is 0	`mask == 0`	`1010 == 0`	false
Full mask (all n bits set)	`(1 << n) - 1`	`(1 << 4) - 1`	1111 (15)

The common transition pattern iterates over elements and tries to add each unused one to the current subset:

How to Identify Bitmask DP Problems

Two signals matter: small n (typically n <= 20, sometimes n <= 15) and subset selection (choosing, assigning, or visiting elements). When both are present, bitmask DP is a candidate. The problem families that fit:

Problem Type	Signal	Examples
Assignment/Matching	Assign n items to n slots optimally	Assignment Problem, Smallest Sufficient Team (LC 1125)
Travelling Salesman	Visit all n nodes exactly once	Shortest Hamiltonian Path
Subset Partitioning	Partition n items into groups	Partition to K Equal Sum Subsets
Set Cover	Cover all elements using fewest subsets	Minimum Number of Work Sessions
Counting	Count valid arrangements of n items	Number of Ways to Wear Hats

The constraint to look for is n <= 20. If n can be up to 1000, bitmask DP does not apply because 2^1000 states are far too many. But 2^20 is about one million, which is manageable.

A second signal is when the problem asks to track "which elements have been used." A boolean visited array carried across recursive calls can usually be replaced with a bitmask.

The Core Idea

The general framework for bitmask DP is:

State: dp[mask] represents the answer (minimum cost, number of ways, boolean feasibility, etc.) when the set of elements included so far is described by mask.
Base case: Usually dp[0] = initial value (empty subset).
Transition: For each mask, try adding each element i that is not yet in the mask. Compute dp[mask | (1 << i)] from dp[mask].
Answer: Typically dp[(1 << n) - 1], the state where all elements are included.

The time complexity is *O(2^n n) in most cases: there are 2^n possible masks, and for each mask you iterate over n elements. The space complexity is O(2^n)** for the DP table.

Each transition adds one element, moving upward through the subset lattice shown earlier. It is structurally a BFS through subsets, where each level adds one more element.

Example Walkthrough: Partition to K Equal Sum Subsets (LeetCode 698)

Problem Statement

Given an integer array nums and an integer k, return true if it is possible to divide the array into k non-empty subsets whose sums are all equal.

Example: nums = [4, 3, 2, 3, 5, 2, 1], k = 4 Total sum = 20, target sum per subset = 20 / 4 = 5. One valid partition: {5}, {4, 1}, {3, 2}, {3, 2}. Return true.

Constraints: 1 <= k <= nums.length <= 16

The constraint n <= 16 enables a bitmask DP solution (2^16 = 65536 states).

Intuition

The brute force tries every way to partition n elements into k groups, giving k^n combinations. The problem can be reframed.

Instead of asking "which group does each element go to," process elements one by one, adding each to the current bucket. When a bucket reaches the target sum, start a new bucket.

Define dp[mask] = the current sum within the bucket being filled, given that the elements indicated by mask have already been placed. When the sum reaches target, the bucket closes and the value resets to 0 (stored as sum_of_used_elements % target). If dp[mask] + nums[i] would exceed the target, element i is skipped for this bucket.

The answer is true if dp[(1 << n) - 1] is well-defined (every element was placed).

Step-by-Step Trace

Trace a few key transitions with nums = [4, 3, 2, 3, 5, 2, 1], k = 4, target = 5.

The value -1 means "this state is unreachable."

Every time the current bucket sum hits the target, the value resets to 0. Reaching the full mask with a non-negative dp value means every element was placed into a bucket of exactly target sum.

Implementation

Complexity Analysis

Time Complexity: O(2^n n). We iterate over all 2^n masks, and for each mask we try all n elements. With n = 16, this is about 16 65536 = roughly 1 million operations.
Space Complexity: O(2^n) for the DP array. With n = 16, this is 65536 integers, well within memory limits.

A Second Example: Travelling Salesman Problem

K-Subsets used dp[mask] with a single dimension. The Travelling Salesman Problem (TSP) is the canonical bitmask DP, and it needs a second dimension. The reason it needs one explains the general rule for choosing the state shape.

Problem Statement

Given an n x n matrix dist[i][j] representing the cost of travelling from city i to city j, find the minimum-cost tour that starts at city 0, visits every city exactly once, and returns to city 0. Typical interview constraint: n <= 20.

Why `dp[mask]` Is Not Enough

In K-Subsets, the value stored at a state (current bucket sum) depended only on which elements had been placed, not on the order. Two execution paths that arrived at the same mask with the same bucket sum were interchangeable.

TSP is different. The cost of extending a partial tour depends on which city you are currently at. Two paths that have visited the exact same set of cities can be sitting at different last cities, and the cheapest next edge depends on which one. A single-dimensional dp[mask] cannot distinguish those situations.

The fix is to add the current position to the state:

State: dp[mask][i] = minimum cost of a path that starts at city 0, visits exactly the cities indicated by mask, and ends at city i. The bit for i must be set in mask; otherwise the state is meaningless.

Recurrence

To be at (mask, i), the previous step must have been at some city j that is also in mask, with j != i, and the predecessor mask must be mask with bit i cleared. The transition:

dp[mask][i] = min over j in mask, j != i of ( dp[mask ^ (1 << i)][j] + dist[j][i] )

Base case: dp[1][0] = 0. Only city 0 has been visited; the tour starts there with zero cost.
Final answer: After every city has been visited, return to city 0: min over i != 0 of ( dp[(1 << n) - 1][i] + dist[i][0] ).

Implementation

Complexity Analysis

Time Complexity: O(2^n n^2). There are 2^n n states, and each state extends to up to n successor states. At n = 20 this is about 4 * 10^8 operations, borderline for a 1-second time limit. In practice n = 15-18 is the comfortable range.
Space Complexity: O(2^n * n). At n = 20, that is 20 million integers (around 80 MB) which can exceed memory limits. Many TSP variants on n = 20 require recursion-with-memoization that only materializes reachable states, not the full table.

The General Rule

The state needs whatever the future cost depends on. K-Subsets only cared about the running bucket sum, which is a function of mask alone. TSP cares about the running cost plus the current city, because the next edge cost depends on it. The same rule appears in:

Counting Hamiltonian paths that end at a specific node: dp[mask][i], same shape.
Shortest path visiting K specified waypoints: dp[mask][i] where mask ranges over subsets of the waypoint set.
Assignment with ordering constraints (e.g., job j must run after job i): again, dp[mask][last_job].

If the answer at a partial state depends on something more than "what's been used," that something belongs in the state. The shape of the bitmask DP follows from this requirement, not from the bitmask itself.

Enumerating All Submasks Of A Mask

K-Subsets and TSP both transition by adding one element at a time: mask -> mask | (1 << i). A different family of bitmask problems transitions by partitioning a mask into two submasks: pick a subset sub of mask, do something with sub and mask ^ sub separately, then combine. The classic example is Set Cover and any partition-style DP.

The transition needs to iterate over every subset of mask (not every subset of {0, ..., n - 1}). The standard idiom for this is the submask enumeration trick:

Why The Idiom Works

The expression (sub - 1) & mask produces the next smaller submask of mask, in strictly decreasing order. Two ideas combine:

sub - 1 flips the lowest set bit of sub to 0 and turns every bit below it to 1. Without the & mask, this would walk through every integer below sub.
& mask clears any bits that are not part of mask, so we only see numbers whose bit pattern is a subset of mask.

Starting from sub = mask, the iteration visits every submask of mask in decreasing order and stops just before sub = 0. The loop guard sub > 0 is correct: the loop body runs for every non-zero submask, and the post-condition sub = (sub - 1) & mask happens to compute mask itself when sub = 0, which would cause infinite recursion if the loop did not exit first. If the algorithm requires the sub = 0 case (the empty subset), handle it before or after the loop.

Why The Total Cost Is O(3^n), Not O(4^n)

A naive analysis says: there are 2^n possible masks, each with up to 2^n possible submasks, so the total work is 2^n * 2^n = 4^n. The actual bound is tighter: O(3^n).

The reason: instead of counting (mask, sub) pairs by mask, count them by the bit. Each bit can be in one of three states across the pair:

In mask and in sub.
In mask but not in sub.
In neither mask nor sub.

(The case "in sub but not in mask" cannot occur because sub is a submask of mask.)

So the total number of valid (mask, sub) pairs is 3^n. The submask enumeration idiom visits exactly these pairs and no others. For n = 16, 3^16 is about 43 million, still feasible; 4^16 would be 4 billion, infeasible.

Worked Example: Smallest Sufficient Team (LC 1125)

You are given req_skills, a list of skills (size m, up to 16), and people, a list of skill sets each person has. Return the indices of a smallest subset of people whose combined skill set covers all req_skills.

Treat each req_skills as a bit position. Each person's skill set is a bitmask peopleMask[i]. The DP state is dp[mask] = the indices of a smallest team whose combined skills are exactly mask. The transition: for each person p, dp[mask | peopleMask[p]] can be reached from dp[mask] by adding p.

For the smaller, partition-style variant ("split the required skills into two halves and solve each independently"), submask enumeration is the natural fit:

For LC 1125 specifically, the forward "add one person at a time" formulation is simpler and faster, but split-based variants (especially when subproblems have a known structure) need the submask enumeration.

When To Reach For Submask Enumeration

Use it when the recurrence at state mask reads from dp[sub] and dp[mask ^ sub] for some submask sub, not just from dp[mask without one element]. This shows up in:

Set cover with a partition structure ("split the universe into two and cover each half").
Optimal binary search tree when the state is a contiguous interval, generalized to a subset.
Assignment problems where the cost of assigning a subset to a sub-team is precomputed.

For the simpler "add one element" recurrences, the standard nested loop for (int i = 0; i < n; i++) if (!(mask & (1 << i))) is enough and cheaper. The submask idiom is a tool you reach for only when the recurrence demands it.

Quiz

Bitmask DP Quiz

10 quizzes

Introduction to Bitmask DP

What Is Bitmask DP?

Bit Manipulation Essentials

How to Identify Bitmask DP Problems

The Core Idea

Example Walkthrough: Partition to K Equal Sum Subsets (LeetCode 698)

Problem Statement

Intuition

Step-by-Step Trace

Implementation

Complexity Analysis

A Second Example: Travelling Salesman Problem

Problem Statement

Why `dp[mask]` Is Not Enough

Recurrence

Implementation

Complexity Analysis

The General Rule

Enumerating All Submasks Of A Mask

Why The Idiom Works

Why The Total Cost Is O(3^n), Not O(4^n)

Worked Example: Smallest Sufficient Team (LC 1125)

When To Reach For Submask Enumeration

Quiz

Bitmask DP Quiz

Get Premium

What Is Bitmask DP?

Bit Manipulation Essentials

How to Identify Bitmask DP Problems

The Core Idea

Example Walkthrough: Partition to K Equal Sum Subsets (LeetCode 698)

Problem Statement

Intuition

Step-by-Step Trace

Implementation

Complexity Analysis

A Second Example: Travelling Salesman Problem

Problem Statement

Why dp[mask] Is Not Enough

Recurrence

Implementation

Complexity Analysis

The General Rule

Enumerating All Submasks Of A Mask

Why The Idiom Works

Why The Total Cost Is O(3^n), Not O(4^n)

Worked Example: Smallest Sufficient Team (LC 1125)

When To Reach For Submask Enumeration

Quiz

Bitmask DP Quiz

Get Premium

Why `dp[mask]` Is Not Enough