Problem Description

Question

Given two strings s and t, return true if s is a subsequence of t, or false otherwise.

A subsequence of a string is a new string that is formed from the original string by deleting some (can be none) of the characters without disturbing the relative positions of the remaining characters. (i.e., "ace" is a subsequence of "abcde" while "aec" is not).

Example 1:

Input: s = "abc", t = "ahbgdc"

Output: true

Example 2:

Input: s = "axc", t = "ahbgdc"

Output: false

Constraints:

0 <= s.length <= 100
0 <= t.length <= 10⁴
s and t consist only of lowercase English letters.

Follow up: Suppose there are lots of incoming s, say s₁, s₂, ..., s_k where k >= 10⁹, and you want to check one by one to see if t has its subsequence. In this scenario, how would you change your code?

Solve it on LeetCode

Understanding the Problem

Before diving into solutions, let's make sure we understand what a subsequence is and how it differs from a substring.

Subsequence vs Substring:

A substring is a contiguous sequence of characters within a string. For "abcde", substrings include "abc", "bcd", "cde", "ab", etc.
A subsequence is a sequence that can be derived by deleting zero or more elements from the original sequence without changing the order of remaining elements. For "abcde", subsequences include "ace", "abd", "ae", "bd", "e", and even the empty string.

The key difference: in a substring, characters must be adjacent; in a subsequence, they can be scattered but must maintain their relative order.

Let's visualize this with an example:

Another way to think about it: imagine you are reading through t from left to right, and you are trying to "check off" each character of s in order. If you can check off all characters of s by the time you finish reading t, then s is a subsequence of t.

Approaches

1. Two Pointers (Optimal for Single Query)

Intuition

The most natural way to solve this problem is to simulate the process of matching characters. We use two pointers: one to track our current position in s (the pattern we are looking for) and another to scan through t (the string we are searching in).

As we scan through t, whenever we find a character that matches the current character we are looking for in s, we advance our pointer in s. If we manage to match all characters in s before (or exactly when) we finish scanning t, then s is a subsequence.

Algorithm

Initialize two pointers: i = 0 (for string s) and j = 0 (for string t)
While both pointers are within bounds:

If s[i] == t[j], we found a match, so increment i
Always increment j to continue scanning t

After the loop, if i == s.length(), we matched all characters in s, so return true
Otherwise, return false

Code

Complexity Analysis

Time Complexity: O(n) where n is the length of t

We traverse t at most once
Each character comparison is O(1)
In the worst case, we scan all of t (when s is not a subsequence or matches at the very end)

Space Complexity: O(1)

We only use two pointer variables
No additional data structures needed

This is the optimal solution for a single query. But what happens when we have many queries?

2. Binary Search (Optimized for Multiple Queries)

Intuition

The follow-up question changes everything. If we have billions of strings s1, s2, ..., sk to check against a single t, the two-pointer approach gives us O(k * n) time where n is the length of t. Can we do better?

The key insight is that we are repeatedly traversing the same string t. If we preprocess t, we can answer each query faster.

Here is the idea: for each character in the alphabet, precompute all the positions where that character appears in t. Store these positions in sorted lists. Then, for each query string s, we use binary search to find valid positions.

For example, if t = "ahbgdc":

'a' appears at positions [0]
'b' appears at positions [2]
'c' appears at positions [5]
'd' appears at positions [4]
'g' appears at positions [3]
'h' appears at positions [1]

When checking if s = "abc" is a subsequence:

Find 'a': binary search in [0] for position >= 0. Found at index 0. Next search starts from position > 0.
Find 'b': binary search in [2] for position > 0. Found at index 2. Next search starts from position > 2.
Find 'c': binary search in [5] for position > 2. Found at index 5. All characters matched!

Why Binary Search?

For each character in s, we need to find the first occurrence in t that comes after our current position. Since the position lists are sorted, binary search gives us this in O(log n) time instead of O(n) with linear search.

Algorithm

Preprocessing: Create a map where each key is a character and each value is a list of indices where that character appears in t
Query Processing: For each string s:

Initialize currentPos = -1 (we start before the string)
For each character c in s:

Get the list of positions for c
Binary search for the smallest position > currentPos
If found, update currentPos to this position
If not found, return false

If all characters are matched, return true

Code

Complexity Analysis

Preprocessing Time: O(n) where n is the length of t

We scan through t once to build the position map
Each insertion into the map is O(1) amortized

Preprocessing Space: O(n)

We store at most n positions total (each character in t contributes one position)
The map has at most 26 keys (lowercase English letters)

Query Time: O(m * log n) where m is the length of query string s

For each of the m characters in s, we perform one binary search
Each binary search is O(log n) in the worst case (when all characters in t are the same)

Total Time for k Queries: O(n + k m log n)

This is significantly better than O(k * n) from the two-pointer approach when k is large
The crossover point depends on the lengths: binary search wins when k is large or m is small relative to n

Is Subsequence

Ashish Pratap Singh

Problem Description

Example 1:

Example 2:

Constraints:

Solve it on LeetCode

Understanding the Problem

Subsequence vs Substring:

Approaches

1. Two Pointers (Optimal for Single Query)

Intuition

Algorithm

Code

2. Binary Search (Optimized for Multiple Queries)

Intuition

Algorithm

Code

Get Premium