We use cookies (including Google cookies) to personalize ads and analyze traffic. By continuing to use our site, you accept our Privacy Policy.

Distinct Subsequences II

Difficulty: Hard


Problem Description

Given a string s, return the number of distinct non-empty subsequences of s. Since the answer may be very large, return it modulo 10^9 + 7. A subsequence of a string is a new string that is formed from the original string by deleting some (can be none) of the characters without disturbing the relative positions of the remaining characters.


Key Insights

  • A subsequence can be formed by choosing to include or exclude each character in the string.
  • The number of distinct subsequences can be derived using dynamic programming.
  • Care must be taken to avoid counting duplicate subsequences caused by repeating characters.

Space and Time Complexity

Time Complexity: O(n)
Space Complexity: O(n)


Solution

To solve the problem, we will use dynamic programming to keep track of the number of distinct subsequences up to each character in the string. We will maintain an array dp where dp[i] represents the number of distinct subsequences that can be formed from the substring s[0:i].

  1. Initialize a variable to keep track of the total number of distinct subsequences.
  2. Use a map to remember the last seen index of each character to handle duplicates.
  3. For each character in the string, calculate the number of distinct subsequences by considering both including and excluding the character.
  4. Update the dp array and the last seen index of the character.
  5. Return the total count of distinct subsequences modulo 10^9 + 7.

Code Solutions

def distinctSubseqII(s: str) -> int:
    mod = 10**9 + 7
    n = len(s)
    dp = [0] * (n + 1)
    dp[0] = 1  # Base case: empty subsequence
    
    last_seen = {}
    
    for i in range(1, n + 1):
        dp[i] = (2 * dp[i - 1]) % mod  # Include or exclude current character
        if s[i - 1] in last_seen:
            dp[i] = (dp[i] - dp[last_seen[s[i - 1]] - 1]) % mod  # Subtract duplicates
        last_seen[s[i - 1]] = i  # Update the last seen index

    return (dp[n] - 1) % mod  # Exclude the empty subsequence
← Back to All Questions