Problem Description
Given two strings str1
and str2
, return the shortest string that has both str1
and str2
as subsequences. If there are multiple valid strings, return any of them.
Key Insights
- A subsequence is formed by deleting some characters from a string without rearranging the remaining characters.
- The shortest common supersequence (SCS) can be derived using the concept of the longest common subsequence (LCS).
- The length of the SCS can be calculated as:
length(SCS) = len(str1) + len(str2) - len(LCS)
. - Dynamic programming can be used to efficiently compute the LCS which will then help in constructing the SCS.
Space and Time Complexity
Time Complexity: O(m * n) where m and n are the lengths of str1 and str2, respectively.
Space Complexity: O(m * n) for the DP table used to store LCS values.
Solution
To solve the problem, we can use dynamic programming to find the longest common subsequence (LCS) between the two strings. Once we have the LCS, we can construct the shortest common supersequence (SCS) by merging the characters from both strings while maintaining the order of characters according to the LCS.
- Create a 2D array
dp
wheredp[i][j]
will hold the length of the LCS ofstr1[0...i-1]
andstr2[0...j-1]
. - Fill this array using the rules of LCS:
- If characters match, increment the count from the previous indices.
- If not, take the maximum count from either the previous character of
str1
orstr2
.
- Once we have the length of the LCS, we can backtrack through the
dp
array to construct the SCS by adding characters from both strings, ensuring the LCS characters are preserved.