Problem Description
Given an array of strings words
, return the smallest string that contains each string in words
as a substring. If there are multiple valid strings of the smallest length, return any of them. You may assume that no string in words
is a substring of another string in words
.
Key Insights
- The problem requires constructing the shortest superstring that contains all given strings.
- A brute force approach would involve trying all permutations of the strings, which is inefficient.
- The optimal approach involves using a greedy algorithm combined with bit manipulation, where we keep track of used words and calculate overlaps.
- Overlapping is crucial to minimize the length of the resultant string.
Space and Time Complexity
Time Complexity: O(n^2 * 2^n) - where n is the number of strings, due to the combination of substring overlaps and permutations. Space Complexity: O(n * 2^n) - for storing the dynamic programming states.
Solution
To solve the problem, we can use a bitmask dynamic programming approach. We define a DP table where dp[mask]
represents the shortest superstring that can be formed using the words represented by the mask
. The mask
is a bit representation of which words have been used. We also maintain a function to calculate the overlap between two strings to minimize their combined length when concatenating them.
- Calculate the overlap between each pair of strings.
- Use a bitmask to represent the state of used strings.
- Update the DP table by trying to add each unused string to the current combination.
- The final result will be the shortest superstring found in the DP table.