Problem Description
Write a bash script to calculate the frequency of each word in a text file words.txt
. For simplicity sake, you may assume that words.txt
contains only lowercase characters and space characters. Each word must consist of lowercase characters only, and words are separated by one or more whitespace characters.
Key Insights
- The input file contains only lowercase letters and spaces.
- Each word is separated by one or more whitespace characters.
- The output should be sorted by descending frequency of the words.
- Each word's frequency count is guaranteed to be unique.
Space and Time Complexity
Time Complexity: O(n log n) - where n is the number of distinct words, due to the sorting step. Space Complexity: O(n) - to store the word counts.
Solution
To solve the problem, we will read the contents of the words.txt
file, split the text into individual words, count the occurrences of each word using a hash map (or dictionary), and then sort the result by the frequency of the words. The final output will be formatted to display each word along with its frequency count.