We use cookies (including Google cookies) to personalize ads and analyze traffic. By continuing to use our site, you accept our Privacy Policy.

Word Frequency

Difficulty: Medium


Problem Description

Write a bash script to calculate the frequency of each word in a text file words.txt. For simplicity sake, you may assume that words.txt contains only lowercase characters and space characters. Each word must consist of lowercase characters only, and words are separated by one or more whitespace characters.

Key Insights

  • The input file contains only lowercase letters and spaces.
  • Each word is separated by one or more whitespace characters.
  • The output should be sorted by descending frequency of the words.
  • Each word's frequency count is guaranteed to be unique.

Space and Time Complexity

Time Complexity: O(n log n) - where n is the number of distinct words, due to the sorting step. Space Complexity: O(n) - to store the word counts.

Solution

To solve the problem, we will read the contents of the words.txt file, split the text into individual words, count the occurrences of each word using a hash map (or dictionary), and then sort the result by the frequency of the words. The final output will be formatted to display each word along with its frequency count.

Code Solutions

from collections import Counter

# Read the contents of the file
with open('words.txt', 'r') as file:
    words = file.read().split()

# Count the frequency of each word
word_count = Counter(words)

# Sort the words by frequency in descending order and print
for word, count in word_count.most_common():
    print(word, count)
← Back to All Questions