AlgoDaily - Find Duplicate Words

Step 4: Unveiling the Duplicates

Identifying Duplicates Made Easy

After painstakingly counting the frequency of each word, finding duplicates becomes a cakewalk. All we have to do is sift through our occurrences HashMap and pinpoint the words that have a frequency of 2 or more. These are our coveted duplicates.

Code Snippet for Identifying Duplicates

Here's how you can extract the duplicates from our occurrences HashMap:

1duplicates = []
2for word, count in occurrences.items():
3    if count > 1:
4        duplicates.append(word)

Analyzing the Complexity of Our Final Solution

Time Complexity Revisited

Let n be the number of words in our input string s.

Populating the occurrences HashMap took O(n) time.
Traversing the occurrences HashMap to identify duplicates also takes O(n) time.

Summing these up, our algorithm works in linear O(n) time, which is remarkably efficient.

Space Complexity Revisited

We've used a HashMap (occurrences) to store the frequency of each word, and a list (duplicates) to store the duplicate words. In a worst-case scenario, each word in the sentence is unique, making our space complexity linear O(n).

xxxxxxxxxx
 
s = "Original String Original String"
split_s = s.lower().split(' ')
​
occurrences = {}
​
for word in split_s:
  if word not in occurrences:
    occurrences[word] = 1
  else:
    occurrences[word] += 1
​
dupes = []
for k in occurrences:
  if occurrences[k] == 2:
    dupes.append(k)
​
print(dupes)

OUTPUT

Results will appear here.