-
Notifications
You must be signed in to change notification settings - Fork 0
Word Summary #5
Copy link
Copy link
Open
Description
- Project Name: Word Summary
- Base Description: Generate a summary of the words that appear in a given string including the total word count and the number of occurrences of each word.
- Extra Description: The summary should give the total word count as well as a word occurrence summary that ties each word to the number of times that word occurs. The summary should be output to a file in a standardized format such as JSON, YAML, etc. You will have to consider how to deal with punctuation and other special characters. A hyphenated word is a single word, but a dash (
-) in another context may need to be removed while computing the summary. What about contractions and quoted areas of text? You will also need to choose how to handle case, whether you want your summary to be case-sensitive or -insensitive. - Sample Input/Output:
// input
"This string is the summarized string."
// output
summary: {
total: 6,
words: [
"This": 1,
"string": 2,
"is": 1,
"the": 1,
"summarized": 1
]
}
- Extensions:
- Produce a word summary of a file.
- Enhance your program so that it can give a proper word count for a markdown file.
- Programmatically compare your output with the output of a word count feature of an NLP library.
- Categories:
Text,Words,NLP - Resources:
- Sources:
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels