490 B
490 B
P1
DONE Create a table with information of all documents
CLOSED: [2020-10-25 Sun 19:58]
filename | type | encoding | language |
DONE Extract all URLs
CLOSED: [2020-10-25 Sun 22:14]
DONE Write to a file all word occurrences and frequencies
CLOSED: [2020-10-25 Sun 23:40] Sorted in a decreasing manner
DONE Plot word frequencies
CLOSED: [2020-10-29 Thu 13:11] With gnuplot, with documents of at least 3 different languages. We'll fit this to the Booth and Federowicz equation