14 lines
490 B
Org Mode
14 lines
490 B
Org Mode
* P1
|
|
** DONE Create a table with information of all documents
|
|
CLOSED: [2020-10-25 Sun 19:58]
|
|
| filename | type | encoding | language |
|
|
** DONE Extract all URLs
|
|
CLOSED: [2020-10-25 Sun 22:14]
|
|
** DONE Write to a file all word occurrences and frequencies
|
|
CLOSED: [2020-10-25 Sun 23:40]
|
|
Sorted in a decreasing manner
|
|
** DONE Plot word frequencies
|
|
CLOSED: [2020-10-29 Thu 13:11]
|
|
With gnuplot, with documents of at least 3 different languages.
|
|
We'll fit this to the Booth and Federowicz equation
|