Python word cloud from a html page: Difference between revisions
From wikiluntti
Line 13: | Line 13: | ||
==== References ==== | ==== References ==== | ||
https://data.solita.fi/finnish-stemming-and-lemmatization-in-python/ | |||
See [https://www.tarmo.fi/perusmuotoon/ Tarmo perusmuodoistaja] | See [https://www.tarmo.fi/perusmuotoon/ Tarmo perusmuodoistaja] |
Revision as of 20:50, 18 August 2021
Introduction
Analyze html tables using word clouds.
Theory
Fetching the table
Linguistic analyzation
The Finnish language is used, thus Voikko morphological analyzer is used to lemmatize the words into the base format.
References
https://data.solita.fi/finnish-stemming-and-lemmatization-in-python/