Python word cloud from a html page: Difference between revisions

From wikiluntti
(Created page with "== Introduction == Analyze html tables using word clouds. == Theory == === Fetching the table === === Linguistic analyzation === === The word cloud === === Save in svg f...")
 
Line 8: Line 8:


=== Linguistic analyzation ===
=== Linguistic analyzation ===
The Finnish language is used, thus [https://voikko.puimula.org/ Voikko] morphological analyzer is used to lemmatize the words into the base format.
==== References ====
https://data.solita.fi/finnish-stemming-and-lemmatization-in-python/


=== The word cloud ===
=== The word cloud ===

Revision as of 20:48, 18 August 2021

Introduction

Analyze html tables using word clouds.

Theory

Fetching the table

Linguistic analyzation

The Finnish language is used, thus Voikko morphological analyzer is used to lemmatize the words into the base format.

References

https://data.solita.fi/finnish-stemming-and-lemmatization-in-python/

The word cloud

Save in svg format

Exercises