[Python] Wordclouds for your book library

Previously. Wolfram Mathematica is a great tool, but a proprietary one. Here is a solution using Python library.

% pip3 install wordcloud

create_wordcloud.sh:

OUT=$1.wordcloud.png
if [ ! -f "$OUT" ]; then
        echo processing $1
        wordcloud_cli --text "$1" --imagefile "$OUT" --width 1000 --height 1000
fi

create_wordclouds.sh:

find . -size +4096c -name '*.my-txt' -type f -exec ./create_wordcloud.sh {} \;

(Too short (erroneous) my-txt files to be skipped.)

And no coding required! Additional info about wordcloud_cli options.

But one problem is that it has a hard time handing Russian language. Probably, stopwords should be added. Maybe this problem is present for other non-English texts.

Examples:

(the post first published at 20221120.)


List of my other blog posts.

Subscribe to my news feed

Yes, I know about these lousy Disqus ads. Please use adblocker. I would consider to subscribe to 'pro' version of Disqus if the signal/noise ratio in comments would be good enough.