Enumerating words can be helpful. With a simple command (tokens_ngrams), in R we also got lists of word pairs and triples and so on. But usually we want to know which words appear within the same documents. W hat about co-occurences within the same song? Which words appear in the same songs? With the fcm() command in R we obtain a neat little table, more or less like the following . ich du gut liebe kalt bitte sonne ich 2775 1289 474 550 352 25 143 du 0 920 209 183 279 25 19 gut 0 0 91 19 0 2 20 lust 627 1 2 0 280 0 1 deutschl 209 532 0 76 152 0 171 liebe 140 55 41 11 kalt 84 28
Rammstein Read by the Machine
The literary quality of these texts is evident. If you have studied German literature, you cannot ignore the allusions not only to GDR songs and brothers Grimm, but also to Trakl and French Symbolism. But how can we analyze Rammstein texts? I want to make them read by the machine. The programming languages R and Python offer a lot of packages with interesting methods of getting into the texts. I am curious about how we can, by machine Learning, find something out.