Passa ai contenuti principali

the Rammstein Mystery (0)

In Italy, in Germany, in Poland.  Teaching German can all of a sudden seem exciting when we listen to Rammstein in class. Some students, as always, do not care, but there are only few of them. Some will be repelled, but a major group will start nodding and shouting as they start understanding the texts of songs they have always, at least in a certain period of their live, listened to. 

When "du hasst" turns to "du hast mich gefragt", and back again, there is a light in the classroom. You may not love the industrial hammering of the music, you may be shocked by hearing aggressive German shouting, but the literary quality of these texts is evident. If you have studied German literature, you cannot ignore the allusions not only to GDR songs and brothers Grimm, but also to Trakl and French Symbolism. 

But how can we analyze Rammstein texts? Isolate motifs and topics? See how they work? I want to try to make them read by the machine. The programming languages R and Python offer a lot of packages with interesting methods of getting into the texts. I am curious about how we can, by machine Learning, find something out. 

I built a corpus of 92 songs I copied, one by one (not scraped, as we need to be sure about he purity of the material) from the Textemania website. The writing had to be corrected, as German äüö and ß are not always reported in the right way. 92 files with the year of the album and the title. 

During the  next weeks, I will try. 


The technical side

The 92 files have names like

R1995 Laichzeit.txt

R text: 

library ("quanteda")

library ("readtext")

library("quanteda.textstats")

library("quanteda.textplots")

library("rtweet")

basis <-readtext("/Users/Achim/Desktop/Rammstein/*.txt",

                 docvarsfrom = "filenames" )

arbname <- basis$doc_id

arbname <- gsub("^R","", arbname)

jahr <- substring(arbname, 0,4)

titel <- substring(arbname, 6,40)

basis <- corpus(basis)

docvars(basis, "year") <- jahr

docvars(basis, "title") <- titel


Commenti

Post popolari in questo blog

Till Lindemann as a Poet and with his Band (1)

Till Lindemann, lead singer of the Rammstein group, has also published some collections of poems. You should not forget: he is the son of Werner Lindemann, who used to be a prominent writer in the times of the socialist German Democratic Rebublic.  As I am (together with Claudia Lisa Moeller) translating one of these books,  "In stillen Nächten"/ "On Quiet Nights" (English translation by Ehren Fordyce,  Raw Dog Screaming Press, Bowie MD in 2025)  into Italian, I took out my R Quanteda package in order to take a more distant view on the poems.  The most frequent words ("stopwords" excluded) in Lindemann's poems are "Herz" ("heart"), which appears 33 times, and "Liebe" ("love") with 26 occurrences. This might seem similar to the frequencies in Rammstein songs, where we read the leading word "Liebe" 46 times, followed  by "Mann"/ man (45).  We might assume that, since "Herz" is often ta...

Between Goethe and Brecht. Rammstein texts and the poems of Till Lindemann

Till Lindemann is a poet better known as the song writer and lead singer of Rammstein. It  could be interesting to compare the texts band’s lyrics and his poems, maybe gaining a better  insight in how both of them are made. For automated analysis, however, both kinds of texts might seem of little interest. With very  short texts, the basis for statistical reasoning is too small. Indeed, we cannot reasonably apply  the various readability indexes we usually employ when analyzing corpora. But if we restrict  our statistical glimpse to some very elementary calculations, something similar to an  “author’s footprint” might emerge.  I will first use the TTR, the ratio between  the number of single words (“types”) that appear in the text and the total number of words  (“tokens”).Then I will have a look at the numeric relation between functional words  (prepositions articles, conjunctions, etc.) and content words. Some characteristics of Till...

Co-occurrences with keyword lists

Playing R with Rammstein texts can be fun when the outcomes are unexpected, and you get plots like Gut, better. We might even see: How do we manage making "das Gute" appear? From general to keyword plots In the plot based on all the co-occurrences  in Rammstein song texts, we can identify a center on the left-hand side: The meaningful central terms seem to be "Lust", "Deutschland", "Liebe", "ich", "du", "kalt" and "gut". Choosing these words as keys, we can plot a new picture. What do we see? There is a clear "ich" - "du" axis. "Liebe" appears together with "ich" and "du", well. But "Lust" seems to be mainly referred only to "ich", as well as "kalt".  Into the contexts Finding "kalt" surprising, even at the border of a net with "ich", "du". "Liebe", "Lust", I searched for the occ...