In my previous blog post (Are there any language detection tools for assigning language to music data?), I descibed my failed attempts att concatenating Artist Origin (or, to be more precise, the artists origin with respect to the language sung in) to a dataset created from Spotify's Web developer API. This information used to be available... Continue Reading →
Are there any language detection tools for assigning language to music data?
Music is a matter of taste and some of us have....how should I put it? different ideas of what is good music and what is trash that should never have seen the day of light. I am, since a few years back, a huge fan of Chinese Hip Hop and Rap (哈狗帮,龙井说唱 and 龍胆紫 )... Continue Reading →
Working with large csv-files in pandas? Create a SQL-database by reading files in chunks
It is not uncommon to have to deal with for instance csv-files containing millions of rows. Searching, filtering and slicing can therefore be time-consuming tasks. So, the question is then: Are there any ways to speed up the process? If so, this could save a considerable amount of time for any data scientist needing to... Continue Reading →