Last week on Wednesday, I gave a seminar at LARGE (Learning Agents Research Group at Erasmus). Thanks to Wolfgang Ketter for both inviting me and more importantly the excellent discussion. The nice thing about seminars is that they often give you an opportunity to do something different. In this case, I did an expanded version of the October WAI talk I gave introducing myself to my new colleagues in the VU’s AI department. When I moved to the VU, I started trying to think about a better organization or perspective on my research. Something that encompassed what I’ve already done but also something that pointed towards the future. The thing that kept coming to my mind is that what we really need is tools that make remixing data as easy as it is to remix music or video. Essentially, why don’t we have Data DJs? (or should it just be DJ – Data Jockey??)
Indeed, we can see that a lot of what scientists (i.e. data analysis pros) do is remix data. In a happy coincidence, this idea was reinforced to me last night when I watched the documentary RiP: A Remix Manifesto. It’s both an entertaining and important documentary about the impact of strong intellectual property laws on creative freedom. Most importantly for this post, it describes the analogy between a music DJ and the practice of science in vivid terms. See the clip below starting at about 4 minutes in.
The other thing about the documentary to me was it showed how accessible the tools for working with music and video were to people. I think we can make tools that are just as good or better for data sitting in files, spreadsheets and databases. I think this DJ to data-analysis analogy is a powerful framework to think about how we can make such tools. I summarize the analogy as follows:
- records = data in one format (linked data?)
- turntable and mixers = end-user programming (workflows)
- recording equipment = capturing what goes on during data analysis (provenance)
The slides at the end of this post are from the talk I gave at LARGE, explaining how my research fits into this framework.
I want to be a Data DJ, do you?