Member-only story
Cowriting an Album With AI
Why a neural net is like ‘a chain of guitar pedals’

These days, artificial intelligence is showing off its creative chops.
GPT-2 and GPT-3 generate text so well that people are using them to author text-adventure games and books of poetry. Visual artists are using image-generation AI to create neural-net paintings. You can create utterly photorealistic pictures of synthetic people that don’t exist.
But in the world of music, things have lagged a bit. Certainly, there are some cool tools out there — like Google’s Magenta music-creation AI, which you can use to autocomplete MIDI melodies or have a piano plink out a tune while you keysmash. The most ambitious is probably OpenAI’s “Jukebox”, which generates entire new songs in the style of well-known musical artists, including lyrics and (crude) singing.
But the truth is, the music tools are less polished and less complete than text or image generation. The tunes these tools produce are usually only parts of songs — or with Jukebox, the songs are complete but very lo-fi, and they don’t have clear verse-chorus structures. With music AI, you can’t just set-it-and-forget-it, push a button and have a creation come out.
Why? It’s because music, I suspect, is so deeply multidimensional, moreso even than textual or visual art. Timing, timbre, human voicing, structure, mood, a welter of instrument voices and styles — there’s just a lot going on in music, such that AI has more trouble cranking it out all on its own.
I’m sure technologists will eventually get there! But for now, AI needs help.
More precisely, it needs collaboration — collaboration with humans.
Recently, Robin Sloan decided to explore this idea, by creating an album using Jukebox as a writing partner. Sloan is a friend of mine from the early, antediluvian days of blogging, and also a phenomenal author; he’s written books like Mr. Penumbra’s 24-Hour Bookstore and novellas like Annabel Scheme. But he’s also a talented programmer who, a few years back, got interested in using neural nets to generate…