Skip to main content
An image of a fake cover to a score generated by Midjourney modelled after Schubert's Trout Quintet title page, with nonsense letterings and the title embedded into the image subtly.
An image of a fake cover to a score generated by Midjourney modelled after Schubert's Trout Quintet title page, with nonsense letterings and the title embedded into the image subtly.

Der Verstiegenheit Quartett

17:44
8:50
4:54
11:13
4:51

Der Verstiegenheit Quartett (2022)

I was struck by this passage in Bloom's analysis of poetry, and in particular the "outside help" that he discusses as the rescue from extravagance. These concepts seemed to be speaking to a similar and related process in the creation of this music, which I undertook with the aid of a machine learning model trained on the totality of music for strings that I have written and recorded, measuring ~3.5 hours, and using RAVE (Realtime Audio Variational autoEncoder) developed by IRCAM. It uses both Representation Learning (using spectral distance to build a perceptually relevant latent space) and Adversarial Fine-tuning (where the encoder is frozen, and uses a multiscale discriminator to increase the synthesized audio quality). RAVE represents a tradeoff between quality and synthesis speed in Neural Audio Synthesis, and for this reason I chose it to explore the potentialities embedded in a model trained on my own work. The training relied on a Google Colab notebook executing Python code blocks (PyTorch) and took roughly 90 hours to complete. The end-result was a model object that parsed and identified various features of my music condensed into a black-box, an n-dimensional space of my creative output.

The first three movements (Landscape, Inner-self, Glance of another) were created using Unconditional Generation, where a 'prior' model is trained on trajectories yielded by the encoder and proposes new ones in an autoregressive fashion. Essentially the object generated output continuously, without any reference input to guide it. This yielded inconsistent results at times, and at other times was capable of generating the haunting, beautiful, and ethereal results that we hear.

The final three movements encode an input audio stream to latent representation, that the model then decodes and synthesizes using the data it was trained on to find a 'match' or isomorphic relationship within its own dynamic latent space dimensionality estimation. For these movements I used as input three contrasting works that I composed: a piece for solo piano (Estrangement); an electronic piece (Solipsism); an ambient electronic piece (Imagined glance of the precursor).

Music composed by Realtime Variational autoEncoder trained on 3.5 hours of recordings of music for various string ensembles by Gavin Gamboa

Recorded and Mixed using the nn~ externals for Max/MSP provided by IRCAM
Los Angeles, California, United States
September-November 2022

Special Thanks
Polina Powers

Artwork
Stable Diffusion prompt: "the extravagance string quartet album artwork, created by machine learning algorithms for a contemporary classical music avant-garde musical work" with CLIP guidance based on cover for Die Forelle by Franz Schubert (Published by Anton Diabelli, Vienna)

View Der Verstiegenheit Quartett in the digital garden.

/