publications | Maciek Tomczak

For a complete list, please check my Google Scholar.

2023

ISMIR
A Dataset of String Ensemble Recordings and Onset Annotations for Timing Analysis

Maciek Tomczak, Susan Min Li, and Massimiliano Di Luca

Extended Abstracts for the Late-Breaking Demo Session of the International Society of Music Information Retrieval Conference (ISMIR), Milan, Italy, 2023

Abs Bib PDF Code

In this paper, we present Virtuoso Strings, a dataset for timing analysis and automatic music transcription (AMT) tasks requiring note onset annotations. This dataset takes advantage of real-world recordings in multitrack format and is curated as a component of the Augmented Reality Music Ensemble (ARME) project, which investigates musician synchronisation and multimodal music analysis. The dataset comprises repeated recordings of quartet, trio, duet and solo ensemble performances. Each performance showcases varying temporal expressions and leadership role assignments, providing new possibilities for developing and evaluating AMT models across diverse musical styles. To reduce the cost of the labour-intensive manual annotation, a semi-automatic method was utilised for both annotation and quality control. The dataset features 746 tracks, totalling 68,728 onsets. Each track includes onset annotations for a single string instrument. This design facilitates the generation of audio files with varied instrument combinations for use in the AMT evaluation process.
@article{tomczak2023virtuoso, author = {Tomczak, Maciek and Li, Susan Min and Di Luca, Massimiliano}, title = {A Dataset of String Ensemble Recordings and Onset Annotations for Timing Analysis}, journal = {Extended Abstracts for the Late-Breaking Demo Session of the International Society of Music Information Retrieval Conference (ISMIR), Milan, Italy}, year = {2023}, }
AudioMostly
Onset Detection for String Instruments Using Bidirectional Temporal and Convolutional Recurrent Networks

Maciek Tomczak, and Jason Hockman

Proceedings of the Audio Mostly Conference, Edinburgh, United Kingdom, ACM, New York, NY, USA, 2023

Abs Bib HTML PDF

Recent work in note onset detection has centered on deep learning models such as recurrent neural networks (RNN), convolutional neural networks (CNN) and more recently temporal convolutional networks (TCN), which achieve high evaluation accuracies for onsets characterized by clear, well-defined transients, as found in percussive instruments. However, onsets with less transient presence, as found in string instrument recordings, still pose a relatively difficult challenge for state-of-the-art algorithms. This challenge is further exacerbated by a paucity of string instrument data containing expert annotations. In this paper, we propose two new models for onset detection using bidirectional temporal and recurrent convolutional networks, which generalise to polyphonic signals and string instruments. We perform evaluations of the proposed methods alongside state-of-the-art algorithms for onset detection on a benchmark dataset from the MIR community, as well as on a test set from a newly proposed dataset of string instrument recordings with note onset annotations, comprising approximately 40 minutes and over 8,000 annotated onsets with varied expressive playing styles. The results demonstrate the effectiveness of both presented models, as they outperform the state-of-the-art algorithms on string recordings while maintaining comparative performance on other types of music.
@article{tomczak2023onset, author = {Tomczak, Maciek and Hockman, Jason}, title = {Onset Detection for String Instruments Using Bidirectional Temporal and Convolutional Recurrent Networks}, journal = {Proceedings of the Audio Mostly Conference, Edinburgh, United Kingdom, ACM, New York, NY, USA}, pages = {136--142}, year = {2023}, }

RPPW

Adaptive metronome: a MIDI plug-in for modelling cooperative timing in music ensembles

Sean Enderby, Ryan Stables, Jason Hockman, Maciek Tomczak, Alan Wing, Mark Elliot, and Massimiliano Di Luca

Rhythm Production and Perception Workshop (RPPW), Birmingham, UK, 2023

@article{enderby2023adaptive,
  journal = {Rhythm Production and Perception Workshop (RPPW), Birmingham, UK},
  title = {Adaptive metronome: a MIDI plug-in for modelling cooperative timing in music ensembles},
  author = {Enderby, Sean and Stables, Ryan and Hockman, Jason and Tomczak, Maciek and Wing, Alan and Elliot, Mark and Di Luca, Massimiliano},
  year = {2023},
}

RPPW

Effect of leadership change on microtiming patterns in string quartet

Maciek Tomczak, Min Susan Li, Maria Witek, and Jason Hockman

Rhythm Production and Perception Workshop (RPPW), Birmingham, UK, 2023

Bib HTML Code

@article{tomczak2023effect,
  journal = {Rhythm Production and Perception Workshop (RPPW), Birmingham, UK},
  title = {Effect of leadership change on microtiming patterns in string quartet},
  author = {Tomczak, Maciek and Li, Min Susan and Witek, Maria and Hockman, Jason},
  year = {2023},
}

PhD Thesis

Automated Rhythmic Transformation of Drum Recordings

Maciek Tomczak

Sound and Music Analysis (SOMA) Group, Digital Media Technology (DMT) Lab, Birmingham City University, UK, 2023

Bib HTML PDF

@phdthesis{tomczak2023automated,
  author = {Tomczak, Maciek},
  school = {Sound and Music Analysis (SOMA) Group, Digital Media Technology (DMT) Lab, Birmingham City University, UK},
  title = {Automated Rhythmic Transformation of Drum Recordings},
  year = {2023},
}

2022

JAES

Deep Audio Effects for Snare Drum Recording Transformations

Matthew Cheshire, Jake Drysdale, Sean Enderby, Maciek Tomczak, and Jason Hockman

Journal of the Audio Engineering Society (JAES), Special Issue: New Trends in Audio Effects, 2022

Bib PDF

@article{cheshire2022deep,
  title = {Deep Audio Effects for Snare Drum Recording Transformations},
  author = {Cheshire, Matthew and Drysdale, Jake and Enderby, Sean and Tomczak, Maciek and Hockman, Jason},
  journal = {Journal of the Audio Engineering Society (JAES), Special Issue: New Trends in Audio Effects},
  year = {2022},
}

2021

ISMIR

Style-based Drum Synthesis with GAN Inversion

Jake Drysdale, Maciek Tomczak, and Jason Hockman

Extended Abstracts for the Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference, 2021

Bib PDF

@article{drysdale2021style,
  title = {Style-based Drum Synthesis with GAN Inversion},
  author = {Drysdale, Jake and Tomczak, Maciek and Hockman, Jason},
  journal = {Extended Abstracts for the Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference},
  year = {2021},
}

2020

ACM MM
Drum Synthesis and Rhythmic Transformation with Adversarial Autoencoders

Maciek Tomczak, Masataka Goto, and Jason Hockman

Proceedings of the ACM International Conference on Multimedia (ACM-MM), Seattle, USA, 2020

Abs Bib HTML PDF

Creative rhythmic transformations of musical audio refer to automated methods for manipulation of temporally-relevant sounds in time. This paper presents a method for joint synthesis and rhythm transformation of drum sounds through the use of adversarial autoencoders (AAE). Users may navigate both the timbre and rhythm of drum patterns in audio recordings through expressive control over a low-dimensional latent space. The model is based on an AAE with Gaussian mixture latent distributions that introduce rhythmic pattern conditioning to represent a wide variety of drum performances. The AAE is trained on a dataset of bar-length segments of percussion recordings, along with their clustered rhythmic pattern labels. The decoder is conditioned during adversarial training for mixing of data-driven rhythmic and timbral properties. The system is trained with over 500000 bars from 5418 tracks in popular datasets covering various musical genres. In an evaluation using real percussion recordings, the reconstruction accuracy and latent space interpolation between drum performances are investigated for audio generation conditioned by target rhythmic patterns.
@article{tomczak2020drum, author = {Tomczak, Maciek and Goto, Masataka and Hockman, Jason}, title = {Drum Synthesis and Rhythmic Transformation with Adversarial Autoencoders}, journal = {Proceedings of the ACM International Conference on Multimedia (ACM-MM), Seattle, USA}, pages = {2427--2435}, year = {2020}, }

DAFx

Adversarial Synthesis of Drum Sounds

Jake Drysdale, Maciek Tomczak, and Jason Hockman

Proceedings of the International Conference on Digital Audio Effects (DAFx), Vienna, Austria, 2020

Bib PDF

@article{drysdale2020adversarial,
  title = {Adversarial Synthesis of Drum Sounds},
  author = {Drysdale, Jake and Tomczak, Maciek and Hockman, Jason},
  journal = {Proceedings of the International Conference on Digital Audio Effects (DAFx), Vienna, Austria},
  pages = {167--172},
  year = {2020},
}

2019

DAFx
Drum Translation for Timbral and Rhythmic Transformation

Maciek Tomczak, Jake Drysdale, and Jason Hockman

Proceedings of the International Conference on Digital Audio Effects (DAFx), Birmingham, UK, 2019

Abs Bib PDF

Many recent approaches to creative transformations of musical audio have been motivated by the success of raw audio generation models such as WaveNet, in which audio samples are modeled by generative neural networks. This paper describes a generative audio synthesis model for multi-drum translation based on a WaveNet denosing autoencoder architecture. The timbre of an arbitrary source audio input is transformed to sound as if it were played by various percussive instruments while preserving its rhythmic structure. Two evaluations of the transformations are conducted based on the capacity of the model to preserve the rhythmic patterns of the input and the audio quality as it relates to timbre of the target drum domain. The first evaluation measures the rhythmic similarities between the source audio and the corresponding drum translations, and the second provides a numerical analysis of the quality of the synthesised audio. Additionally, a semi- and fully-automatic audio effect has been proposed, in which the user may assist the system by manually labelling source audio segments or use a state-of-the-art automatic drum transcription system prior to drum translation.
@article{tomczak2019drum, title = {Drum Translation for Timbral and Rhythmic Transformation}, author = {Tomczak, Maciek and Drysdale, Jake and Hockman, Jason}, journal = {Proceedings of the International Conference on Digital Audio Effects (DAFx), Birmingham, UK}, pages = {340--346}, year = {2019}, }

2018

DAFx

Audio Style Transfer with Rhythmic Constraints

Maciek Tomczak, Carl Southall, and Jason Hockman

Proceedings of the International Conference on Digital Audio Effects (DAFx), Aveiro, Portugal, 2018

Bib PDF Code

@article{tomczak2018audio,
  author = {Tomczak, Maciek and Southall, Carl and Hockman, Jason},
  journal = {Proceedings of the International Conference on Digital Audio Effects (DAFx), Aveiro, Portugal},
  title = {Audio Style Transfer with Rhythmic Constraints},
  pages = {45--50},
  year = {2018},
}

FMA

Player Recognition for Traditional Irish Flute Recordings

Islah Ali-MacLachlan, Carl Southall, Maciek Tomczak, and Jason Hockman

In Proceedings of the 8th International Workshop on Folk Music Analysis, Thessaloniki, Greece, 2018

Bib PDF

@inproceedings{ali2018player,
  title = {Player Recognition for Traditional Irish Flute Recordings},
  author = {Ali-MacLachlan, Islah and Southall, Carl and Tomczak, Maciek and Hockman, Jason},
  booktitle = {Proceedings of the 8th International Workshop on Folk Music Analysis, Thessaloniki, Greece},
  pages = {3--8},
  year = {2018},
}

2017

RPPW

Rhythm Modelling using Convolutional Neural Networks

Maciek Tomczak, Carl Southall, and Jason Hockman

Rhythm Production and Perception Workshop (RPPW), Birmingham, UK, 2017

Bib

@article{tomczak2017rhythm,
  author = {Tomczak, Maciek and Southall, Carl and Hockman, Jason},
  journal = {Rhythm Production and Perception Workshop (RPPW), Birmingham, UK},
  title = {Rhythm Modelling using Convolutional Neural Networks},
  year = {2017},
}

FMA

Improved Onset Detection for Traditional Flute Recordings using Convolutional Neural Networks

Islah Ali-MacLachlan, Maciek Tomczak, Carl Southall, and Jason Hockman

2017

Bib PDF

@article{ali2017improved,
  title = {Improved Onset Detection for Traditional Flute Recordings using Convolutional Neural Networks},
  author = {Ali-MacLachlan, Islah and Tomczak, Maciek and Southall, Carl and Hockman, Jason},
  booktitle = {Proceedings of the 7th International Workshop on Folk Music Analysis, Malaga, Spain},
  year = {2017},
}

2016

FMA

Note, Cut and Strike Detection for Traditional Irish Flute Recordings

Islah Ali-MacLachlan, Maciek Tomczak, Carl Southall, and Jason Hockman

2016

Bib PDF

@article{ali2016note,
  title = {Note, Cut and Strike Detection for Traditional Irish Flute Recordings},
  author = {Ali-MacLachlan, Islah and Tomczak, Maciek and Southall, Carl and Hockman, Jason},
  booktitle = {Proceedings of the 6th International Workshop on Folk Music Analysis, Dublin, Ireland},
  year = {2016},
  pages = {30--35},
}

2015

The Salience of MFCC Semantic Classification on Electric Guitar Recordings

Maciek Tomczak, and Ryan Stables

Undergraduate thesis. Birmingham City University, 2015

Abs Bib

This project describes an approach of semantic recognition by using the Mel Frequency Cepstral Coefficients (MFCCs) extracted from equalised signal of electric guitar recordings. Feature scaling is employed, prior to training and testing semantically processed samples via k Nearest Neighbour (kNN) and Support Vector Machine (SVM). Based on the created dataset of total 400 semantic trials collected from 20 experiment participants, it was possible to successfully train the kNN and SVM classifiers to distinguish between warm and bright extracted features. Results presented in this study show that a k = 5 NN model classifies the warm and bright descriptors most accurately, achieving 0.04% error on the test set.
@article{tomczak2015salience, title = {The Salience of MFCC Semantic Classification on Electric Guitar Recordings}, author = {Tomczak, Maciek and Stables, Ryan}, journal = {Undergraduate thesis. Birmingham City University}, year = {2015}, }