The International Society of Music Information Retrieval


Conferences
Transactions of ISMIR
Women in MIR
Resources


About the Society
Membership
Community Statistics
Contact

Conferences / ISMIR 2023

Full Proceedings

Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, November 5-9, 2023 (ISBN: 978-1-7327299-3-3) [pdf]

Papers
Shreyas Nadkarni, Sujoy Roychowdhury, Preeti Rao, Martin Clayton
Exploring the Correspondence of Melodic Contour With Gesture in Raga Alap Singing 21-28[pdf]
Miguel Perez, Holger Kirchhoff, Xavier Serra
TriAD: Capturing Harmonics With 3D Convolutions 29-36[pdf]
Fabio Morreale, Megha Sharma, I-Chieh Wei
Data Collection in Music Generation Training Sets: A Critical Analysis 37-46[pdf]
Bob L. T. Sturm, Arthur Flexer
A Review of Validity and Its Relationship to Music Information Research 47-55[pdf]
Gowriprasad R, Srikrishnan Sridharan, R Aravind, Hema A. Murthy
Segmentation and Analysis of Taniavartanam in Carnatic Music Concerts 56-63[pdf]
Changhong Wang, Gaël Richard, Brian McFee
Transfer Learning and Bias Correction With Pre-Trained Audio Embeddings 64-70[pdf]
Michèle Duguay, Kate Mancey, Johanna Devaney
Collaborative Song Dataset (CoSoD): An Annotated Dataset of Multi-Artist Collaborations in Popular Music 71-79[pdf]
Michele Newman, Lidia Morris, Jin Ha Lee
Human-AI Music Creation: Understanding the Perceptions and Experiences of Music Creators for Ethical and Productive Collaboration 80-88[pdf]
Nathan Fradet, Nicolas Gutowski, Fabien Chhel, Jean-Pierre Briot
Impact of Time and Note Duration Tokenizations on Deep Learning Symbolic Music Modeling 89-97[pdf]
Max Johnson, Mark R. H. Gotham
Musical Micro-Timing for Live Coding 98-105[pdf]
Francisco J. Castellanos, Antonio Javier Gallego, Ichiro Fujinaga
A Few-Shot Neural Approach for Layout Analysis of Music Score Images 106-113[pdf]
Behzad Haki, Błażej Kotowski, Cheuk Lun Isaac Lee, Sergi Jordà
TapTamDrum: A Dataset for Dualized Drum Patterns 114-120[pdf]
Andrea Martelloni, Andrew P. McPherson, Mathieu Barthet
Real-Time Percussive Technique Recognition and Embedding Learning for the Acoustic Guitar 121-128[pdf]
Hiromu Yakura, Masataka Goto
IteraTTA: An Interface for Exploring Both Text Prompts and Audio Priors in Generating Music With Text-to-Audio Models 129-137[pdf]
Mirco Pezzoli, Raffaele Malvermi, Fabio Antonacci, Augusto Sarti
Similarity Evaluation of Violin Directivity Patterns for Musical Instrument Retrieval 138-145[pdf]
George Sioros
Polyrhythmic Modelling of Non-Isochronous and Microtiming Patterns 146-153[pdf]
Shangda Wu, Dingyao Yu, Xu Tan, Maosong Sun
CLaMP: Contrastive Language-Music Pre-Training for Cross-Modal Symbolic Music Information Retrieval 157-165[pdf]
Luca Marinelli, György Fazekas, Charalampos Saitis
Gender-Coded Sound: Analysing the Gendering of Music in Toy Commercials via Multi-Task Learning 166-173[pdf]
Li-Yang Tseng, Tzu-Ling Lin, Hong-Han Shuai, Jen-Wei Huang, Wen-Whei Chang
A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability 174-181[pdf]
Carlos Peñarrubia, Carlos Garrido-Munoz, Jose J. Valero-Mas, Jorge Calvo-Zaragoza
Efficient Notation Assembly in Optical Music Recognition 182-189[pdf]
Yuting Yang, Zeyu Jin, Connelly Barnes, Adam Finkelstein
White Box Search Over Audio Synthesizer Parameters 190-196[pdf]
Vincent K. M. Cheung, Lana Okuma, Kazuhisa Shibata, Kosetsu Tsukuda, Masataka Goto, Shinichi Furuya
Decoding Drums, Instrumentals, Vocals, and Mixed Sources in Music Using Human Brain Activity With fMRI 197-206[pdf]
Liyue Zhang, Xinyu Yang, Yichi Zhang, Jing Luo
Dual Attention-Based Multi-Scale Feature Fusion Approach for Dynamic Music Emotion Recognition 207-214[pdf]
Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji
Automatic Piano Transcription With Hierarchical Frequency-Time Transformer 215-222[pdf]
Nazif Can Tamer, Yigitcan Özer, Meinard Müller, Xavier Serra
High-Resolution Violin Transcription Using Weak Labels 223-230[pdf]
Lejun Min, Junyan Jiang, Gus Xia, Jingwei Zhao
Polyffusion: A Diffusion Model for Polyphonic Score Generation With Internal and External Controls 231-238[pdf]
Claire Arthur, Nathaniel Condit-Schultz
The Coordinated Corpus of Popular Musics (CoCoPops): A Meta-Corpus of Melodic and Harmonic Transcriptions 239-246[pdf]
Anja Volk, Tinka Veldhuis, Katrien Foubert, Jos De Backer
Towards Computational Music Analysis for Music Therapy 247-256[pdf]
Luca Comanducci, Fabio Antonacci, Augusto Sarti
Timbre Transfer Using Image-to-Image Denoising Diffusion Implicit Models 257-263[pdf]
Neha Rajagopalan, Blair Kaneshiro
Correlation of EEG Responses Reflects Structural Similarity of Choruses in Popular Music 264-271[pdf]
Mark R. H. Gotham
Chromatic Chords in Theory and Practice 272-278[pdf]
Yo-Wei Hsiao, Tzu-Yun Hung, Tsung-Ping Chen, Li Su
BPS-Motif: A Dataset for Repeated Pattern Discovery of Polyphonic Symbolic Music 281-288[pdf]
Michael Krause, Sebastian Strahl, Meinard Müller
Weakly Supervised Multi-Pitch Estimation Using Cross-Version Alignment 289-296[pdf]
Patricia Hu, Gerhard Widmer
The Batik-Plays-Mozart Corpus: Linking Performance to Score to Musicological Annotations 297-303[pdf]
Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle
Mono-to-Stereo Through Parametric Stereo Generation 304-310[pdf]
Charilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos
From West to East: Who Can Understand the Music of the Others Better? 311-318[pdf]
Juan C. Martinez-Sevilla, Adrián Roselló, David Rizo, Jorge Calvo-Zaragoza
On the Performance of Optical Music Recognition in the Absence of Specific Training Data 319-326[pdf]
Martin E. Malandro
Composer’s Assistant: An Interactive Transformer for Multi-Track MIDI Infilling 327-334[pdf]
Ethan Lustig, David Temperley
The FAV Corpus: An Audio Dataset of Favorite Pieces and Excerpts, With Formal Analyses and Music Theory Descriptors 335-342[pdf]
Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger B. Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo
LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT 343-351[pdf]
Alia Morsi, Kana Tatsumi, Akira Maezawa, Takuya Fujishima, Xavier Serra
Sounds Out of Pläce? Score-Independent Detection of Conspicuous Mistakes in Piano Performances 352-358[pdf]
Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo
VampNet: Music Generation via Masked Acoustic Token Modeling 359-366[pdf]
Yucong Jiang
Expert and Novice Evaluations of Piano Performances: Criteria for Computer-Aided Feedback 367-374[pdf]
Andres Ferraro, Jaehun Kim, Sergio Oramas, Andreas Ehmann, Fabien Gouyon
Contrastive Learning for Cross-Modal Artist Retrieval 375-382[pdf]
Christoph Finkensiep, Matthieu Haeberle, Friedrich Eisenbrand, Markus Neuwirth, Martin Rohrmeier
Repetition-Structure Inference With Formal Prototypes 383-390[pdf]
Peter van Kranenburg, Eoin J. Kearns
Algorithmic Harmonization of Tonal Melodies Using Weighted Pitch Context Vectors 391-397[pdf]
Kento Watanabe, Masataka Goto
Text-to-Lyrics Generation With Image-Based Semantics and Reduced Risk of Plagiarism 398-406[pdf]
SeungHeon Doh, Keunwoo Choi, Jongpil Lee, Juhan Nam
LP-MusicCaps: LLM-Based Pseudo Music Captioning 409-416[pdf]
Morgan Buisson, Brian McFee, Slim Essid, Helene C. Crayencour
A Repetition-Based Triplet Mining Approach for Music Segmentation 417-424[pdf]
Francesco Foscarin, Daniel Harasim, Gerhard Widmer
Predicting Music Hierarchies With a Graph-Based Neural Decoder 425-432[pdf]
Johannes Zeitler, Simon Deniffel, Michael Krause, Meinard Müller
Stabilizing Training With Soft Dynamic Time Warping: A Case Study for Pitch Class Estimation With Weakly Aligned Targets 433-439[pdf]
Danbinaerin Han, Rafael Caro Repetto, Dasaem Jeong
Finding Tori: Self-Supervised Learning for Analyzing Korean Folk Song 440-447[pdf]
Bernardo Torres, Stefan Lattner, Gaël Richard
Singer Identity Representation Learning Using Self-Supervised Techniques 448-456[pdf]
Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Chenghua Lin, Xingran Chen, Anton Ragni, Hanzhi Yin, Emmanouil Benetos, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger B. Dannenberg, Yike Guo, Jie Fu
On the Effectiveness of Speech Self-Supervised Learning for Music 457-465[pdf]
Tian Cheng, Masataka Goto
Transformer-Based Beat Tracking With Low-Resolution Encoder and High-Resolution Decoder 466-473[pdf]
Vanessa Nina Borsan, Mathieu Giraud, Richard Groult, Thierry Lecroq
Adding Descriptors to Melodies Improves Pattern Matching: A Study on Slovenian Folk Songs 474-481[pdf]
Karlijn Dinnissen, Christine Bauer
How Control and Transparency for Users Could Improve Artist Fairness in Music Recommender Systems 482-491[pdf]
Ahyeon Choi, Eunsik Shin, Haesun Joung, Joongseek Lee, Kyogu Lee
Towards a New Interface for Music Listening: A User Experience Study on YouTube 492-499[pdf]
Xavier Riley, Simon Dixon
FiloBass: A Dataset and Corpus Based Study of Jazz Basslines 500-507[pdf]
Louis Couturier, Louis Bigo, Florence Levé
Comparing Texture in Piano Scores 508-515[pdf]
Johannes Hentschel, Andrew McLeod, Yannis Rammos, Martin Rohrmeier
Introducing DiMCAT for Processing and Analyzing Notated Music on a Very Large Scale 516-523[pdf]
Sehun Kim, Kazuya Takeda, Tomoki Toda
Sequence-to-Sequence Network Training Methods for Automatic Guitar Transcription With Tokenized Outputs 524-531[pdf]
Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters
PESTO: Pitch Estimation With Self-Supervised Transposition-Equivariant Objective 535-544[pdf]
Vanessa Nina Borsan, Mathieu Giraud, Richard Groult
The Games We Play: Exploring the Impact of ISMIR on Musicology 545-552[pdf]
Genís Plaja-Roglans, Marius Miron, Adithi Shankar, Xavier Serra
Carnatic Singing Voice Separation Using Cold Diffusion on Training Data With Bleeding 553-560[pdf]
Kosetsu Tsukuda, Tomoyasu Nakano, Masahiro Hamasaki, Masataka Goto
Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights From a User Survey 561-570[pdf]
Jan Hajič jr., Gustavo A. Ballen, Klára Hedvika Mühlová, Hana Vlhová-Wörner
Towards Building a Phylogeny of Gregorian Chant Melodies 571-578[pdf]
Yiwei Ding, Alexander Lerch
Audio Embeddings as Teachers for Music Classification 579-587[pdf]
Ilya Borovik, Vladimir Viro
ScorePerformer: Expressive Piano Performance Rendering With Fine-Grained Control 588-596[pdf]
Emmanouil Karystinaios, Gerhard Widmer
Roman Numeral Analysis With Graph Neural Networks: Onset-Wise Predictions From Note-Wise Features 597-604[pdf]
Brian Regan, Desislava Hristova, Mariano Beguerisse-Díaz
Semi-Automated Music Catalog Curation Using Audio and Metadata 605-611[pdf]
Ioannis Petros Samiotis, Christoph Lofi, Alessandro Bozzon
Crowd’s Performance on Temporal Activity Detection of Musical Instruments in Polyphonic Music 612-618[pdf]
Igor Pereira, Felipe Araújo, Filip Korzeniowski, Richard Vogl
MoisesDB: A Dataset for Source Separation Beyond 4-Stems 619-626[pdf]
Zeng Ren, Wulfram Gerstner, Martin Rohrmeier
Music as Flow: A Formal Representation of Hierarchical Processes in Music 627-633[pdf]
Silvan David Peter
Online Symbolic Music Alignment With Offline Reinforcement Learning 634-641[pdf]
Oren Barkan, Shlomi Shvartzman, Noy Uzrad, Moshe Laufer, Almog Elharar, Noam Koenigstein
Inversynth II: Sound Matching via Self-Supervised Synthesizer-Proxy and Inference-Time Finetuning 642-648[pdf]
Amantur Amatov, Dmitry Lamanov, Maksim Titov, Ivan Vovk, Ilya Makarov, Mikhail Kudinov
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task 649-656[pdf]
Keren Shao, Ke Chen, Taylor Berg-Kirkpatrick, Shlomo Dubnov
Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction 657-663[pdf]
Chin-Yun Yu, György Fazekas
Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables 667-675[pdf]
Qiaoyu Yang, Frank Cwitkowitz, Zhiyao Duan
Harmonic Analysis With Neural Semi-CRF 676-683[pdf]
Alberto Acquilino, Ninad Puranik, Ichiro Fujinaga, Gary Scavone
A Dataset and Baseline for Automated Assessment of Timbre Quality in Trumpet Sound 684-691[pdf]
Frank Heyen, Quynh Quang Ngo, Michael Sedlmair
Visual Overviews for Sheet Music Structure 692-699[pdf]
Luís Carvalho, Gerhard Widmer
Passage Summarization With Recurrent Models for Audio – Sheet Music Retrieval 700-707[pdf]
Pedro Ramoneda, Jose J. Valero-Mas, Dasaem Jeong, Xavier Serra
Predicting Performance Difficulty From Piano Sheet Music Images 708-715[pdf]
Junghyun Koo, Yunkee Chae, Chang-Bin Jeon, Kyogu Lee
Self-Refining of Pseudo Labels for Music Source Separation With Noisy Labeled Data 716-724[pdf]
Marcel A. Vélez Vásquez, Mariëlle Baelemans, Jonathan Driedger, Willem Zuidema, John Ashley Burgoyne
Quantifying the Ease of Playing Song Chords on the Guitar 725-732[pdf]
Irmak Bükey, Jason Zhang, TJ Tsai
FlexDTW: Dynamic Time Warping With Flexible Boundary Conditions 733-740[pdf]
Alexandre D’Hooge, Louis Bigo, Ken Déguernel
Modeling Bends in Popular Music Guitar Tablatures 741-748[pdf]
Geoffroy Peeters
Self-Similarity-Based and Novelty-Based Loss for Music Structure Analysis 749-756[pdf]
Carey Bunks, Tillman Weyde, Simon Dixon, Bruno Di Giorgi
Modeling Harmonic Similarity for Jazz Using Co-occurrence Vectors and the Membrane Area 757-764[pdf]
Shuqi Dai, Yuxuan Wu, Siqi Chen, Roy Huang, Roger B. Dannenberg
SingStyle111: A Multilingual Singing Dataset With Style Transfer 765-773[pdf]
Haven Kim, Kento Watanabe, Masataka Goto, Juhan Nam
A Computational Evaluation Framework for Singable Lyric Translation 774-781[pdf]
Kosetsu Tsukuda, Masahiro Hamasaki, Masataka Goto
Chorus-Playlist: Exploring the Impact of Listening to Only Choruses in a Playlist 782-792[pdf]
David Lewis, Elisabete Shibata, Andrew Hankinson, Johannes Kepper, Kevin R. Page, Lisa Rosendahl, Mark Saccomano, Christine Siegert
Supporting Musicological Investigations With Information Retrieval Tools: An Iterative Approach to Data Collection 795-801[pdf]
Federico Simonetta, Ana Llorens, Martín Serrano, Eduardo García-Portugués, Álvaro Torrente
Optimizing Feature Extraction for Symbolic Music 802-809[pdf]
Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer
Exploring Sampling Techniques for Generating Melodies With a Transformer Language Model 810-816[pdf]
John Ashley Burgoyne, Janne Spijkervet, David John Baker
Measuring the Eurovision Song Contest: A Living Dataset for Real-World MIR 817-823[pdf]
Pablo Alonso-Jiménez, Xavier Serra, Dmitry Bogdanov
Efficient Supervised Training of Audio Transformers for Music Representation Learning 824-831[pdf]
Michael Krause, Christof Weiß, Meinard Müller
A Cross-Version Approach to Audio Representation Learning for Orchestral Music 832-839[pdf]
Tomoyasu Nakano, Masataka Goto
Music Source Separation With MLP Mixing of Time, Frequency, and Channel 840-847[pdf]
Huan Zhang, Emmanouil Karystinaios, Simon Dixon, Gerhard Widmer, Carlos Eduardo Cancino-Chacón
Symbolic Music Representations for Classification Tasks: A Systematic Evaluation 848-858[pdf]
Jacopo de Berardinis, Valentina Anita Carriero, Albert Meroño-Peñuela, Andrea Poltronieri, Valentina Presutti
The Music Meta Ontology: A Flexible Semantic Model for the Interoperability of Music Metadata 859-867[pdf]
Jeff Miller, Johan Pauwels, Mark Sandler
Polar Manhattan Displacement: Measuring Tonal Distances Between Chords Based on Intervallic Content 868-874[pdf]