publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- CrossMuSim: A cross-modal framework for music similarity retrieval with LLM-powered text description sourcing and miningMay 2025arXiv:2503.23128 [cs] Summary: This paper introduces a dual-source data acquisition approach combining online scraping and LLM-based prompting, where carefully designed prompts leverage LLMs’ comprehensive music knowledge to generate contextually rich descriptions.
2024
- Multi-View Midivae: Fusing Track- and Bar-View Representations for Long Multi-Track Symbolic Music GenerationIn ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024Summary: Object and subjective experimental results demonstrate that, compared to the baseline, Multi-view MidiVAE exhibits significant improvements in terms of modeling long multi-track symbolic music.
- Cycle Frequency-Harmonic-Time Transformer for Note-Level Singing Voice TranscriptionIn 2024 IEEE International Conference on Multimedia and Expo (ICME), Jul 2024Summary: A novel 3D Cycle Frequency-Harmonic-Time Transformer (CFT) is proposed to explicitly capture the harmonic series of singing voices, where a tokenization scheme is defined that captures harmonics across multiple octaves, then the harmonic features are aggregated into the frequency-harmonic-time representations via a cyclic architecture.
- Efficient adapter tuning for joint singing voice beat and downbeat tracking with self-supervised learning featuresIn Proceedings of the 25th International Society for Music Information Retrieval Conference, Nov 2024Summary: A novel temporal convolutional network-based beat-tracking approach featuring self-supervised learning representations and adapter tuning is proposed to track the beat and downbeat of singing voices jointly.
2023
- Improving Automatic Singing Skill Evaluation with Timbral Features, Attention, and Singing Voice SeparationIn 2023 IEEE International Conference on Multimedia and Expo (ICME), Jul 2023Summary: This paper proposes a more general ASSE model which applies to both solo singing and singing with accompaniment, and employs an existing singing voice separation tool for accompaniment removal and compares ASSE models trained with and without accompaniment.
2022
- VocEmb4SVS: Improving Singing Voice Separation with Vocal EmbeddingsIn 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Nov 2022Summary: VocEmb4SVS is proposed, an SVS framework to utilize vocal embeddings of the singer as auxiliary knowledge for SVS conditioning and achieves state-of-the-art performance on the MUSDB18 dataset.
- AnimeTAB: A new guitar tablature dataset of anime and game musicOct 2022arXiv:2210.03027 [cs] Summary: This paper presents AnimeTAB, a fingerstyle Guitar tablature dataset in MusicXML format, which provides more high-quality guitar tablature for both researchers and guitar players and an accompanying analysis toolkit, TABprocessor, is included to further facilitate its use.
2021
- Addressing ambiguity in supervised machine learning: A case study on automatic chord labellingMcGill University, Oct 2021
2020
- Automatic Chord Labelling: A Figured Bass ApproachIn Proceedings of the 7th International Conference on Digital Libraries for Musicology, Oct 2020Summary: This paper proposes a series of four rule-based algorithms that automatically generate chord labels for homorhythmic Baroque chorales based on both figured bass annotations and the musical surface, which are applied to the existing Bach Chorales Figured Bass dataset.
- Automatic Figured Bass Annotation Using the New Bach Chorales Figured Bass DatasetIn Proceedings of the 21th International Society for Music Information Retrieval Conference, Oct 2020
- Figured Bass Encodings for Bach Chorales in Various Symbolic Formats: A Case StudyIn Proceedings of the Music Encoding Conference, Oct 2020
- Data Quality Matters: Iterative Corrections on a Corpus of Mendelssohn String Quartets and Implications for MIR AnalysisIn International Society for Music Information Retrieval Conference (ISMIR 2020), Oct 2020
2019
- An Interactive Workflow for Generating Chord Labels for Homorhythmic Music in Symbolic FormatsIn Proceedings of the 20th International Society for Music Information Retrieval Conference, Oct 2019
2018
- A Flexible Approach to Automated Harmonic Analysis: Multiple Annotations of Chorales by Bach and PrætoriusIn Proceedings of the 19th International Society of Music Information Retrieval Conference, Oct 2018
2017
- Non-chord Tone Identification Using Deep Neural NetworksIn Proceedings of the 4th International Workshop on Digital Libraries for Musicology - DLfM ’17, Oct 2017Summary: The results suggest that DNNs offer an innovative and promising approach to tackling the problem of non-chord tone identification, as well as harmonic analysis.
2012
- K-means initial clustering center optimal algorithm based on KruskalJ. Inf. Comput. Sci, Oct 2012