TY - GEN
T1 - Searching musical audio datasets by a batch of multi-variant tracks
AU - Yu, Yi
AU - Downie, J. Stephen
AU - Chen, Lei
AU - Oria, Vincent
AU - Joe, Kazuki
PY - 2008
Y1 - 2008
N2 - Multi-variant music tracks are those audio tracks of a particular song which are sung and recorded by different people (i.e., cover songs). As music social clubs grow on the Internet, more and more people like to upload music recordings onto such music social sites to share their own homeproduced albums and participate in Internet singing contests. Therefore it is very important to explore a computerassisted evaluation tool to detect these audio-based multivariant tracks. In this paper we investigate such a task: the original track of a song is embedded in datasets, with a batch of multi-variant audio tracks of this song as input, our retrieval system returns an ordered list by similarity and indicates the position of relevant audio track. To help process multi-variant audio tracks, we suggest a semantic indexing framework and propose the Federated Features (FF) scheme to generate the semantic summarization of audio feature sequences. The conjunction of federated features with three typical similarity searching schemes, K-Nearest Neighbor (KNN), Locality Sensitive Hashing (LSH), and Exact Euclidian LSH (E2LSH), is evaluated. From these findings, a computer-assisted evaluation tool for searching multi-variant audio tracks was developed to search over large musical audio datasets.
AB - Multi-variant music tracks are those audio tracks of a particular song which are sung and recorded by different people (i.e., cover songs). As music social clubs grow on the Internet, more and more people like to upload music recordings onto such music social sites to share their own homeproduced albums and participate in Internet singing contests. Therefore it is very important to explore a computerassisted evaluation tool to detect these audio-based multivariant tracks. In this paper we investigate such a task: the original track of a song is embedded in datasets, with a batch of multi-variant audio tracks of this song as input, our retrieval system returns an ordered list by similarity and indicates the position of relevant audio track. To help process multi-variant audio tracks, we suggest a semantic indexing framework and propose the Federated Features (FF) scheme to generate the semantic summarization of audio feature sequences. The conjunction of federated features with three typical similarity searching schemes, K-Nearest Neighbor (KNN), Locality Sensitive Hashing (LSH), and Exact Euclidian LSH (E2LSH), is evaluated. From these findings, a computer-assisted evaluation tool for searching multi-variant audio tracks was developed to search over large musical audio datasets.
KW - Content-based audio retrieval
KW - Cover songs
KW - Hash-based
KW - Indexing
KW - Musical audio sequences summarization
UR - http://www.scopus.com/inward/record.url?scp=70450242881&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70450242881&partnerID=8YFLogxK
U2 - 10.1145/1460096.1460117
DO - 10.1145/1460096.1460117
M3 - Conference contribution
AN - SCOPUS:70450242881
SN - 9781605583129
T3 - Proceedings of the 1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08
SP - 121
EP - 127
BT - Proceedings of the 1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08
T2 - 1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08
Y2 - 30 August 2008 through 31 August 2008
ER -