DogSpeak: A Canine Vocalization Classification Dataset
Hridayesh Lekhak, Theron S. Wang, Tuan M. Dang, and 1 more author
In Proceedings of the 33rd ACM International Conference on Multimedia, Dublin, Ireland, Jul 2025
Progress in understanding real-world canine vocal communication is constrained by datasets lacking scale and ’in-the-wild’ diversity. We introduce DogSpeak, a large-scale public dataset of 77,202 Barkseqs (33.162 hours) from 156 dogs (5 breeds), uniquely sourced from online social media with accurate dog ID, sex, and breed labels. DogSpeak, one of the largest of its kind, addresses prior limitations. Benchmark tasks (sex, breed, individual dog recognition) demonstrate its utility and highlight how its inherent real-world challenges necessitate and foster research into more robust bioacoustic models, preprocessing, and feature representation.