Nexora-Music-PD v1-mini is a curated public-domain music dataset composed of historical audio recordings sourced from the Library of Congress Citizen DJ collections. The dataset is designed for open research, audio analysis, music information retrieval, remixing, and AI/ML experimentation.
This is a mini release (v1) intended as a lightweight, easy-to-use subset for testing pipelines, educational use, and small-scale experiments.
- Dataset Name: nexora-music-pd-v1-mini
- Pretty Name: Nexora Music PD v1 Mini
- Version: v1-mini
- License: MIT (metadata & curation)
- Audio License: Public Domain
- Language: English
- Size Category: n < 1K audio files
- Task Category: Text-to-Audio
The audio recordings in this dataset are sourced from the Library of Congress Citizen DJ – National Jukebox (Popular Music) collections.
These recordings are identified as public domain and are free to use, reuse, remix, and redistribute without restriction.
Source homepage: https://citizen-dj.labs.loc.gov/
Nexora-Music-PD-v1-mini/
├── audio/
│ └── nexora-music-pd-v1-mini.zip
├── sampels/
│ ├── Sobre-las-olas_jukebox-120145_001_00-00-30.mp3
│ ├── Some-boy_jukebox-132913_001_00-00-58.mp3
│ └── Some-of-these-days_jukebox-254467_001_00-01-16.mp3
└── README.md
This dataset may be used for:
- Audio & music research
- AI / ML model training and evaluation
- Text-to-audio experimentation
- Music information retrieval (MIR)
- Educational and academic projects
- Creative remixing and sound design
- Identifying or profiling individuals
- Any use that violates applicable laws or ethical research standards
All audio recordings included in this dataset are public domain. No copyright restrictions apply to the audio itself.
The dataset structure, metadata files, and documentation are released under the MIT License.
See LICENSE.txt for details.
If you use this dataset in academic or research work, please cite it as:
@dataset{nexora_music_pd_v1_mini,
title = {Nexora-Music-PD v1-mini},
author = {Nexora},
year = {2026},
url = {https://citizen-dj.labs.loc.gov/}
}
Special thanks to the Library of Congress and the Citizen DJ initiative for making historical audio recordings openly accessible to the public.
Project maintained by JackMa.
For issues, suggestions, or contributions, please open an issue or pull request.