This repository supports work related to the National Secure Data Service (NSDS) project, led by the National Science Foundation's National Center for Science and Engineering Statistics (NCSES).
The NSDS is mandated under the 2022 CHIPS and Science Act to lead a government-wide effort on strengthening data linkage and data access infrastructure, facilitating statistical activities that support increased evidence building for the American public.
The concept of a National Secure Data Service was first introduced in the bipartisan Commission on Evidence-Based Policymaking's Final Report in 2017, and further developed through recommendations from the Advisory Committee on Data for Evidence Building in 2022, established under the Foundations for Evidence-Based Policymaking Act of 2018.
The NSDS project aims to:
- Develop a shared services model to streamline and innovate data sharing and linking
- Enable evidence-based decision making at all levels of government and across all sectors
- Deploy services, technologies, techniques, and shared service models in support of the NSDS mission
- Advance novel research collaborations, data linkage methodologies, and privacy preserving technologies
The NSDS focuses on building robust infrastructure to serve researchers, agencies, and policymakers across all sectors. Core focus areas include:
- Data linkage infrastructure and methodology
- Privacy-preserving technologies and techniques
- Shared services model development
- Cross-agency and cross-sector data access frameworks
This repository serves as the home for a collection of toolkits designed to enhance the NSDS mission and accelerate data discovery through the application of artificial intelligence.
These tools aim to push beyond traditional data linkage methods by leveraging AI to surface hidden relationships, patterns, and insights across complex, multi-source datasets. The toolkit initiative is built around the following objectives:
- Augmenting NSDS capabilities by developing reusable tools and frameworks that extend and strengthen the core services of the NSDS
- AI-driven data discovery through machine learning and intelligent search techniques to identify and connect relevant data assets across disparate sources
- Accelerating evidence building by reducing the friction researchers and analysts face when navigating large, distributed government datasets
- Interoperability and openness through modular, well-documented tools that can be adopted and extended by partner agencies and research communities
Contributions, feedback, and collaboration from the broader data and research community are welcome as this toolkit evolves.
- NSDS Initiative Page
- Vision for a Future NSDS
- Authorizing Legislation
- Privacy and Confidentiality
- NSDS Projects
- Two-Year Congressional Report
For questions related to this repository, please open an issue or contact the project team directly.
This project is conducted in support of the National Science Foundation / NCSES NSDS initiative.