NIST Atomic Spectra Log-Cosine Spectral Scan

This repository contains a reproducible log-cosine spectral-density scan over public NIST Atomic Spectra Database line lists for transition metals (Fe, Ni, Co, Cr, Mn, Ti). The primary result is a stable Fe II log-cosine spectral-density mode in logarithmic wavenumber coordinates.

This project uses public CSV exports from the NIST Atomic Spectra Database. It is not "NIST certified", endorsed, or validated by NIST. NIST is the data provider only.

What the scanner does

For each line list (e.g. data/Fe_lines.csv) the scanner reads wavenumber wn(cm-1) (handling NIST/Excel-style cells such as ="49998.80"), optionally filters by ionization stage sp_num, converts to the logarithmic coordinate

ell = ln(wavenumber_cm^-1)

bins line counts on ell, and fits a one-mode Poisson log-linear model:

log mu(ell; k, a, b, c) = log mu0(ell) + c + a cos(k ell) + b sin(k ell)

against a Gaussian-smoothed baseline mu0. The scan statistic is the deviance improvement

DeltaChi^2_star = max_k ( D[y || mu0] - D[y || mu(k)] )

over a k grid. The active-domain winding is

n = k * Delta_ell / (2 pi)

where Delta_ell = ell_max - ell_min. The null is a parametric Poisson bootstrap from mu0, scanned with the same k grid; the empirical p-value is

p_hat = (1 + tail_count_ge) / (1 + null_n)

Primary result (Fe II, ion = 2, n_unique_lines = 9447)

Bins	k_best	n_obs	DeltaD	null (tail/N)	p_hat
120	31.3265306122449	10.717134	355.7443	0 / 5000	1/5001 ≈ 0.00019996
160	31.3265306122449	10.739649	315.7277	0 / 5000	1/5001 ≈ 0.00019996
200	31.3265306122449	10.753158	259.1788	0 / 5000	1/5001 ≈ 0.00019996

The headline observation is bin-stability of k_best = 31.3265306122449 across 120 / 160 / 200 histogram bins, with n_obs stable near 10.7 and zero null exceedances at 5000 nulls per bin setting. The PASS/FAIL gate encoded in scripts/make_verdict.py enforces this.

Repository layout

README.md
RESULTS.md
requirements.txt
data/
  Fe_lines.csv  Ni_lines.csv  Co_lines.csv
  Cr_lines.csv  Mn_lines.csv  Ti_lines.csv
scripts/
  nist_wct_log_spectral_scan_FIXED.py    # canonical CPU scanner
  nist_wct_log_spectral_scan.py          # legacy one-shot Fe splitter
  nist_wct_log_spectral_scan_CUPY.py     # optional GPU mirror (CuPy)
  nist_batch_run_neighbors.py            # neighbor-ion batch runner
  run_feii_bin_stability.py              # Fe II 120/160/200 ladder + master CSV
  make_verdict.py                        # PASS/FAIL from master CSV
outputs/
  fe_ion2_120/  fe_ion2_160/  fe_ion2_200/   # canonical Fe II runs
  batch_neighbors/                            # canonical neighbor batch (written on demand)
tables/
  nist_master_results.csv
  nist_verdict.csv
tests/
  test_nist_scanner.py

CPU canonical script vs optional CuPy script

scripts/nist_wct_log_spectral_scan_FIXED.py is the canonical scanner. It is pure NumPy/SciPy and produces the values in the table above. The PASS/FAIL verdict is built off this script's output.
scripts/nist_wct_log_spectral_scan_CUPY.py is an optional GPU mirror using CuPy. It is faster for large null_n but is not required for reproduction. If CuPy is not installed, ignore this script.

Commands

Quick single-bin run (Fe II, bins=160, 500-null preview):

python scripts/nist_wct_log_spectral_scan_FIXED.py \
    --csv data/Fe_lines.csv --ion 2 --bins 160 \
    --null-n 500 --min-lines 100 \
    --out-dir outputs/fe_ion2_160

Bin-stability ladder (quick, 500 nulls per bin):

python scripts/run_feii_bin_stability.py --null-n 500

Full run (5000 nulls per bin) plus verdict:

python scripts/run_feii_bin_stability.py --null-n 5000
python scripts/make_verdict.py

Tests:

pytest -q

Neighbor batch (Ni / Co / Cr / Mn / Ti, ion=2) — exploratory:

python scripts/nist_batch_run_neighbors.py --preview-null 500 --no-promote

Canonical vs legacy outputs

The canonical Fe II outputs live in outputs/fe_ion2_{120,160,200}/. Only these (or whatever rows are listed in tables/nist_master_results.csv) feed the PASS/FAIL verdict.
Older / exploratory runs at the repo root (for example outputs_fe_ion2_120_cupy_full/, outputs_ni_ion2_160_preview/, outputs_nist_wct/, etc.) are preserved as provenance. They are legacy, non-canonical, and are intentionally not used by the verdict unless they are explicitly referenced in tables/nist_master_results.csv. Do not delete them.

Limitations

This is a line-density scan, not a flux-spectrum scan.
The data are public NIST Atomic Spectra Database CSV exports; line completeness varies by species and is not uniform across ell.
The null is a parametric Poisson bootstrap from a Gaussian-smoothed baseline. Baseline-sigma sensitivity is documented and should be audited for any extension beyond Fe II.
This repository makes no claim of NIST endorsement, certification, or validation.

R implementation

An independent R computational reproduction of the canonical Python scanner lives under R/. It mirrors the scientific definitions, scan grid, output schema, and PASS/FAIL gate. The deterministic pipeline (read → clean → bin → Gaussian baseline → IRLS Poisson fit → deviance scan) reproduces the Python reference to machine precision; see R/README.md for the numerical conventions and the one documented Python/R RNG difference (the parametric Poisson bootstrap).

Status: Python remains the currently established canonical implementation until parity tests pass. The committed R outputs already pass the parity checks below (deterministic quantities agree with the Python reference to ≤ ~1e-11 relative). The R port is an independent computational reproduction, not an independent dataset and not an independent scientific confirmation.

Required R version and packages

R ≥ 4.1.0.
Packages: jsonlite, gmp (the gmp package needs system libgmp), and testthat for the test suite.

# system dependency for gmp (Debian/Ubuntu)
sudo apt-get install -y libgmp-dev

# R packages
Rscript -e 'install.packages(c("jsonlite", "gmp", "testthat"), repos="https://cloud.r-project.org")'

(DESCRIPTION in the repo root lists the same dependencies.)

Single-bin example (Fe II, bins = 160, 500-null preview)

Rscript R/nist_wct_log_spectral_scan.R \
    --csv data/Fe_lines.csv --ion 2 --bins 160 \
    --null-n 500 --min-lines 100 \
    --out-dir outputs_r/fe_ion2_160

Full 5000-null example

Rscript R/nist_wct_log_spectral_scan.R \
    --csv data/Fe_lines.csv --ion 2 --bins 160 \
    --null-n 5000 --min-lines 100 \
    --out-dir outputs_r/fe_ion2_160

Bin-stability ladder (Fe II 120 / 160 / 200)

Rscript R/run_feii_bin_stability.R --null-n 500
# aggregates into tables_r/nist_master_results_r.csv

Verdict

Rscript R/make_verdict.R
# writes tables_r/nist_verdict_r.csv (same gate as scripts/make_verdict.py)

Tests

Rscript tests/testthat.R
# or: Rscript -e 'testthat::test_dir("tests/testthat")'

Python-versus-R comparison

Rscript R/compare_python_r.R \
    --py outputs/fe_ion2_160 \
    --r  outputs_r/fe_ion2_160

R outputs are written under outputs_r/ and tables_r/ so the canonical Python results under outputs/ and tables/ are never modified.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NIST Atomic Spectra Log-Cosine Spectral Scan

What the scanner does

Primary result (Fe II, ion = 2, n_unique_lines = 9447)

Repository layout

CPU canonical script vs optional CuPy script

Commands

Canonical vs legacy outputs

Limitations

R implementation

Required R version and packages

Single-bin example (Fe II, bins = 160, 500-null preview)

Full 5000-null example

Bin-stability ladder (Fe II 120 / 160 / 200)

Verdict

Tests

Python-versus-R comparison

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
R		R
data		data
figures_r/statistical_audit		figures_r/statistical_audit
outputs		outputs
outputs_r		outputs_r
reports		reports
scripts		scripts
tables		tables
tables_r		tables_r
tests		tests
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
README.md		README.md
RESULTS.md		RESULTS.md
RESULTS_legacy_log.md		RESULTS_legacy_log.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

NIST Atomic Spectra Log-Cosine Spectral Scan

What the scanner does

Primary result (Fe II, ion = 2, n_unique_lines = 9447)

Repository layout

CPU canonical script vs optional CuPy script

Commands

Canonical vs legacy outputs

Limitations

R implementation

Required R version and packages

Single-bin example (Fe II, bins = 160, 500-null preview)

Full 5000-null example

Bin-stability ladder (Fe II 120 / 160 / 200)

Verdict

Tests

Python-versus-R comparison

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages