Kmeans++

A minimal C++20 implementation of the K-Means++ clustering algorithm with OpenMP parallelism.

What's in the box

kmeans_pp.h: public API
kmeans_pp.cpp: implementation
main.cpp: example usage (2D synthetic dataset, evaluation, CSV export)
plot.py: matplotlib script to visualize results

There are no precompiled libraries to link against. Just drop kmeans_pp.h and kmeans_pp.cpp into your project and compile them alongside your own code.

Building

Requires a C++20 compiler (or later) and OpenMP. CMake 3.8+.

cmake --preset linux-debug   # or x64-release, etc.
cmake --build out/build/linux-debug

Windows presets (x64-debug, x64-release, x86-*) use MSVC + Ninja.

Usage

The entire API is a single function:

#include "kmeans_pp.h"

// data: flat array of N points, each with D coordinates (row-major)
// Returns centroids and per-point cluster assignments.
auto result = kmeans::kmeans_pp(N, D, K, data, max_iterations, seed);

// result.centroids: std::vector<double>, size K*D
// result.assignments: std::vector<int>, size N

seed = 0 uses std::random_device for non-deterministic initialization.

Example

main.cpp loads a 2D dataset in ARFF format, runs K-Means++, evaluates accuracy against ground-truth labels via greedy cluster-to-class matching, and exports results to CSV. It's just a demo, not part of the library.

The dataset used is 2d-10c (2990 points, 10 classes).

./kmeans data.arff        # default seed=42
./kmeans data.arff 123    # custom seed

To plot:

python plot.py datasets/kmeans_result.csv

Dependencies

OpenMP (linked via CMake's OpenMP::OpenMP_CXX)
C++20 or later
plot.py needs pandas and matplotlib.

License

Do whatever you want with it.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
datasets		datasets
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
README.md		README.md
kmeans_pp.cpp		kmeans_pp.cpp
kmeans_pp.h		kmeans_pp.h
main.cpp		main.cpp
plot.py		plot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kmeans++

What's in the box

Building

Usage

Example

Dependencies

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kmeans++

What's in the box

Building

Usage

Example

Dependencies

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages