Computer Vision project: ChessVision (Chess Game Reconstruction from Video)

Overview
Installation
Usage
Evaluation

This project combines the efforts of chesscog by Georg Wölflein and some of my own work which consists of fine-tuning the model, creting a small dataset and adding some chess game logic to remove inconsistencies in the board state.

This work is still in progress. The final objectives is to parse the full game from video, but the current state of the project is to parse a game from a sequence of images taken by a user. A whole pipeline to select the video frames and process them in real time is still to be implemented.

Morphy's Opera Game

Overview

The model consists of a pretrained occupancy classifier (ResNet) and piece classifier (InceptionV3). The model was trained by the authors of chesscog on a dataset of ~5,000 synthetically generated images (3D renderings of chess positions from different angles and with varying light). After investigating the limits of this approach I shot pictures of 3 chess games, from both player perspectives, move-by-move. The dataset consists of 358 images, 179 for each perspective. Some additional images were taken to test the model on a different chess boards and chess sets than the one used in the training set.

Please note that the system's main weakness is the RANSAC board localisation. To address the issue of too many lines being in the image I leveraged background removal by rembg. If the system fails try taking better shots of the image by changing the angle. Having a clean and realiable background leads to faster inference.

ChessVision

The purpose of this project is to parse a complete chess game from a video. To tackle this problem I addressed the simpler issue of chess position recognition by exploiting a pre-trained model by Georg Wölflein et al. and fine tuning.

The repo suggests taking a picture of the starting position from both sides. I deemed the results to be unsatisfactory, thus repeating the process with a larger pool of images. To ease the process of obtaining labels (FENs) for positions, I took pictures of chess games and parse the FEN directly from the PGN.

The dataset will be made publicly available, it consists of pictures taken from both perspectives (white and black) move-by-move of famous chess games, including:

Morphy's Opera Game
Alekhine - Nimzowitsch (1930)
Tal - Hjartarson 1987

Installation

To run the project I suggest using python 3.8.

conda create -n chessvision python=3.8
conda activate chessvision
pip install -r requirements.txt
pip install cairosvg

If you run into installation issues with cairosvg and avoit issues run

conda install cairo pango gdk-pixbuf libffi cairosvg

Usage

Dataset

If you have your own set of images to evalute, numbered in any ascending order, you can use the create_dataset.py script to move your images to the dataset folder. This will take a custom folder, which may contain subfolders, and handle the processing and save them in the output dir folder after preprocessing. Run the --rembg flag to remove background from your images, this might be helpful if you have noisy backgrounds with a lot lines.

python data_processing/create_dataset <source_dir> <dest_dir> #--rembg

For sheer training, evaluating and testing or corner detection the model please refer to the chesscog repository. After populating the train partition of the dataset (data://transfer_learning/train) with images run the following:

python -m chesscog.transfer_learning.create_dataset
python -m chesscog.transfer_learning.train
python -m chesscog.transfer_learning.evaluate 
# this will return some basic statistics for the model

If you find the results to be satisfactory, copy the model from runs/ to models://transfer_learning/ and run the main script. This will work on a sequence of images, run main.py with the path to the folder containing the images as an argument.

python main.py path/to/folder

Future Work

The project is still in progress. The following steps are to be implemented:

Parsing from video

The final goal of this project is to parse a chess game from a video. This work is still in progress and yet to be devoloped.

Testing and Evaluation

The ability of the model to generalise is to be better tested. I plan to test the model by training it on different chess boards and chess sets and test how many shots of the game chessboard are needed to parse the game successfully. i.e. If the game is played on a green-and-white board, how many shots of the board are needed to parse occupancy reliably. The same kind of experiment is to be repeated on different chess sets.

Inference Results

The inference process is still faulty, but the occupancy classifier mostly detects all pieces and rarely includes false negatives. This means that illogical board states can be parsed out.

As the confusion matrix shows, the piece classifier mostly has trouble discriminating between queens, bishops and pawns. This is due to their similarity from a top-view perspective. Also, the model has more trouble classifying the black pieces as they too matte.

This implementation can filter out the following:

Piece moving from one square to another, but on the second board state it is misclassified.
Pawns appearing on the first or last rank are filtered out.
Pieces appearing on squares that are unreachable via legal moves are removed.

Sample initial position
Example of misclassified piece corrected with logic
Chess move recognised despite two mistakes by the classifier
A beautiful checkmating sacrifice
The disappearing and reappearing f2-pawn
Same issue

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.vscode		.vscode
data_processing		data_processing
source		source
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
download_data.py		download_data.py
eval.sh		eval.sh
eval_all.sh		eval_all.sh
eval_postproc.sh		eval_postproc.sh
highlight_moves.py		highlight_moves.py
inference.py		inference.py
io_utils.py		io_utils.py
main.py		main.py
requirements.txt		requirements.txt
stats.py		stats.py
stream.py		stream.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Vision project: ChessVision (Chess Game Reconstruction from Video)

Morphy's Opera Game

Overview

ChessVision

Installation

Usage

Dataset

Future Work

Parsing from video

Testing and Evaluation

Inference Results

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Computer Vision project: ChessVision (Chess Game Reconstruction from Video)

Morphy's Opera Game

Overview

ChessVision

Installation

Usage

Dataset

Future Work

Parsing from video

Testing and Evaluation

Inference Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages