Experiments in NLP

The code in this repository is for a course in Experiments in NLP at the Vrije Universiteit Amsterdam (2025).

The experiments performed in this repository are designed to analyze the learning dynamics of input and contextual embeddings of BERT-like models.

Instructions

This repository contains several notebooks, for:

Training an LTG-BERT model on the WikiText-103 dataset.
Training a Wiki2Vec model on the WikiText-103 dataset.
Create a dataset of target words for embedding analysis.
Analyze the embeddings of the LTG-BERT model.
Analyze the embeddings of the Wiki2Vec model.

All notebooks were written to be ran in Google Colab; file paths point to Google Drive and requirements are resolved automatically by the Colab runtime.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.DS_Store		.DS_Store
ENLP_WordNet (1).ipynb		ENLP_WordNet (1).ipynb
README.md		README.md
compute_metrics_BERT.ipynb		compute_metrics_BERT.ipynb
target_words_dataset.txt		target_words_dataset.txt
train_LTGBERT.ipynb		train_LTGBERT.ipynb
train_Word2Vec_model_and_compute_ metrics.ipynb		train_Word2Vec_model_and_compute_ metrics.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experiments in NLP

Instructions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Experiments in NLP

Instructions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages