Skip to content

kaiser-factorial/d3_slang

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Slang InfoViz Final Project

Observable notebook: Slang visualization prototype

This project is ongoing, and both the analysis pipeline and visualization work are still being refined.

This project compares slang usage across two datasets:

  • a 2020-2025 Gen Z slang dataset
  • a 2010 Twitter slang dataset

The repository combines data cleaning, annotation, filtering, sentiment analysis, and exploratory summary generation for an information visualization project.

Repository Layout

Key Data Files

2010 Twitter Workflow

The 2010 Twitter dataset was built in stages:

  1. clean the raw source rows
  2. create a term-level annotation table
  3. merge term annotations back onto tweet rows
  4. annotate tweet-level usage context and irony
  5. filter out rows where the word is not actually being used as slang
  6. export an analysis-ready dataset
  7. optionally add sentiment scores

Important note: several intermediate CSVs are still present because the scripts expect those filenames and use them as pipeline checkpoints. Some of them are logically redundant with later outputs, but they are useful if you want to rerun only one section of the workflow instead of rebuilding everything from scratch.

Most Important Scripts

About

Files for Info Viz slang project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors