Skip to content

6shun/clickbait-annotation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Annotation Project on Click-baiting YouTube titles

Annotation Project for course INFO-259 at UC Berkeley with Prof. David Bamman, Spring 2022

This repo is structured as following:

.
├── data                            
│    ├── 1_raw_data.csv                 # Raw data
│    ├── 4_annotated_data.csv           # Annotated data
│    ├── 5_adjucated.txt                # Annotated data in .txt format
│    ├── 5_individual_annotation.txt    # Annotated data with individual annotation recorded   
│    └── ap_data                        # Final data
│           └── 1 
│                ├── train.txt
│                ├── dev.txt
│                └── test.txt                  
├── docs                                 
│    ├── guidelines                     # Annotation guidelines
│    ├── guidelines_feedback.md         # Feedback: peers & instructor
│    ├── Model_analysis                 # Results and analysis
│    └── DATASHEET.md                   # Datasheet questions filled
├── notebook_data_process               # Notesbooks for preprocessing data
│    ├── AP1_preprocess.ipynb           # Cleaning raw data for annotation
│    ├── AP2_annotate_data_to_txt.ipynb # Annotated data to txt files
│    └── Data Validator.ipynb           # Validating Data for classifiers
├── notebook_classifiers                
│    ├── LogReg.ipynb                   # Logistic Regression
│    └── BERT.ipynb                     # BERT
└── README.md

About

Info259 | NLP Annotation Project

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published