Resume_parser/
│
├── README.md # Project documentation
├── Resume_parser_final.ipynb # Main notebook for development and experimentation
│
├── base_config.cfg # Base configuration for NLP model
├── config.cfg # Final training configuration file
│
├── jz_skill_patterns.jsonl # Custom skill patterns for entity recognition
├── train_data.pk # Serialized training dataset
├── train_new.spacy # Trained spaCy model data
Contains project overview, setup instructions, and usage guide.
Jupyter Notebook used for:
- Data preprocessing
- Model training
- Testing and evaluation
Initial configuration file used to define the NLP pipeline structure before training.
Final configuration file used for training the spaCy model with optimized parameters.
Contains predefined skill patterns used for:
- Named Entity Recognition (NER)
- Skill extraction from resumes
Pickle file containing the processed training dataset.
Trained spaCy model file used for:
- Extracting entities
- Running inference on resumes
- The project uses spaCy for building and training the NLP model
- Custom NER is implemented for better skill extraction
- Training data can be extended to improve model accuracy