This project scrapes news articles from BBC Indonesia, analyzes entities, and visualizes relationships using NetworkX.
Kaggle Notebook Link: https://www.kaggle.com/code/invigi/bbcscrape
- Web scraping to gather news articles from BBC Indonesia.
- Entity recognition using spaCy.
- Network visualization of entity relationships.
- Python 3.x
- Libraries:
beautifulsoup4,requests,spacy,networkx
-
Clone the repository:
git clone https://github.com/GitAJov/BBCScrape.git cd repository-name -
Install the required packages:
pip install beautifulsoup4 requests spacy networkx python -m spacy download en_core_web_sm