Skip to content

PriyaGandhi311/AI-webScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Web Scraper

Scrape and parse any webpage using AI — quickly extract meaningful insights from HTML content with LLM-powered analysis.

Demo Snapshot

image

Features

  • LLM-powered parsing using OpenAI
  • Intelligent HTML DOM chunking for cleaner context
  • Structured summaries from unstructured web content
  • Interactive Streamlit UI for quick experimentation

How It Works

  1. Enter a webpage URL.
  2. The scraper fetches and parses the DOM structure.
  3. Chunks of HTML are sent to an LLM for meaningful interpretation.
  4. The AI returns a structured summary to the user.

Built With

  • LangChain Ollama LLM (llama3)
  • Python
  • Streamlit
  • BeautifulSoup

About

Using the Ollama model to scrape websites and extract the required information.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages