HCKSCRP - Hacker News Scraper

A comprehensive command-line tool for scraping various sections of Hacker News, built with Go and the Colly web scraping framework.

Features

Home Page Scrapers

Front Page - Scrape the main Hacker News front page
News - Scrape the latest news stories
Ask HN - Scrape Ask Hacker News posts
Show HN - Scrape Show Hacker News posts
Jobs - Scrape job postings
New Comments - Scrape the latest comments across the site
Fetch Ask Comments - Get detailed comments for specific Ask HN posts

User-Specific Scrapers

User Submissions - View all posts submitted by a specific user
User Threads - View all comments made by a specific user
User Favorites - View posts favorited by a specific user
User Info - Get detailed profile information for any user

Installation

Clone the repository:

git clone https://github.com/pimatis/hckscrp.git
cd hckscrp

Install dependencies:

go mod tidy

Build the application:

go build -o hckscrp main.go

Usage

Run the application:

./hckscrp

or if you prefer to run it with go run:

go run main.go

You'll be presented with an interactive menu:

Welcome to HCKSCRP - Hacker News Scraper
Available commands:
1. Front Page
2. News
3. Ask
4. Show
5. Jobs
6. New Comments
7. Fetch Ask Comments
8. User Submissions
9. User Threads
10. User Favorites
11. User Info
12. Exit
-----------------------------------------
Enter command (1-12):

Examples

Scraping Home Pages

Select options 1-6 and enter a page number when prompted
Page numbers start from 1 (default)

Fetching Comments

Select option 7 and enter a specific item ID
Example: 44178902 for a specific Ask HN post

User-Specific Operations

Select options 8-11 and enter a username when prompted
For submissions, threads, and favorites, you can also specify a page number
Example username: queaxtra

Output Format

All scraped data is displayed in formatted tables with relevant columns:

Story Tables Include:

Rank
Title (truncated to 50 characters)
Domain/URL
Score (when available)
Author
Time posted
Comment count

Comment Tables Include:

Comment ID
Author
Time posted
Content (truncated for readability)
Story context

User Info Table Includes:

Username
Account creation date
Karma score
About section (if available)

Dependencies

Colly - Web scraping framework
go-pretty - Table formatting

Features

Pagination Support: All relevant scrapers support multiple pages
Automatic URL Handling: Properly formats both external and internal Hacker News links
Error Handling: Graceful handling of network errors and missing data
Clean Output: Formatted tables with appropriate column widths and text truncation
Interactive Menu: Easy-to-use command-line interface

Rate Limiting

Please be respectful when using this scraper:

Don't make too many rapid requests
Consider adding delays between requests for heavy usage
Follow Hacker News' robots.txt and terms of service

Contributing

Feel free to submit issues and enhancement requests!

License

This project is open source and available under the MIT License.

Created by Pimatis Labs

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
hckscrp		hckscrp
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HCKSCRP - Hacker News Scraper

Features

Home Page Scrapers

User-Specific Scrapers

Installation

Usage

Examples

Scraping Home Pages

Fetching Comments

User-Specific Operations

Output Format

Story Tables Include:

Comment Tables Include:

User Info Table Includes:

Dependencies

Features

Rate Limiting

Contributing

License

About

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

HCKSCRP - Hacker News Scraper

Features

Home Page Scrapers

User-Specific Scrapers

Installation

Usage

Examples

Scraping Home Pages

Fetching Comments

User-Specific Operations

Output Format

Story Tables Include:

Comment Tables Include:

User Info Table Includes:

Dependencies

Features

Rate Limiting

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages