Scraper for entry and mid level sw jobs
Go to file
hook-lord 8a5e665f77
All checks were successful
Build and Push Docker Image / build-and-push (push) Successful in 28s
updated cache path
2024-12-06 22:13:38 +01:00
.gitea/workflows hmmmm.... 2024-12-06 19:42:16 +01:00
.gitignore added scraper for it-jobbank 2024-06-11 11:38:05 +02:00
Dockerfile named build stage 2024-12-06 19:30:08 +01:00
go.mod initial commit 2024-06-08 16:11:40 +02:00
go.sum initial commit 2024-06-08 16:11:40 +02:00
main.go updated cache path 2024-12-06 22:13:38 +01:00
readme.md Update readme.md 2024-07-30 13:12:34 +00:00
run-scrapers.sh added run script 2024-06-10 11:53:27 +02:00

IT jobs scraper

This is a simple scraper that extracts job details from the The Hub website and itjobbank.

Filtering

The scraper filters out a list of keywords like senior, architect etc. as I wrote it for entry and mid level roles. It also filters out unpaid form the hub, and keyword based from itjobbank.

Usage

To run the scraper, simply execute the following command:

go run scraper.go

The scraper will create a thehub.json and itjobbank.json file in the current directory, which contains a list of job details in JSON format. It caches the pages, so very light on resources and requests.