jobTakt: German Job Market Intelligence
A data engineering project processing 18K+ jobs daily.
Data Insight
View InsightsA Glimpse of the Pipeline
Data flows from multiple sources, is processed and stored, then served to the frontend.
Core Project Features
This project demonstrates a wide range of data engineering principles and practices.
All job data is stored in a single, unified PostgreSQL database, acting as the single source of truth.
The data pipeline runs automatically multiple times a day, ensuring the job board is always up-to-date.
Jobs from various sources are transformed and loaded into a standardized format for consistency.
The frontend allows users to filter jobs by location, keywords, and other criteria in real-time.
The architecture is built to handle a growing number of data sources and increasing data volume with ease.
The system logs recent activities and key metrics, providing a clear view of the pipeline's health and performance.
Recent Pipeline Activity
A live look at the most recent data harvesting and processing events.
Scraped DHL Group: Found 4135 jobs (2357 new).
13 minutes ago
Scraped DeepL: Found 64 jobs (0 new).
17 minutes ago
Scraped vivid: Found 15 jobs (0 new).
17 minutes ago
Scraped Auxmoney: Found 11 jobs (0 new).
17 minutes ago
Scraped Bundesagentur für Arbeit: Found 0 jobs (0 new).
17 minutes ago