jobTakt: German Job Market Intelligence

A data engineering project processing 18K+ jobs daily.

Data Insight

View Insights

A Glimpse of the Pipeline

Data flows from multiple sources, is processed and stored, then served to the frontend.

Data Sources
Smart Scripts
Validation
Database
API
Web UI
Click to explore the full architecture

Core Project Features

This project demonstrates a wide range of data engineering principles and practices.

Centralized Database

All job data is stored in a single, unified PostgreSQL database, acting as the single source of truth.

Automated Refresh

The data pipeline runs automatically multiple times a day, ensuring the job board is always up-to-date.

Data Unification

Jobs from various sources are transformed and loaded into a standardized format for consistency.

Dynamic Filtering

The frontend allows users to filter jobs by location, keywords, and other criteria in real-time.

Scalable Design

The architecture is built to handle a growing number of data sources and increasing data volume with ease.

Pipeline Monitoring

The system logs recent activities and key metrics, providing a clear view of the pipeline's health and performance.

Recent Pipeline Activity

A live look at the most recent data harvesting and processing events.

  • Scraped Bundesagentur für Arbeit: Found 14356 jobs (0 new).

    1 hours ago

  • Scraped DHL Group: Found 4135 jobs (2357 new).

    2 hours ago

  • Scraped DeepL: Found 64 jobs (0 new).

    2 hours ago

  • Scraped vivid: Found 15 jobs (0 new).

    2 hours ago

  • Scraped Auxmoney: Found 11 jobs (0 new).

    2 hours ago