Commit History¶
Technical log of changes made to the project, ordered newest first.
March 2026¶
docs: restructure nav, add Mermaid click-to-enlarge¶
- Renamed doc files to logical names (introduction, implementation, classes, database)
- Added click-to-enlarge overlay for all Mermaid diagrams
- Flattened nav structure, split into developer/management docs
docs: add MkDocs site, GitHub Actions deploy workflow¶
- Set up MkDocs 1.6.x with Material theme
- Added Cloudflare Pages deployment for private repo hosting
- Created
requirements-docs.txtwith pinned versions
docs: add README and JobsAustria pipeline documentation¶
- Added
README.mdwith project overview - Created data flow sequence diagram
- Created Stage 2 class diagram for
JobsAustriaCacheSynchronizer - Created ERD with real schema and data lifecycle table
- Created implementation details doc
pipeline: JobsAustria cache synchronizer and detail enrichment¶
- Built
JobsAustriaCacheSynchronizer— two-queue design, links scrape_cache → jobs via url_hash - Built
JobsAustriaDetailsETL+PortalRouter— fires Apify detail actors in batches of 100, max 3 concurrent - Confirmed
jobs_austria_cache_key_sync.pyis a duplicate and safe to delete
pipeline: JobsAustria ETL cache import and payload processing¶
- Built
JobsAustriaETLCache— async producer/consumer, full refresh scraper using asyncio.gather - Built
JobsAustriaCacheProcess— seeds jobs table from scrape_cache payload JSON