Skip to content

JobsAustriaDetailsETL

File: jobs_austria_details_scraping.py
Inherits:

Fetches job URLs from the DB that still need detail extraction, fires them at the ams_details Apify actor in batches of URL_BATCH_SIZE, then streams results back and persists them.

Class Diagram

classDiagram
    class JobsAustriaDetailsETL {
        +__init__()
        -_create_engine()
        +async connect()
        +async disconnect()
        -async _fetch_pending_urls() list
        -_build_configs() list
        -_start_actors() list
        -async _drain_generator()
        +async produce_apify_results()
        +async consume_apify_results()
        +async produce_and_consume()
        +async run()
    }
Hold "Ctrl" to enable pan & zoom

Methods

Method Parameters Returns
__init__()
_create_engine()
connect()
disconnect()
_fetch_pending_urls() list
_build_configs() urls list
_start_actors() configs list
_drain_generator() generator: ApifyGenerator
produce_apify_results()
consume_apify_results()
produce_and_consume()
run()

Attributes

No class-level attributes.