Skip to content

_fetch_fk_pending_batch()

Class: JobsAustriaCacheSynchronizer
File: jobs_austria_cache_synchronizer.py · line 42

Rows not yet linked to a job — need synchronize_fk_id.

Signature

Parameters none
Returns not annotated
Async No
Visibility Private

Implementation

def _fetch_fk_pending_batch(self) -> pd.DataFrame:
    """Rows not yet linked to a job — need synchronize_fk_id."""
    query = (
        "SELECT id AS scrape_cache_id, url_hash, data_payload "
        f"FROM scrape_cache WHERE fk_job_id IS NULL LIMIT {BATCH_SIZE}"
    )
    with self.engine.connect() as conn:
        return pd.read_sql(query, conn)