Skip to content

SPR-946: Nightly failures - Deadlock with wait_map, partition table out of order sync #1104

@garthgoodson

Description

@garthgoodson
field value
url https://linear.app/springtail/issue/SPR-946/nightly-failures-deadlock-with-wait-map-partition-table-out-of-order
author @
date Wed Aug 13 22:58:11 PDT 2025
state Done
project
priority No priority
assignee @
labels
related SPR-850
parent
children
PRs
attachments #587 #600
  1. Deadlock:
log_reader internal worker waits at: issue_resync_and_wait and blocks log reader 
because of fixed size internal queue and the copy thread waits at wait_for_state:

      │   LOG_DEBUG(LOG_PG_LOG_MGR, "Wait for state syncing, db: {}", _db_id);
      │   // ensure the pipeline was stalled before we complete
      │   _internal_state.wait_for_state(STATE_SYNCING);
  1. Partition table sync issue:
    Also when the parent table and child table are resynced in parallel, there are times these tables' ddl go out of order causing parent table to lose partition.

cc: @garth @craig @Arun

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions