Holesky Incident Debrief, February 26, 2025

Holesky Incident Debrief Notes

State of Holesky

  • The chain is operating with limited block production on the “canonical” minority chain
    • Currently seeing 4-8 blocks per epoch (12-25% of expected blocks)
    • Goal is to reach 75% block production before coordinating mass slashings
    • More peers on the correct chain improves synchronization capabilities
    • Stable block production is critical for maintaining chain liveness

Immediate Action Items (Before ACD Meeting)

  1. Priority: Restore Validator Operations

    • Focus on getting Holesky validators and full nodes back online on the correct chain
    • Client teams to share their latest valid versions on the ACD agenda
    • Instructions for validators:
  2. Sepolia Fork Planning

    • So far, consensus is to proceed with Sepolia fork as scheduled, to confirm on ACDE tomorrow
    • Client teams to confirm their releases for the fork
    • Any concerns about proceeding should be raised immediately
  3. Testing Infrastructure

    • EthPandaOps to begin preparations for pectra-devnet-7, which will:
      • Support approximately 1 million validators (comparable to mainnet)
      • Allow validator handoffs to staking pools, infrastructure providers, and DVT clusters
      • Include a faucet for manual validator deposits
      • Provide a platform for testing consolidations that can’t be tested on Sepolia

ACD Follow Ups

  1. Holesky Recovery Assessment

    • If missed slot rate is <25%, begin coordinating controlled slashings of client team validators
    • If above 25%, determine steps to improve block production
  2. Sepolia Fork Confirmation

    • Final decision on fork timing
    • Client teams to confirm release readiness
  3. Mainnet Preparation Requirements

    • Discuss criteria and testing needed before mainnet fork

Slashing Strategy

  • Once more validators are back online, estimate how many are slasheable
  • Coordinate slashings in controlled batches to avoid overwhelming the network
  • Begin with core developer validators and Genesis validators
  • Process will take several weeks to reach finalization (limited by exit rate of 8 per epoch)
  • Validators reaching 0 balance through slashing penalties will eventually exit

Node Operator Guidance

  • For synced nodes: Enable validators with slashing protection ON
  • For unsynced nodes: Update to latest client release and resync
  • Important note: If validators can attest without disabling slashing protection, they likely didn’t attest to the invalid chain

Next Steps for ACDC (Following Week)

  • Teams to share retrospectives on the incident
  • Discuss long-term mitigations for similar issues
  • Evaluate the effectiveness of the recovery process
  • Plan for comprehensive testing of consolidations on devnet-7
4 Likes