Owner: @Mehmet

Reviewers: 🟢@Álvaro Castro-Castilla 🟢@Daniel Sanchez Quiros

Introduction

In data availability sampling, the goal is to ensure that light nodes—which are resource-constrained and cannot store or download the entire dataset—can reliably verify data availability by downloading only small, random portions of the dataset. This verification process prevents malicious actors from making data unavailable while evading detection.

This document examines the setup for NomosDA, a data availability protocol that uses erasure coding to protect against data withholding attacks. We evaluate two sampling strategies—chunk sampling and column sampling—and derive formulas for detecting unavailable shares with specified confidence levels. We then analyze how effectively each strategy performs under practical system parameters.

Overview

This document analyzes the NomosDA data availability protocol, focusing on two sampling strategies—chunk sampling and column sampling—to ensure reliable data verification for light nodes. The key findings are summarized below:

  1. Efficiency Comparison:
  2. Robustness Against Adversaries:
  3. Practical Implications:

These results highlight the practical advantages of Column Sampling in terms of both efficiency and simplicity, making it the preferred strategy for NomosDA in most scenarios.

Data Encoding and Distribution

Sampling Strategies

Two types of data availability sampling strategies are analyzed:

  1. Chunk Sampling: The light node samples small chunks (individual elements) from random row-column coordinates.
  2. Column Sampling: The light node samples entire columns of the matrix.

Adversary Model