ZODA Parameters Calculation

Initial Experimentation

As an initial trial, I performed calculations to encode a 1MB dataset using a finite field of size and a $256\times256$ matrix. This was done to establish a baseline comparison and analyze how ZODA operates under these conditions.

Comparison with Nomos DA v1

In Nomos DA v1, one of the primary reasons for working with smaller data sizes was the increase in the number of columns as the dataset grows. This leads to a rise in the polynomial degree, which in turn makes proof generation more complex and computationally expensive. However, ZODA proves to be more efficient for encoding larger datasets, which is an important aspect to consider for future optimizations.

ZODA Sampling Mechanism

The ZODA protocol processes data by selecting a predefined number of rows and columns for sampling. The verification step in ZODA consists of two primary checks:

The first check ensures that the selected row and column intersect at a single point.
The second check computes the product of the selected row and column and verifies that it matches the encoded data matrix. This mechanism is detailed in Step 3 of the referenced document.

Adapting to the Subnet Architecture

In our current subnet structure, each subnet will receive an encoded row and an encoded column. These row-column pairs will be distinct for each subnet. Once a sufficient number of subnets have performed checks, the sampling process will be considered complete.

For an error probability of $2^{-7}$ , approximately 18 samples are required. If a higher error probability is acceptable, the number of required samples increases accordingly.

Data Size Analysis

For each subnet, the transmitted data includes:

Encoded row: 8KB
Encoded column: 8KB
A single entry in the encoded data matrix: 16 bytes
Additional Merkle tree proof and opening values (considered negligible as per the paper)

Thus, the total data size per node is estimated to be 16.5KB.

Consideration of Encoding Matrices ($D_r$ and $G$)

Encoding matrices should be treated as global parameters. These matrices can be accessed on-chain, eliminating the need for additional transmission.