Regarding parallel block scanning

I've mentioned a couple of times that I don't think parallel block scanning will give a noticeable performance improvement, but I've never really explained why. I made some graphs trying to illustrate my point. Here's a bunch of blocks, from $1$ to $n$:

<img width="961" height="551" alt="Image" src="https://github.com/user-attachments/assets/05f18e65-07ae-4838-9f86-33089a3a663d" />

Assume that every block has $m$ transactions for simplicity.

For scanning silent payments, the main computational burden is calculating the candidate scriptpubkey. This is an operation that has to be done for every transaction, meaning that for $n$ blocks of $m$ transactions, we need to perform $n \times m$ operations.

However, since transactions are independent of one another, these operations can be done in parallel. Currently we parallelize this process on the block level, meaning we parallelize the transactions in a single block. This looks like this:

<img width="961" height="551" alt="Image" src="https://github.com/user-attachments/assets/00c53a5a-0a0b-4a3c-ae65-f6ff0169b911" />

All transactions in the green square can be done in parallel.

Another option is to scan blocks in parallel as well. This way, all transactions can be done in parallel:

<img width="961" height="551" alt="Image" src="https://github.com/user-attachments/assets/8da5c7bf-a230-43d5-8506-90b222addc64" />

However in practice, I think it will look more like this:

<img width="961" height="551" alt="Image" src="https://github.com/user-attachments/assets/b3d42fc1-cf3c-4964-8456-cb2f30bf7a9c" />

This is because the target device is limited in how many actions it can perform in parallel. In other words: there is a maximum size to the green square. If a computer has $k$ cores available, it can process $k$ transactions in parallel and the square is of size $k$. 

Here's another illustration of the 2 approaches, where we take the limit for parallelization into account ($k=3$):

<img width="961" height="551" alt="Image" src="https://github.com/user-attachments/assets/71ed48d3-f5a9-42a0-9177-9466f7aa2f0b" />

To summarize, it does not make a difference whether the parallelized transactions are all part of the same block $[1,1] \dots [1,k]$ or of different blocks $[1,1] \dots [k,1]$. Ultimately we need to process every block, meaning we need to process every transaction in the $n \times m$ grid.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding parallel block scanning #91

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Regarding parallel block scanning #91

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions