Ledger Chunk download API #7550

achamayou · 2026-01-06T11:15:57Z

Adding an API aiming to allow a ledger backup process that does not have access to the ledger storage directories to efficiently fetch committed ledger chunks for archival/retention purposes.

HEAD/GET /node/ledger-chunk
HEAD/GET /node/ledger-chunk/{chunk_name}

Typical Scenario

sequenceDiagram
  Note over Client: Client asks for chunk starting at index
  Client->>+Backup: GET /node/ledger-chunk?since=index
  Backup->>-Client: 308 Location: /node/ledger-chunk/ledger_startIndex_endIndex.committed
  Note over Backup: Backup node has that chunk
  Client->>+Backup: GET /node/ledger-chunk/ledger_startIndex_endIndex.committed
  Backup->>-Client: 200 <Chunk Contents>
  Client->>+Backup: GET /node/ledger-chunk?since=endIndex+1
  Note over Backup: Backup node does not yet have a committed chunk starting at endIndex+1
  Backup->>-Client: 308 Location: https://primary/node/ledger-chunk?since=endIndex+1
  Client->>+Primary: GET /node/ledger-chunk?since=endIndex+1
  Primary->>-Client: 308 Location: /node/ledger-chunk/ledger_endIndex+1_nextEndIndex.committed
  Client->>+Primary: GET /node/ledger-chunk/ledger_startIndex_endIndex.committed
  Note over Primary: But the Primary node has the most recent chunk already
  Primary->>-Client: 200 <Chunk Contents>

Alternative Scenario

The initial node (Primary in this case) that client hits has started from a snapshot, and does not have some past chunks. To make this more readable, let's say that Primary started from snapshot_100.committed and locally has:

ledger_1-50.committed
ledger_101-150.committed

Backup has:

ledger_1-50.committed
ledger_51-100.committed

sequenceDiagram
  Client->>+Primary: GET /node/ledger-chunk?since=51
  Primary->>-Client: 308 Location: https://backup/node/ledger-chunk?since=51
  Client->>+Backup: GET /node/ledger-chunk?since=51
  Backup->>-Client: 308 Location: /node/ledger-chunk/ledger_51-100.committed
  Client->>+Backup: GET /node/ledger-chunk/ledger_51-100.committed
  Backup->>-Client: 200 <Chunk Contents>
  Client->>+Backup: GET /node/ledger-chunk?since=101
  Note over Backup: Backup node does not have 101-150
  Backup->>-Client: 308 Location: https://primary/node/ledger-chunk?since=51
  Client->>+Primary: GET /node/ledger-chunk?since=101

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

src/node/rpc/file_serving_handlers.h

doc/schemas/node_openapi.json

src/host/ledger.h

tests/e2e_operations.py

src/node/rpc/file_serving_handlers.h

Co-authored-by: Eddy Ashton <edashton@microsoft.com>

achamayou · 2026-01-16T16:55:42Z

This is blocked on #7576 and #7578 being merged and backported.

Copilot

Pull request overview

This PR introduces a new API for downloading committed ledger chunks, designed to enable ledger backup processes that don't have direct access to ledger storage directories. The implementation adds HTTP endpoints /node/ledger-chunk and /node/ledger-chunk/{chunk_name} with intelligent redirection logic to handle scenarios where nodes may not have all historical chunks (e.g., when started from a snapshot).

Changes:

New ledger chunk download API with HEAD/GET endpoints supporting byte-range requests
Serialization of ledger access using mutexes to ensure thread-safe concurrent access
fsync of committed chunks before closing to ensure file integrity for remote serving
New ReadLedgerSubsystem interface for accessing ledger metadata
Comprehensive end-to-end tests covering chunk access, redirection, and gap scenarios

Reviewed changes

Copilot reviewed 17 out of 17 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
tests/schema.py	Added test registration for ledger chunk download
tests/infra/remote.py	Added helper to get main ledger directory
tests/infra/node.py	Added version check for LedgerChunkRead feature and helper method
tests/infra/interfaces.py	Enabled LedgerChunkRead operator feature on file serving interface
tests/e2e_operations.py	Added comprehensive tests for chunk access, redirection scenarios, and gap handling
src/node/rpc/node_frontend.h	Updated API version to 5.0.1
src/node/rpc/ledger_subsystem.h	New subsystem implementation for ledger read operations
src/node/rpc/ledger_interface.h	New interface definition for ledger subsystem
src/node/rpc/file_serving_handlers.h	Implemented ledger chunk endpoints with redirection logic and byte-range support
src/host/test/ledger.cpp	Updated ledger test capture to include file sizes
src/host/run.cpp	Added ledger parameter to enclave creation
src/host/ledger.h	Added state_lock for thread safety, fsync for committed files, new query methods
src/enclave/main.cpp	Updated to pass ledger reference to enclave
src/enclave/entry_points.h	Updated signature to accept ledger reference
src/enclave/enclave.h	Installed ReadLedgerSubsystem in node context
include/ccf/http_consts.h	Added CCF_LEDGER_CHUNK_NAME header constant
doc/schemas/node_openapi.json	Added OpenAPI schema for new endpoints, updated version

src/node/rpc/file_serving_handlers.h

tests/infra/node.py

tests/e2e_operations.py

tests/infra/node.py

tests/infra/remote.py

src/node/rpc/file_serving_handlers.h

eddyashton

LGTM!

The consolidation of helper functions in file_serving_handlers.h is great. I think we could go slightly further and factor out the ?since= param lookup, and possibly standardise the pattern for subsystem lookup, but these can be deferred to a future PR. I'm a little suspicious of the concept of init_idx, but it looks like its populated and tested correctly so I'm happy.

achamayou · 2026-01-28T17:20:23Z

I think we could go slightly further and factor out the ?since= param lookup, and possibly standardise the pattern for subsystem lookup, but these can be deferred to a future PR.

I had a go, but it's mandatory in one case and optional in the other, and so it's a little awkward. I'll try again.

Co-authored-by: Eddy Ashton <edashton@microsoft.com>

achamayou · 2026-01-28T17:35:52Z

I'm a little suspicious of the concept of init_idx, but it looks like its populated and tested correctly so I'm happy.

I am also unhappy with that because LedgerFiles clearly has too much state already, and this is more state. But on the other hand, this is the right cutoff, and there is no other obvious way to get it.

Add state lock

ad87404

achamayou added the bench-ab label Jan 6, 2026

achamayou and others added 4 commits January 6, 2026 16:27

ledger

2743a79

subsystem

5ab5677

Merge branch 'main' into thread_safe_ledger_iface

2853106

ledger-chunk

91b7248

achamayou removed the bench-ab label Jan 8, 2026

achamayou and others added 5 commits January 8, 2026 16:19

wip

e5ba6ec

Better CBOR wrappers (#7549)

d673acc

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

EverCBOR wrapper in cbor test for cose headers (#7564)

5076c85

download

bfd4ce6

chunk download

998253c

achamayou changed the title ~~[Draft] Thread-safe ledger file access interface~~ [Draft] Ledger Chunk download API Jan 12, 2026

achamayou and others added 5 commits January 12, 2026 16:41

Merge branch 'main' into thread_safe_ledger_iface

919d7b0

test all the chunks

8f97131

redirect

b3282fe

Test failover to primary

c503805

re-enable other operations tests

c5efe64

andpiccione reviewed Jan 13, 2026

View reviewed changes

eddyashton reviewed Jan 13, 2026

View reviewed changes

tests/e2e_operations.py Outdated Show resolved Hide resolved

tests/e2e_operations.py Outdated Show resolved Hide resolved

tests/e2e_operations.py Show resolved Hide resolved

src/node/rpc/file_serving_handlers.h Outdated Show resolved Hide resolved

achamayou and others added 8 commits January 13, 2026 10:47

Update src/node/rpc/file_serving_handlers.h

3b1e636

Co-authored-by: Eddy Ashton <edashton@microsoft.com>

LedgerChunkRead

a60ca7a

Merge branch 'main' into thread_safe_ledger_iface

a369b71

tweaks

e4e051c

Factor out the node configuration getting

5998a88

Do not set flags on builds that don't support them

c996518

Merge branch 'main' into thread_safe_ledger_iface

cd7c909

Merge branch 'main' into thread_safe_ledger_iface

2523a94

Merge branch 'main' into thread_safe_ledger_iface

f3d9644

achamayou requested a review from a team as a code owner January 27, 2026 21:12

Copilot AI review requested due to automatic review settings January 27, 2026 21:12

Copilot started reviewing on behalf of achamayou January 27, 2026 21:12 View session

Copilot AI reviewed Jan 27, 2026

View reviewed changes

src/node/rpc/file_serving_handlers.h Show resolved Hide resolved

src/node/rpc/file_serving_handlers.h Outdated Show resolved Hide resolved

tests/infra/node.py Show resolved Hide resolved

tests/e2e_operations.py Show resolved Hide resolved

achamayou removed the bench-ab label Jan 28, 2026

achamayou changed the title ~~[Draft] Ledger Chunk download API~~ Ledger Chunk download API Jan 28, 2026

achamayou mentioned this pull request Jan 28, 2026

The range handling is inconsistent with HTTP Range semantics #7626

Open

achamayou and others added 2 commits January 28, 2026 14:41

fixes

b3131f4

Merge branch 'main' into thread_safe_ledger_iface

d51c3ed

eddyashton reviewed Jan 28, 2026

View reviewed changes

tests/infra/node.py Outdated Show resolved Hide resolved

tests/infra/remote.py Outdated Show resolved Hide resolved

eddyashton reviewed Jan 28, 2026

View reviewed changes

src/node/rpc/file_serving_handlers.h Outdated Show resolved Hide resolved

eddyashton reviewed Jan 28, 2026

View reviewed changes

src/node/rpc/file_serving_handlers.h Outdated Show resolved Hide resolved

eddyashton reviewed Jan 28, 2026

View reviewed changes

src/node/rpc/file_serving_handlers.h Show resolved Hide resolved

eddyashton approved these changes Jan 28, 2026

View reviewed changes

achamayou and others added 4 commits January 28, 2026 17:32

get_ledger_main_dir

2074a5c

No other nodes

9fc6595

Update src/node/rpc/file_serving_handlers.h

1fca344

Co-authored-by: Eddy Ashton <edashton@microsoft.com>

clarity

1b7bcd1

achamayou and others added 7 commits January 28, 2026 17:42

schema

8f8ed24

fmt

24bd782

doc

4f73661

doc

8a260e9

fmt

50d3a62

mermaid

9e29391

Merge branch 'main' into thread_safe_ledger_iface

14395f4

achamayou enabled auto-merge (squash) January 29, 2026 18:24

achamayou merged commit 0e2a429 into main Jan 29, 2026
17 checks passed

achamayou deleted the thread_safe_ledger_iface branch January 29, 2026 18:53

Ledger Chunk download API #7550

Ledger Chunk download API #7550

Conversation

achamayou commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Typical Scenario

Alternative Scenario

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

achamayou commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eddyashton left a comment

Choose a reason for hiding this comment

Uh oh!

achamayou commented Jan 28, 2026

Uh oh!

achamayou commented Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

achamayou commented Jan 6, 2026 •

edited

Loading

achamayou commented Jan 16, 2026 •

edited

Loading