Logo ROOT  
Reference Guide
 
Loading...
Searching...
No Matches
ROOT::Experimental::Internal::RPageSinkBuf Class Reference

Wrapper sink that coalesces cluster column page writes.

Definition at line 41 of file RPageSinkBuf.hxx.

Classes

class  RColumnBuf
 A buffered column. More...
 
struct  RCounters
 I/O performance counters that get registered in fMetrics. More...
 

Public Member Functions

 RPageSinkBuf (const RPageSinkBuf &)=delete
 
 RPageSinkBuf (RPageSinkBuf &&)=default
 
 RPageSinkBuf (std::unique_ptr< RPageSink > inner)
 
 ~RPageSinkBuf () override
 
ColumnHandle_t AddColumn (DescriptorId_t fieldId, RColumn &column) final
 Register a new column.
 
std::uint64_t CommitCluster (NTupleSize_t nNewEntries) final
 Finalize the current cluster and create a new one for the following data.
 
void CommitClusterGroup () final
 Write out the page locations (page list envelope) for all the committed clusters since the last call of CommitClusterGroup (or the beginning of writing).
 
void CommitDatasetImpl () final
 
void CommitPage (ColumnHandle_t columnHandle, const RPage &page) final
 Write a page to the storage. The column must have been added before.
 
void CommitSealedPage (DescriptorId_t physicalColumnId, const RSealedPage &sealedPage) final
 Write a preprocessed page to storage. The column must have been added before.
 
void CommitSealedPageV (std::span< RPageStorage::RSealedPageGroup > ranges) final
 Write a vector of preprocessed pages to storage. The corresponding columns must have been added before.
 
void CommitStagedClusters (std::span< RStagedCluster > clusters) final
 Commit staged clusters, logically appending them to the ntuple descriptor.
 
void CommitSuppressedColumn (ColumnHandle_t columnHandle) final
 Commits a suppressed column for the current cluster.
 
const RNTupleDescriptorGetDescriptor () const final
 Return the RNTupleDescriptor being constructed.
 
void InitImpl (RNTupleModel &model) final
 
RPageSinkBufoperator= (const RPageSinkBuf &)=delete
 
RPageSinkBufoperator= (RPageSinkBuf &&)=default
 
RPage ReservePage (ColumnHandle_t columnHandle, std::size_t nElements) final
 Get a new, empty page for the given column that can be filled with up to nElements; nElements must be larger than zero.
 
RStagedCluster StageCluster (NTupleSize_t nNewEntries) final
 Stage the current cluster and create a new one for the following data.
 
void UpdateExtraTypeInfo (const RExtraTypeInfoDescriptor &extraTypeInfo) final
 Adds an extra type information record to schema.
 
void UpdateSchema (const RNTupleModelChangeset &changeset, NTupleSize_t firstEntry) final
 Incorporate incremental changes to the model into the ntuple descriptor.
 
- Public Member Functions inherited from ROOT::Experimental::Internal::RPageSink
 RPageSink (const RPageSink &)=delete
 
 RPageSink (RPageSink &&)=default
 
 RPageSink (std::string_view ntupleName, const RNTupleWriteOptions &options)
 
 ~RPageSink () override
 
void CommitDataset ()
 Run the registered callbacks and finalize the current cluster and the entrire data set.
 
void DropColumn (ColumnHandle_t) final
 Unregisters a column.
 
virtual RSinkGuard GetSinkGuard ()
 
EPageStorageType GetType () final
 Whether the concrete implementation is a sink or a source.
 
const RNTupleWriteOptionsGetWriteOptions () const
 Returns the sink's write options.
 
void Init (RNTupleModel &model)
 Physically creates the storage container to hold the ntuple (e.g., a keys a TFile or an S3 bucket) Init() associates column handles to the columns referenced by the model.
 
bool IsInitialized () const
 
RPageSinkoperator= (const RPageSink &)=delete
 
RPageSinkoperator= (RPageSink &&)=default
 
void RegisterOnCommitDatasetCallback (Callback_t callback)
 The registered callback is executed at the beginning of CommitDataset();.
 
- Public Member Functions inherited from ROOT::Experimental::Internal::RPageStorage
 RPageStorage (const RPageStorage &other)=delete
 
 RPageStorage (RPageStorage &&other)=default
 
 RPageStorage (std::string_view name)
 
virtual ~RPageStorage ()
 
ColumnId_t GetColumnId (ColumnHandle_t columnHandle) const
 
virtual Detail::RNTupleMetricsGetMetrics ()
 Returns the default metrics object.
 
const std::string & GetNTupleName () const
 Returns the NTuple name.
 
RPageStorageoperator= (const RPageStorage &other)=delete
 
RPageStorageoperator= (RPageStorage &&other)=default
 
void SetTaskScheduler (RTaskScheduler *taskScheduler)
 

Private Member Functions

void ConnectFields (const std::vector< RFieldBase * > &fields, NTupleSize_t firstEntry)
 
void FlushClusterImpl (std::function< void(void)> FlushClusterFn)
 

Private Attributes

std::vector< RColumnBuffBufferedColumns
 Vector of buffered column pages. Indexed by column id.
 
std::unique_ptr< RCountersfCounters
 
std::unique_ptr< RNTupleModelfInnerModel
 The buffered page sink maintains a copy of the RNTupleModel for the inner sink.
 
std::unique_ptr< RPageSinkfInnerSink
 The inner sink, responsible for actually performing I/O.
 
DescriptorId_t fNColumns = 0
 
DescriptorId_t fNFields = 0
 
std::vector< ColumnHandle_tfSuppressedColumns
 Columns committed as suppressed are stored and passed to the inner sink at cluster commit.
 

Additional Inherited Members

- Public Types inherited from ROOT::Experimental::Internal::RPageSink
using Callback_t = std::function< void(RPageSink &)>
 
- Public Types inherited from ROOT::Experimental::Internal::RPageStorage
using ColumnHandle_t = RColumnHandle
 The column handle identifies a column with the current open page storage.
 
using SealedPageSequence_t = std::deque< RSealedPage >
 
- Static Public Member Functions inherited from ROOT::Experimental::Internal::RPageSink
static RSealedPage SealPage (const RSealPageConfig &config)
 Seal a page using the provided info.
 
- Static Public Attributes inherited from ROOT::Experimental::Internal::RPageStorage
static constexpr std::size_t kNBytesPageChecksum = sizeof(std::uint64_t)
 The page checksum is a 64bit xxhash3.
 
- Protected Member Functions inherited from ROOT::Experimental::Internal::RPageSink
RSealedPage SealPage (const RPage &page, const RColumnElementBase &element)
 Helper for streaming a page.
 
- Protected Member Functions inherited from ROOT::Experimental::Internal::RPageStorage
void WaitForAllTasks ()
 
- Protected Attributes inherited from ROOT::Experimental::Internal::RPageSink
std::unique_ptr< RNTupleCompressorfCompressor
 Helper to zip pages and header/footer; includes a 16MB (kMAXZIPBUF) zip buffer.
 
std::unique_ptr< RNTupleWriteOptionsfOptions
 
- Protected Attributes inherited from ROOT::Experimental::Internal::RPageStorage
Detail::RNTupleMetrics fMetrics
 
std::string fNTupleName
 
std::unique_ptr< RPageAllocatorfPageAllocator
 For the time being, we will use the heap allocator for all sources and sinks. This may change in the future.
 
RTaskSchedulerfTaskScheduler = nullptr
 

#include <ROOT/RPageSinkBuf.hxx>

Inheritance diagram for ROOT::Experimental::Internal::RPageSinkBuf:
[legend]

Constructor & Destructor Documentation

◆ RPageSinkBuf() [1/3]

ROOT::Experimental::Internal::RPageSinkBuf::RPageSinkBuf ( std::unique_ptr< RPageSink inner)
explicit

Definition at line 34 of file RPageSinkBuf.cxx.

◆ RPageSinkBuf() [2/3]

ROOT::Experimental::Internal::RPageSinkBuf::RPageSinkBuf ( const RPageSinkBuf )
delete

◆ RPageSinkBuf() [3/3]

ROOT::Experimental::Internal::RPageSinkBuf::RPageSinkBuf ( RPageSinkBuf &&  )
default

◆ ~RPageSinkBuf()

ROOT::Experimental::Internal::RPageSinkBuf::~RPageSinkBuf ( )
override

Definition at line 47 of file RPageSinkBuf.cxx.

Member Function Documentation

◆ AddColumn()

ROOT::Experimental::Internal::RPageStorage::ColumnHandle_t ROOT::Experimental::Internal::RPageSinkBuf::AddColumn ( DescriptorId_t  fieldId,
RColumn column 
)
finalvirtual

Register a new column.

When reading, the column must exist in the ntuple on disk corresponding to the meta-data. When writing, every column can only be attached once.

Implements ROOT::Experimental::Internal::RPageStorage.

Definition at line 56 of file RPageSinkBuf.cxx.

◆ CommitCluster()

std::uint64_t ROOT::Experimental::Internal::RPageSinkBuf::CommitCluster ( NTupleSize_t  nNewEntries)
finalvirtual

Finalize the current cluster and create a new one for the following data.

Returns the number of bytes written to storage (excluding meta-data).

Reimplemented from ROOT::Experimental::Internal::RPageSink.

Definition at line 253 of file RPageSinkBuf.cxx.

◆ CommitClusterGroup()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitClusterGroup ( )
finalvirtual

Write out the page locations (page list envelope) for all the committed clusters since the last call of CommitClusterGroup (or the beginning of writing).

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 275 of file RPageSinkBuf.cxx.

◆ CommitDatasetImpl()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitDatasetImpl ( )
finalvirtual

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 282 of file RPageSinkBuf.cxx.

◆ CommitPage()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitPage ( ColumnHandle_t  columnHandle,
const RPage page 
)
finalvirtual

Write a page to the storage. The column must have been added before.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 140 of file RPageSinkBuf.cxx.

◆ CommitSealedPage()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitSealedPage ( DescriptorId_t  physicalColumnId,
const RSealedPage sealedPage 
)
finalvirtual

Write a preprocessed page to storage. The column must have been added before.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 211 of file RPageSinkBuf.cxx.

◆ CommitSealedPageV()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitSealedPageV ( std::span< RPageStorage::RSealedPageGroup ranges)
finalvirtual

Write a vector of preprocessed pages to storage. The corresponding columns must have been added before.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 217 of file RPageSinkBuf.cxx.

◆ CommitStagedClusters()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitStagedClusters ( std::span< RStagedCluster clusters)
finalvirtual

Commit staged clusters, logically appending them to the ntuple descriptor.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 268 of file RPageSinkBuf.cxx.

◆ CommitSuppressedColumn()

void ROOT::Experimental::Internal::RPageSinkBuf::CommitSuppressedColumn ( ColumnHandle_t  columnHandle)
finalvirtual

Commits a suppressed column for the current cluster.

Can be called anytime before CommitCluster(). For any given column and cluster, there must be no calls to both CommitSuppressedColumn() and page commits.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 135 of file RPageSinkBuf.cxx.

◆ ConnectFields()

void ROOT::Experimental::Internal::RPageSinkBuf::ConnectFields ( const std::vector< RFieldBase * > &  fields,
NTupleSize_t  firstEntry 
)
private

Definition at line 61 of file RPageSinkBuf.cxx.

◆ FlushClusterImpl()

void ROOT::Experimental::Internal::RPageSinkBuf::FlushClusterImpl ( std::function< void(void)>  FlushClusterFn)
private

Definition at line 225 of file RPageSinkBuf.cxx.

◆ GetDescriptor()

const ROOT::Experimental::RNTupleDescriptor & ROOT::Experimental::Internal::RPageSinkBuf::GetDescriptor ( ) const
finalvirtual

Return the RNTupleDescriptor being constructed.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 79 of file RPageSinkBuf.cxx.

◆ InitImpl()

void ROOT::Experimental::Internal::RPageSinkBuf::InitImpl ( RNTupleModel model)
finalvirtual

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 84 of file RPageSinkBuf.cxx.

◆ operator=() [1/2]

RPageSinkBuf & ROOT::Experimental::Internal::RPageSinkBuf::operator= ( const RPageSinkBuf )
delete

◆ operator=() [2/2]

RPageSinkBuf & ROOT::Experimental::Internal::RPageSinkBuf::operator= ( RPageSinkBuf &&  )
default

◆ ReservePage()

ROOT::Experimental::Internal::RPage ROOT::Experimental::Internal::RPageSinkBuf::ReservePage ( ColumnHandle_t  columnHandle,
std::size_t  nElements 
)
finalvirtual

Get a new, empty page for the given column that can be filled with up to nElements; nElements must be larger than zero.

Reimplemented from ROOT::Experimental::Internal::RPageSink.

Definition at line 290 of file RPageSinkBuf.cxx.

◆ StageCluster()

ROOT::Experimental::Internal::RPageSink::RStagedCluster ROOT::Experimental::Internal::RPageSinkBuf::StageCluster ( NTupleSize_t  nNewEntries)
finalvirtual

Stage the current cluster and create a new one for the following data.

Returns the object that must be passed to CommitStagedClusters to logically append the staged cluster to the ntuple descriptor.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 261 of file RPageSinkBuf.cxx.

◆ UpdateExtraTypeInfo()

void ROOT::Experimental::Internal::RPageSinkBuf::UpdateExtraTypeInfo ( const RExtraTypeInfoDescriptor extraTypeInfo)
finalvirtual

Adds an extra type information record to schema.

The extra type information will be written to the extension header. The information in the record will be merged with the existing information, e.g. duplicate streamer info records will be removed. This method is called by the "on commit dataset" callback registered by specific fields (e.g., streamer field) and during merging.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 128 of file RPageSinkBuf.cxx.

◆ UpdateSchema()

void ROOT::Experimental::Internal::RPageSinkBuf::UpdateSchema ( const RNTupleModelChangeset changeset,
NTupleSize_t  firstEntry 
)
finalvirtual

Incorporate incremental changes to the model into the ntuple descriptor.

This happens, e.g. if new fields were added after the initial call to RPageSink::Init(RNTupleModel &). firstEntry specifies the global index for the first stored element in the added columns.

Implements ROOT::Experimental::Internal::RPageSink.

Definition at line 92 of file RPageSinkBuf.cxx.

Member Data Documentation

◆ fBufferedColumns

std::vector<RColumnBuf> ROOT::Experimental::Internal::RPageSinkBuf::fBufferedColumns
private

Vector of buffered column pages. Indexed by column id.

Definition at line 113 of file RPageSinkBuf.hxx.

◆ fCounters

std::unique_ptr<RCounters> ROOT::Experimental::Internal::RPageSinkBuf::fCounters
private

Definition at line 106 of file RPageSinkBuf.hxx.

◆ fInnerModel

std::unique_ptr<RNTupleModel> ROOT::Experimental::Internal::RPageSinkBuf::fInnerModel
private

The buffered page sink maintains a copy of the RNTupleModel for the inner sink.

For the unbuffered case, the RNTupleModel is instead managed by a RNTupleWriter.

Definition at line 111 of file RPageSinkBuf.hxx.

◆ fInnerSink

std::unique_ptr<RPageSink> ROOT::Experimental::Internal::RPageSinkBuf::fInnerSink
private

The inner sink, responsible for actually performing I/O.

Definition at line 108 of file RPageSinkBuf.hxx.

◆ fNColumns

DescriptorId_t ROOT::Experimental::Internal::RPageSinkBuf::fNColumns = 0
private

Definition at line 117 of file RPageSinkBuf.hxx.

◆ fNFields

DescriptorId_t ROOT::Experimental::Internal::RPageSinkBuf::fNFields = 0
private

Definition at line 116 of file RPageSinkBuf.hxx.

◆ fSuppressedColumns

std::vector<ColumnHandle_t> ROOT::Experimental::Internal::RPageSinkBuf::fSuppressedColumns
private

Columns committed as suppressed are stored and passed to the inner sink at cluster commit.

Definition at line 115 of file RPageSinkBuf.hxx.

Libraries for ROOT::Experimental::Internal::RPageSinkBuf:

The documentation for this class was generated from the following files: