# rf401_importttreethx
Data and categories: advanced options for importing data from ROOT TTree and THx histograms

Basic import options are demonstrated in rf102_dataimport.C




**Author:** Wouter Verkerke  
<i><small>This notebook tutorial was automatically generated with <a href= "https://github.com/root-project/root/blob/master/documentation/doxygen/converttonotebook.py">ROOTBOOK-izer</a> from the macro found in the ROOT repository  on Wednesday, May 15, 2024 at 09:51 AM.</small></i>

In [1]:
%%cpp -d
#include "RooRealVar.h"
#include "RooDataSet.h"
#include "RooDataHist.h"
#include "RooCategory.h"
#include "RooGaussian.h"
#include "TCanvas.h"
#include "TAxis.h"
#include "RooPlot.h"
#include "TH1.h"
#include "TTree.h"
#include "TRandom.h"
#include <map>

using namespace RooFit;

TH1 *makeTH1(const char *name, double mean, double sigma);
TTree *makeTTree();

 Definition of a helper function: 

In [2]:
%%cpp -d
TH1 *makeTH1(const char *name, double mean, double sigma)
{
   // Create ROOT TH1 filled with a Gaussian distribution

   TH1D *hh = new TH1D(name, name, 100, -10, 10);
   for (int i = 0; i < 1000; i++) {
      hh->Fill(gRandom->Gaus(mean, sigma));
   }
   return hh;
}

 Definition of a helper function: 

In [3]:
%%cpp -d
TTree *makeTTree()
{
   // Create ROOT TTree filled with a Gaussian distribution in x and a uniform distribution in y

   TTree *tree = new TTree("tree", "tree");
   double *px = new double;
   double *py = new double;
   double *pz = new double;
   Int_t *pi = new Int_t;
   tree->Branch("x", px, "x/D");
   tree->Branch("y", py, "y/D");
   tree->Branch("z", pz, "z/D");
   tree->Branch("i", pi, "i/I");
   for (int i = 0; i < 100; i++) {
      *px = gRandom->Gaus(0, 3);
      *py = gRandom->Uniform() * 30 - 15;
      *pz = gRandom->Gaus(0, 5);
      *pi = i % 3;
      tree->Fill();
   }
   return tree;
}

Import multiple TH1 into a RooDataHist
--------------------------------------------------------------------------

Create thee ROOT TH1 histograms

In [4]:
TH1 *hh_1 = makeTH1("hh1", 0, 3);
TH1 *hh_2 = makeTH1("hh2", -3, 1);
TH1 *hh_3 = makeTH1("hh3", +3, 4);

Declare observable x

In [5]:
RooRealVar x("x", "x", -10, 10);

Create category observable c that serves as index for the ROOT histograms

In [6]:
RooCategory c("c", "c", {{"SampleA",0}, {"SampleB",1}, {"SampleC",2}});

Create a binned dataset that imports contents of all TH1 mapped by index category c

In [7]:
RooDataHist *dh = new RooDataHist("dh", "dh", x, Index(c), Import("SampleA", *hh_1), Import("SampleB", *hh_2),
                                  Import("SampleC", *hh_3));
dh->Print();

RooDataHist::dh[c,x] = 300 bins (2964 weights)


Alternative constructor form for importing multiple histograms

In [8]:
std::map<std::string, TH1 *> hmap;
hmap["SampleA"] = hh_1;
hmap["SampleB"] = hh_2;
hmap["SampleC"] = hh_3;
RooDataHist *dh2 = new RooDataHist("dh", "dh", x, c, hmap);
dh2->Print();

RooDataHist::dh[c,x] = 300 bins (2964 weights)


Importing a TTree into a RooDataSet with cuts
-----------------------------------------------------------------------------------------

In [9]:
TTree *tree = makeTTree();

Define observables y,z

In [10]:
RooRealVar y("y", "y", -10, 10);
RooRealVar z("z", "z", -10, 10);

Import only observables (y,z)

In [11]:
RooDataSet ds("ds", "ds", RooArgSet(x, y), Import(*tree));
ds.Print();

[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds) Skipping event #7 because y cannot accommodate the value 13.3845
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds) Skipping event #8 because y cannot accommodate the value 11.1861
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds) Skipping event #12 because y cannot accommodate the value 13.7009
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds) Skipping event #14 because y cannot accommodate the value -10.6852
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds) Skipping ...
RooDataSet::ds[x,y] = 65 entries


Import observables (x,y,z) but only event for which (y+z<0) is true

In [12]:
RooDataSet ds2("ds2", "ds2", RooArgSet(x, y, z), Import(*tree), Cut("y+z<0"));
ds2.Print();

[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds2) Skipping event #7 because y cannot accommodate the value 13.3845
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds2) Skipping event #8 because y cannot accommodate the value 11.1861
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds2) Skipping event #12 because y cannot accommodate the value 13.7009
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds2) Skipping event #14 because y cannot accommodate the value -10.6852
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds2) Skipping ...
RooDataSet::ds2[x,y,z] = 26 entries


Importing integer TTree branches
---------------------------------------------------------------

Import integer tree branch as RooRealVar

In [13]:
RooRealVar i("i", "i", 0, 5);
RooDataSet ds3("ds3", "ds3", RooArgSet(i, x), Import(*tree));
ds3.Print();

[#1] INFO:DataHandling -- RooAbsReal::attachToTree(i) TTree Int_t branch i will be converted to double precision.
RooDataSet::ds3[i,x] = 100 entries


Define category i

In [14]:
RooCategory icat("i", "i");
icat.defineType("State0", 0);
icat.defineType("State1", 1);

Import integer tree branch as RooCategory (only events with i==0 and i==1
will be imported as those are the only defined states)

In [15]:
RooDataSet ds4("ds4", "ds4", RooArgSet(icat, x), Import(*tree));
ds4.Print();

[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds4) Skipping event #2 because i cannot accommodate the value 0
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds4) Skipping event #5 because i cannot accommodate the value 0
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds4) Skipping event #8 because i cannot accommodate the value 0
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds4) Skipping event #11 because i cannot accommodate the value 0
[#1] INFO:DataHandling -- RooTreeDataStore::loadValues(ds4) Skipping ...
RooDataSet::ds4[i,x] = 67 entries


Import multiple RooDataSets into a RooDataSet
----------------------------------------------------------------------------------------

Create three RooDataSets in (y,z)

In [16]:
std::unique_ptr<RooAbsData> dsA{ds2.reduce({x, y}, "z<-5")};
std::unique_ptr<RooAbsData> dsB{ds2.reduce({x, y}, "abs(z)<5")};
std::unique_ptr<RooAbsData> dsC{ds2.reduce({x, y}, "z>5")};

Create a dataset that imports contents of all the above datasets mapped by index category c

In [17]:
RooDataSet dsABC{"dsABC", "dsABC", RooArgSet(x, y), Index(c), Import("SampleA", *dsA),
                 Import("SampleB", *dsB), Import("SampleC", *dsC)};

dsABC.Print();

RooDataSet::dsABC[x,y,c] = 26 entries
