doc/master/portfolio_8C_source.html

/// \file

/// \ingroup tutorial_quadp

/// \notebook

/// This macro shows in detail the use of the quadratic programming package quadp .

/// Running this macro :

///

/// ~~~{.cpp}

///     .x portfolio.C+

/// ~~~

///

/// or

///

/// ~~~{.cpp}

/// gSystem->Load("libQuadp");

/// .L portFolio.C+; portfolio()

/// ~~~

///

/// Let's first review what we exactly mean by "quadratic programming" :

///

/// We want to minimize the following objective function :

///

///   \f$ c^T x + ( 1/2 ) x^T Q x \f$  wrt. the vector \f$ x \f$

///

///  \f$ c \f$ is a vector and \f$ Q \f$ a symmetric positive definite matrix

///

/// You might wonder what is so special about this objective which is quadratic in

/// the unknowns, that can not be done by Minuit/Fumili . Well, we have in addition

/// the following boundary conditions on \f$ x \f$:

///

///  \f[

///          A x =  b \\

///  clo \le  C x \le cup \\

///  xlo \le    x \le xup

///  \f]  where A and C are arbitrary matrices and the rest are vectors

///

/// Not all these constraints have to be defined . Our example will only use \f$ xlo \f$,

/// \f$ A \f$ and \f$ b \f$

/// Still, this could be handled by a general non-linear minimizer like Minuit by introducing

/// so-called "slack" variables . However, quadp is tailored to objective functions not more

/// complex than being quadratic . This allows usage of solving techniques which are even

/// stable for problems involving for instance 500 variables, 100 inequality conditions

/// and 50 equality conditions .

///

/// Enough said about quadratic programming, let's return to our example .

/// Suppose, after a long day of doing physics, you have a look at your investments and

/// realize that an early retirement is not possible, given the returns of your stocks .

/// So what now ? ROOT to the rescue ...

///

/// In 1990 Harry Markowitz was awarded the Nobel prize for economics: " his work provided new tools

/// for weighing the risks and rewards of different investments and for valuing corporate stocks and bonds" .

/// In plain English, he developed the tools to balance greed and fear, we want the maximum return

/// with the minimum amount of risk. Our stock portfolio should be at the

/// ["Efficient Frontier"](see http://www.riskglossary.com/articles/efficient_frontier.htm).

/// To quantify better the risk we are willing to take, we define a utility function \f$ U(x) \f$. It describes

/// as a function of our total assets \f$ x \f$, our "satisfaction" . A common choice is \f$ 1-exp(-k*x) \f$ (the reason for

/// the exponent will be clear later)  . The parameter \f$ k \f$ is the risk-aversion factor . For small values of \f$ k \f$

/// the satisfaction is small for small values of \f$ x \f$; by increasing \f$ x \f$ the satisfaction can still be increased

/// significantly . For large values of \f$ k \f$, \f$ U(x) \f$ increases rapidly to 1, there is no increase in satisfaction

/// for additional dollars earned .

///

/// In summary :

///  - small \f$ k \f$ ==> risk-loving investor

///  - large \f$ k \f$ ==> risk-averse investor

///

/// Suppose we have for nrStocks the historical daily returns \f$ r = closing_price(n) - closing_price(n-1) \f$.

/// Define a vector \f$ x \f$ of length of \f$ nrStocks \f$, which contains the fraction of our money invested in

/// each stock . We can calculate the average daily return \f$ z \f$ of our portfolio and its variance using

/// the portfolio covariance Covar :

///

///  \f$ z = r^T x \f$  and \f$ var = x^T Covar x \f$

///

/// Assuming that the daily returns have a Normal distribution, \f$ N(x) \f$, so will \f$ z \f$ with mean \f$ r^T x \f$

/// and variance \f$ x^T Covar x \f$

///

/// The expected value of the utility function is :

///

///  \f[

/// E(u(x)) = Int (1-exp(-k*x) N(x) dx \\

///         = 1-exp(-k (r^T x - 0.5 k x^T Covar x) ) \\

///  \f]

///

/// Its value is maximised by maximising \f$ r^T x -0.5 k x^T Covar x \f$

/// under the condition \f$ sum (x_i) = 1 \f$, meaning we want all our money invested and

/// \f$ x_i \ge 0 \f$, we can not "short" a stock

///

/// For 10 stocks we got the historical daily data for Sep-2000 to Jun-2004:

///

///  - GE   : General Electric Co

///  - SUNW : Sun Microsystems Inc

///  - QCOM : Qualcomm Inc

///  - BRCM : Broadcom Corp

///  - TYC  : Tyco International Ltd

///  - IBM  : International Business Machines Corp

///  - AMAT : Applied Materials Inc

///  - C    : Citigroup Inc

///  - PFE  : Pfizer Inc

///  - HD   : Home Depot Inc

///

/// We calculate the optimal portfolio for 2.0 and 10.0 .

///

/// Food for thought :

/// - We assumed that the stock returns have a Normal distribution . Check this assumption by

///   histogramming the stock returns !

/// - We used for the expected return in the objective function, the flat average over a time

///   period . Investment firms will put significant resources in improving the return prediction .

/// - If you want to trade significant number of shares, several other considerations have

///   to be taken into account :

///    +  If you are going to buy, you will drive the price up (so-called "slippage") .

///       This can be taken into account by adding terms to the objective

///       (Google for "slippage optimization")

///    +  FTC regulations might have to be added to the inequality constraints

/// - Investment firms do not want to be exposed to the "market" as defined by a broad

///   index like the S&P and "hedge" this exposure away . A perfect hedge this can be added

///    as an equality constrain, otherwise add an inequality constrain .

///

/// \macro_image

/// \macro_output

/// \macro_code

///

/// \author Eddy Offermann


#include "Riostream.h"

#include "TCanvas.h"

#include "TFile.h"

#include "TMath.h"

#include "TTree.h"

#include "TArrayF.h"

#include "TH1.h"

#include "TF1.h"

#include "TLegend.h"

#include "TSystem.h"


#include "TMatrixD.h"

#include "TMatrixDSym.h"

#include "TVectorD.h"

#include "TQpProbDens.h"

#include "TGondzioSolver.h"


const Int_t nrStocks = 10;

static const Char_t *stocks[] =

     {"GE","SUNW","QCOM","BRCM","TYC","IBM","AMAT","C","PFE","HD"};


class TStockDaily  {

public:

  Int_t fDate;

  Int_t fOpen;     // 100*open_price

  Int_t fHigh;     // 100*high_price

  Int_t fLow;      // 100*low_price

  Int_t fClose;    // 100*close_price

  Int_t fVol;

  Int_t fCloseAdj; // 100*close_price adjusted for splits and dividend


  TStockDaily() {

     fDate = fVol = fOpen = fHigh = fLow = fClose = fCloseAdj = 0;

  }

  virtual ~TStockDaily() {}


  ClassDef(TStockDaily,1)

};


//---------------------------------------------------------------------------

Double_t RiskProfile(Double_t *x, Double_t *par) {

  Double_t riskFactor = par[0];

  return 1-TMath::Exp(-riskFactor*x[0]);

}


//---------------------------------------------------------------------------

TArrayF &StockReturn(TFile *f,const TString &name,Int_t sDay,Int_t eDay)

{

  TTree *tDaily = (TTree*)f->Get(name);

  TStockDaily *data = 0;

  tDaily->SetBranchAddress("daily",&data);

  TBranch *b_closeAdj = tDaily->GetBranch("fCloseAdj");

  TBranch *b_date     = tDaily->GetBranch("fDate");


  //read only the "adjusted close" branch for all entries

  const Int_t nrEntries = (Int_t)tDaily->GetEntries();

  TArrayF closeAdj(nrEntries);

  for (Int_t i = 0; i < nrEntries; i++) {

    b_date->GetEntry(i);

    b_closeAdj->GetEntry(i);

    if (data->fDate >= sDay && data->fDate <= eDay)

      closeAdj[i] = data->fCloseAdj/100.;

  }


  TArrayF *r = new TArrayF(nrEntries-1);

  for (Int_t i = 1; i < nrEntries; i++)

//    (*r)[i-1] = closeAdj[i]-closeAdj[i-1];

    (*r)[i-1] = closeAdj[i]/closeAdj[i-1];


  return *r;

}


#ifndef __MAKECINT__

//---------------------------------------------------------------------------

TVectorD OptimalInvest(Double_t riskFactor,TVectorD r,TMatrixDSym Covar)

{

// what the quadratic programming package will do:

//

//  minimize    c^T x + ( 1/2 ) x^T Q x

//  subject to                A x  = b

//                    clo <=  C x <= cup

//                    xlo <=    x <= xup

// what we want :

//

//  maximize    c^T x - k ( 1/2 ) x^T Q x

//  subject to        sum_x x_i = 1

//                    0 <= x_i


  // We have nrStocks weights to determine,

  // 1 equality- and 0 inequality- equations (the simple square boundary

  // condition (xlo <= x <= xup) does not count)


  const Int_t nrVar     = nrStocks;

  const Int_t nrEqual   = 1;

  const Int_t nrInEqual = 0;


  // flip the sign of the objective function because we want to maximize

  TVectorD    c = -1.*r;

  TMatrixDSym Q = riskFactor*Covar;


  // equality equation

  TMatrixD A(nrEqual,nrVar); A = 1;

  TVectorD b(nrEqual); b = 1;


  // inequality equation

  //

  // - although not applicable in the current situation since nrInEqual = 0, one

  //   has to specify not only clo and cup but also an index vector iclo and icup,

  //   whose values are either 0 or 1 . If iclo[j] = 1, the lower boundary condition

  //   is active on x[j], etc. ...


  TMatrixD C   (nrInEqual,nrVar);

  TVectorD clo (nrInEqual);

  TVectorD cup (nrInEqual);

  TVectorD iclo(nrInEqual);

  TVectorD icup(nrInEqual);


  // simple square boundary condition : 0 <= x_i, so only xlo is relevant .

  // Like for clo and cup above, we have to define an index vector ixlo and ixup .

  // Since each variable has the lower boundary, we can set the whole vector

  // ixlo = 1


  TVectorD xlo (nrVar); xlo  = 0;

  TVectorD xup (nrVar); xup  = 0;

  TVectorD ixlo(nrVar); ixlo = 1;

  TVectorD ixup(nrVar); ixup = 0;


  // setup the quadratic programming problem . Since a small number of variables are

  // involved and "Q" has everywhere entries, we chose the dense version "TQpProbDens" .

  // In case of a sparse formulation, simply replace all "Dens" by "Sparse" below and

  // use TMatrixDSparse instead of TMatrixDSym and TMatrixD


  TQpProbDens *qp = new TQpProbDens(nrVar,nrEqual,nrInEqual);


  // stuff all the matrices/vectors defined above in the proper places


  TQpDataDens *prob = (TQpDataDens *)qp->MakeData(c,Q,xlo,ixlo,xup,ixup,A,b,C,clo,iclo,cup,icup);


  // setup the nrStock variables, vars->fX will contain the final solution


  TQpVar      *vars  = qp->MakeVariables(prob);

  TQpResidual *resid = qp->MakeResiduals(prob);


  // Now we have to choose the method of solving, either TGondzioSolver or TMehrotraSolver

  // The Gondzio method is more sophisticated and therefore numerically more involved

  // If one want the Mehrotra method, simply replace "Gondzio" by "Mehrotra" .


  TGondzioSolver *s = new TGondzioSolver(qp,prob);

  const Int_t status = s->Solve(prob,vars,resid);


  const TVectorD weight = vars->fX;


  delete qp; delete prob; delete vars; delete resid; delete s;

  if (status != 0) {

    cout << "Could not solve this problem." <<endl;

    return TVectorD(nrStocks);

  }


  return weight;

}

#endif


 //---------------------------------------------------------------------------

void portfolio()

{

   const Int_t sDay = 20000809;

   const Int_t eDay = 20040602;


   const char *fname = "stock.root";

   TFile *f = 0;

   if (!gSystem->AccessPathName(fname)) {

      f = TFile::Open(fname);

   } else if (!gSystem->AccessPathName(Form("%s/quadp/%s", TROOT::GetTutorialDir().Data(), fname))) {

      f = TFile::Open(Form("%s/quadp/%s", TROOT::GetTutorialDir().Data(), fname));

   } else {

      printf("accessing %s file from http://root.cern/files\n",fname);

      f = TFile::Open(Form("http://root.cern/files/%s",fname));

   }

   if (!f) return;


   TArrayF *data = new TArrayF[nrStocks];

   for (Int_t i = 0; i < nrStocks; i++) {

      const TString symbol = stocks[i];

      data[i] = StockReturn(f,symbol,sDay,eDay);

   }


   const Int_t nrData = data[0].GetSize();


   TVectorD r(nrStocks);

   for (Int_t i = 0; i < nrStocks; i++)

      r[i] = data[i].GetSum()/nrData;


   TMatrixDSym Covar(nrStocks);

   for (Int_t i = 0; i < nrStocks; i++) {

      for (Int_t j = 0; j <= i; j++) {

         Double_t sum = 0.;

         for (Int_t k = 0; k < nrData; k++) {

            sum += (data[i][k] - r[i]) * (data[j][k] - r[j]);

         }

         Covar(i,j) = Covar(j,i) = sum/nrData;

      }

   }


   const TVectorD weight1 = OptimalInvest(2.0,r,Covar);

   const TVectorD weight2 = OptimalInvest(10.,r,Covar);


   cout << "stock     daily  daily   w1     w2" <<endl;

   cout << "symb      return sdv              " <<endl;

   for (Int_t i = 0; i < nrStocks; i++)

      printf("%s\t: %.3f  %.3f  %.3f  %.3f\n",stocks[i],r[i],TMath::Sqrt(Covar[i][i]),weight1[i],weight2[i]);


   TCanvas *c1 = new TCanvas("c1","Portfolio Optimizations",10,10,800,900);

   c1->Divide(1,2);


   // utility function / risk profile


   c1->cd(1);

   gPad->SetGridx();

   gPad->SetGridy();


   TF1 *f1 = new TF1("f1",RiskProfile,0,2.5,1);

   f1->SetParameter(0,2.0);

   f1->SetLineColor(49);

   f1->Draw("AC");

   f1->GetHistogram()->SetXTitle("dollar");

   f1->GetHistogram()->SetYTitle("utility");

   f1->GetHistogram()->SetMinimum(0.0);

   f1->GetHistogram()->SetMaximum(1.0);

   TF1 *f2 = new TF1("f2",RiskProfile,0,2.5,1);

   f2->SetParameter(0,10.);

   f2->SetLineColor(50);

   f2->Draw("CSAME");


   TLegend *legend1 = new TLegend(0.50,0.65,0.70,0.82);

   legend1->AddEntry(f1,"1-exp(-2.0*x)","l");

   legend1->AddEntry(f2,"1-exp(-10.*x)","l");

   legend1->Draw();


   // vertical bar chart of portfolio distribution


   c1->cd(2);

   TH1F *h1 = new TH1F("h1","Portfolio Distribution",nrStocks,0,0);

   TH1F *h2 = new TH1F("h2","Portfolio Distribution",nrStocks,0,0);

   h1->SetStats(0);

   h1->SetFillColor(49);

   h2->SetFillColor(50);

   h1->SetBarWidth(0.45);

   h1->SetBarOffset(0.1);

   h2->SetBarWidth(0.4);

   h2->SetBarOffset(0.55);

   for (Int_t i = 0; i < nrStocks; i++) {

      h1->Fill(stocks[i],weight1[i]);

      h2->Fill(stocks[i],weight2[i]);

   }


   h1->Draw("BAR2 HIST");

   h2->Draw("BAR2SAME HIST");


   TLegend *legend2 = new TLegend(0.50,0.65,0.70,0.82);

   legend2->AddEntry(h1,"high risk","f");

   legend2->AddEntry(h2,"low  risk","f");

   legend2->Draw();

}

b
#define b(i)
Definition RSha256.hxx:100

f
#define f(i)
Definition RSha256.hxx:104

c
#define c(i)
Definition RSha256.hxx:101

Riostream.h

Int_t
int Int_t
Definition RtypesCore.h:45

Char_t
char Char_t
Definition RtypesCore.h:37

Double_t
double Double_t
Definition RtypesCore.h:59

ClassDef
#define ClassDef(name, id)
Definition Rtypes.h:342

TArrayF.h

TCanvas.h

TRangeDynCast
ROOT::Detail::TRangeCast< T, true > TRangeDynCast
TRangeDynCast is an adapter class that allows the typed iteration through a TCollection.
Definition TCollection.h:358

TF1.h

TFile.h

data
Option_t Option_t TPoint TPoint const char GetTextMagnitude GetFillStyle GetLineColor GetLineWidth GetMarkerStyle GetTextAlign GetTextColor GetTextSize void data
Definition TGWin32VirtualXProxy.cxx:104

r
Option_t Option_t TPoint TPoint const char GetTextMagnitude GetFillStyle GetLineColor GetLineWidth GetMarkerStyle GetTextAlign GetTextColor GetTextSize void char Point_t Rectangle_t WindowAttributes_t Float_t r
Definition TGWin32VirtualXProxy.cxx:168

name
char name[80]
Definition TGX11.cxx:110

TGondzioSolver.h

TH1.h

TLegend.h

TMath.h

TMatrixDSym.h

TMatrixD.h

TQpProbDens.h

Form
char * Form(const char *fmt,...)
Formats a string in a circular formatting buffer.
Definition TString.cxx:2489

TSystem.h

gSystem
R__EXTERN TSystem * gSystem
Definition TSystem.h:561

TTree.h

TVectorD.h

TVectorD
TVectorT< Double_t > TVectorD
Definition TVectorDfwd.h:23

gPad
#define gPad
Definition TVirtualPad.h:308

ROOT::Detail::TRangeCast
Definition TCollection.h:311

TArrayF
Array of floats (32 bits per element).
Definition TArrayF.h:27

TAttFill::SetFillColor
virtual void SetFillColor(Color_t fcolor)
Set the fill area color.
Definition TAttFill.h:38

TAttLine::SetLineColor
virtual void SetLineColor(Color_t lcolor)
Set the line color.
Definition TAttLine.h:42

TBranch
A TTree is a list of TBranches.
Definition TBranch.h:93

TCanvas
The Canvas class.
Definition TCanvas.h:23

TF1
1-Dim function class
Definition TF1.h:233

TF1::GetHistogram
virtual TH1 * GetHistogram() const
Return a pointer to the histogram used to visualise the function Note that this histogram is managed ...
Definition TF1.cxx:1584

TF1::Draw
void Draw(Option_t *option="") override
Draw this function with its current attributes.
Definition TF1.cxx:1333

TF1::SetParameter
virtual void SetParameter(Int_t param, Double_t value)
Definition TF1.h:667

TFile
A ROOT file is an on-disk file, usually with extension .root, that stores objects in a file-system-li...
Definition TFile.h:53

TFile::Open
static TFile * Open(const char *name, Option_t *option="", const char *ftitle="", Int_t compress=ROOT::RCompressionSetting::EDefaults::kUseCompiledDefault, Int_t netopt=0)
Create / open a file.
Definition TFile.cxx:4088

TGondzioSolver
Derived class of TQpSolverBase implementing Gondzio-correction version of Mehrotra's original predict...
Definition TGondzioSolver.h:57

TGondzioSolver::Solve
Int_t Solve(TQpDataBase *prob, TQpVar *iterate, TQpResidual *resid) override
Solve the quadratic programming problem as formulated through prob, store the final solution in itera...
Definition TGondzioSolver.cxx:129

TH1F
1-D histogram with a float per channel (see TH1 documentation)
Definition TH1.h:645

TH1::SetBarOffset
virtual void SetBarOffset(Float_t offset=0.25)
Set the bar offset as fraction of the bin width for drawing mode "B".
Definition TH1.h:380

TH1::Fill
virtual Int_t Fill(Double_t x)
Increment bin with abscissa X by 1.
Definition TH1.cxx:3315

TH1::Draw
void Draw(Option_t *option="") override
Draw this histogram with options.
Definition TH1.cxx:3037

TH1::SetBarWidth
virtual void SetBarWidth(Float_t width=0.5)
Set the width of bars as fraction of the bin width for drawing mode "B".
Definition TH1.h:381

TH1::SetStats
virtual void SetStats(Bool_t stats=kTRUE)
Set statistics option on/off.
Definition TH1.cxx:8992

TLegend
This class displays a legend box (TPaveText) containing several legend entries.
Definition TLegend.h:23

TMatrixTSym< Double_t >

TMatrixT< Double_t >

TQpDataDens
Data for the dense QP formulation.
Definition TQpDataDens.h:63

TQpProbDens
dense matrix problem formulation
Definition TQpProbDens.h:61

TQpResidual
The Residuals class calculates and stores the quantities that appear on the right-hand side of the li...
Definition TQpResidual.h:62

TQpVar
Class containing the variables for the general QP formulation.
Definition TQpVar.h:60

TQpVar::fX
TVectorD fX
Definition TQpVar.h:91

TROOT::GetTutorialDir
static const TString & GetTutorialDir()
Get the tutorials directory in the installation. Static utility function.
Definition TROOT.cxx:3119

TString
Basic string class.
Definition TString.h:139

TSystem::AccessPathName
virtual Bool_t AccessPathName(const char *path, EAccessMode mode=kFileExists)
Returns FALSE if one can access a file using the specified access mode.
Definition TSystem.cxx:1296

TTree
A TTree represents a columnar dataset.
Definition TTree.h:79

TVectorT< Double_t >

c1
return c1
Definition legend1.C:41

x
Double_t x[n]
Definition legend1.C:17

h1
TH1F * h1
Definition legend1.C:5

f1
TF1 * f1
Definition legend1.C:11

TMVA_SOFIE_GNN_Parser.h2
h2
Definition TMVA_SOFIE_GNN_Parser.py:188

TMath::C
constexpr Double_t C()
Velocity of light in .
Definition TMath.h:114

TMath::Exp
Double_t Exp(Double_t x)
Returns the base-e exponential function of x, which is e raised to the power x.
Definition TMath.h:713

TMath::Sqrt
Double_t Sqrt(Double_t x)
Returns the square root of x.
Definition TMath.h:666

sum
static uint64_t sum(uint64_t i)
Definition Factory.cxx:2345