Histogram Factory

The number of arguments passed to hist() is large and usually a source of code repetation. The HistogramFactory is a way to define default argument that can be overridded when creating a histogram.

import pandas as pd
import seaborn as sns

from freeforestml import Variable, Process, Cut, hist, HistogramFactory, McStack, DataStack
from freeforestml import toydata, example_style
Load or geneate toy dataset.

df = toydata.get()

Define processes included in the histogram.

p_ztt = Process(r"$Z\rightarrow\tau\tau$", range=(0, 0))
p_sig = Process(r"Signal", range=(1, 1))
p_asimov = Process(r"Asimov", selection=lambda d: d.fpid >= 0)

Define stacks. Data is it’s own stack and should not be stacked on top of the MC prediction.

s_bkg = McStack(p_ztt, p_sig)
s_data = DataStack(p_asimov)


Create a default plotting method the has a default value for the dataframe, the stacks and the binning.

hist_factory = HistogramFactory(df, stacks=[s_bkg, s_data], bins=20, range=(0, 200), selection=None,

Create a plot for the mass variable. Note that we pass a single argument to the plotting method.

v_mmc = Variable(r"$m^H$", "higgs_m", "GeV")

Create a plot for different variables, also overriding the binning.

v_tau_pT = Variable(r"$p_\mathrm{T}{\tau}$", "tau_pt", "GeV")
hist_factory(v_tau_pT, bins=12, range=(0, 120))
v_lep_pT = Variable(r"$p_\mathrm{T}{\ell}$", "lep_pt", "GeV")
hist_factory(v_lep_pT, bins=12, range=(0, 120))