ROAS

A compound poisson like process?

Author

Matthew Reda

Published

December 1, 2022

What is ROAS?

ROAS or return on ad spend is a ratio of revenue driven by an advertisement campaign and the cost of that campaign \(\frac{R}{C}\). How would ROAS be distributed? We have the revenue which is roughly the number of conversions driven by an ad campaign \(N\) times the revenue driven per conversion \(P\): \(R=NP\).

How is ROAS distributed?

Let’s take the number of conversions (\(N\)) from a given number of impressions as poisson distributed (potentially with some liberties taken with the measure of impressions). \[N \sim Pois(\lambda)\]

The revenue driven per conversion (\(P\)) and ad expenditures (\(C\)) are drawn from some positive non-zero distribution I choose a lognormal distribution. \[\begin{aligned}P &\sim Lognormal(\mu_p, \sigma_p^2) \\ C &\sim Lognormal(\mu_c, \sigma_c^2)\end{aligned}\]

We can now think of ROAS as \(N\frac{P}{C}\). We can make some further simplifying assumptions:

  • The revenue driven per conversion and the average ad expenditure do not depend on the content of the creative.
  • The conversion rate is independent of the revenue driven per conversion and ad spend.
  • The only influence the content of the creative \(X\) has is on the rate of conversions \(\lambda\).

Under these assumptions: \[\begin{aligned}\mu(creative)&=E[ROAS|creative]\\&=E[N\frac{P}{C}|creative]\\&\approx E[N|creative]E[\frac{P}{C}]\\ if \, P \perp \!\!\! \perp C \, \& \, E[C]\gg1 \implies &\approx E[N|creative]\frac{E[P]}{E[C]}\end{aligned}\]

These assumptions may not be realistic. For instance the conversion rates may be lower for more expensive products meaning \(cov(N, P) \not= 0\). Or a creative can influence a person to purchase a more expensive laptop over the one they were originally going to purchase \(P|X \not= P\).

Figure 1: Log Revenue distribution for 5 geographies generated randomly from the above model.

Figure 2: Log ROAS distribution for 5 geographies generated randomly from the above model.

The Model

A linear model will be used to predict ROAS based on the content of the creative. This type of model is relatively easy to implement and given certain assumptions can produce results with nice statistical properties.

OLS

OLS requires: \[\begin{aligned} ROAS|X &= X\beta + \varepsilon \\ \varepsilon &\sim N(0, \sigma^2) \\ Var(\varepsilon|X) &= \sigma^2 \end{aligned}\]

from the above generative model this is very unlikely to be true. But we can still use OLS to get a sense of the relationship between ROAS and the creative. Lets see how well OLS performs at uncovering the relationship between ROAS and the creative.

FR:
    Accuracy: 0.64  F1_score: 0.391
    TP: 232 FP: 553
    FN: 170 TN: 1056
US:
    Accuracy: 0.709 F1_score: 0.463
    TP: 251 FP: 433
    FN: 150 TN: 1172
CA:
    Accuracy: 0.88  F1_score: 0.725
    TP: 306 FP: 150
    FN: 82  TN: 1401
DE:
    Accuracy: 0.509 F1_score: 0.366
    TP: 293 FP: 893
    FN: 120 TN: 758
UK:
    Accuracy: 0.929 F1_score: 0.824
    TP: 329 FP: 74
    FN: 67  TN: 1510
true_betas pred_betas pvalues
features
feature_0 -0.117 -0.223 0.000
feature_1 0.124 0.147 0.002
feature_2 -0.061 -0.004 0.935
feature_3 0.066 0.134 0.005
feature_4 0.135 0.213 0.000
feature_5 0.003 0.032 0.506
feature_6 0.006 0.055 0.248
feature_7 -0.000 0.002 0.970
feature_8 -0.141 -0.315 0.000
feature_9 0.017 -0.015 0.753
Table 1: True betas for the 10 features in the creative used to generate the data compared to the ols estimates and their p-values.

Figure 3: OLS predictions for the 5 geographies generated randomly from the above model.

Figure 4: OLS residuals for the 5 geographies generated randomly from the above model.