Join the newly launched Discord community for real-time discussions, peer support, and direct interaction with the Meridian team!

meridian.model.model.Meridian

View source on GitHub

Contains the main functionality for fitting the Meridian MMM model.

  meridian 
 . 
 model 
 . 
 model 
 . 
 Meridian 
 ( 
 input_data 
 : 
   meridian 
 . 
 data 
 . 
 input_data 
 . 
 InputData 
 
 
 , 
 model_spec 
 : 
 ( 
   meridian 
 . 
 model 
 . 
 spec 
 . 
 ModelSpec 
 
 
 | 
 None 
 ) 
 = 
 None 
 , 
 inference_data 
 : 
 ( 
 az 
 . 
 InferenceData 
 | 
 None 
 ) 
 = 
 None 
 )

Attributes

input_data

An InputData object containing the input data for the model.

model_spec

A ModelSpec object containing the model specification.

inference_data

A mutable arviz.InferenceData object containing the resulting data from fitting the model.

n_geos

Number of geos in the data.

n_media_channels

Number of media channels in the data.

n_rf_channels

Number of reach and frequency (RF) channels in the data.

n_organic_media_channels

Number of organic media channels in the data.

n_organic_rf_channels

Number of organic reach and frequency (RF) channels in the data.

n_controls

Number of control variables in the data.

n_non_media_channels

Number of non-media treatment channels in the data.

n_times

Number of time periods in the KPI or spend data.

n_media_times

Number of time periods in the media data.

is_national

A boolean indicating whether the data is national (single geo) or not (multiple geos).

knot_info

A KnotInfo derived from input data and model spec.

kpi

A tensor constructed from input_data.kpi .

revenue_per_kpi

A tensor constructed from input_data.revenue_per_kpi . If input_data.revenue_per_kpi is None, then this is also None.

controls

A tensor constructed from input_data.controls .

non_media_treatments

A tensor constructed from input_data.non_media_treatments .

population

A tensor constructed from input_data.population .

media_tensors

A collection of media tensors derived from input_data .

rf_tensors

A collection of Reach & Frequency (RF) media tensors.

organic_media_tensors

A collection of organic media tensors.

organic_rf_tensors

A collection of organic Reach & Frequency (RF) media tensors.

total_spend

A tensor containing total spend, including media_tensors.media_spend and rf_tensors.rf_spend .

total_outcome

A tensor containing the total outcome, aggregated over geos and times.

controls_transformer

A ControlsTransformer to scale controls tensors using the model's controls data.

non_media_transformer

A CenteringAndScalingTransformer to scale non-media treatmenttensors using the model's non-media treatment data.

kpi_transformer

A KpiTransformer to scale KPI tensors using the model's KPI data.

controls_scaled

The controls tensor normalized by population and by the median value.

non_media_treatments_scaled

The non-media treatment tensor normalized by population and by the median value.

kpi_scaled

The KPI tensor normalized by population and by the median value.

media_effects_dist

A string to specify the distribution of media random effects across geos.

unique_sigma_for_each_geo

A boolean indicating whether to use a unique residual variance for each geo.

prior_broadcast

A PriorDistribution object containing broadcasted distributions.

baseline_geo_idx

The index of the baseline geo.

holdout_id

A tensor containing the holdout id, if present.

adstock_decay_spec

Returns AdstockDecaySpec object with correctly mapped channels.

non_media_treatments_normalized

Normalized non-media treatments.

The non-media treatments values are scaled by population (for channels where non_media_population_scaling_id is True ) and normalized by centering and scaling with means and standard deviations.

posterior_sampler_callable

A PosteriorMCMCSampler callable bound to this model.

prior_sampler_callable

A PriorDistributionSampler callable bound to this model.

Methods

`adstock_hill_media`

View source

  adstock_hill_media 
 ( 
 media 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 alpha 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 ec 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 slope 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 decay_functions 
 : 
 ( 
 str 
 | 
 Sequence 
 [ 
 str 
 ]) 
 = 
 constants 
 . 
 GEOMETRIC_DECAY 
 , 
 n_times_output 
 : 
 ( 
 int 
 | 
 None 
 ) 
 = 
 None 
 ) 
 -> 
   meridian 
 . 
 backend 
 . 
 Tensor

Transforms media or using Adstock and Hill functions in the desired order.

Args

media

Tensor of dimensions (n_geos, n_media_times, n_media_channels) containing non-negative media execution values. Typically this is impressions, but it can be any metric, such as media_spend . Clicks are often used for paid search ads.

alpha

Uniform distribution for Adstock and Hill calculations.

ec

Shifted half-normal distribution for Adstock and Hill calculations.

slope

Deterministic distribution for Adstock and Hill calculations.

decay_functions

String or sequence of strings denoting the adstock decay function(s) for each channel. Default: 'geometric'.

n_times_output

Number of time periods to output. This argument is optional when the number of time periods in media equals self.n_media_times , in which case n_times_output defaults to self.n_times .

Returns

Tensor with dimensions [..., n_geos, n_times, n_media_channels] representing Adstock and Hill-transformed media.

`adstock_hill_rf`

View source

  adstock_hill_rf 
 ( 
 reach 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 frequency 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 alpha 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 ec 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 slope 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 decay_functions 
 : 
 ( 
 str 
 | 
 Sequence 
 [ 
 str 
 ]) 
 = 
 constants 
 . 
 GEOMETRIC_DECAY 
 , 
 n_times_output 
 : 
 ( 
 int 
 | 
 None 
 ) 
 = 
 None 
 ) 
 -> 
   meridian 
 . 
 backend 
 . 
 Tensor

Transforms reach and frequency (RF) using Hill and Adstock functions.

Args

reach

Tensor of dimensions (n_geos, n_media_times, n_rf_channels) containing non-negative media for reach.

frequency

Tensor of dimensions (n_geos, n_media_times, n_rf_channels) containing non-negative media for frequency.

alpha

Uniform distribution for Adstock and Hill calculations.

ec

Shifted half-normal distribution for Adstock and Hill calculations.

slope

Deterministic distribution for Adstock and Hill calculations.

decay_functions

String or sequence of strings denoting the adstock decay function(s) for each channel. Default: 'geometric'.

n_times_output

Number of time periods to output. This argument is optional when the number of time periods in reach equals self.n_media_times , in which case n_times_output defaults to self.n_times .

Returns

Tensor with dimensions [..., n_geos, n_times, n_rf_channels] representing Hill and Adstock-transformed RF.

`calculate_beta_x`

View source

  calculate_beta_x 
 ( 
 is_non_media 
 : 
 bool 
 , 
 incremental_outcome_x 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 linear_predictor_counterfactual_difference 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 eta_x 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 beta_gx_dev 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 ) 
 -> 
   meridian 
 . 
 backend 
 . 
 Tensor

Calculates coefficient mean parameter for any treatment variable type.

The "beta_x" in the function name refers to the coefficient mean parameter of any treatment variable. The "x" can represent "m", "rf", "om", or "orf". This function can also be used to calculate "gamma_n" for any non-media treatments.

Args

is_non_media

Boolean indicating whether the treatment variable is a non-media treatment. This argument is used to determine whether the coefficient random effects are normal or log-normal. If True , then random effects are assumed to be normal. Otherwise, the distribution is inferred from self.media_effects_dist .

incremental_outcome_x

The incremental outcome of the treatment variable, which depends on the parameter values of a particular prior or posterior draw. The "_x" indicates that this is a tensor with length equal to the dimension of the treatment variable.

linear_predictor_counterfactual_difference

The difference between the treatment variable and its counterfactual on the linear predictor scale. "Linear predictor" refers to the quantity that is multiplied by the geo-level coefficient. For media variables, this is the output of the hill/adstock transformation function. For non-media treatments, this is simply the treatment variable after centering/scaling transformations. This tensor has dimensions for geo, time, and channel.

eta_x

The random effect standard deviation parameter values. For media variables, the "x" represents "m", "rf", "om", or "orf". For non-media treatments, this argument should be set to xi_n , which is analogous to "eta".

beta_gx_dev

The latent standard normal parameter values of the geo-level coefficients. For media variables, the "x" represents "m", "rf", "om", or "orf". For non-media treatments, this argument should be set to gamma_gn_dev , which is analogous to "beta_gx_dev".

Returns

The coefficient mean parameter of the treatment variable, which has dimension equal to the number of treatment channels..

`compute_non_media_treatments_baseline`

View source

  compute_non_media_treatments_baseline 
 ( 
 non_media_baseline_values 
 : 
 ( 
 Sequence 
 [ 
 str 
 | 
 float 
 ] 
 | 
 None 
 ) 
 = 
 None 
 ) 
 -> 
   meridian 
 . 
 backend 
 . 
 Tensor

Computes the baseline for each non-media treatment channel.

Args

non_media_baseline_values

Optional list of shape (n_non_media_channels,) . Each element is either a float (which means that the fixed value will be used as baseline for the given channel) or one of the strings "min" or "max" (which mean that the global minimum or maximum value will be used as baseline for the values of the given non_media treatment channel). If float values are provided, it is expected that they are scaled by population for the channels where model_spec.non_media_population_scaling_id is True . If None , the model_spec.non_media_baseline_values is used, which defaults to the minimum value for each non_media treatment channel.

Returns

A tensor of shape (n_non_media_channels,) containing the baseline values for each non-media treatment channel.

`create_inference_data_coords`

View source

  create_inference_data_coords 
 ( 
 n_chains 
 : 
 int 
 , 
 n_draws 
 : 
 int 
 ) 
 -> 
 Mapping 
 [ 
 str 
 , 
 np 
 . 
 ndarray 
 | 
 Sequence 
 [ 
 str 
 ]]

Creates data coordinates for inference data.

`create_inference_data_dims`

View source

  create_inference_data_dims 
 () 
 -> 
 Mapping 
 [ 
 str 
 , 
 Sequence 
 [ 
 str 
 ]]

`expand_selected_time_dims`

View source

  expand_selected_time_dims 
 ( 
 start_date 
 : 
   meridian 
 . 
 data 
 . 
 time_coordinates 
 . 
 Date 
 
 
 = 
 None 
 , 
 end_date 
 : 
   meridian 
 . 
 data 
 . 
 time_coordinates 
 . 
 Date 
 
 
 = 
 None 
 ) 
 -> 
 ( 
 list 
 [ 
 str 
 ] 
 | 
 None 
 )

Validates and returns time dimension values based on the selected times.

If both start_date and end_date are None, returns None. If specified, both start_date and end_date are inclusive, and must be present in the time coordinates of the input data.

Args

start_date

Start date of the selected time period. If None, implies the earliest time dimension value in the input data.

end_date

End date of the selected time period. If None, implies the latest time dimension value in the input data.

Returns

A list of time dimension values (as Meridian-formatted strings) in the input data within the selected time period, or do nothing and pass through None if both arguments are Nones, or if start_date and end_date correspond to the entire time range in the input data.

Raises

ValueError if start_date or end_date is not in the input data time dimensions.

`linear_predictor_counterfactual_difference_media`

View source

  linear_predictor_counterfactual_difference_media 
 ( 
 media_transformed 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 alpha_m 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 ec_m 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 slope_m 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 ) 
 -> 
   meridian 
 . 
 backend 
 . 
 Tensor

Calculates linear predictor counterfactual difference for non-RF media.

For non-RF media variables (paid or organic), this function calculates the linear predictor difference between the treatment variable and its counterfactual. "Linear predictor" refers to the output of the hill/adstock function, which is multiplied by the geo-level coefficient.

This function does the calculation efficiently by only calculating calling the hill/adstock function if the prior counterfactual is not all zeros.

Args

media_transformed

The output of the hill/adstock function for actual historical media data.

alpha_m

The adstock alpha parameter values.

ec_m

The adstock ec parameter values.

slope_m

The adstock hill slope parameter values.

Returns

The linear predictor difference between the treatment variable and its counterfactual.

`linear_predictor_counterfactual_difference_rf`

View source

  linear_predictor_counterfactual_difference_rf 
 ( 
 rf_transformed 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 alpha_rf 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 ec_rf 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 , 
 slope_rf 
 : 
   meridian 
 . 
 backend 
 . 
 Tensor 
 
 
 ) 
 -> 
   meridian 
 . 
 backend 
 . 
 Tensor

Calculates linear predictor counterfactual difference for RF media.

For RF media variables (paid or organic), this function calculates the linear predictor difference between the treatment variable and its counterfactual. "Linear predictor" refers to the output of the hill/adstock function, which is multiplied by the geo-level coefficient.

This function does the calculation efficiently by only calculating calling the hill/adstock function if the prior counterfactual is not all zeros.

Args

rf_transformed

The output of the hill/adstock function for actual historical media data.

alpha_rf

The adstock alpha parameter values.

ec_rf

The adstock ec parameter values.

slope_rf

The adstock hill slope parameter values.

Returns

The linear predictor difference between the treatment variable and its counterfactual.

`populate_cached_properties`

View source

  populate_cached_properties 
 ()

Eagerly activates all cached properties.

This is useful for creating a tf.function computation graph with this Meridian object as part of a captured closure. Within the computation graph, internal state mutations are problematic, and so this method freezes the object's states before the computation graph is created.

`sample_posterior`

View source

  sample_posterior 
 ( 
 n_chains 
 : 
 ( 
 Sequence 
 [ 
 int 
 ] 
 | 
 int 
 ), 
 n_adapt 
 : 
 int 
 , 
 n_burnin 
 : 
 int 
 , 
 n_keep 
 : 
 int 
 , 
 current_state 
 : 
 ( 
 Mapping 
 [ 
 str 
 , 
 backend 
 . 
 Tensor 
 ] 
 | 
 None 
 ) 
 = 
 None 
 , 
 init_step_size 
 : 
 ( 
 int 
 | 
 None 
 ) 
 = 
 None 
 , 
 dual_averaging_kwargs 
 : 
 ( 
 Mapping 
 [ 
 str 
 , 
 int 
 ] 
 | 
 None 
 ) 
 = 
 None 
 , 
 max_tree_depth 
 : 
 int 
 = 
 10 
 , 
 max_energy_diff 
 : 
 float 
 = 
 500.0 
 , 
 unrolled_leapfrog_steps 
 : 
 int 
 = 
 1 
 , 
 parallel_iterations 
 : 
 int 
 = 
 10 
 , 
 seed 
 : 
 ( 
 Sequence 
 [ 
 int 
 ] 
 | 
 int 
 | 
 None 
 ) 
 = 
 None 
 , 
 ** 
 pins 
 )

Runs Markov Chain Monte Carlo (MCMC) sampling of posterior distributions.

For more information about the arguments, see windowed_adaptive_nuts .

Drawn samples are merged into this model's Arviz inference_data property.

Args

n_chains

Number of MCMC chains. Given a sequence of integers, windowed_adaptive_nuts will be called once for each element. The n_chains argument of each windowed_adaptive_nuts call will be equal to the respective integer element. Using a list of integers, one can split the chains of a windowed_adaptive_nuts call into multiple calls with fewer chains per call. This can reduce memory usage. This might require an increased number of adaptation steps for convergence, as the optimization is occurring across fewer chains per sampling call.

n_adapt

Number of adaptation draws per chain.

n_burnin

Number of burn-in draws per chain. Burn-in draws occur after adaptation draws and before the kept draws.

n_keep

Integer number of draws per chain to keep for inference.

current_state

Optional structure of tensors at which to initialize sampling. Use the same shape and structure as model.experimental_pin(**pins).sample(n_chains) .

init_step_size

Optional integer determining where to initialize the step size for the leapfrog integrator. The structure must broadcast with current_state . For example, if the initial state is:

{ 'a':
tf.zeros(n_chains), 'b': tf.zeros([n_chains, n_features]), }

then any of 1. , {'a': 1., 'b': 1.} , or

{'a': tf.ones(n_chains), 'b':
tf.ones([n_chains, n_features])}

will work. Defaults to the dimension of the log density to the ¼ power.

dual_averaging_kwargs

Optional dict keyword arguments to pass to tfp.mcmc.DualAveragingStepSizeAdaptation . By default, a target_accept_prob of 0.85 is set, acceptance probabilities across chains are reduced using a harmonic mean, and the class defaults are used otherwise.

max_tree_depth

Maximum depth of the tree implicitly built by NUTS. The maximum number of leapfrog steps is bounded by 2**max_tree_depth , for example, the number of nodes in a binary tree max_tree_depth nodes deep. The default setting of 10 takes up to 1024 leapfrog steps.

max_energy_diff

Scalar threshold of energy differences at each leapfrog, divergence samples are defined as leapfrog steps that exceed this threshold. Default is 1000 .

unrolled_leapfrog_steps

The number of leapfrogs to unroll per tree expansion step. Applies a direct linear multiplier to the maximum trajectory length implied by max_tree_depth . Defaults is 1 .

parallel_iterations

Number of iterations allowed to run in parallel. Must be a positive integer. For more information, see tf.while_loop .

seed

An int32[2] Tensor or a Python list or tuple of 2 int s, which will be treated as stateless seeds; or a Python int or None , which will be treated as stateful seeds. See tfp.random.sanitize_seed .

**pins

These are used to condition the provided joint distribution, and are passed directly to joint_dist.experimental_pin(**pins) .

Throws

MCMCOOMError

If the model is out of memory. Try reducing n_keep or pass a list of integers as n_chains to sample chains serially. For more information, see ResourceExhaustedError when running Meridian.sample_posterior .

`sample_prior`

View source

  sample_prior 
 ( 
 n_draws 
 : 
 int 
 , 
 seed 
 : 
 ( 
 int 
 | 
 None 
 ) 
 = 
 None 
 )

Draws samples from the prior distributions.

Drawn samples are merged into this model's Arviz inference_data property.

Args

n_draws

Number of samples drawn from the prior distribution.

seed

Used to set the seed for reproducible results. For more information, see PRNGS and seeds .

meridian.model.model.Meridian Stay organized with collections Save and categorize content based on your preferences.

Attributes

Methods

adstock_hill_media

adstock_hill_rf

calculate_beta_x

compute_non_media_treatments_baseline

create_inference_data_coords

create_inference_data_dims

expand_selected_time_dims

linear_predictor_counterfactual_difference_media

linear_predictor_counterfactual_difference_rf

populate_cached_properties

sample_posterior

sample_prior

meridian.model.model.Meridian

`adstock_hill_media`

`adstock_hill_rf`

`calculate_beta_x`

`compute_non_media_treatments_baseline`

`create_inference_data_coords`

`create_inference_data_dims`

`expand_selected_time_dims`

`linear_predictor_counterfactual_difference_media`

`linear_predictor_counterfactual_difference_rf`

`populate_cached_properties`

`sample_posterior`

`sample_prior`