Deriving the Conditional PMF of the Pareto/NBD Model

Σχετικά έγγραφα
Additional Results for the Pareto/NBD Model

Deriving an Expression for P (X(t) = x) Under the Pareto/NBD Model

Section 8.3 Trigonometric Equations

Solutions to Exercise Sheet 5

CHAPTER 25 SOLVING EQUATIONS BY ITERATIVE METHODS

2 Composition. Invertible Mappings

Example Sheet 3 Solutions

Problem Set 3: Solutions

Math221: HW# 1 solutions

Solution Series 9. i=1 x i and i=1 x i.

Problem Set 9 Solutions. θ + 1. θ 2 + cotθ ( ) sinθ e iφ is an eigenfunction of the ˆ L 2 operator. / θ 2. φ 2. sin 2 θ φ 2. ( ) = e iφ. = e iφ cosθ.

derivation of the Laplacian from rectangular to spherical coordinates

Homework 3 Solutions

forms This gives Remark 1. How to remember the above formulas: Substituting these into the equation we obtain with

Homework 8 Model Solution Section

Matrices and Determinants

ST5224: Advanced Statistical Theory II

Phys460.nb Solution for the t-dependent Schrodinger s equation How did we find the solution? (not required)

Jesse Maassen and Mark Lundstrom Purdue University November 25, 2013

The Simply Typed Lambda Calculus

HOMEWORK 4 = G. In order to plot the stress versus the stretch we define a normalized stretch:

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

4.6 Autoregressive Moving Average Model ARMA(1,1)

C.S. 430 Assignment 6, Sample Solutions

Other Test Constructions: Likelihood Ratio & Bayes Tests

Concrete Mathematics Exercises from 30 September 2016

Second Order Partial Differential Equations

Απόκριση σε Μοναδιαία Ωστική Δύναμη (Unit Impulse) Απόκριση σε Δυνάμεις Αυθαίρετα Μεταβαλλόμενες με το Χρόνο. Απόστολος Σ.

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

Areas and Lengths in Polar Coordinates

A Note on Deriving the Pareto/NBD Model and Related Expressions

Inverse trigonometric functions & General Solution of Trigonometric Equations

DESIGN OF MACHINERY SOLUTION MANUAL h in h 4 0.

EE512: Error Control Coding

6.3 Forecasting ARMA processes

3.4 SUM AND DIFFERENCE FORMULAS. NOTE: cos(α+β) cos α + cos β cos(α-β) cos α -cos β

Statistical Inference I Locally most powerful tests

6.1. Dirac Equation. Hamiltonian. Dirac Eq.

Areas and Lengths in Polar Coordinates

Exercises 10. Find a fundamental matrix of the given system of equations. Also find the fundamental matrix Φ(t) satisfying Φ(0) = I. 1.

Bayesian statistics. DS GA 1002 Probability and Statistics for Data Science.

SCHOOL OF MATHEMATICAL SCIENCES G11LMA Linear Mathematics Examination Solutions

Chapter 6: Systems of Linear Differential. be continuous functions on the interval

CHAPTER 48 APPLICATIONS OF MATRICES AND DETERMINANTS

Approximation of distance between locations on earth given by latitude and longitude

PARTIAL NOTES for 6.1 Trigonometric Identities

SOLUTIONS TO MATH38181 EXTREME VALUES AND FINANCIAL RISK EXAM

Practice Exam 2. Conceptual Questions. 1. State a Basic identity and then verify it. (a) Identity: Solution: One identity is csc(θ) = 1

Section 9.2 Polar Equations and Graphs

Section 7.6 Double and Half Angle Formulas

Econ 2110: Fall 2008 Suggested Solutions to Problem Set 8 questions or comments to Dan Fetter 1

Srednicki Chapter 55

Second Order RLC Filters

b. Use the parametrization from (a) to compute the area of S a as S a ds. Be sure to substitute for ds!

Exercises to Statistics of Material Fatigue No. 5

Fourier Series. MATH 211, Calculus II. J. Robert Buchanan. Spring Department of Mathematics

A Note on Intuitionistic Fuzzy. Equivalence Relation

Uniform Convergence of Fourier Series Michael Taylor

Every set of first-order formulas is equivalent to an independent set

Higher Derivative Gravity Theories

( y) Partial Differential Equations

D Alembert s Solution to the Wave Equation

Lecture 2: Dirac notation and a review of linear algebra Read Sakurai chapter 1, Baym chatper 3

SCITECH Volume 13, Issue 2 RESEARCH ORGANISATION Published online: March 29, 2018

Trigonometry 1.TRIGONOMETRIC RATIOS

A Two-Sided Laplace Inversion Algorithm with Computable Error Bounds and Its Applications in Financial Engineering

Quadratic Expressions

Appendix to On the stability of a compressible axisymmetric rotating flow in a pipe. By Z. Rusak & J. H. Lee

An Introduction to Signal Detection and Estimation - Second Edition Chapter II: Selected Solutions

Reminders: linear functions

( ) 2 and compare to M.

5.4 The Poisson Distribution.

Homomorphism in Intuitionistic Fuzzy Automata

Partial Differential Equations in Biology The boundary element method. March 26, 2013

HOMEWORK#1. t E(x) = 1 λ = (b) Find the median lifetime of a randomly selected light bulb. Answer:

ANSWERSHEET (TOPIC = DIFFERENTIAL CALCULUS) COLLECTION #2. h 0 h h 0 h h 0 ( ) g k = g 0 + g 1 + g g 2009 =?

1 String with massive end-points

Testing for Indeterminacy: An Application to U.S. Monetary Policy. Technical Appendix

CRASH COURSE IN PRECALCULUS

Ordinal Arithmetic: Addition, Multiplication, Exponentiation and Limit

Derivation of Optical-Bloch Equations

Mock Exam 7. 1 Hong Kong Educational Publishing Company. Section A 1. Reference: HKDSE Math M Q2 (a) (1 + kx) n 1M + 1A = (1) =

Tridiagonal matrices. Gérard MEURANT. October, 2008

MATH423 String Theory Solutions 4. = 0 τ = f(s). (1) dτ ds = dxµ dτ f (s) (2) dτ 2 [f (s)] 2 + dxµ. dτ f (s) (3)

6. MAXIMUM LIKELIHOOD ESTIMATION

k A = [k, k]( )[a 1, a 2 ] = [ka 1,ka 2 ] 4For the division of two intervals of confidence in R +

Differential equations

Finite Field Problems: Solutions

Μηχανική Μάθηση Hypothesis Testing

상대론적고에너지중이온충돌에서 제트입자와관련된제동복사 박가영 인하대학교 윤진희교수님, 권민정교수님

SOLVING CUBICS AND QUARTICS BY RADICALS

CHAPTER 101 FOURIER SERIES FOR PERIODIC FUNCTIONS OF PERIOD

Η ΨΥΧΙΑΤΡΙΚΗ - ΨΥΧΟΛΟΓΙΚΗ ΠΡΑΓΜΑΤΟΓΝΩΜΟΣΥΝΗ ΣΤΗΝ ΠΟΙΝΙΚΗ ΔΙΚΗ

A summation formula ramified with hypergeometric function and involving recurrence relation

Dynamic types, Lambda calculus machines Section and Practice Problems Apr 21 22, 2016

2. THEORY OF EQUATIONS. PREVIOUS EAMCET Bits.

HW 3 Solutions 1. a) I use the auto.arima R function to search over models using AIC and decide on an ARMA(3,1)

SPECIAL FUNCTIONS and POLYNOMIALS

EE101: Resonance in RLC circuits

Lecture 2. Soundness and completeness of propositional logic

Transcript:

Deriving the Conditional PMF of the Pareto/NBD Model Peter S. Fader www.petefader.com Bruce G. S. Hardie www.brucehardie.com January 24 Abstract This note presents the derivation of an expression for the Pareto/NBD model conditional PMF, PXT, T t = x x, t x, T. Preliminaries Recall the basic Pareto/NBD model results Fader and Hardie 25: i The individual-level likelihood function for someone with purchase history x, t x, T is Lλ, µ x, t x, T = λ x e λµt λx µ λ µ e λµtx λx µ λ µ e λµt = λx µ λ µ e λµtx λx λ µ e λµt. 2 ii The likelihood function for a randomly chosen individual with purchase history x, t x, T is Lr, α, s, β x, t x, T = Γr xαr β s { s r x A A 2 3 Γr where A = 2F, s ; ; αt x α t x rsx if α β 2F, r x; ; βt x β t x rsx if α < β c 24 Peter S. Fader and Bruce G.S. Hardie. This document can be found at <http://brucehardie.com/notes/28/>.

and A 2 = 2F, s; ; αt α T rsx if α β 2F, r x ; ; βt β T rsx if α < β iii The joint posterior distribution of Λ and M is where gλ, µ r,α,s, β;x,t x, T = Lλ, µ x, t x, Tgλ r, αgµ s,β, 4 Lr, α, s, β x, t x, T gλ r,α = αr λ r e λα Γr is the gamma distribution that captures heterogeneity in transaction rates across customers, and gµ s, β = βs µ s e µβ 6 Γs is the gamma distribution that captures heterogeneity in death rates across customers. iv The probability that a customer with purchase history x, t x, T is alive at time T is the probability that the unobserved time at which he dies ω occurs after T. Conditional on λ and µ, this is PΩ > T λ, µ; x, t x, T = 5 λx e λµt Lλ, µ x,t x, T. 7 It follows from that the probability that a customer with purchase history x, t x, T is dead at time T is PΩ T λ, µ, x, t x, T = Removing the conditioning on λ and µ gives us λ x µ λ µ e λµtx λx µ λ µ e λµt Lλ, µ x,t x, T. 8 PΩ > T r, α, s,β; x,t x, T Γr xα r β s / = Γrα T rx β T s Lr, α, s, β x, t x, T. 9 As we proceed with the derivation, we will need evaluate a double integral of the following form several times: A = λ a µ b λ µ ce λµd gλ r,αgµ s,β dλdµ. Consider the transformation Y = M/Λ M and Z = Λ M. Noting that the Jacobian of this transformation is λ λ y z J = µ µ = z, y 2 z

it follows from the standard transformation of random variables method Casella and Berger 22, Section 4.3, pp. 56 62; Mood et al. 974, Section 6.2, p. 24ff that the joint distribution of Y and Z is gy, z α, β, r, s = We solve in the following manner: A = = αr β s ΓrΓs = αr β s ΓrΓs αr β s ΓrΓs ys y r z rs e zα y. y b y a z ab c e zd gy, z α, β, r,s dzdy which for r s a b > c y sb y ra z rsab c e zαd y dz dy y sb y ra { = αr β s Γr s a b c ΓrΓs α d rsab c z rsab c e zαd y dz dy y sb y ra [ αd y ] rsab c dz which, recalling Euler s integral for the Gaussian hypergeometric function, = αr β s Γr s a b c ΓrΓs α d rsab c Br a, s b 2 F r s a b c, s b; r s a b; αd. Looking closely at, we see that the argument z of the Gaussian hypergeometric function, αd, is guaranteed to be bounded between and when α β since β >, thus ensuring convergence of the series representation of the function. However, when α < β we can be faced with the situation where αd <, in which case the series is divergent. Applying the linear transformation Abramowitz and Stegun 972, equation 5.3.4 gives us A = 2F a, b; c; z = z a 2F a, c b; c; z z αr β s Γr s a b c ΓrΓs β d rsab c Br a, s b 2 F r s a b c, r a; r s a b; βd. 2 We note that the argument z of the above Gaussian hypergeometric function is bounded between and when α < β. We therefore present and 2 as solutions to : we use when α β and 2 when α < β. 2F a, b; c; z = R Bb,c b tb t c b zt a dt, c > b. 3

2 Derivation Suppose we know an individual s unobserved latent characteristics λ and µ. If the customer is alive at T i.e., ω > T, it follows from the derivation of PXt = x λ, µ Fader and Hardie 26 that When x >, PXT, T t = x λ, µ, ω > T = λtx e λµt x! PXT, T t = x λ, µ; x,t x, T λ x µ x [ e λµt λ µ x i= [ λ µt ] i = PXT, T t = x λ, µ, ω > TPΩ > T λ, µ; x,t x, T. i! ]. 3 This also holds when x =. However, in this second case we also have to account for the possibly that no purchases are observed in T, T t] because the customer died at or before T. Therefore, PXT, T t = x [ λ, µ, x, t x, T = δ x = PΩ > T λ, µ; x, tx, T ] PXT, T t = x λ, µ, ω > TPΩ > T λ, µ; x, t x, T. 4 Therefore, PXT, T t = x λ, µ; x, t x, T = δ x = { λx e λµt { t x Lλ, µ x,t x, T x! λxx e λµtt x i= t i i! Lλ, µ x, t x, T λxx µ λ µ x e λµt λ xx µ λ µ x i e λµtt. 5 We remove the conditioning on λ and µ by taking the expectation of this over the joint posterior distribution of Λ and M, 4, giving us PXT, T t = x { r, α, s, β;x,t x, T = δ x = PΩ > T r, α, s, β;x,tx, T Lr, α, s, β x, t x, T { t x x! B B 2 x i= t i i! B 3i 6 where B = λ xx e λµtt gλ r,αgµ s,β dλdµ { { = λ xx e λtt gλ r,αdλ e µtt gµ s, β dµ = Γr x x Γr α r β s α T t rxx β T t s 7 4

B 2 = B 3i = λ xx µ λ µ x e λµt gλ r,αgµ s, β dλdµ = αr β s ΓrΓs ΓBr x x, s 2F, s ; x ; αt α T rsx if α β 2F, r x x ; x ; βt β T rsx if α < β λ xx µ λ µ x i e λµtt gλ r, αgµ s,β dλdµ = αr β s ΓrΓs Γ ibr x x, s 2F i, s ; x ; α T t rsxi if α β 2F i, r x x ; x ; β T t rsxi if α < β 8 9 3 The Special Case of x = We can derive a simpler expression for the special case of x = i.e., no purchasing in the interval T, T t]. Conditional on λ and µ, there are three ways in which a customer with purchase history x, t x, T could make no purchases in the interval T, T t]: The customer is dead at T, which occurs with probability PΩ T λ, µ, x,t x, T, 8, or The customer is alive at T, which occurs with probability PΩ > T λ, µ, x,t x, T, 7, and he remains alive through the interval T, T t] with probability e µt and makes no purchases in that interval the Poisson probability of which is e λt, or he dies at time ω in the interval T, Tt] and makes no purchases in the interval T, ω]: Tt Combining terms and simplifying gives us T t e λω T µe µω T dω = µ e λµs ds = µ e λµt. λ µ PXT, T t = λ, µ; x,t x, T = Lλ, µ x, t x, T { λ x µ λ µ e λµtx λx λ µ e λµtt. 2 5

Taking the expectation of this over the joint posterior distribution of λ and µ, 4, it follows from 2 that where PXT, T t = r, α,s, β; x,t x, T = Γr xαr β s A 3 = { Lr, α, s, β x, t x, T Given 3, this can be rewritten as s Γr A r x A 3, 2 2F, s; ; α T t rsx if α β 2F, r x ; ; β T t rsx if α < β PXT, T t = r, α,s, β; x,t x, T = sa r xa 3 sa r xa 2. 22 Setting x to in 6 does not give us 2. Are these two equations equivalent? We first note that evaluating 5 at x = and using 8 as the expression for PΩ > T λ, µ; x,t x, T and simplifying gives us 2. Therefore 2 must be equivalent to 6 at x =. To prove this, we first note that leads to an alternative expression for the Pareto/NBD likelihood function Fader and Hardie 25: Lr, α, s, β x, t x, T = Γr xαr β s { where for α β while for α < β Lr, α, s, β x, t x, T Γr α T rx β T s A = 2 F, s ; ; αt x α t x rsx 2 F, s ; ; αt α T rsx A = 2 F, r x; ; βt x β t x rsx Γr x Γr s 2 F, r x; ; βt β T rsx. α r β s A, 23 Let us first consider the case of α β. Substituting 9 and 23 in 6 for x = gives us { s 2 F, s ; ; αt x α T t rx β T t s s 2F α t x rsx, s ; ; α T t rsx. 6

Equivalence to 2 implies α T t s s 2 F, s ; ; β T t = r x 2 F, s; ;. Noting that 2 F a, b; b;z = z a, 2F, s; ; α T t s =, β T t equivalence implies r x 2 F, s; ; s 2 F, s ; ; 2 F, s; ; =. 24 One of the so-called Gauss relations for contiguous functions states Abramowitz and Stegun 972, equation 5.2.24 is c b 2 F a, b; c; z b 2 F a, b ; c; z c 2 F a, b; c ; z =. 25 Therefore 24 is true, which means 6 evaluated at x = and 2 are equivalent when α β. Turning to the case of α < β, substituting 9 and 23 in 6 for x = gives us Lr, α, s, β x, t x, T Γr x Γr α r β s α T t rx β T t s s 2F { s 2 F, r x; ; βt x β t x rsx, r x; ; β T t rsx. Equivalence to 2 implies β T t rx s 2 F, r x; ; α T t = r x 2 F, r x ; ;. Using the result 2 F a, b; b;z = z a, we get 2F, r x; ; β T t rx =, α T t meaning equivalence implies s 2 F, r x; ; r x 2 F, r x ; ; 2 F, r x; ; =, 26 which we see from 25 is true. Therefore 6 evaluated at x = and 2 are also equivalent when α < β. 7

References Abramowitz, Milton and Irene A. Stegun eds. 972, Handbook of Mathematical Functions, New York: Dover Publications. Fader, Peter S. and Bruce G.S. Hardie 25, A Note on Deriving the Pareto/NBD Model and Related Expressions, <http://brucehardie.com/notes/9/>. Fader, Peter S. and Bruce G.S. Hardie 26, Deriving an Expression for PXt = x Under the Pareto/NBD Model, <http://brucehardie.com/notes/2/>. 8