Slate, EBay: Are Organic Search Results 100% Effective Substitutes for SEM?

This is, of course, a standard Free Rider question:  can we just dispense with paying Search Engines money and live off the Organics?

The question has surfaced again at Slate, in this article profiling E-Bay’s claim that pausing their paid search campaign has resulted in no revenue loss:

http://www.slate.com/articles/business/harvard_business_review/2013/03/paid_search_ads_did_ebay_just_prove_they_don_t_work.html

Companies spend enormous sums on marketing their products. Yet it’s notoriously difficult to measure the impact of ad expenditures. Companies advertise heavily at times when they hope to sell a lot—like Christmas Eve and Boxing Day—and in areas where they expect to see their sales grow. So a naïve examination of the relationship between ad expenditures and revenues will of course find they move in sync, even if customers don’t pay the ads any mind.

Advertising has also traditionally produced a lot of waste—I see ads for Brioni suits when I open up the morning paper, even though the last time I wore a suit was on my wedding day. The study’s authors quote 19th-century retailer John Wannamaker: “I know half the money I spend on advertising is wasted, but I can never find out which half”

But what do companies actually get for the billions they now spend on search marketing? The eBay team began by examining whether there’s any benefit to buying search ads that contain the word “ebay.” In these cases, it’s possible that in the absence of paid listings, customers would simply click on the unpaid—or “natural”—listing, which would appear at the top of the search anyway.So in March 2012, eBay conducted a controlled trial to see what would happen if they shut off this “branded keyword advertising” by halting their purchases of search ads containing the word “ebay” on Microsoft and Yahoo search engines, while continuing to purchase search ads on Google as a control. There was no change in eBay sales via Yahoo and Bing, relative to those that came through Google—consumers simply substituted clicks on the unpaid search listing for the now-absent paid ones.

 

Search Engine Land has picked up the story

AdWords “Ineffective” Says eBay, Google “Meta-Pause Analysis” Contradicts Those Findings

Google: Ads Offer Incremental Traffic not Replaced by Organic

Not surprisingly Google has research (.pdf) that says the exact opposite of what eBay found.

In early 2012 Google published the results of a “meta-analysis” of “six months of Search Ads Pause studies” where advertisers had reduced AdWords spending “at least 95 percent.” According to Google, “these amounted to 390 studies between April, 2011 and October, 2011.”

These studies were conducted in the US, UK, France and Germany. They looked broadly at search marketing and not just AdWords.

The conclusion of that analysis was that SEM offered a major lift to advertisers and that organic rankings and traffic did not compensate when search campaigns were paused:

 

[O]n average, 81% of ad impressions and 66% of ad clicks occur without an associated organic result . . . On average, 50% of the ad clicks that occurred with a top rank organic result are incremental, i.e., they would not be recovered organically if the ad campaign is paused. For ad clicks with an associated organic result in rank 2 – ­5, on average, 82% of the ad clicks are incremental. Finally, for ad clicks with an associated organic result in rank 5­ – n, on average, 96% of the ad clicks are incremental.How can this meta-analysis of “pause studies” be reconciled with eBay’s research? Wordstream’s Larry Kim has a theory: eBay ad creative, bidding and keyword practices are poor. He actually used a much stronger word.

 

Yes, the paper cited is the one we extensively analyzed when it came out (see below).

From the conclusion of the first blog in the series:

That paper gives great insight into how Google is calculating what Hal Varian terms VPC (value per click).  The very last equation of the paper informs us that advertisers halt their ad campaigns unless:

 

(v – c) / v > r*(1-IAC)

 

where IAC is estimated in the paper, by Bayesian methods (Gibbs Sampling and Slice Sampling), to be about 0.89 (mean expectation).  The LHS isn’t exactly innocent:  rewriting it as 1 – c/v, it is a function of the fraction of marginal revenue Google gets to keep.  That is, if a click has marginal value to an advertiser of v and Google gets to keep marginal CPC c, then c/v is Google’s (percentage) take.

 

A few worked examples will make the point: we can re-arrange c/v < 1 – r*(1-IAC) ['Google's take will be no more than the RHS, and with proper tuning can be that amount without causing an advertiser to pause a campaign, behaviourally]. ‘r‘ is the relative (conversion) value of an organic click to a paid search click.  Since IAC is *estimated* in the article, we can eliminate it from the equation by substitution, and for any value of r, learn Google’s percent take.  For example, if r = 1, then Google keeps about 90%.  If r is only 0.1 however, Google could keep 99%.  When r is as much as 10, they keep nothing at all.

Nice Article on the Bing’s Version of Mechanical Turk

A web page that definitively satisfies a searcher’s intent is “Perfect,” and should appear at the top of Bing’s search results. On the other end of the scale, spammy web pages and pages that almost no searcher would find useful are deemed “Bad.”

That’s a bit of how Bing instructs the people in its Human Relevance System (HRS) project to grade web pages. It’s explained in a 52-page document that Bing calls the “HRS Judging Guidelines.”

http://searchengineland.com/bing-search-quality-rating-guidelines-130592

Actually, Adcenter has a similar system to HRS, which is very nicely described in this presentation by Ty Liu (note: slow link to China):

http://tcci.ccf.org.cn/summersch/classm/Introduction-to-Online-Advertising.pdf [PDF!]

In fact, the Ad Center equivalent of the HRS system, as it applies to Ad Landing pages, is described in this conference proceeding hosted by him, and with a paper :

http://research.microsoft.com/en-us/um/beijing/events/mload-2010/mload2010.pdf [PDF!]

Look for the paper by Yih and Jiang, entitled ‘Similarity Models for Ad Relevance Measures’.

 

 

Google’s Point Kinematics Model of the Marketplace

Continuing my analysis from the last blog.  The term, ‘point kinematics’ is mine — it reminds me of models of nuclear reactors.  In the simplest model of a reactor, the various control parameters are elaborated in a way that makes it clear how ‘inputs’ and ‘outputs’ and various ‘efficiencies’ or ratios all work together.  This is called kinematics because it describes how the reactor operates.  Dynamics describes how the control variables evolve in time.  The simplest kinematic model is a point model if it doesn’t elaborate any structure — where the control rods and fuel rods are located relative to each other.

The Google researchers, Chan, Yuan, Koehler, and Kumar (CYKK) casually employ a rather complex and non-linear model, in their Bayesian analysis, without reference or further explanation.  The model relates organic and paid clicks, in an advertising campaign, to total spend on that campaign, S, and total page impressions received, I:

O.clicks = (I + alpha1)*[kappa1 + (kappa2 - kappa1)*exp(- beta1*S/I)]

P.clicks = beta0*(I+alpha2)*[1 - exp(- beta2*S/I)]

This is the model that the authors estimate using Bayesian techniques, meaning they determine for their data of Clicks, Spend, and Search Impressions, the seven parameters, beta0, beta1, beta2, kappa1, kappa2, and alpha1 and alpha2, are assigned distributions — which means we don’t get told what those distributions look like nor even what the expectation values of those parameters are.  In a normal ‘Frequentist’ (non-Bayesian) model, we would get estimates of the parameters that are a single number, not a distribution.

Be that as it may, there is a good reason we are not told the expected value of the parameters — they would be very revealing of how the Google paid search marketplace scales.  The model estimated succinctly describes the response function (Clicks), as a function of two stimuli — how advertisers spend, and how often users choose to visit Google in a way that causes them to view either an organic or paid listing for that advertiser.  The model appears to be over the space of *all* advertisers, who receive different shares of clicks, impressions, and have different spends.  Let’s see if we can give an interpretation to the functional form of the model.

S/I (spend per search page impression), is an important metric to Google, close enough to RPM (revenue per mille).  Using the physicist’s approach of dimensional analysis, and noting that the argument of an exponential must be dimensionless, the coefficients beta1 and beta2 have dimensions of reciprocal RPM.  Another way of saying this is that we can rewrite beta1*S/I as R/R1, where R is RPM share of that data point (ad campaign), and R1 is a scaling factor, R1 = 1/beta1.  That is, Google’s RPM has a natural scaling length, and that is the meaning if beta1 (and beta2).  It is interesting that there are two different scale lengths, one for paid search and one for organic listings.

The role of the exponentials is to introduce a sigmoidal, or S-shaped, ‘switch’ between the two limiting cases of no ad spend, and infinite (saturated) ad spend.  In the ‘zero spend’ limit, for example, the complex factor in the first equation simplifies the RHS to (I + alpha1)*kappa2 — kappa1 drops out and the exponential has value unity. Paid search clicks goes to zero in this limiting case — of course, since there are no ads to click on!

The simple form of

O.clicks = (I + alpha1)*kappa2

suggests that kappa2 (and kappa1) have dimensions of Clicks per Search Impressions, that is, they are click yields.  In fact kappa2 is the organic click yield in the limit there are no ad impressions, and kappa1 is limiting click yield in which ad impressions are saturated.  kappa2 – kappa1 is the delta (‘drop in click yield due to ads cannibalising organic clicks’) — in the limit of a saturated ad campaign.

Likewise, beta0 in the P.clicks equation is the saturation (high page impression) limit of click yield for clicks on paid search ads.

What about alpha1 and alpha2?  A physicist would probably think of those as virial coefficients in a virial expansion — a correction to a product relation, like the ideal gas law.  The limiting, or ad-saturated, equations are ‘linear’, but adding in an extra degree of freedom, an intercept, likely helps the fit.

In effect, what we have here is a simple two ‘feature’ click prediction model for all of Google advertising. (!)  If you know your spend and search page impressions (and you do), then you can predict your clicks.  Would that it were so easy!  Of course, that is what the Bayesian methods are about — to make it so that those equations have a bit of give, and allow different ad campaigns to ‘be distributed’ the way they do end up being — different.

In fact, there is more information to be wrung from the two equations — more about Google than individual ad campaigns, though perhaps interesting if you have to scale campaigns from the size of a small pilot project to a large spend.  Notice that neither equation presents the response variables (organic and paid search) clicks, as simple linear responses to search page impressions.  Stimulus-Response scaling laws have a long history in neurobiology, where a linear scaling law for light adaptation in the eye was proposed by Weber, and a square root law proposed by de Vries-Rose.  Transition between a non-linear and linear regime of response to a visual stimulus (such as a search page) is very common in nature.

To get a feel for this, let’s explore the ‘low RPM’ limit of the above equation, and temporarily hold ad spend, S, fixed, while varying search page impressions, I, and observing paid ad clicks (we will concentrate on the second equation now).  Expanding the exponential exp ( – beta2*S/I) = (1 – beta2*S/I) we obtain:

P.clicks = beta0*( Imp + alpha2)*(beta2*S/Imp) = beta0*beta2*S*( 1 + alpha2/Imp )

I spelled out Imp her so you could see it. ;)   The product beta0*beta2 now has dimensions of click yield per RPM (!) and when multiplied by Spend gives clicks.  Alpha2 has to have the same dimensions as search page impressions.  If, in addition to our ‘low RPM’ limit that we used to expand the exponential, we take a ‘high traffic’ limit, large Imp, we find the alpha2/Imp term ‘decays away’ at the 1/Impressions power.  That is, it is some sort of ‘finite impressions’ correction that goes way in the ‘big data’ limit of lots of searches.  In that limit, Google says it gets linear scaling of clicks.

That is the half-full way to look at things.  The half-empty way is to walk in the other direction, and say the ‘empirical marginal click yield’ (d P.clicks / d Imp) does not equal the ‘empirical average click yield’ ( P.clicks / Imp).  However, what an economist would call ‘unit elasticity’ and ‘constant return to scale’ (CRTS), is approached, in the *high traffic* limit (for a given spend).

But wait, there’s more!  We have been exploring a ‘linearised’ version of the epxonential.  What happens if we keep the quadratic term in the expansion, exp ( – beta2*S/I) = 1 – beta2*S/I  + 0.5*(beta2*S/I)^2 + … ?  Let’s rewrite the argument as -I2/Imp, where we interpret I2 = beta2*S as a scaling factor for Impressions, rather than beta2 along for revenue per search:

P.clicks = beta0*(Imp + alpha2)*[ I2/Imp - 0.5*(I2/Imp)^2 ]

Here we seen the next order correction to ‘diminishing returns’ implicit in the rather rich set of equations the Google researchers chose.

Incremental Clicks Impact Of Search Advertising

SearchEngineLand has a very interesting post on how Google researchers have attempted to measure the economic tradeoff of organic listings and paid search advertisements.  The real jewel of the article is the link to this paper (PDF) however.

That paper gives great insight into how Google is calculating what Hal Varian terms VPC (value per click).  The very last equation of the paper informs us that advertisers halt their ad campaigns unless:

(v – c) / v > r*(1-IAC)

where IAC is estimated in the paper, by Bayesian methods (Gibbs Sampling and Slice Sampling), to be about 0.89 (mean expectation).  The LHS isn’t exactly innocent:  rewriting it as 1 – c/v, it is a function of the fraction of marginal revenue Google gets to keep.  That is, if a click has marginal value to an advertiser of v and Google gets to keep marginal CPC c, then c/v is Google’s (percentage) take.

A few worked examples will make the point: we can re-arrange c/v < 1 – r*(1-IAC) ['Google's take will be no more than the RHS, and with proper tuning can be that amount without causing an advertiser to pause a campaign, behaviourally]. ‘r‘ is the relative (conversion) value of an organic click to a paid search click.  Since IAC is *estimated* in the article, we can eliminate it from the equation by substitution, and for any value of r, learn Google’s percent take.  For example, if r = 1, then Google keeps about 90%.  If r is only 0.1 however, Google could keep 99%.  When r is as much as 10, they keep nothing at all.

Thus, Google’s interest in mining its logs to estimate IAC is quite understandable!  They operate a bazaar, and charge rent on the booths.  They can count the number of *transactions* that occur in their mart (up to uncertainty about how many clicks result in actual conversions), but they cannot see the amount of money that changes hands.  If they could learn what their customers are making (marginal VPC), they could price the stalls in the mart so that they receive an optimal fraction of the money changing hands there.  Hence, the extreme inetrest in researching the two variables that determine that — r, the marginal rate of substitution of an organic click’s value for a paid ad’s one, and 1 – IAC, the ‘cannibalism factor.’

Of course, telling your advertisers that IAC is high (and 1 – IAC 10%-ish, though quite skewed to higher values) helps retain their business.  That, however, is not Google’s only motive here.  VPC may be unknowable — but what is revealed here is that Google thinks the (maximal) CPC / VPC ratio is, in fact, only determined by two factors: r and IAC.

They have estimated the latter — IAC — and revealed its median value is very high (98%) and because of skew its expected value quite a bit less (90%).  Naturally, the marginal rate of substitution will vary from advertiser to advertiser.  I would guess most clicks leading to an advertiser’s site have similar value, though organic clicks may be worth less, since they will contain information queries that may not ‘convert’ as well as someone clicking on an ad.

In any event, Google has told us about their two key revenue drivers, and how they relate to optimising their business.  I suppose the next order of business, then, is to understand what factors affect the one they *haven’t* told us about quantitatively.  Clearly, the relative importance of organic listings to paid search will depend on the relevance and quality of those listings.  The interesting point we see here, is how Google conceives of the tradeoff of revenue and those relevance factors — namely, in terms of marginal rate of substitution between competing pathways to a website obtaining a click.

 

News Roundup: 3/26/2012

‘The Free Lunch is over’ (not exactly news… just a nice article on the implications of concurrency)

http://www.gotw.ca/publications/concurrency-ddj.htm

 

Access the InfoChimps API from R

http://blog.revolutionanalytics.com/2010/11/access-the-infochimps-api-from-r.html

InfoChimps.com is mainly known as a clearinghouse for finding large data sets, for free or for sale. But they have also released (in beta, at least) an API that lets you find some pretty useful information on-demand. Normally, you’d have you use RESTful calls to access the API, but now Drew Conway has created an R package (and released gist sources) that lets you query the API using simple R commands. With the infochimps package, you can:

Information, Physics, and Computation…

… is a textbook on just those subjects (and the topic of this blog) by Marc Mezard and Andrea Montanari.

http://www.amazon.com/Information-Physics-Computation-Oxford-Graduate/dp/019857083X/

The start of their preface makes a fine introduction to this topic:

Over the last few years, several research areas have witnessed important progress through the unexpected collaboration of statistical physicists, computer scientists, and information theorists.  The dialogue between scientific disciplines has not been without difficulties, as each field has its own objectives and rules of behaviour.  Nonetheless, there is increasing consensus that a common ground exists and that it can be fruitful.  This book aims at making this common ground more widely accessible, through a unified approach to a selection of important problems that have benefited from this convergence.

Indeed, the points of intersection between Statistical Physics and Computation Theory, discovered in the last 5 years or so, are both surprising and indeed shocking.  It is the purpose of this blog, as well, to elucidate the common ground between these subjects, and to make the growing consensus concerting the unity of the topics discussed more widely available.  Above all, we will be seeing how to *apply* the ‘unified approach’ mentioned in the text cited to problems in Online Advertising.