The content’s there but the data often isn’t

Neil Heyside’s Oct. 18 story on PBS MediaShift about how newspapers should analyze their content by source type – staff-produced, wire/syndicated or free from citizen journalists – got me thinking about other ways content should be analyzed to craft audience-driven hyper-local and paid-content strategies.

 

Picture 1 Most news sites have navigation that mimics traditional media products – News, Sports, Business, Opinion, and Entertainment.   However, those types of broad titles don’t work well with digital content because people consume news and information in bits and pieces rather than in nicely packaged 56-page print products and 30-minute TV programs.

Picture 2

Each chunk, each piece of content – story, photo, video, audio, whatever – should be tagged or classified with a geographic area and a niche topic so a news org can determine how much content it has for each of its highest priority audience segments – and how much traffic each type of content is getting.

By geographic area I mean hyper-local.   East Cyberton, not Cyberton.  Maybe even more hyper – East Cyberton north of Main Street, for example, or wherever there’s a distinct audience segment that has different characteristics and thus different news needs and news consumption patterns.

 

Similarly, news orgs need hyper-topic codes, especially for hyper-local topics.  The Cyberton community orchestra – not Classical Music, Music, or Entertainment.  If a news org is looking at web traffic data for “Music” it should know whether that traffic is for rock music or classical, and whether the content was about a local, regional, national or international group.

 

Picture 3 Oh, and there’s one more aspect to this hyper-coding.  Content should be coded across the site.  Ruthlessly.  For example, to really understand whether it needs to add or cut coverage in East Cyberton, a news org needs to add up those East Cyberton stories in Local News, plus those East Cyberton Main St. retail development stories in Business, and those editorials and op-eds in Opinion about how ineffective the East Cyberton neighborhood council is, and….

 

Sometimes these hyper-codes are in content management systems but not in web analytics systems like Omniture or Google Analytics.  Knowing what you’ve got is great – but knowing how much traffic each hyper-coded chunk of content is equally if not more important.

 

Whether the hyper-codes and thus the data are there only makes a difference if a news org is willing to take a hard, nontraditional look at itself.   The data may suggest it needs to radically change what it covers and the way it allocates its news resources so it can produce “relevant, non-commodity local news that differentiates” it, as Neil Heyside’s PBS MediaShift story points out.

 

Picture 4 Heyside’s study of four U.K. newspaper chains has some interesting ideas about how a news org can cut costs but still maintain quality by changing the ways it uses staff-produced, wire, and free, citizen journalist content.  The news orgs in the study “had already undergone extensive staff reductions.  In the conventional sense, all the costs had been wrung out.  But newspapers have to change the way they think in order to survive.  If you’ve wrung out all the costs you can from the existing content creation model, then it’s time to change the model itself.”

 

Decision-making
If a news org doesn’t know, in incredibly painful detail, what type of content it has and how much traffic each type is getting, then it’s not arming itself with everything it can to mitigate the risks of making radical changes such as investing what it takes to succeed in hyperlocal news and in setting up pay walls.  Both are pretty scary, and it’s going to take a lot of bold experimentation – and data – to get it right.

 

 

 

Two million users? Not enough…

Xmarks Xmarks had lots of users but little engagement.  Apparently it forgot to figure out what the users needed or wanted.

From WebNewser:

“For the popular bookmark syncing service Xmarks, 2 million users was apparently not popular enough. Co-founder and CTO Todd Agulnick announced on the company blog Tuesday that, despite growing by 3,000 users each day, the startup was floundering and would shut down its service in 90 days….

Unfortunately, users who tried the system were looking for answers to questions rather than topical lists of sites.”

Believing in time-on-site

A Sept. 12 Pew Center study found that “Americans are spending more time following the news than over the past decade.”  Great news – or is this yet another misleading key performance indicator, as my previous blog post about time spent on site might suggest?

No – I like the Pew Center study.  It’s a study of attitudes and feelings.  Good old-fashioned survey research (with all of its mind-numbing statistical sampling), is an essential component of web analytics.  Web site traffic data is audience behavior – the “what.” News orgs have to havPew Research Centere attitudinal research to understand the “why” so they can attract audiences that aren’t coming to their sites.

The data you get from Google Analytics or Omniture is enticing, isn’t it?  (Work with me, here….)  Oh wow, we can track every click!  Ah, yes, we can track every click – but we can ONLY track clicks on OUR site, not on anyone else’s.

A time-on-site calculation can only be harvested for you if someone clicks on a page in your site and generates a page view that’s counted by your Google Analytics/Omniture account.  Time-on-site is the time in between the first page clicked on YOUR site and the last page clicked on YOUR site.

This means:

1.   If someone clicks on YOUR site and then immediately goes to another site (a bounce),
Picture 8 it’s not included by Google Analytics/Omniture in the time- on-site calculation.  It’s like it never existed, time-on-site-wise.

It IS counted as one visit and as one page view.  So that means that all of those people who come to your site regularly (you know, the ones we really like) just to get the latest on a story aren’t counted in time-on-site – and they should be.

2.    If someone is on his/her third page in your site and opens another tab anPicture 5d goes to another site for twenty minutes before returning to your site and clicking on another two pages, those twenty minutes are included in time-on-site – and they shouldn’t be.

3.    The time a person spends on the last page of your site isn’t counted.
Picture 7 If someone clicks through a few pages on your site and spends 15
minutes utterly absorbed in a story before leaving your site to go pay
bills online, those 15 minutes aren’t included in time-on-site – and they should be.

So, time-spent-on-site is always either over- or under-counted.  And you’ll never know which – this makes this metric utterly unreliable as an indicator of success or failure.  So you can’t make a decision with this data because you can’t know whether your action – a section added, the number of long-form videos reduced – caused time spent to go up or down.

More importantly, these days it really doesn’t matter how much time people  actually spend on a news site.  What matters much, much more is whether people are engaged with the news, whether they believe news sites are an essential component to their lives, so much so that they come back repeatedly, rate a story with five stars, leave comments, click on an ad, and otherwise use the site.  It doesn’t matter whether they spend three seconds or three hours.
More say they graze for news

That’s what makes the Pew Center finding so exciting (surely you’re still with me on how great web analytics is?).  People actually said they’re spending more time with the news now than they did a decade ago.  It doesn’t matter whether they actually are (!) – they believe they are.
I wish every news org could afford its own Pew Center-like attitudinal research study so it could track how engaged its own targeted audiences are (or aren’t), and to understand how to get and retain new audiences.  The information wouldn’t always be pretty, but at least it would be data that could make a difference.

Decision-making with data

Wow – a web analytics story in the New York Times!  And one that probably didn't spark mass hysteria by saying that looking at web traffic data is the end of journalism as we know it.  

Jeremy Peters' September 5 story, "Some Newspapers, Tracking Readers Online, Shift Coverage," nicely framed the issues about using web analytics to make decisions about coverage.  At one end you have journalists who completely ignore audiences and report whatever they want.  On the other there are those who "pander to the most base reader interests." 

This was one of the few mainstream media stories I've seen that recognized the new role that audiences have in informing – not dictating – news decisions. 

Actually what was the most interesting to me was that Raju Narisetti, the Washington Post's managing editor of online operations, said that he used "reader metrics as a tool to help him better determine how to use online resources….the data has proved highly useful in today's world of shrinking newsroom budgets." 

Picture 1 In other words, Narisetti didn't just look at overall site traffic numbers like total visits or unique visitors and say "Oh, that's nice," or "Whatever! I've got a paper to put out."  He had to cut his budget, he saw that long-form videos had little traffic compared to other types of content, and then he decided to cut "a couple of people" from that department.

Narisetti made the decision – the data didn't.  He used the data.  In my rather narrowly focused web analytics world that is a really, really big deal.  Data just wants to be useful.  Seriously.  If it could talk, it would say that it doesn't want to be any part of a daily e-mail "about 120 people in The Post's newsroom" get that lays "out how the Web site performed in the closely watched metrics – 46 in all."

Forty-six! 

For data to be useful you have to ask it specific questions that you know it can answer.  A metrics report won't tell you what department to cut, and by how much.  However, it could tell you what types of stories get the most and least audiences, and which story types appear to have growing or declining audiences.  You can also get indicators – such as the percent of long-form videos people watched to the end, rated and/or commented on – of whether you're engaging audiences.

Then the hard part – the decision – is up to you.

 

Number wars

What does the war in Iraq have in common with the war news organizations are fighting for their survival?

Nothing at all, unequivocally.

BBC However, I was struck by Owen Bennett-Jones’ closing of his August 27 BBC story about the disparities in the reports of the numbers of civilians killed in Iraq since the war started in 2003:

“It remains true that people tend to cite a number that reflects not their view of the quality of the research but rather their view of the war.”

In other words, getting someone to use your numbers – your definition of success – is just as hard as getting them to change their thinking.

In the world of web analytics, people “tend to present the metric that’s most likely to work in their favor.  They’re tracking the wrong way or they don’t want to look at a particular set of data. They are wrongly using analytics as what [Robert Rose of Big Blue Moose] calls a Weapon of Mass Delusion….Worst of all, they are not learning to apply insight to action.”

— from “A Web Analytics Trap,” by Jim Ericson, Information Management, August 13, 2010

 

 

Page views: bad metric #3

Why do news organizations persist in using total page views as a measure of success?  Perhaps because if you're afraid of numbers then you're even more afraid of bad numbers, or numbers that tell you that your site isn't as successful as you want.

As with unique visitors and time on site, page views is a deeply flawed metric for understanding how a news organization is growing and retaining audiences.  

If the number of page views goes up, it could be a good thing.  Or, it could be bad.

If page views go down, it could be a bad thing.  Or – you guessed it – it could be good.

We would all like to think that a soaring number of page views means lots of people are eagerly pawing through our sites reading everything that's written.  However, how many times have you gone to a site and clicked on, say, 12 pages, fruitlessly looking for something? 

This is counted as:

1.    One unique visitor
2.    One visit
3.    12 page views

And one dissatisfied person who may not come back.

The page views metric rewards the bad design and navigation that many news sites have (sorry).  Most news sites persist in using section titles that are the same as their legacy media product (e.g., "Local News," "Life"), leaving audiences – if they're so inclined – to have to click numerous times before landing on a story about a particular city or activity like gardening.   

Or, a site breaks up a story into multiple pages, which can be annoying to a reader and reduces the possibility the reader will read the entire story and rate it, e-mail it or leave a comment.  What could be counted as one page view with a comment is counted as, say, five page views. 

If a site is redesigned and readers can find what they want with fewer clicks, total page views will – should – go down. 

And, dynamic content, or content that changes on the page without the reader clicking anything – isn't counted at all.  Streaming stories, videos, widgets, Flash applications, podcasts?  One page view.

To truly grow, a news org must understand every action its audiences are taking on its sites.  These are challenging times that require news sites to experiment and try many different things.  Not all things will work, which means sometimes the numbers will be bad.  But – you guessed it – that's a good thing.  We have to know if something's not working so we can fix it. 

 

“A trend is a friend”

Someday I’ll give up web analytics and move on to something real like pottery or something, but until then I’ll keep fighting the good fight to get news orgs to stop using monthly unique visitors as an indicator of success.

It’s tempting, I know, to count UVs because the number of monthly subscribers is the standard for print, and the number of people Nielsen says watched a program is the standard for TV.

But technology has made everything different.  Strategy is more important than ever, and understanding audiences, not just counting them, is essential.

UV overcount

UVs are counted by counting the number of cookies, or computers,
that go to a site.  This means UVs are always significantly overcounted
or undercounted.

If one person uses three computers, it’s counted as
three unique visitors.

UV undercount Conversely, a number of people going to one computer – for example, at a school or library – means that UVs will be undercounted.

Neil Mason, a web analytics guru who does some pretty thorough research for his clients, noted in a recent ClickZ column that UVs are usually overcounted.

(So that’s the reason why news orgs use total UVs – would they use this number if it were consistently undercounted?  Don’t think so…)

Mason notes that while the UV metric is “particularly important for those sites that are dependent on advertising revenues as a major source of
income,”  it “must always be treated with caution and never taken at face
value.”

Believe me, it’s rare to hear a web analytics expert use the word “never.”

Gingerbread man The advice from the ever-pragmatic Mason:  “A trend is a friend.”   Analyzing significant increases or decreases over time will give news orgs the information needed to build audiences.

(Trends are indeed friends, but don’t even think about using counts of Facebook friends and Twitter followers!)

 

An ounce of Prevention

I exercise regularly and don’t smoke, but I avoid reading Prevention.com. It’s just too annoying to be reminded of all of the other “smart ways to live well.”  When I see “by the editors of Prevention.com,” I have this picture in my mind of a bunch of really healthy people popping out cheerful stories like “5 Vitamins Your Bones Love” and “10 Reasons You’re Always Exhausted.”

Prevention Now I have another (annoying) vision, thanks to MinOnline‘s story about how well Prevention is using web analytics.  Of course a staff that is so pragmatic and probably always mentally alert would resist “going for the cheap link grab and traffic spike” – the junk food of web analytics.

I haven’t been sleeping well because I think too much (the top reason people don’t get enough sleep and are therefore exhausted), so I’ll just plop in these two paragraphs verbatim from Steve Smith’s MinOnline story, “At the Building Prevention.Com, Only The Abs Are Flat.”

Page views rpt Prevention stays “on its own brand message and [courts] the kinds of audiences that it and
its advertisers really want. ‘We got back to engaging with our customer
in the ways we knew they wanted us to engage with them,’ says [vp/digital Bill] Stump.
Fishing for any and all eyeballs and courting simple traffic spikes in
the search-driven universe doesn’t pay off in the end. ‘You get waves of
traffic, but the tide goes back out and what are you left with?’
Instead, by keeping to the needs of the ‘core customer’ in everything
that goes out to syndication or into the e-mail newsletters, prevention.com is courting the
people who tend to stay.

Page views per visit “Now, each big wave raises the sea level for all of prevention.com’s metrics, says
Stump. In the last two years, overall page views climbed 60%. In the
last year, the number of visits per user went up 12%. But it is the
engagement metrics of which Stump is proudest. ‘The number that warms my
heart,’ he says, ‘is page views per visitor that are up 49%.’ That
means the new visitors are sticking with the site and drilling much
deeper than they ever have before. ‘In general, advertisers want an
engaged audience. They want the metrics that show that people value your
brand and come to you for something that is unique. We own natural
health and fitness and beauty. We are the authentic voice.'”

 

 

Wasting time

Time spent on a site or a visit ranks right up there with total page views and monthly unique visitors as widely quoted metrics masking as indicators of success for news organizations.

No, it's not a crime to misuse a metric, but isn't a shame to waste your time on something that's not absolutely essential to your site's success? 

Dali-clock-compressed Plus, the way that time spent is calculated is flawed.  All web metrics are flawed somewhat, but time spent is really misleading.

More on the ugly methodology later – let's tackle time spent's uselessness first.  In other words, if the methodology were acceptable would time spent still be a key performance indicator?

Advertisers have always made decisions based on the level of engagement a news org's audiences have with its brand and content.  But both content and the ways people use and interact with content are different – and thus the way engagement is measured is different, too. 

Man_reading_newspaper In the past, time spent was an important measure of engagement for news orgs and advertisers.  People spent whole chunks of time with one medium or another.  Readership surveys measured time spent per day or per week.

Because these were surveys, time spent was based on self-reported information.  It was what people said they did vs. what they actually did.

Picture 1 But it didn't matter whether what people said matched with what they did.  What mattered was how engaged people felt.  People who reported they spent an hour a day with Monday's newspaper but actually only spent twenty minutes believed they spent a large chunk of time and attention with a news org.

In stark contrast, web advertising decisions depend on knowing actual behavior as reported via rows upon rows of numbers ruthlessly pouring out every second.   Among many other things, advertisers track the number of times their ads come up and are clicked upon. Sites and audiences are more niche and are highly segmented.   The algorithms for and definitions of "engagement" vary for every site and every company.  

Time spent just isn't a good indicator of engagement.  Someone who spends five minutes a day on a site, goes to five different stories each visit and adds comments twice a week is clearly more engaged than someone who comes onto a site for 30 minutes a week and clicks idly on a few pages while talking on the phone.    

How many times have you spent 30 minutes or so on a site, flipping and flapping through what seems like a million page views in a fruitless attempt to find something?  Maybe you spent 30 minutes in such a visit once – and never went back.

A news org's success in the long-term will be based not on how much time people spend on a site but what they do once they're there.

————–

How Time Spent on a Site is Calculated

Continue reading “Wasting time”

AOL analytics

AOL logo - old1 In the late 1990s, “America Online” was the shiny new company everyone watched, feared and tried to copy.  Just “AOL” now, it’s hardly as fresh or inspiring. With its new CEO, logos and use of web analytics to select the stories it covers and evaluating its reporters, has AOL once again become a news organization to watch?

AOL logo - new AOL’s announcement that it will employ “judicious use of Web-analytics software” sparked the expected flutter of coverage.  It’s admitted to using data to inform (dictate?) news decisions, so you could be led to believe that AOL is adopting a true audience-based approach.  However, after reading the Feb. 22 story in BusinessWeek and the reactions gathered by Media Post News, it seems like AOL is still using a traditional advertising-based mass media strategy.  It’s still trying to be all things to all people.  It’s just using web analytics to decide what those things are.

“Audience growth and audience engagement have to be the things that we judge the most off of our journalist investments,” AOL CEO Tim Armstrong is quoted as saying.  So far, so good.

AOL logo - swirl Armstrong also said that “brand ads should be a lot bigger on the Internet today,” talking about how online advertising revenue should pick up.  But there was no mention about AOL’s own brand strategy, something that would answer the question of “What is AOL?” for audiences and advertisers once and for all.  On which niches will it focus?  How much of its content will be unique and compelling enough to those niche audiences so that they’ll come back regularly?

Patricia Handschiegel, who blogs as Daily Patricia, sums it nicely:

AOL logo - tongue “The right approach to the content business is to KNOW YOUR AUDIENCE, or
the people that come to your site, and create a product for THEM. AOL’s
approach is clearly not centered on this….it’ll drive up page views and therefore, revenue but that’s not
likely to last as the industry becomes more analytics savvy. Today, a
million uniques with zero session times, high bounce rate and no repeat
visitors isn’t seen as a sign of a lack of audience but in the not too
distant future it will.”

I haven’t researched AOL myself, so I don’t know if all of the details in the BusinessWeek and Media Post News stories really reflect what AOL is doing.  So I’ll just note some some things news orgs should think about when using web analytics to inform news decisions and evaluate journalists.

  • Evaluating success (either a site’s or a journalist’s) by total page views doesn’t work. A large number of page views may just indicate visitors got there by mistake or clicked around trying to find something.   Plus, dynamic content (Flash, etc.) will not be counted as page views. Page views can be a useful metric, but only when combined with other metrics – such as ratios – that give context.
  • Engagement can’t be determined by web traffic or behavioral data alone.  Attitudinal research is essential to find out why or why don’t people come to a site regularly, what they want and what they’re not finding.
  • If journalists are going to be held accountable for web traffic and audience engagement will they also have control over the factors that drive traffic, such as design, navigation and marketing? Or will they just submit their stories and hope for the best?
  • Money “AOL is even considering sharing a portion of quarterly profits with staffers whose work fetches the most page views.”  BusinessWeek

How will traffic goals be set?  If journalists will be rewarded for generating “traffic” (however it’s defined), will they be fired if they don’t? Will the benchmarks or starting points – and the time journalists have to reach the required traffic levels – be based on whether a topic is already established or whether it’s one a news org wants to nurture and grow because the topic is essential to achieving its strategy?

  • “Tacked to the newsroom walls in AOL’s downtown Manhattan headquarters are pages and pages of Web traffic data.” BusinessWeek Stacks.bill.pages.giUh, this would cause even me to shut down.  It’s definitely not “judicious use of Web-analytics software.”  Does AOL have a few key performance indicators that everyone understands and on which they can focus as a team?

    Software and reports don’t make decisions; people do. Successful use of web analytics depends on the decision-makers understanding and using the information correctly.  If news orgs believe the success of their websites depends on being truly audience-focused then they must also ensure the analytical resources and processes are there as well.

AOL logo - fish AOL may stumble again but at least it’s trying something different.  I look forward to learning from AOL whether it succeeds or fails.