The FTC Privacy report “Do Not Track” – a missed opportunity

GDPR & Privacy

As readers of this blog will know, I am a strong advocate of online privacy… That may sound strange coming from a web analytics evangelist. However, if we as an industry do not sort these privacy issues out, there is a real danger that web analytics as we know it today will disappear completely.

So, following the recent excellent post from Phil Kemelor on The FTC Privacy Report, “Do Not Track” Options and Web Analytics, I wanted to also add my take here…

Disappointment

I was disappointed with the FTC Privacy report for only tackling the issue of Personally Identifiable Information (PII). To my knowledge, all developed countries have good data protection laws on this already. Essentially, this means, “you” can only store PII data with the explicit person’s permission, and you must reveal this to the person concerned should they request it. See for example the UK Data Protection Act.

What I was hoping for from the FTC, was a position on non-PII data collection. That is, collecting data that does not DIRECTLY identify the individual. I emphasise directly, because with so many web data points available from an anonymous user, it is possible for an organisation to “triangulate” non-PII data and build up a pretty sophisticated profile of the person – ultimately identifying them.

A classic case of this happening was the AOL data scandal of 2006. This involved the release of a large volume of “anonymised” search query data, intended for research purposes, that NYT journalists (and others) were able to analyse and subsequently identify people with.

Track in Aggregate

I think (hope!) web users are pretty savvy when it comes to sharing their PII data on the web – in the same way you wouldn’t share your PII with a stranger in the street.

Tracking individuals as “individuals” on the web (as opposed to in aggregate), even when anonymous, poses a greater privacy threat – as it is unregulated. As more people realise this, we could reach a critical mass of people blocking vendor tools such as Omniture, Google Analytics, Coremetrics etc., to the point where the data is so unrepresentative that it is meaningless.

The answer is to only* track your web visitors in aggregate – that is, looking at metrics that represent a segment/group, rather than an individual. In that way, an individual can never be identified. Yes, this is a compromise. Individual data is much more interesting to marketers – “we could target a potential customer with laser-like precision“. But in reality this rarely happens – after all, the visitor is still anonymous, which means there is still a lot of guess work to be done with your laser.

IF (and its a big if), all web analytics reporting was conducted in aggregate, I feel the privacy fear many people have with web analytics, would all but disappear – safe guarding our industry long into the future. That’s a very large up-side compared to the very small down-side of not having individual visitor level data.

As always, I would be interested in your comments on this subject…

*If a visitor is an existing customer, subscriber or previously given you their PII, then of course tracking them as an individual makes sense – so long as they “identify” themselves each session i.e. log in.

Related from arrticle from Vicky Brock of the WAA:

http://waablog.webanalyticsassociation.com/2011/02/privacy-debate-not-just-about-advertising-marketing.html

← Prev: Improving a website *without* Web Analytics – a case study Next: Google Analytics customisations you cannot live without #1 →

Looking for a keynote speaker, or wish to hire Brian…?

If you are an organisation wishing to hire me and my team, please view the Contact page. I am based in Sweden and advise organisations in Europe as well as North America.

4 Comments

Brian Clifton on March 8, 2011 at 1:49 pm

Related to this:

New rules set to make cookies crumble http://bbc.in/hfpTNL – the problem is that no one is differentiating 1st and 3rd party cookies
Reply
Richard on March 7, 2011 at 1:36 am

I’d like to say this is the first post I read from you Brian, I am seeking to educate a group of people on the benefits of anyaltics, split testing, and understanding conversions as it pertains to their web business activity.

You brought to light a good point with PII, I run across many sites that are setup for affiliate marketing that do not even have TOS or Privcay Statement information on them.

I am willing to bet most of them do not even protect their physical systems well enough.
Reply
Brian Clifton on February 16, 2011 at 9:37 pm

Stephen: you make a very good point. No one bothers to trawl through “individual” data even if they have it. You would be surprised as the number of vendors that list it as a USP though…
Reply
Stephan on February 11, 2011 at 11:36 am

Absolutely. Indeed there are the *other* tools that want to track each individual’s behavior throughout the website, even let you popup a window to invite them to a 1-to-1 personalized chat. How scalable is that? I mean, how much time can you devote to each individual visitor? And how significant is the data for one visitor? What decisions can you take? What actions?
Only aggregates can scale. Individual data is fun to play with for a mom&pop ecommerce site (and also obtrusive to begin with), but for large scale websites, I just don’t get it.
Reply