Are we doing very precise things with very imprecise (secondary) data?

A fascinating ESOMAR Webinar – The (Still) Unfulfilled Promise of Secondary Data? – took place September 9, 2020 arranged and moderated by Reg Baker, Regional Ambassador to North America and Consultant to ESOMAR’s Professional Standards Committee. So why a different title for this event?

In setting up the session, Reg noted:

The technologies and tools required to access, combine, and analyse these (secondary) data already exist, and marketers and advertisers are using them on a broad scale. But within the market research sector the promise of so-called “big data” remains unfulfilled.
We will consider how two values that comprise the foundation of market research -– validity and respect for the privacy of those whose data we process – need to be rethought in this new context.

The speakers involved are two of the most experienced in the use of secondary data arena but with very different backgrounds. Rex Briggs, Founder and Executive Chairman of the Board, Marketing Evolution, has been a leading vendor of ROI measurement and optimization based primarily on secondary data sources for many years. Dr Sara Jordan acts as Policy Counsel, Artificial Intelligence and Ethics at The Future of Privacy Forum and has immersed herself in the data/analytics “ethics” arena at the highest levels her whole career.

The appropriate context?

Before highlighting each speaker’s key points, I suggest there is a fundamental and crucial context to consider when addressing this secondary data usage arena. There is certainly no one in the research business than can ignore the Cambridge Analytica/Facebook data “arrangement” which many data/marketing experts concluded were instrumental in the election of the current occupant of the White House and the departure of Great Britain from the EU. Also, I do not believe you have to be a career media researcher or a Brit living in the US to incorporate this underpinning especially as the ESOMAR Professional Standards Committee is planning to establish “Secondary Data Usage Guidelines”.

The content

Rex Brigs recalled the 1998 promise of a personal relationship between marketers and consumers in the internet age as reflected by Wired in its May issue that year- The NEW You. No Google or Facebook then. He noted that this data and intelligence industry has grown to ~$5 trillion today along with massive tracking capabilities that are generally unchecked, “with walled gardens still collecting and abusing individual’s data and using it as competitive barriers to new entrants.” Walled garden data monopolies are clearly used to “personally target advertising to fuel their revenues.”

After reminding the audience of the scary means to check their “collected” personal on-line data, Rex shared that Marketing Evolution, as a major secondary data user and analytics company, has a fundamental concern. “Across the various secondary data sources and subsequent mashups, is an individual’s data actually reflecting the person, the household, the street or even the neighbourhood?” He noted that even with today’s computer technology hashing the array of data and executing analytics in real-time remains impossible.

He did acknowledge the significant compound risks not only concerning any individual’s data but critically to the insights and findings from data analysis due to potential imprecision. To the question, “Are we doing very precise things with very imprecise (secondary) data?” Rex posited that the analytics based on typical data mashups were more precise than the propensity scores from primary quality data but admittedly had a long way to go. This raises the parallel concern of the validity of the analytic models.

In concluding, he posed a central question: “What will data look like in eight years from now in 2028?

Consumers will have direct control of all their data and be compensated for it.
Big data media companies will be even bigger and more control consumer data.

Dr Sara Jordan reminded us of the risks versus the benefits of research notably in view of the trends on privacy and their effects on ethical data usage and the consequent importance of professional bodies to establish Ethical Guidelines regarding use of any personal data. She underlined that benefits will only accrue from research if:

Performed well
Presented for review and accepted by peers
Translated into policy and/or action

And when using human data, the research is:

Reviewed by appropriate review boards when that is required
Abides by methodological conventions and reporting requirements
Is written with clarity in accessible formats
Abides by the norms of publication ethics

Sara emphasized the special considerations required when using secondary data. She believes the cornerstones of this dimension are: “Abides by terms of consent and purpose limitations”; and “Respects the limits of the data sharing or data use agreements, including transfer, limitation, and destruction of data.”

All of which raises the difficulty in developing Guidelines, in her terms, “risk assessments”, that assure the objectivity and independence in evaluation of any project whether it is the data itself, the data use or the analytic models used.

Key considerations moving forward

Further points raised for consideration of the ESOMAR Secondary Data Usage Guidelines Committee initiative included:

Ethical Standards must account for the scale secondary data and can potentially take principles from primary research.
Has the GDPR made the behemoths stronger, and what is its role in any Guidelines?
Do the ends justify the means?
Any Standards or legal agenda needs to encourage an ecosystem of mutual trust between all parties involved.
Data/analytics users need to be more thoughtful in leaving people out of marketing campaigns.
Guidelines must have serious teeth with substantial consequences to be of any value to any of the parties involved.

The catch-22?

If we believe it is neither in the interests of advertisers nor consumers to be involved whatsoever with the often toxic environments of some of the social media sites that provide secondary data to research vendors (whether anonymized or not), should researchers be using the data from such sites?

As Charlie Warzel, New York Times, wrote September 3, 2020, in his article, “Mark Zuckerberg Is the Most Powerful Unelected Man in America.”

“Facebook’s news dominance and mercurial distribution algorithms led to a rise of hyper-partisan pages and websites to fill the gaps and capitalize on the platform’s ability to monetize engagement, which in turn led to a glut of viral misinformation and disinformation that Facebook has been unable (or perhaps unwilling) to adequately police.” (Source – ed.)

I would respectfully suggest that ‘Secondary Data Source Environment Acceptability’ must be included as a third dimension of the “values” component posited by Reg for any ESOMAR Secondary Data Usage Guidelines.

ESOMAR’s challenge?

We are chasing the holy grail of understanding the elements that drive ROI and/or key decisions for any endeavour and tweaking them in real-time, especially in a digital media world that provides a tsunami of secondary data to potentially help drive those decisions. However, the validity and ethical elements of using and analyzing that data along with the plethora and influence of any misinformation surrounding the potential audience flowing through any digital data source becomes even more concerning.

To watch the webinar on demand just click here and fill in the registration form (ed.).

Cookie	Type	Duration	Description
cli_user_preference	persistent	1 year	Keeps track of the cookie consents for on the current domain.
cookielawinfo-checkbox-marketing	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-measurement	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-necessary	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
cookielawinfo-checkbox-non-necessary	0	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-preferences	persistent	1 year	Keeps track of the cookie consent for a specific category on the current domain.
hustle_module_show_count-	persistent	1 day	This cookie is used to determine when the internal slide-in/pop-up/embed module for newsletter opt-ins is displayed to the user.
inc_optin_	persistent	1 hour	This cookie is used to determine when the internal slide-in/pop-up/embed module for newsletter opt-ins is displayed or hidden to the user.
PHPSESSID	session	0 minute	Preserves user session state across page requests. The PHPSESSID cookie is native to PHP and enables websites to store serialised state data. On the website it is used to establish a user session and to pass state data via a temporary cookie, which is commonly referred to as a session cookie. Stores unique session ID.
viewed_cookie_policy	persistent	1 hour	Stores the user's cookie consent state for the current domain.
viewed_cookie_policy	0	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_	session	session	WordPress cookie for a logged in user.
wordpress_logged_in_	session	session	WordPress cookie for a logged in user.
wordpress_test_	session	session	WordPress cookie for a logged in user.
wordpress_test_cookie	session	session	WordPress test cookie.
wp-settings-	session	session	Wordpress also sets a few wp-settings-[UID] cookies. The number on the end is your individual user ID from the users database table. This is used to customize your view of admin interface, and possibly also the main site interface.
wp-settings-time-	session	session	Wordpress also sets a few wp-settings-{time}-[UID] cookies. The number on the end is your individual user ID from the users database table. This is used to customize your view of admin interface, and possibly also the main site interface.

Cookie	Type	Duration	Description
AMP_TOKEN	persistent	1 year	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. It contains a token that can be used to retrieve a Client ID from AMP Client ID service. Other possible values indicate opt-out, inflight request or an error retrieving a Client ID from AMP Client ID service.
collect	third party	session	Used to send data to Google Analytics about the visitor's device and behaviour. Tracks the visitor across devices and marketing channels.
_ga	persistent	2 year	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.
_gid	persistent	1 day	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.
__gads	third party	2 years	Associated with the DoubleClick for Publishers service from Google. It serves purposes such as measuring interactions with the ads on our domain and preventing the same ads from being shown to you too many times.
__utma	persistent	2 years	This cookie is typically written to the browser upon the first visit. If the cookie has been deleted by the browser operator, and the browser subsequently visits strategy-business.com, a new __utma cookie is written with a different unique ID. In most cases, this cookie is used to determine unique visitors to strategy-business.com, and it is updated with each page view. Additionally, this cookie is provided with a unique ID that Google Analytics uses to ensure both the validity and the accessibility of the cookie as an extra security measure.
__utmb	persistent	30 minutes	This cookie is typically written to the browser upon the first visit. If the cookie has been deleted by the browser operator, and the browser subsequently visits strategy-business.com, a new __utma cookie is written with a different unique ID. In most cases, this cookie is used to determine unique visitors to strategy-business.com, and it is updated with each page view. Additionally, this cookie is provided with a unique ID that Google Analytics uses to ensure both the validity and the accessibility of the cookie as an extra security measure.
__utmc	persistent	30 minutes	Historically, this cookie operated in conjunction with the __utmb cookie to determine whether or not to establish a new session for the user. For backward compatibility purposes with sites still using the urchin.js tracking code, this cookie will continue to be written and will expire when the user exits the browser. However, if you are debugging your site tracking and you use the ga.js tracking code, you should not interpret the existence of this cookie in relation to a new or expired session.
__utmv	persistent	2 years	This cookie is not normally present in a default configuration of the tracking code. The __utmvcookie passes the information provided via the _setVar() method, which you use to create a custom user segment. This string is then passed to the Analytics servers in the GIF request URL via the utmcc parameter. This cookie is written only if you have added the¬_setVar() method for the tracking code on your website page.
__utmz	persistent	6 months	This cookie stores the type of referral used by the visitor to reach strategy-business.com, whether via a direct method, a referring link, a website search, or a campaign such as an ad or an email link. It is used to calculate search engine traffic, ad campaigns, and page navigation within strategy-business.com. The cookie is updated with each page view to strategy-business.com.

Cookie	Type	Duration	Description
GoogleAdServingTest	persistent	session	Used to register what ads have been displayed to the user.
IDE	persistent	1 year	Used by Google DoubleClick to register and report the website user's actions after viewing or clicking one of the advertiser's ads with the purpose of measuring the efficacy of an ad and to present targeted ads to the user.
test_cookie	third party	1 day	Used to check if the user's browser supports cookies.
__ab12#	persistent	2 years	Pending

Top 10 Global Consumer Trends 2020

Top 10 Global Consumer Trends 2021

Understanding the Why? Projective Techniques in Qualitative…

African consumers resistance to e-commerce and what is…

The fascinating dynamism of the African Insights industry

Christmas 2020: Opportunities to close the year on…

Make your customer experience meaningful, not only frictionless

There Is a Way Out of This Mess

Nail Biting in Georgia US Senate Races –…

Media polling and the way forward

U.S. election pollsters: watch Florida for key indicators!

Post-pandemic marketing & advertising trends among marketers

Cross-Media Measurement, XMM: no viewing – no outcomes!

XMM Disconnect? As Alice went into Wonderland, things…

Innovations in media measurement, accelerated by COVID, establish…

Insight from the Insight250 winners: Data-driven leadership

Insights from the Insight250 winners: Evolutions and innovations…

Customer advocacy: How to turn customers into friends,…

Brands as provocations: How to connect at scale…

Predictive qual: How to turn the art of…

What It truly means to be tech-enabled in…

Insights on insights: Which survey data analysis solution…

Eating in, is the new testing out –…

Behavioural tech-heads: What technology needs to learn from…

SHOBSERVATORY Research Chronicles: The heart of the brand…

ESOMAR announces the 2021 award winners

SHOBSERVATORY Research Chronicles: How presentations are created

Are we doing very precise things with very imprecise (secondary) data?

Leave a Comment Cancel Reply

Predictive qual: How to turn the art of qual into a science...

Are we doing very precise things with very imprecise (secondary) data?

Leave a Comment Cancel Reply

Related Articles

We value your privacy!