<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title> &#187; data masking methods</title>
	<atom:link href="http://www.grid-tools.com/blog/tag/data-masking-methods/feed" rel="self" type="application/rss+xml" />
	<link>http://www.grid-tools.com/blog</link>
	<description></description>
	<lastBuildDate>Tue, 20 Apr 2010 08:44:04 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Using data masking to create &#8216;anonymous&#8217; datasets</title>
		<link>http://www.grid-tools.com/blog/uncategorized/using-data-masking-to-create-anonymous-datasets</link>
		<comments>http://www.grid-tools.com/blog/uncategorized/using-data-masking-to-create-anonymous-datasets#comments</comments>
		<pubDate>Mon, 25 Jan 2010 10:57:40 +0000</pubDate>
		<dc:creator>Jess3589</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Data masking]]></category>
		<category><![CDATA[data masking methods]]></category>
		<category><![CDATA[data masking techniques]]></category>
		<category><![CDATA[de-identify sensitive data]]></category>

		<guid isPermaLink="false">http://www.grid-tools.com/blog/uncategorized/using-data-masking-to-create-anonymous-datasets</guid>
		<description><![CDATA[Yesterday morning I woke to find an interesting article in &#8216;The Observer&#8217; about anonymizing or masking personal data records. This turned out to be somewhat ironic considering I wrote two blogs on this very topic just last week!
The article, written by Anushka Asthana (Policy Editor), discussed the concerns around using data masking or &#8220;anonymization techniques&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday morning I woke to find an interesting article in &#8216;The Observer&#8217; about anonymizing or masking personal data records. This turned out to be somewhat ironic considering I wrote two blogs on this very topic just last week!</p>
<p>The article, written by Anushka Asthana (Policy Editor), discussed the concerns around using data masking or &#8220;anonymization techniques&#8221; to de-identify sensitive and personal information. Many large and well-known multi-national organizations and government agencies are using data masking methods to keep their production data secure (or anonymous) when they use it in development and test.</p>
<p>Anushka&#8217;s article, however, states that computer scientists in the US have discovered ways to &#8220;re-identify&#8221; the personal information of individuals who were included in anonymous datasets. How?, through using a statistical &#8220;de-anonymization&#8221; technique or, as my last blog suggested, re-engineering of masked test data.</p>
<p>So, as Anushka&#8217;s article asks, just how safe is it to share personal and sensitive information even if it is masked or de-identified? The answer, once again, is not very.</p>
<p>Organizations should start looking into other methods to secure their production data when using it outside of their &#8220;live&#8221; environment; whether this be for testing, development, training, QA or even presenting statistical information. My last two blogs discuss the option of using &#8216;data creation&#8217; techniques. No, this isn&#8217;t the process of &#8220;creating&#8221; or making-up some data based on whatever fake names or addresses come into your head. It&#8217;s quite a sophisticated process, and the end-product is secure test data that can never be re-engineered. It&#8217;s based on a model of your production environment, so the data maintains referential integrity and is exactly like &#8220;live&#8221; data, but it isn&#8217;t. Read my last two blogs to find out more.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.grid-tools.com/blog/uncategorized/using-data-masking-to-create-anonymous-datasets/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>So, why are you still using data masking?</title>
		<link>http://www.grid-tools.com/blog/uncategorized/so-why-are-you-still-using-data-masking</link>
		<comments>http://www.grid-tools.com/blog/uncategorized/so-why-are-you-still-using-data-masking#comments</comments>
		<pubDate>Thu, 21 Jan 2010 11:25:08 +0000</pubDate>
		<dc:creator>Jess3589</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data creation]]></category>
		<category><![CDATA[Data masking]]></category>
		<category><![CDATA[data masking methods]]></category>
		<category><![CDATA[Test Data Creation]]></category>
		<category><![CDATA[Test Data Management]]></category>

		<guid isPermaLink="false">http://www.grid-tools.com/blog/uncategorized/so-why-are-you-still-using-data-masking</guid>
		<description><![CDATA[Most of us have no idea when it comes to figuring out ways to acquire the right kind of data we need for any type of test or development project.  We’re lost.  We’ve been taking copies of our live environments for years.  It’s the only method we know.  
There is only [...]]]></description>
			<content:encoded><![CDATA[<p>Most of us have no idea when it comes to figuring out ways to acquire the right kind of data we need for any type of test or development project.  We’re lost.  We’ve been taking copies of our live environments for years.  It’s the only method we know.  </p>
<p>There is only one issue changing our IT infrastructures and threatening the comfortable and reliant procedures we know good and well; compliance.  Yes, we’re now in the data protection era.  Regulating bodies would rather personal information be kept safe and secure inside the production database where it belongs, thank-you-very-much.  Also, ignoring compliance measures isn’t really the best idea.  We’ve learnt this firsthand from our competitors and their very public data breaches whilst secretly smiling to ourselves, relieved it didn’t happen to us.  </p>
<p>However, let us consider for a moment the potential implications of your organization’s sensitive data being leaked into the public domain by some unfortunate soul’s mindless mistake:</p>
<p>•	Potential law suit – check<br />
•	Damage to corporate brand and reputation – check<br />
•	Large fines imposed by regulating bodies – check<br />
•	Loss of integrity and potential loss of customers &#8211; check    </p>
<p>So, what about data masking – you ask? It’s a quick and easy way to solve the problem, right?  Let’s get a bit of production data, scramble it up, de-identify some names and &#8211; there you go.  You’ve got your test data and the perfect solution to the thorn digging into your side.  </p>
<p>Oh, but wait.  We now hear data masking isn’t actually that secure.  Bugger. </p>
<p>Then what alternative do we have? Well, using &#8217;synthetic&#8217; or &#8216;fake&#8217; test data seems to be the topic of the day – the new method, the new ‘fad’ if you will.</p>
<p>Now, for those of you who don’t know, test data creation is a bi-product of modeling and sampling production data.  What is modeled is then turned into data objects or templates which are based on the entire production environment.  The templates can be edited or enhanced based on project needs, or moved into different data formats like flat files or CSV files.  </p>
<p>This may sound impossible to you.  You may be thinking about the referential integrity of your production database; the value of every table, every cell, every format, every name, how each table is ever so consequentially connected to one another.  But it&#8217;s rather easy, actually &#8211; very easy to be frank. </p>
<p>I’ve started looking into this as part of my consultancy.  It came to light when I was contacted by a government agency needing an alternative to data masking.  I’m impressed.  It is true – using test data creation is the way forward.  On top of this, less data is actually being used and stored, since the data is generated from the model into a small template.  Also, believe it or not, synthetic data gives you better code and functional coverage.  How?  Well, it’s easier than you think.  You simply point your test data creation tool toward its data editing, enhancing and manipulation functions.  </p>
<p>Oh, and yes, I nearly forgot about the most important point.  More secure than data masking, you say?! Well, it is.  This is because data creation tools never actually access or manipulate ‘live’ production data.  They can, in fact, create data without data. Confusing?  Most data masking tools anonymize your test and development data by moving live production data into a separate staging environment so it can then be masked. How is this secure?  Well, it’s not really.  Likewise, I bet you didn’t know that masked data can be reengineered back into its original format fairly easily.  Synthetic data can never be reengineered because it’s not actually ‘real’ data.</p>
<p>So, why are you still using data masking?  My guess is because you don’t know enough about data creation &#8211; start reading!  It’s the way forward.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.grid-tools.com/blog/uncategorized/so-why-are-you-still-using-data-masking/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
<a class="a2a_dd" href="http://www.addtoany.com/share_save?linkname=&amp;linkurl=http%3A%2F%2Flocalhost%2Fwordpress%2F"><img src="http://static.addtoany.com/buttons/share_save_171_16.png" width="171" height="16" border="0" alt="Share/Bookmark"/></a><script type="text/javascript">a2a_linkname=document.title;a2a_linkurl="http://localhost/wordpress/";</script><script type="text/javascript" src="http://static.addtoany.com/menu/page.js"></script>