<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Dachis Group&#187; Filter Failure</title>
	<atom:link href="http://www.dachisgroup.com/tag/filter-failure/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dachisgroup.com</link>
	<description>Social Business, Brand Engagement, Powerful Insights</description>
	<lastBuildDate>Fri, 10 Feb 2012 22:07:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>The Structured vs. Unstructured Data Dilemma</title>
		<link>http://www.dachisgroup.com/2010/01/the-structured-vs-unstructured-data-dilemma/</link>
		<comments>http://www.dachisgroup.com/2010/01/the-structured-vs-unstructured-data-dilemma/#comments</comments>
		<pubDate>Thu, 28 Jan 2010 15:37:09 +0000</pubDate>
		<dc:creator>Lee Provoost</dc:creator>
				<category><![CDATA[Blog Post]]></category>
		<category><![CDATA[Data Ubiquity]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[Filter Failure]]></category>
		<category><![CDATA[Future Trends]]></category>
		<category><![CDATA[Workforce Collaboration]]></category>

		<guid isPermaLink="false">http://www.dachisgroup.com/?p=25883</guid>
		<description><![CDATA[One of the first things you learn at university in your first year of computer science is data normalisation. I don't know about the other people out there, but I found it such an utterly boring course. Mankind has such an obsession with categorising every single piece of data that this behaviour is crammed into the minds of naïve and unknowing computer science students, just fresh from high school.]]></description>
			<content:encoded><![CDATA[<p>One of the first things you learn at university in your first year of computer science is data normalisation. I don&#8217;t know about the other people out there, but I found it such an utterly boring course. Mankind has such an obsession with categorising every single piece of data that this behaviour is crammed into the minds of naïve and unknowing computer science students, just fresh from high school.</p>
<p>Why is that? Well it does make it easer for us mortals to grasp the vast amount of data that is around us. Think about the early days of Yahoo! that used a directory approach and manual work to build a whole categorised database of web links.</p>
<p>But then cracks start appearing in our approach. Familiar with the &#8220;in which folder am I going to put this email?&#8221;-problem? First of all you start to have a gazillion folders and then the problem pops up that an email can belong to two different folders. After a while you start to wonder &#8220;where the hell is that message that I filed?&#8221;.</p>
<h4>Social Networks and Cloud driving the change</h4>
<p>As a technologist, I also understand the underlying technical reasons why we want to normalise and categorise data. You first start with &#8220;objectifying&#8221; your view of the world in an object-oriented language like Java so it is easier to program and maintain those systems. This is persisted in classic row-column structure in a database where you create different tables for different data concepts. You have the student object with its properties stored in the Student table and course object with its properties in the Course table. Now you can create meaningful relations between the two and develop your software accordingly.</p>
<p>When your application starts to get a huge load, you will either spend an awful lot of money on expensive Oracle cluster software or you will use concepts like <a href="http://en.wikipedia.org/wiki/Shard_%28database_architecture%29">sharding</a> to split databases to keep it performing. But still, you stick to your normalized Student and Course table structures.</p>
<p>An approach that started to get traction in some of the successful internet startups was the concept of key/value pairs. While not a novel concept at all, it got more popular because we started to get more and more websites that couldn&#8217;t cope with the amount of users. The idea is that you don&#8217;t normalise data in Course and Student tables, but you just have a key and an associated value combination in one huge table. This approach scales extremely well because it can be easily spread over a cluster of machines. Google&#8217;s <a href="http://labs.google.com/papers/bigtable.html">BigTable</a> uses this as well as Facebook&#8217;s <a href="http://incubator.apache.org/cassandra/">Cassandra</a>. It became more &#8220;mainstream&#8221; with Amazon&#8217;s introduction of their <a href="http://aws.amazon.com/simpledb/">SimpleDB</a> that has this key/value concept, as well as Microsoft&#8217;s key/value storage in <a href="http://www.microsoft.com/windowsazure/http://www.microsoft.com/windowsazure/developers/sqlazure/">Azure</a>.</p>
<h4>SSD discs to the rescue!</h4>
<p>The only problem is&#8230; that developers have been so brainwashed with the relational row/column concept that the average corporate developer had a hard time grasping this key/value concept and building meaningful apps on top of it.</p>
<p>Both Amazon and Microsoft have realized that and have added a relational layer on top of their key/value data storage engine. This means that the vendors can use key/value for achieving extreme scalability, but they let developers interact through a relational model interface. (It does create a performance penalty, but negligible for most users I think).</p>
<p>But still, even this key/value approach requires that you are modelling your apps and data in a particular way. What if we didn&#8217;t have to do ANY effort at all? What if we could just take the vast amount of data we have in our company, dump it on a disc and just being able to find meaningful information AS IS? Yes, we do have the Google Appliance for that, or Microsoft Enterprise Search, but they are merely indexing in a batch process and storing the results in a cache. What if we want a real-time extremely fast search that gives you on the spot information regardless of the data structures?</p>
<p>That&#8217;s exactly the gap Oracle wants to fill with their <a href="http://www.oracle.com/us/products/database/exadata/index.htm">Exadata</a> v2 server. Consider it as a beast of a server, crammed with more than a terabyte of Solid-State Drive (flash) discs that has your data fully stored, it&#8217;s fairly comparable with having ALL your data hot in memory, ready to be queried.</p>
<p>Can you grasp the opportunity this presents us? We can achieve real-time search inside our enterprise on vast amounts of unstructured data!</p>
<h4>Be like the kitteh<a href="http://www.headshift.com/blog/funny-pictures-cat-faxes-himself.jpg"><img class="alignright" src="http://www.headshift.com/blog/funny-pictures-cat-faxes-himself-thumb-300x201.jpg" alt="funny-pictures-cat-faxes-himself.jpg" width="300" height="201" /></a></h4>
<p>This is again an example that we shouldn&#8217;t limit ourselves by the current state of technology. In the past 10 years I&#8217;ve seen again and again that eventually, technology will catch up to make things possible that we couldn&#8217;t do before.</p>
<p>Most people will quote Einstein and say &#8220;imagination is more important than knowledge&#8221;, I&#8217;d rather tell you to be more like the <a href="http://www.urbandictionary.com/define.php?term=kitteh">kitteh</a> at the right. I&#8217;m pretty sure that one day, he/she will be able to fax itself to the Burger King to get some delicious &#8220;<a href="http://icanhascheezburger.com/2010/01/16/funny-pictures-to-burger-king/">cheezburgerz</a>&#8220;.</p>
<h4>Impact on Enterprise 2.0</h4>
<p>So how does this relate to Enterprise 2.0 I hear you asking. Well, I argue that we shouldn&#8217;t lose too much time and effort on meticulously categorising and tagging information and knowledge across all the different systems we have. This is because I believe that it is almost a lost cause anyway: it doesn&#8217;t scale and it&#8217;s often not the quality you want.</p>
<p>My point is that the tools and technology we get to our disposal are advancing at such a rapid speed, that we slowly start to reach the point where we can rely on software and hardware to do this boring heavy lifting for us.</p>
<p>I shared my view in a previous blog <a href="http://www.headshift.com/blog/2010/01/data-ubiquity-threatening-usef.php">post</a> that knowledge should be a happy by-product of your daily work, however this creates such a vast amount of information that this overload creates a problem on itself. Having intelligent software that can make sense of this, automatically tag it and categorise it, automatically make relationships and assumptions, would dramatically increase our efficiency.</p>
<p>At Headshift we&#8217;re doing a lot of research on helping companies staying ahead of the pack and we&#8217;ve been looking at software solutions to more intelligently and automatically manage your knowledge as described. It seems that now we finally have the right hardware innovations to make it even juicier&#8230;</p>
<p><em>This post originally appeared <a href="http://www.headshift.com/blog/2010/01/the-structured-vs-unstructured.php" target="_blank">on the Headshift blog</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dachisgroup.com/2010/01/the-structured-vs-unstructured-data-dilemma/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Data Ubiquity Threatening Usefulness of Enterprise 2.0</title>
		<link>http://www.dachisgroup.com/2010/01/data-ubiquity-threatening-usefulness-of-enterprise-2-0/</link>
		<comments>http://www.dachisgroup.com/2010/01/data-ubiquity-threatening-usefulness-of-enterprise-2-0/#comments</comments>
		<pubDate>Thu, 14 Jan 2010 15:50:50 +0000</pubDate>
		<dc:creator>Lee Provoost</dc:creator>
				<category><![CDATA[Blog Post]]></category>
		<category><![CDATA[Data Integration]]></category>
		<category><![CDATA[Data Ubiquity]]></category>
		<category><![CDATA[Filter Failure]]></category>

		<guid isPermaLink="false">http://www.dachisgroup.com/?p=23887</guid>
		<description><![CDATA[With large corporations storing more and more data in their Enterprise 2.0 (and overall IT) systems, we have the danger of getting big data silos or disparate solutions. To make matters worse, they are often stored locally in systems that are owned by different business units with different purposes.]]></description>
			<content:encoded><![CDATA[<p><em>(This blog post is co-written with Nigel Walsh from the Enterprise Mashups vendor Corizon and started over a bowl of porridge. It originally appeared on the <a href="http://www.headshift.com/blog/2010/01/data-ubiquity-threatening-usef.php" target="_blank">Headshift blog</a>.)</em></p>
<blockquote><p>&#8220;Content and data are everywhere. People are creating and curating content like never before. As data storage becomes cheaper, <strong>businesses are storing, archiving, and mining more data than previously possible. The increasing openness of APIs and data portability make more enterprise data available for both consumers and employees to consume.</strong> Free flow of data also allows business partner relationships to be readily analyzed and optimized.&#8221; (<a href="../social-business-design/emerging-opportunities/">Emerging Opportunities</a> in Social Business Design)</p></blockquote>
<h2><strong>Filter Failure</strong></h2>
<p>With large corporations storing more and more data (be it for compliance, regulatory or internal mining purposes) in their Enterprise 2.0 (and overall IT) systems, we have the danger of getting big data silos or disparate solutions. To make matters worse, they are often stored locally in systems that are owned by different business units with different purposes. So, imagine that you have invested a lot of money and effort in a knowledge management system, just to realize after 3 years that it does not suit your needs anymore and you need something else? If you have a couple of thousands of files, it&#8217;s still quite manageable. However, if you work in a very knowledge-intensive organisation, three years of data might have accumulated into several hundreds of gigabytes of data. Good luck with that migration.</p>
<p>Then go through a merger with your competitor or launch a whole load of new products or services and try to gain consensus and consistency across these disparate solutions.</p>
<p>With the increasing importance (and increasing amount) of data floating around your organisation, it becomes more and more important to think about open standards for data interoperability. Accept the reality of the day that a lot of your data is stored in silos. What we need to think of now is how we are going to make this data step-by-step accessible so that we don&#8217;t need to do tedious and error-prone data migrations when the system doesn&#8217;t cope with our demands anymore.</p>
<p>Perhaps to your surprise, I&#8217;d argue that the data silo lock-in is not your biggest problem. No, the inability to intelligently manage and reuse this volume of content in a meaningful way is a much bigger danger that has a direct impact on your business. Filter failure arises when individuals are unable to synthesize and understand the vast amounts of information being generated by an organisation.</p>
<p><strong>Where the problem used to be getting enough information, now it&#8217;s being able to make sense of it all.</strong> So in addition to filtering the underlying plethora of data and subsequent applications, you also have to be an inline translator. For anyone dealing with end users directly (e.g. a front line customer agent dealing with lots of applications), they will always speak in their own language and never that of your systems, applications or processes. More importantly, they have no reason to.</p>
<h2><strong>The interface is the product</strong><span style="display: inline;"><img class="alignright" src="http://www.headshift.com/blog/simplicity.png" alt="simplicity.png" width="179" height="347" /></span></h2>
<p>But what exactly are we trying to solve here? Why would we even care about this problem? <a href="http://stuffthathappens.com/blog/2008/03/05/simplicity/">Just a Bunch of Stuff That Happens</a> perfectly coined it in the following cartoon:</p>
<p>Even this is being kind &#8211; the average knowledge worker will use between 6 and 15 of these apps, we have experienced people using upwards of 30 because of these data silos. The typical enterprise application looks much like &#8220;Your company&#8217;s app&#8221; as shown in the cartoon. There is such a vast amount of data flowing around your company that you often end up with these kind of user interfaces. Instead of achieving the goal of bringing powerful information to the fingertips of the business end-user, it just confuses people. It just makes people unhappy and unproductive. For every new channel, (email, web, social channels, &#8230;) and for every new product, the quick answer is often to bring in a new additional application. This all adds to the complexity and mess on the knowledge workers desktop.</p>
<p>And just in case you would forget, an IBM Design <a href="http://twitter.com/ibmdesign/status/6172214532">tweet</a> nailed it:</p>
<p style="text-align: center;"><span style="display: inline;"><img class="aligncenter" src="http://www.headshift.com/blog/twitter_ibmdesign_mash.jpg" alt="twitter_ibmdesign_mash.jpg" width="300" height="*" /></span></p>
<p>The end-user of your product doesn&#8217;t care what kind of data silos are laying underneath your IT system. They just want the information they need to do their work, but very importantly: taking in account the context of the work! Put yourself in the consumer&#8217;s shoes, your customer doesn&#8217;t talk to you in silos and certainly doesn&#8217;t want to be treated in silos. Have you ever called your bank, only to be passed to several different departments? We know already that Interactions aren&#8217;t Connected and ultimately we are still running processes as if we are in the industrial age &#8211; an assembly line of handoff after handoff.</p>
<p>This is the same as going to a McDonalds and asking for a &#8216;Happy Meal&#8217; but being told to get a drink from one counter, a sandwich from another, fries from a third and the toy will be sent directly from Mattel &#8211; oh and you want a straw, napkins and sauce &#8211; there&#8217;s self service for that. More of a Meal than Happy! Not quite so fast or convenient food. What happens when the meal then changes, there&#8217;s a new toy, you add something else to the box &#8211; how is the existing process able to cope with changes easily?</p>
<h2><strong>Impact of data ubiquity</strong></h2>
<p>We have now identified the impact of data ubiquity:</p>
<ol>
<li>the corporate IT department being challenged by huge data silos (lock-in) and disparate solutions with complex processes that rely on humans to be the integration layer</li>
<li>the business end-user dealing with too many different and complex applications and not being able to make sense of all the data (filter failure), subsequently not being able to deliver the process</li>
</ol>
<p>In many businesses the delivery of an end to end business process relies on users accessing multiple software applications that combine to deliver the complete process. The result of this can be disjointed processes, mistakes, slow access to required information, no single customer view and ultimately a dysfunctional customer experience that the business users can&#8217;t impact.</p>
<p>So, the goal we&#8217;re trying to achieve is to <strong>provide business end-users Enterprise 2.0 systems with meaningful contextual information in a simple and elegant way</strong> &#8211; we like to call this fit for purpose!</p>
<h2><strong>Mashing it together</strong></h2>
<p>As a technologist, the first reaction would be to try to solve this data silo problem. (Let&#8217;s ignore for a second the old approach to create a monolithic repository, called the black hole, where we dump everything in.) How can we make it more accessible, can we wrap a web services around it, can we apply on a large scale the principles of Service-Oriented Architecture and Model-View-Controller, can we add an open data API interface to it, etc. That will most likely keep us busy for the next coming years. Wake up call: your customers are not going to wait two years till you have your internal issues solved. This approach also often instigates new shadow projects that proclaim to deliver tactical, quick win solutions whilst waiting for the &#8216;nirvana&#8217;.</p>
<p>As a business end-user, you&#8217;re faced with a proliferation of applications. A lot of time and money have been invested in building these, so&#8230; why not starting by reusing the useful bits of the existing apps to get immediate value?</p>
<p>This is the approach pioneered by <a href="http://www.corizon.com/">Corizon</a>. A user-centered focus approach, starting with the end users and working down &#8211; understanding the process in which they go through to complete a task, be it solve a customer enquiry in the front office or manage work in the back office &#8211; referred to as the user process. All too often, technology&#8217;s answer to a problem is upgrade to the latest version as it has all these new features. The problem with this is that this doesn&#8217;t necessarily resolve the original problems, complex process, too many applications.</p>
<p>Once the user process is clearly defined, we then (or can in parallel) look at working up &#8211; understanding what data &amp; applications we need access to effectively and efficiently complete the defined user processes. We will now understand the pattern, what gets the most use, by who, how. This firmly puts the cross hairs on which applications to enable for reuse. Unlike traditional data re-use approaches, this approach enables the useful &amp; required bits of applications, but also defines reusable UI and stores these in a library of reusable services.</p>
<p>Now we have two key elements defined, a clear user process and a set of reusable UI services. Corizon&#8217;s solution then allows you to mashup these to create the optimal interface, be it a standalone UI or consumed as a widget in your Enterprise 2.0 application. Adding new, or removing legacy applications becomes a much less complex task &#8211; the UI Services approach allows you to interchange these without affecting the interface.</p>
<h2><strong>Good is good enough</strong></h2>
<p>At first sight, this approach might sound a bit unconventional, but we&#8217;d like to invite you to an excellent post by Peter Evans-Greenwood (@<a href="http://www.twitter.com/pevansgreenwood">pevansgreenwood</a>) that talks about &#8220;<a href="http://peter.evans-greenwood.com/2009/11/26/the-price-of-regret/">The Price of Regret</a>&#8220;.</p>
<blockquote><p>Building the big, scalable perfect solution in the first place might be more efficient from an engineering point of view. However, if we make the delivery effort so large that we miss the window of opportunity, then we&#8217;ve just killed any chance of helping the business to capitalise on the opportunity. &#8230; Size the solution to match the business opportunity, and accept that there may need to be some rework in the future. Make the potential need for rework clear to the business so that there are no surprises. Don&#8217;t use potential rework in the future as a reason to do nothing. Or to force approval of a strategic infrastructure project which will deliver sometime in the distant future, a future which may never come.</p></blockquote>
<p>One thing we&#8217;ve learned in this consulting business is that most of the times, good is good enough since perfection takes an eternity.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dachisgroup.com/2010/01/data-ubiquity-threatening-usefulness-of-enterprise-2-0/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

