Technology Integration And Big Data: Extracting Value

Become a Member!

Why Register?

Login

Featured Research

Announcements

Technology Events

Home Profile Peers Wiki Activity Groups Feedback

Technology Integration and Big Data: Extracting Value

Currently 5/5 Stars.
1
2
3
4
5

rate this

Last Update: Aug 16, 2012 | 06:01

Viewed 16345 times | Community Rating: 5

Originating Author: Sid Probstein

Most people think of Big Data as being about volume, but there are other critical dimensions such as velocity and variety. From the Relay TM case, we see that while volume is certainly important, the main driver of business value comes from looking across many disparate sources - both internal and external. A unified view is essential to making this happen.

Architecturally, there are a few different routes to achieve unified information access (UIA) across silos. The following diagram presents the three most prevalent options.

is a federated or virtualized approach. A client calls a query server and provides details on the information it needs. The query server connects to each of many sources, both structured and unstructured, passes the query off to each source, and then aggregates the results and returns them to the client. Building a model like this is complex, although it seems sensible because it doesn't require any normalization. On the other hand, it is also a "brute force" approach that won't perform well on cross-silo analysis when any result set is large.
is a pre-JOINed approach. Data is ingested and normalized into a single model, following an ETL process. A query server resolves queries against it. This model will have more consistent performance. However, it trades-off flexibility at query time, because in order for a new relationship to be used, all of the data must be re-ingested and re-normalized. The ingestion logic is also challenging as data must be modeled prior to ingestion, and the keys between data items must be pre-defined.
is the true agile UIA approach. Data is ingested and modeled just as it was in the source repository - typically in tables with keys identifying relationships. Flat repositories like file systems become tables also. This model has consistent performance and offers complete flexibility at query time as any relationship, even one that is not formal in the data, can be used. The ingestion logic is far simpler than in option #2, as it does not require a normalized model, and thus avoids the ETL step.

Selecting one of these architectures depends heavily on the use case. For solutions that simply need to aggregate information from multiple sources, architecture #1 can be made to work, especially if most of the data is structured. Solutions that require relational algebra might try approach #2 if there are relatively few sources, with limited growth of sources over time. It seems to work particularly well for e-commerce sites where the catalog is central to the experience. Architecture #3 is most suited for integrating multiple silos, at scale, across multiple domains, or for solutions that may support numerous types of analysis.

Action Item: Use a UIA architecture for an upcoming strategic project. This will get your organization and colleagues thinking about how they can build solutions that connect the dots, instead of just creating more silos that require costly and time-consuming integration efforts.

Footnotes:

Comments on 'Technology Integration and Big Data: Extracting Value'

There are currently no comments. Be the first!

Post A Comment

You must be logged in to post a comment, please Sign in

Revision ID	Author	Timestamp	Comment
42224	Wikibon Daemon	12 Aug 16 18:01:07
42223	Wikibon Daemon	12 Aug 16 17:59:53
42222	Wikibon Daemon	12 Aug 16 17:59:27
42216	Wikibon Daemon	12 Aug 16 17:16:47
42215	Wikibon Daemon	12 Aug 16 17:16:38
42214	Wikibon Daemon	12 Aug 16 17:16:23
42213	Wikibon Daemon	12 Aug 16 17:15:58
42212	Wikibon Daemon	12 Aug 16 17:14:39
41929	Bert Latamore	12 Jul 18 18:18:54
41890	Wikibon	12 Jul 16 19:40:41
41766	Wikibon Daemon	12 Jul 12 11:00:40
41760	Wikibon Daemon	12 Jul 12 10:55:35
41749	Wikibon Daemon	12 Jul 12 10:39:46
41748	Wikibon Daemon	12 Jul 12 10:37:52
41747	Wikibon Daemon	12 Jul 12 10:37:32
41746	Wikibon Daemon	12 Jul 12 10:37:14
41745	Wikibon Daemon	12 Jul 12 10:36:55
41744	Wikibon Daemon	12 Jul 12 10:36:32
41742	Wikibon Daemon	12 Jul 12 10:35:03
41741	Wikibon Daemon	12 Jul 12 10:34:48
41740	Wikibon Daemon	12 Jul 12 10:34:32
41737	Sidprobstein	12 Jul 12 10:13:05	Created page with ' Most people think of Big Data as being about volume, but there are other critical dimensions such as velocity and variety. From the Relay TM case, we see that while...'

Wikibon is a professional community solving technology and business problems through an open source sharing of free advisory knowledge.

Become a Member!

Login

Featured Research

Announcements

Technology Events

Comments on 'Technology Integration and Big Data: Extracting Value'

Post A Comment

most recent wikibon articles

latest wikibon blog posts

company profiles

wikibon community information