Preliminary Standard
Business Data Lake Conceptual Framework
As a draft standard, © 2015 Capgemini
Copyright © 2015, The Open Group
The Open Group hereby authorizes you to use this document for any purpose, PROVIDED THAT any copy of this document, or any part thereof, which you make shall retain all copyright and other proprietary notices contained herein.
This document may contain other proprietary notices and copyright information.
Nothing contained herein shall be construed as conferring by implication, estoppel, or otherwise any license or right under any patent or trademark of The Open Group or any third party. Except as expressly provided above, nothing contained herein shall be construed as conferring any license or right under any copyright of The Open Group.
Note that any product, process, or technology in this document may be the subject of other intellectual property rights reserved by The Open Group, and may not be licensed hereunder.
This document is provided “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NON-INFRINGEMENT. Some jurisdictions do not allow the exclusion of implied warranties, so the above exclusion may not apply to you.
Any publication of The Open Group may include technical inaccuracies or typographical errors. Changes may be periodically made to these publications; these changes will be incorporated in new editions of these publications. The Open Group may make improvements and/or changes in the products and/or the programs described in these publications at any time without notice.
Should any viewer of this document respond with information including feedback data, such as questions, comments, suggestions, or the like regarding the content of this document, such information shall be deemed to be non-confidential and The Open Group shall have no obligation of any kind with respect to such information and shall be free to reproduce, use, disclose, and distribute the information to others without limitation. Further, The Open Group shall be free to use any ideas, concepts, know-how, or techniques contained in such information for any purpose whatsoever including but not limited to developing, manufacturing, and marketing products incorporating such information.
If you did not obtain this copy through The Open Group, it may not be the latest version. For your convenience, the latest version of this publication may be downloaded at www.opengroup.org/bookstore.
Preliminary Standard
Business Data Lake Key Concepts
ISBN:
Document Number:
Published by The Open Group, .
Comments relating to the material contained in this document may be submitted to:
The Open Group, Apex Plaza, Forbury Road, Reading, Berkshire, RG1 1AX, United Kingdom
or by electronic mail to:
ogedit@opengroup.org
Contents
Contents 3
Preface 5
Trademarks 6
Participants 7
Acknowledgements 8
Referenced Documents 9
1Introduction 10
1.1Objective 10
1.2Overview 10
1.3Linkage to Other Open Group Standards 10
1.3.1Linkage to TOGAF 10
1.3.2Linkage to Archimate 11
1.3.3Linkage to IT4IT 11
1.3.4Linkage to O-DEF 11
1.4Conformance 11
1.5Terminology 11
1.6Future Directions 11
2Definitions 13
2.1Analytics 13
2.2Batch, Micro Batch 13
2.3Big Data 13
2.4Ecosystem 15
2.5Enterprise Data Warehouse (EDW) 16
2.6Knowledge 17
2.7Master Data Management (MDM) 17
2.8Metadata 18
2.9Open Platform 3.0 19
2.10Platform 19
2.11Real-Time, Near Real-Time and Interactive response time 19
2.12Structured Data, Semi-Structured Data, Unstructured Data 19
3Overview of the BDL 21
3.1Business Data Lake Definition 21
3.2How does the BDL work? 21
3.3Relevant Business Scenarios for the BDL 24
3.3.1Enterprise Data Warehouse off-load 24
3.3.2Discovery Platform 24
3.3.3Big Data Apps 25
3.3.4Data-Driven Enterprise 25
3.3.5Data-Driven Ecosystem 25
4The concepts of the Business Data Lake 26
4.1.1Data 27
4.1.2Metadata 27
4.1.3Event 27
4.1.4Stream 27
4.1.5Insight 27
4.3.1Lambda Architecture 29
4.3.2Batch Processing Workflow 30
4.3.3Analytics 31
4.3.4Analytics Engine 32
4.3.5Real-Time processing 33
4.3.6Business compartments 33
4.3.7Actions – Service Layer 34
4.3.8Existing IS landscape 35
4.4.1Master Data Management 35
4.4.2Reference Data Management 35
4.4.3Audit & Policy Management 36
4.4.4Privacy and Protection 36
4.4.5Information Security 36
4.5.1System Monitoring 36
4.5.2System Management 37
Index 38
Preface
The Open Group
The Open Group is a global consortium that enables the achievement of business objectives through IT standards. With more than 400 member organizations, The Open Group has a diverse membership that spans all sectors of the IT community – customers, systems and solutions suppliers, tool vendors, integrators, and consultants, as well as academics and researchers – to:
Capture, understand, and address current and emerging requirements, and establish policies and share best practices
Facilitate interoperability, develop consensus, and evolve and integrate specifications and open source technologies
Offer a comprehensive set of services to enhance the operational efficiency of consortia
Operate the industry’s premier certification service
Further information on The Open Group is available at www.opengroup.org.
The Open Group publishes a wide range of technical documentation, most of which is focused on development of Open Group Standards and Guides, but which also includes white papers, technical studies, certification and testing documentation, and business titles. Full details and a catalog are available at www.opengroup.org/bookstore.
Readers should note that updates – in the form of Corrigenda – may apply to any publication. This information is published at www.opengroup.org/corrigenda.
This Document
This document represents The Open Group Preliminary Standard for the Business Data Lake Conceptual Framework. It has been initially submitted by Capgemini and then further developed and approved by The Open Group.
Trademarks
ArchiMate®, DirecNet®, Making Standards Work®, OpenPegasus®, The Open Group®, TOGAF®, UNIX®, and the Open Brand (“X Device”) are registered trademarks and Boundaryless Information Flow™, Build with Integrity Buy with Confidence™, Dependability Through Assuredness™, FACE™, IT4IT™, Open Platform 3.0™, Open Trusted Technology Provider™, The Open Group Certification Mark (“Open O”), and UDEF™ are trademarks of The Open Group.
Microsoft® is a registered trademark of Microsoft Corporation in the United States and/or other countries.
All other brands, company, and product names are used for identification purposes only and may be trademarks that are the sole property of their respective owners.
Participants
The initial version of this document was a fast-track submission to The Open Group by Capgemini. It was then further developed by the Big Data group of the Open Platform 3.0 Forum from The Open Group. The members of this group were:
Chris Harding, The Open Group, Forum Director for Open Platform 3.0
Ken Street, Conexiam
Olivier Flebus, Capgemini
Robert Weisman, Build The Vision Inc.
Carlos Ferraro Cavallini, Salesforce.com
Sudhir Singapuram, IBM
Seshu Madabhushi, Tata Consultancy Services
Acknowledgements
The Open Group gratefully acknowledges the contribution of the following people in the development of this document:
All the global Pivotal and Capgemini teams who originally crafted the Business Data Lake concepts, especially Steve Jones, Paul Gittins, Lee Brown, Pramod Taneja, John D. McKinney,
The Big Data teams at Capgemini Aerospace & Defence who contributed to and reviewed the initial sumbmission of this document, especially Cédric Cormont, Sébastien Guilloux, Alexandre Diaz, Adrien Calvayrac and Pascal Gillet.
The people who reviewed and helped to improve this document: Kary Framling, Dave Lounsbury, Andrew Josey, Ed Roberts, Tim Vincent, Mandy Chessell, Kelvin Laurence.
Referenced Documents
The following documents are referenced in this Preliminary Standard.
(Please note that the links below are good at the time of writing but cannot be guaranteed for the future.)
The Information Architecture Guide from the Open Group
Share with your friends: |