Dama-Dmbok2 Framework Draft For Download
Dama-Dmbok2 Framework Draft For Download
Dama-Dmbok2 Framework Draft For Download
Patricia Cupoli Susan Earley Deborah Henderson Editor Production Editor Program Director
DAMA-DMBOK2 Framework
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32
Table of Contents
1.1. About This Document .......................................................................................................... 2 1.2. Revision History .................................................................................................................. 4 2. What is the DAMA-DMBOK? ................................................................................................... 5 3. Introduction to DAMA-DMBOK2 ............................................................................................. 5 3.1. Why a Framework Outline? ................................................................................................. 5 3.2. History.................................................................................................................................. 6 3.3. Purpose................................................................................................................................. 6 3.4. Goals .................................................................................................................................... 7 3.5. Audience .............................................................................................................................. 7 3.6. Potential Uses....................................................................................................................... 7 4. Proposed Framework .................................................................................................................. 8 4.1. Knowledge Areas ................................................................................................................. 8 4.2. Knowledge Area-Related Processes, Activities and Elements .......................................... 10 4.2.1. Context Diagrams ........................................................................................................... 10 4.2.2. Activity Groups ............................................................................................................... 11 4.2.3. Environmental Elements ................................................................................................. 12 5. DAMA-DMBOK2 Structure .................................................................................................... 14 5.1. DAMA-DMBOK2 Book Outline ...................................................................................... 14 5.2. Knowledge Area Chapter Structure ................................................................................... 19 6. Concordance between DAMA DMBOK Editions .................................................................... 21 7. Next Steps ................................................................................................................................. 22
Table of Figures
Figure 1. The DAMA-DMBOK2 Knowledge Area Wheel ............................................................ 9 Figure 2. Context Diagram Example ............................................................................................ 11 Figure 3. Environmental Elements ............................................................................................... 13 Figure 4. Environmental Elements Scope Summary ................................................................. 13 Figure 5. Knowledge Area Chapter Outline ................................................................................. 21
DAMA-DMBOK2 Framework
33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51
DAMA-DMBOK2 Framework
52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
DAMA-DMBOK2 Project Committee Members Patricia Cupoli Editor Susan Earley Production Editor Deborah Henderson Program Director Lisa Pazzano Communications and Marketing Manager Sanjay Shirude DAMA International VP Education Eva Smith Project Infrastructure Manager
DAMA-DMBOK2 Framework
72 73
10
11
Patricia Cupoli
Deborah Henderson
12 13
14
15
74 75
DAMA-DMBOK2 Framework
76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115
Standardization will help data management professionals perform more effectively. Executives in particular need to understand and assign value to data management activities, so they can fully support, fund, and staff the data management function. Moreover standardization will also help us communicate with our teammates, managers, and executives, and ubiquitous use will harden Data Management into a formal discipline around the world.
3. Introduction to DAMA-DMBOK2
3.1. Why a Framework Outline?
The DAMA-DMBOK2 Framework Outline described here exists to provide the proposed structure and outline of content for organizing the second edition of the Data Management Body of Knowledge (DAMA-DMBOK2) document. In order to ensure the work is an accurate
DAMA-DMBOK2 Framework
116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156
reflection of the profession it is essential to gain community consensus for the Framework that becomes the foundation of the document.
3.2. History
The DAMA Data Management Body of Knowledge (DAMA-DMBOK) has undergone an evolution over the years. It began as the Guidelines for Implementing Data Resource Management in 1991. This publication was published by DAMA International in various forms through four versions in collaboration with DAMA Chicago. DAMA International published The DAMA Guide to the Data Management Body of Knowledge (DAMA-DMBOK Guide, 1st edition) in 2009. DAMA-DMBOK was in development for several years as a complete overhaul of the earlier Guidelines. A Framework white paper was written and floated to the data management community for comment and input, and became the basis for the first publication. Full DAMA-DMBOK text development proceeded with input from contributing authors, the DAMA-DMBOK editors, DAMA-DMBOK Editorial Board, and over 120 DAMA member reviewers. In preparation for the 2nd edition, input on existing and proposed content has been collected from DAMA chapter members and Enterprise Data World conference sessions. The DAMA Dictionary of Data Management is now in its 2nd edition and was published in April 2011 containing almost 2000 terms, including terms from the DAMA Certified Data Management Professional (CDMP) exams managed by the Institute for the Certification of Computing Professionals (ICCP). It is aligned to the terms in the DAMA-DMBOK and is the glossary for the DAMA-DMBOK.
3.3. Purpose
The 2nd edition of the DAMA-DMBOK (DAMA-DMBOK2) will continue the 1st edition philosophy of offering DAMA standardization of Data Management guidelines, characteristics and active practices. It will cover the WHAT, WHO and WHY of Data Management and its various knowledge areas. It will be modeled after other professional organizations Bodies of Knowledge (BOKs) such as PMIs PMBOK (Project Management BOK), and IEEEs SWEBOK (Software Engineering BOK). The entire body of knowledge about data management is quite large and constantly growing. The DAMA-DMBOK is a guide that is intended to provide a definitive introduction that body of knowledge. It presents a standard industry view of data management knowledge areas, terminology, and common best practices, without going into implementation details. The DAMA DMBOK Guide introduces alternative views and industry accepted approaches where clear differences of opinion exist. The DAMA-DMBOK Guide should not be read as an attempt to be a complete authority on any specific data management knowledge area. Instead, it points readers to widely recognized
April 23, 2012 2012 DAMA International All Rights Reserved 6
DAMA-DMBOK2 Framework
157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190
publications, articles, and other resources for further reading on the HOW-TO methods and implementation details. DAMA also encourages communities of practice discussions on the topics presented.
3.4. Goals
The goals of the DAMA-DMBOK2 Guide are: 1. To build consensus for a generally applicable view of data management knowledge areas. 2. To provide standard definitions for commonly used data management knowledge areas, deliverables, roles and, other terminology, in conjunction with the DAMA Dictionary of Data Management and thus to move the Data Management Community towards standardization on concepts and activities. 3. To identify guiding principles for data management. 4. To clarify the scope and boundaries of data management activities. 5. To overview commonly accepted good practices, widely adopted techniques, and significant alternative approaches, without reference to specific technology vendors or their products. 6. To briefly identify common organizational and cultural issues. 7. To identify strategies for data management maturity analysis. 8. To guide readers to additional resources for further understanding.
3.5. Audience
The audiences for DAMA-DMBOK2 will be similar to the audiences for the 1st edition. The audiences for the DAMA-DMBOK are quite varied and include: Certified and aspiring data management professionals. Other IT professionals working with data management professionals. Business data stewards at all levels. Executives with an interest in managing data as an enterprise asset. Knowledge workers developing an appreciation of data as an enterprise asset. Consultants conducting assessments of client data management areas and helping to implement and improve data management at these clients. Educators responsible for developing and delivering a data management curriculum. Researchers in the field of data management.
DAMA-DMBOK2 Framework
191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222
Helping data stewards, data owners, and data professionals understand their responsibilities. Providing the basis for assessments of data management effectiveness and maturity. Guiding efforts to implement and improve data management knowledge areas. Pointing readers to additional sources of knowledge about data management. Educating students, new hires, practitioners and executives on data management knowledge areas Guiding the development and delivery of data management curriculum content for higher education. Suggesting areas of further research in the field of data management. Helping data management professionals prepare for Certified Data Management Professional (CDMP) data exams. Assisting organizations in defining their enterprise data strategy.
4. Proposed Framework
4.1. Knowledge Areas
In the 1st edition of the DAMA-DMBOK, Data Management was described as a function that is also known as a high level business process or the name of the program. This process was captured in 10 functions and associated activities. In the DAMA-DMBOK2 we are emphasizing knowledge areas rather than functions. A knowledge area is a category of specialization. It could be made up of one or more topics, which will be handled in separate sections. There are 11 knowledge areas that cover the core areas in DAMA-DMBOK2 that DAMA International considers important for those performing data management. Each knowledge area has many section topics logically grouping activities that are required to be performed. In addition, there are supporting section topics that round out the knowledge requirements for data management professionals in additional chapters. Based on received input, the DAMA-DMBOK2 will use this revised Data Management knowledge area wheel (Figure 1):
DAMA-DMBOK2 Framework
223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240
The 11 Data Management Knowledge Areas are: Data Governance planning, oversight, and control over data management and use of data Data Architecture as an integral part of the enterprise architecture Data Modeling and Design analysis, design, building, testing, deployment and maintenance Data Storage structured physical data assets storage management Data Security ensuring privacy, confidentiality and appropriate access Data Integration and Interoperability data acquisition, transformation and movement; managing ETL, federation, or virtualization Documents and Content storing, protecting, indexing, and enabling access to data found in unstructured sources (electronic files and physical records), and making this data available for integration and interoperability with structured (database) data. Reference & Master Data managing gold versions and replicas Data Warehousing and Business Intelligence managing analytical data processing and enabling access to decision support data for reporting and analysis
2012 DAMA International All Rights Reserved 9
DAMA-DMBOK2 Framework
241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280
Meta-data integrating, controlling and delivering meta-data Data Quality defining, monitoring and improving data quality
10
DAMA-DMBOK2 Framework
281 282 283 284 285 286 287 288 289 290 291 292 293 294
Inputs: What documents or raw materials are directly necessary for a Process to initiate or continue? Supplier Roles: Roles and/or teams that supply the Inputs to the Process. Responsible Roles: Roles and/or teams that perform the Process. Stakeholder Roles: Roles and/or teams Informed or Consulted on the Process. Tools: Technology types used by the Process to perform the Function. Deliverables: What is directly produced by the Processes? Consumer Roles: Roles and/or teams that expect and receive the Deliverables. Metrics: Measurements of how to quantify the success of Processes based on the Goals
This diagram is an example of what the context diagram for a knowledge area would contain. If appropriate, a sub-topic section of a knowledge area may have its own context diagram for clarity.
11
DAMA-DMBOK2 Framework
302 303 304 305 306 307 308 309 310 311 312 313 314
High level or supervisory activities that set the strategic and tactical course for other data management activities. Planning activities may be performed on an iterative basis. Oversight activities performed on an on-going basis. Activities undertaken within projects and recognized as part of the systems development lifecycle (SDLC), creating data deliverables through analysis, design, building, testing, and deployment, performed on an iterative basis. Service and support and maintenance activities performed on an on-going basis.
Below is an overview of the work profile for our four Activities Groups Iterative Planning (P) Development (D) On-going Control (C) Operational (O)
Oversight Attention to Detail 315 316 317 318 319 320 321 322 323 324 325 326 327 328
The Framework identifies the following seven elements in DAMA-DMBOK2 consistent with DAMA-DMBOK version 1. Each Element now has an additional type descriptor; People, Process, or Technology
12
DAMA-DMBOK2 Framework
332 333
13
DAMA-DMBOK2 Framework
Environment Elements relate to the Knowledge Area Context Diagrams and Activity Groups in the following way: Environment Elements Knowledge Area Context Diagrams Definition Goals Activity Inputs & Outputs, Documents, Software Products Supplier Role Responsible Role Consumer Role Stakeholder Role Approver Role Accountable Role Techniques & Metrics Activity Groups
All activities
Practices & Techniques Organization & Culture 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357
5. DAMA-DMBOK2 Structure
5.1. DAMA-DMBOK2 Book Outline
The proposed outline is as follows below. Examples are provided to clarify the content of the chapter sections; however, they are not necessarily inclusive. Foreword Preface Acknowledgements Chapter 1: Introduction BOK Framework overview Vision Statement Scope of BOK what has changed since DMBOK1 Overlap/Interface with other BOKs and standards frameworks (like ANSI)
14
DAMA-DMBOK2 Framework
358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401
Chapter 2: Use of the DMBOK Case Studies Communicating DM value to the business Data as an asset Placing value on data Chapter 3: Overall Process: Data Management Data vs. Information Core Concepts : o Knowledge Area overview o Environmental Elements overview Knowledge Areas and Value The core Knowledge Areas follow. Here are the general outlines for each chapter. Chapter 4: Knowledge Area: Data Governance Section 1 = Data Governance Data Governance - as oversight for all data management, moving towards a unified theory of data management strategy and control (also within chapters as a focus for each knowledge area) Context: Relationship to Information Governance, IT Governance, IT Service Management, Business Management, PMO, Business Operations Data valuation ROI Data governance and Government Sector Section 2 = Overall Data Management Maturity Model (also within chapters for each knowledge area) Maturity benchmarking Maturity development (targets and activities) Section 3 = Data Stewardship and Ownership Section 4 = Business Cultural Development (SDLC incorporation in various methodologies such as waterfall and agile change management inclusion, communication challenges, Section 5 = Contracting. Service level agreements, outsourcing Data in a Cloud Section 5 = Ethics Chapter 5: Knowledge Area: Data Architecture Section 1 = Enterprise Data Architecture Frameworks: Zachman Framework for Enterprise Architecture, others Working within Enterprise Architecture (Information vs. Infrastructure, Business, and Application, specialized architectures (e.g., network)) Section 2 = Data Architecture Implementation Enterprise architecture models vs. project architecture models Data in the Cloud Linked data architecture
April 23, 2012 2012 DAMA International All Rights Reserved 15
DAMA-DMBOK2 Framework
402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444
Semantic data architecture (Resource Description Framework (RDF), dereferenced data) Data as a Service SEE DII Big Data architecture Web data architecture (aka information architecture) Specific architectures for knowledge areas are included in those knowledge areas. Data Architecture Governance: Standard data architectures, compliance through project execution Chapter 6: Knowledge Area: Data Modeling & Design Section 1 = Modeling Overview Relational Object DW modeling (star, snowflake, outrigger) Canonical Semantic modeling and Resource Description Framework (RDF) Master Data modeling - SEE Reference and Master Data Section 2 = Conceptual/Logical Modeling Data requirements analysis New and Existing model analysis and integration Data Profiling as it relates to validation of logical models (interrogation and verification of the data behavior) Logical modeling Requirements Normalization Discussion (1st, -6th, other) Modeling techniques for model expansion Industry standard models Section 3 = Physical Modeling Physical DB restrictions discussion De-normalization Discussion Data Vault Overview (modeling, hub, link, satellite) Non-normalized-Storage Modeling Discussion Historical Data Retention Designs, including partitioning Distributed designs ER and Object modeling Big Data Columnar DB modeling Semi-structured modeling Issues in: Views or model? Indexing (map reduce approach, traditional OLTP, hash) leading edge discussions here on retrieval issues and solutions Referential integrity enforcement Data Modeling and Design Governance:
16
DAMA-DMBOK2 Framework
445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486
Documenting the Model (including versioning, lineage) and its use as a data governance tool Best practices in Naming conventions
Chapter 7: Knowledge Area: Data Storage Section 1 = DBMS applications [RDBMS, triple store, federation/cloud], transaction vs. bulk load support, backup/recovery, monitoring/tuning, purge/archive, rowbased vs. column-based, configuration management Virtualization (cloud) Object / multi-media database DBMS monitoring, including mobile monitoring, bots. Section 2 = File storage systems (Hadoop) (big data), No SQL Data Storage Governance: Enterprise demands in service management for data storage Chapter 8: Knowledge Area: Data Security Section 1 = Security Requirement Categories (HIPPA, PCI, PII, SOX PIPEDA) Section 2 = Security Management (AAA) Internet security, costs of data breaches (monetary and otherwise) Section 3 = Privacy Data Security Governance = working with Risk Management, Legal, Security breach response, access to information (government), Identity management Chapter 9: Knowledge Area: Data Integration and Interoperability (DII) Section 1 = Approaches: integration or interoperate? Mergers and acquisitions Drivers for DII Data.gov, Open Data (government published data) Standards Architectures o Data as a Service o Batch o Near real time, trickle o Real time Section 2 = Data acquisition (get data in) Buying / selling data, contracting Integrating 3rd party data Section 3 = Data movement/services (move data around), Data integration (combine data for use), Approaches to structured / unstructured data integration / issues Data transformation (change data in place or in combination with above activity) Section 4 = Data interoperability (use separate data together with OR without integration)
17
DAMA-DMBOK2 Framework
487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529
Data Integration and Interoperability Governance = redundancy control, security, lineage, value chain (impact analysis), data sharing agreements, quality and recombination Chapter 10: Knowledge Area: Document and Content Section 1 = Common activities regardless of document type Architecture Data as evidence Data retention Confidentiality including data marking Section 2 = Content Management (classification, taxonomies, tagging, indexing,) Section 3 = Physical Documents (Printed documents/records) Section 4 = Electronic Documents Documents/records Images/Audio/Video Document and Content Governance = working with Risk Management, Legal, Security breach response, access to information (government), service management Chapter 11: Knowledge Area: Reference and Master Data Section 1 = Common activities regardless of data type Architecture Administration approaches / compliance System of record / Data of record (gold data) Section 2 = Reference Data (including GIS (base spatial data, solids models CAD, temporal data, purchased data such as Bloomberg, Post Office,) GIS business reference data (enterprise specific GIS reference data (where and what company assets are)) Section 3 = Master Data Business rules Data sources Reference and Master Data Governance = Determining systems/data of record, determining and managing business rules Chapter 12: Knowledge Area: Data Warehousing & Business Intelligence Section 1 = back-office specialization, Kimball vs. Inmon, , real-time, near-real-time, data discovery (not modeling, but inventory, classification and assessment) and database inventory (what data is stored where and at what level).Note: ETL(Extract Transform and Load) is covered in Data Integration and Interoperability. Section 2 = front-office specialization, Business Intelligence & Analytics - Analytics, visualization, delivery, (including GIS, Storyboarding, See also suggestions, Dynamic search, autosuggest, sticky notes, and personalization, mashups, big data analytics)
18
DAMA-DMBOK2 Framework
530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572
Data Warehousing & Business Intelligence Governance = Reporting Strategy, Appropriate use of and interpretation of data, data architecture compliance, training and BICC (Business Intelligence Competency Centre) Chapter 13: Knowledge Area: Meta-data Architecture Section 1 = Semantics and metadata identification, types Multilingual environments Section 2 = Metadata solutions: Business glossary, Repository Architecture, collection and maintenance Meta-data Governance = Standard data definition (models, glossary, value chain, master data, gold data source, owner, and stewards) Chapter 14: Knowledge Area: Data Quality Section 1 = Measuring and Monitoring Defining quality, Impacts of low quality Section 2 = Data Profiling, data correction Data Quality Governance = Ensuring data quality (process engineering, rules, ownership and compliance) Chapter 15: DM Supporting Topics Section 1 = Professional Development (certification, facilitation) Section 2 = Business Data Requirement Development (how to get good data requirements) and deliverable verification to requirements Section 3 = Communicating Data Management value to the business Section 4 = Data management cost control Section 5 = The Data Management organization Section 6 = Facilitation Appendix 1. Primary Contributing Authors (by chapter or section) 2. Contributing Reviewers and Commenters 3. Context Diagram contents by context area consolidations 4. Chart relating DMBOK 1st edition to DMBOK 2nd edition (Concordance) 5. Bibliography
19
DAMA-DMBOK2 Framework
573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616
1. Introduction/Knowledge Area Definition 2. Description of Sections/Topics 3. Section N a. Executive Summary/Context Diagram b. Essential Concepts, Common Vocabulary, and Popular Frameworks c. Business Drivers i. Goals & Principles ii. Government Regulations iii. Industry Standards d. Processes i. Activities e. Inputs and Outputs i. Inputs ii. Outputs f. Technical Drivers i. Toolsets ii. Techniques g. People i. Overall Organization and Culture ii. Roles & Responsibilities 1. Supplier Roles a. Internal b. External 2. Responsible Roles a. Internal b. External 3. Consumer Roles a. Internal b. External 4. Stakeholder Roles a. Internal b. External h. Inter- Section Relationships/Interfaces i. Implementation Guidelines i. Topic Readiness Assessment / Risk Assessment ii. Organization and Cultural Change 1. Process Change 2. Communication/Training 3. Transition 4. Maintenance iii. Top mistakes to avoid j. Conclusion/Summary i. Looking Forward/Trends k. Reference Citations/Additional Reading
April 23, 2012 2012 DAMA International All Rights Reserved 20
DAMA-DMBOK2 Framework
617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657
4. Section N+1 5. Knowledge Area Governance i. Knowledge Area Maturity Model Overview, Metrics ii. Knowledge Area Value Chain / Business Value
The DAMA-DMBOK2 will contain some different concepts than the 1st edition and these concepts are covered in section 4 of this Framework. They include, in summary: Revised DAMA-DMBOK knowledge area wheel to include eleven knowledge areas with the addition of Data Integration and Interoperability A re-ordering of the knowledge area wheel so that Data and Content Management is introduced earlier in the lifecycle of the wheel (as read clockwise)
21
DAMA-DMBOK2 Framework
658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676
An emphasis on knowledge areas (rather than functions) that will contain multiple subjects A re-structured format for each chapter, including re-formatted context diagrams An emphasis on data governance as a unifying oversight mechanism in data management
A table describing the areas of similarities and differences between editions will be included in the DAMA-DMBOK2 as an appendix.
7. Next Steps
This Framework proposes an outline and structure for DAMA-DMBOK2. It will be made available for a period of time to DAMA members for comment, and then to the general public. The comments will be compiled and be available via www.dama.org. The final Framework will be provided to the primary contributing authors as a guideline for development of their knowledge area or section, and to all reviewers so they can comment within expected content parameters. The editors are responsible for the final DAMA-DMBOK2 presentation and content.
22