Which of the following provides the strongest tangible reason for driving initiation of a Data Governance process in an enterprise?
There are three basic approaches to implementing a Master Data hub environment, including:
A data governance strategy defines the scope and approach to governance efforts. Deliverables include:
Your organization has many employees with official roles as data stewards and data custodians, but they don't seem to know exactly what they're supposed to be doing. Which of the following is most likely to be a root cause of this problem?
To understand and evaluate ethical use of data within the organization what principles should we base our decisions on?
A Business Glossary forces a business to adopt a single definition of a business term.
The term data quality refers to both the characteristics associated with high quality data and to the processes used to measure or improve the quality of data.
When doing reference data management, there many organizations that have standardized data sets that are incredibly valuable and should be subscribed to. Which of these organizations would be least useful?
A database uses foreign keys from code tables for column values. This is a way of
implementing:
A node is a group of computers hosting either processing or data as part of a distributed database.
A data model that consists of a single fact table linked to important concepts of the
business is a:
Industry is struggling to distinguish the accountabilities of CDO and CIO. The definition of their responsibilities may specify parts of:
What area do you not consider when developing a 'Data Governance operating model?
CMDB provide the capability to manage and maintain Metdata specifically related to the IT assets, the relationships among them, and contractual details of the assets.
A limitation of the centralized metadata repository approach is it may be less expensive.
Data Governance includes developing alignment of the data management approach with organizational touchpoints outside of the direct authority of the Chief Data Officer. Select the example of such a touchpoint.
The Belmont principles that may be adapted for Information Management disciplines, include:
The European Commission Article 29 Data Protection Working Party provides a set of criteria to evaluate anonymization methods. What do they recommend?
With reliable Metadata an organization does not know what data it has, what the data represents and how it moves through the systems, who has access to it, or what it means for the data to be of high quality.
SPARC published their three-schema approach to database management. The three key components were:
Drivers for data governance most often focus on reducing risk or improving processes. Please select the elements that relate to the reduction in risk:
Data for Big Data ingestion can also be called the data lake. This needs to be carefully managed, or the data lake will become:
Media monitoring and text analysis are automated methods for retrieving insights from large unstructured or semi-structured data, such as transaction data, social media, blogs, and web news sites.
Content needs to be modular, structured, reusable and device and platform independent.
The library of Alexandria was one of the largest collection of books in the ancient
world. Which DMBoK knowledge area is most aligned with managing the collection?
Service accounts are convenient because they can tailor enhanced access for the processes that use them.
Top down' and "bottom up' data analysis and profiling is best done in concert
because:
Data Governance is at the centre if the data management activities, since governance is required for consistency within and balance between functions.
Governance ensures data is managed, but is not include the actual act of managing data.
A weakness or defect in a system that allows it to be successfully attacked and
compromised is a:
Data Integration and Interoperability is dependent on these other areas of data management:
Valuation information, as an example of data enrichment, is for asset valuation, inventory and sale.
Confidentiality classification schemas might include two or more of the five confidentiality classification levels. Three correct classifications levels are:
Repositories facilitate the collection, publishing and distribution of data in a centralized and possibly standardized way. Data is most often used to:
Obfuscating or redacting data is the practice of making information anonymous ot removing sensitive information. Risks are present in the following instances:
A successful Data Governance program requires that all enterprise data be certified.
The dependencies of enterprise technology architecture are that it acts on specified data according to business requirements.
How do data management professionals maintain commitment of key stakeholders to the data management initiative?
Integration of ETL data flows will usually be developed within tools specialised to manage those flows in a proprietary way.
In a SQL injection attack, a perpetrator inserts authorized database statements into a vulnerable SQL data channel, such as a stored procedure.
Small reference data value sets in the logical data model can be implemented in a physical model in three common ways:
The difference between warehouses and operational systems do not include the following element:
Change Data Capture is a method of reducing bandwidth by filtering to include only data that has been changed within a defined timeframe.
Assessment criteria are broken into levels, and most capability maturity models use five (5) levels. This is important since:
Which of the following are must-do for any successful Data Governance programme?
Implementing a BI portfolio is about identifying the right tools for the right user communities within or across business units.
Effectiveness metrics for a data governance programme includes: achievement of goals and objectives; extend stewards are using the relevant tools; effectiveness of communication; and effectiveness of education.
Validity, as a dimension of data quality, refers to whether data values are consistent with a defined domain of values.
When constructing models and diagrams during formalisation of data architecture there are certain characteristics that minimise distractions and maximize useful information. Characteristics include:
A goal of reference and master data management is for data to ensure shared data is:
How can the Data Governance process best support Regulatory reporting requirements?
Defining quality content requires understanding the context of its production and use, including:
The ISO 11179 Metadata registry, an international standard for representing Metadata in an organization, contains several sections related to data standards, including naming attributes and writing definitions.
Data security includes the planning, development and execution of security policies and procedures to provide authentication, authorisation, access and auditing of data and information assets.
Data Integration and Interoperability (DII) describes processes related to the movement and consolidation of data within and between data stores, applications and organizations.
Reference data management entails the preventative maintenance of undefined domain values, definitions and the relationship within and across domain values.
In Data Modelling, the generalization of the concept of person and organization into a party enables:
Primary deliverables of the Data Warehouse and Business Intelligence context diagram include:
The Zachman Framweork’s communication interrogative columns provides guidance on defining enterprise architecture. Please select answer(s) that is(are) coupled correctly:
Operationality and interoperability depends on the data quality. In order to measure the efficiency of a repository the data quality needs to be:
Achieving security risk reduction in an organisation begins with developing what?
Preparation and pre-processing of historical data needed in a predictive model may be performed in nightly batch processes or in near real-time.
Data security issues, breaches and unwarranted restrictions on employee access to data cannot directly impact operational success.
An advantage of a centralized repository include: Quick metadata retrieval, since the repository and the query reside together.
When starting a Data Governance initiative it is important to understand what the Business cannot achieve due to data issues because:
Examples of concepts that can be standardized within the data quality knowledge area include:
A ‘Golden Record’ means that it is always a 100% complete and accurate representation of all entities within the organization.
Measuring the effects of change management on in five key areas including: Awareness of the need to change; Desire to participate and support the change; Knowledge about how to change; Ability to implement new skills and behaviors; and Reinforcement to keep the change in place.
Common understanding of the core business concepts and terminology is the objective of which deliverable?
Through similarity analysis, slight variation in data can be recognized and data values can be consolidated. Two basic approaches, which can be used together, are:
Data quality management is a key capability of a data management practice and organization.
The scope and focus of any data governance program depend on organizational needs, but most programs include:
Which Data Architecture artefact contains the names of key business entities, their
relationships, critical guiding business rules and critical attributes?
Value is the difference between the cost of a thing and the benefit derived from that thing.
Reference and Master data definition: Managing shared data to meet organizational goals, reduce risks associated with data redundancy, ensure higher quality, and reduce the costs of data integration.
Subtype absorption: The subtype entity attributes are included as nullable columns into a table representing the supertype entity
Misleading visualisations could be an example where a base level of truthfulness and transparency are not adhered to.
The impact of the changes from new volatile data must be isolated from the bulk of the historical, non-volatile DW data. There are three main approaches, including:
Location Master Data includes business party addresses and business party location, as well as facility addresses for locations owned by organizations.
Differentiating between data and information. Please select the correct answers based on the sentence below: Here is a marketing report for the last month [1]. It is based on data from our data warehouse[2]. Next month these results [3] will be used to generate our month-over-month performance measure [4].
DBAs and database architects combine their knowledge of available tools with the business requirements in order to suggest the best possible application of technology to meet organizational goals.
Improving an organization’s ethical behaviour requires an informal Organizational Change Management (OCM) process.
A content strategy should end with an inventory of current state and a gap assessment.
An implemented warehouse and its customer-facing BI tools is a technology product.
Over a decade an organisation has rationalised implementation of party concepts
from 48 systems to 3. This is a result of good:
To build models, data modellers heavily rely on previous analysis and modelling work.
Data parsing is the process of analysing data using pre-determined rules to define its content or value.
An advantage of a centralized repository include: High availability since it is independent of the source systems.
Data mining is a sub-field of supervised learning where users attempt to model data elements and predict future outcomes through the evaluation of probability estimates.
Looking at the DMBoK definition of Data Governance, and other industry definitions, what are some of the common key elements of Data Governance?
In the context of big data the Three V’s refer to: Volume, Velocity and Validity
The categories of the Data Model Scorecard with the highest weightings include:
A security mechanism that searches for customer bank account details in outgoing
emails is achieving the goal of:
What techniques should be used and taught to produce the required ethical data handling deliverables?
Modeling Bid data is a non-technical challenge but critical if an organization that want to describe and govern its data.
Data science involves the iterative inclusion of data sources into models that develop insights. Dat science depends on:
Referential Integrity (RI) is often used to update tables without human intervention. Would this be a good idea for reference tables?
Access to data for Multidimensional databases use a variant of SQL called MDX or Multidimensional expression.
A communication plan includes an engagement model for stakeholders, the type of information to be shared, and the schedule for sharing information.
The advantage of a decentralized data governance model over a centralized model is:
The business glossary application is structured to meet the functional requirements of the three core audiences:
Data governance requires control mechanisms and procedures for, but not limited to, facilitating subjective discussions where managers’ viewpoints are heard.
When trying to integrate a large number of systems, the integration complexities can
be reduced by:
A catastrophic system failure due to processing attachments that are too large may
be solved by:
DAMA International’s Certified Data Management Professional (CDMP) certification required that data management professionals subscribe to a formal code of ethics, including an obligation to handle data ethically for the sake of society beyond the organization that employs them.
A dimensional physical data model is usually a star schema, meaning there is one structure for each dimension.
A deliverable in the data architecture context diagram includes an implementation roadmap.
A goal of a Reference and Master Data Management program include enabling master and reference data to be shared across enterprise functions and applications.
Technical Metadata provides data about the technical data, the systems that store data, and the processes that move between systems.
Effective data management involves a set of complex, interrelated processes that enable an organisation to use its data to achieve strategic goals.
What are the three characteristics of effective Data Governance communication?
A ‘Content Distribution Network’ supporting a multi-national website is likely to use:
The Data Governance Council (DGC) manages data governance initiatives, issues, and escalations.
Please select the correct name for the PDM abbreviation when referring to modelling.
Data modelling tools are software that automate many of the tasks the data modeller performs.
Project that use personal data should have a disciplined approach to the use of that data. They should account for:
There are three recovery types that provide guidelines for how quickly recovery takes place and what it focuses on.
How does the DMBOK refer to an organization that values data as an asset and manages data through all phases of its lifecycle?
Information gaps represent enterprise liabilities with potentially profound impacts on operational effectiveness and profitability.
What position is responsible for the quality and use of their organization’s data assets?
Issue management is the process for identifying, quantifying, prioritizing, and resolving Data Governance issues. Which of the following are areas where that issues might arise:
The most informal enterprise data model is the most detailed data architecture design document.
A e-discovery readiness assessment should examine and identify opportunities for the commercial response program.
No recorded negative ethical outcomes does not mean that the organization is processing data ethically. Legislation cannot keep up with the evolution of the data environment so how do we stay compliant?
The database administrator (DBA) is the most established and the most widely adopted data professional role.
Different levels of policy are required to govern behavior to enterprise security. For example:
The goals of implementing best practices around document and content management include:
An enterprise's organisation chart has multiple levels, each with a single reporting
line. This is an example of a:
Please select correct term for the following sentence: An organization shall assign a senior executive to appropriate individuals, adopt policies and processes to guide staff and ensure program audibility.
Operational Metadata describes details of the processing and accessing of data. Which one is not an example:
The accuracy dimension of data quality refers to the degree that data correctly respresents ‘real-life’ entities.
A change management program supporting formal data governance should focus communication on:
Obtaining buy-in from all stakeholders
A synonym for transformation in ETL is mapping. Mapping is the process of developing the lookup matrix from source to target structures, but not the result of the process.
What are the three characteristics of effective Data Governance communication?