Cloudera Navigator: Integrated Data Management and Governance For Hadoop

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

DATA SHEET

Manage Data, Achieve Compliance,


Cloudera Navigator
& Accelerate Productivity Integrated Data Management and Governance for Hadoop
Big data represents a major opportunity for enterprises to become information-driven in their
Discover and Explore strategies, operations, and customer engagements - with the ability to store and analyze all
• Easily track, classify, and locate data types of data at any scale and open it up to more users and analytic tools. However, this
within the unified metadata repository opportunity can also lead to data management challenges, with armies of business users
wanting self-service access to discovery data; administrators needing to know how data
Optimize
is being used to optimize analytic performance; and security teams requiring visibility into
• Get the best results from Hadoop
how data is accessed and used to meet compliance regulations. An enterprise big data
with instant workload insights and
platform must be able to address the data management and compliance needs across
optimization guidance
the organization, without limiting the flexibility and benefits of big data itself.
Audit Cloudera Navigator is the only integrated data management and governance solution for
• Maintain a full audit history across big data and Apache Hadoop. Built into the core of Cloudera Enterprise, Cloudera Navigator
all of Hadoop in a single place is trusted by hundreds of users to get unprecedented visibility into their data and provides
the auditing and data protection necessary to comply with even the most stringent regulatory
Trace
restrictions. Powered by the only comprehensive metadata foundation for Hadoop, it
• Visualize the upstream and downstream
automatically brings together all the technical metadata from each platform component
lineage of data to verify reliability
and the user-defined business metadata across the organization into a single, searchable
Protect repository. From this, Cloudera Navigator provides four fundamental components for
• Keep sensitive data secure with effective data management:
performance-optimized encryption
and key management
Self-Service Data Discovery & Analytics
Business users can effortlessly find and trust the data that matters most
Manage Lifecycle • Discover and explore data across the only unified metadata repository
• Define and automate complex data with intuitive full-text search and SQL access
lifecycle activities with integrated • Locate data sets based on business context and classification, combined with
metadata policies automatic technical context - making it easy to find similar, relevant data

Active Data Optimization


Database Administrators can get instant insights to optimize the most critical workloads
“ We are the custodians of financial • Instantly analyzes existing SQL logs for comprehensive visibility into which queries
information for our customers when are the most critical, what data is accessed the most often, and how is that data used.
they’re trusting us to transfer money to • Improve performance and efficiency of Hadoop with intelligent optimization guidance
their near and dear ones. We need to • Reduce workload development time with compatibility identification for fast success
make sure that we are compliant and with Hadoop
have proper monitoring and auditing,
Compliance-Ready Governance & Protection
and that’s where [Cloudera] Navigator
Security teams can track, understand, and protect access to sensitive data
comes in.”
• Automatically maintain a full audit history and track every access attempt,
Western Union right down to the user ID, IP address, and full query text
• Track how data is used and changing with column-level, visual lineage to quickly
identify the origin of a data set and its impact on downstream artifact
• Protect all data with high-performance encryption and key management through
Navigator Encrypt and Navigator Key Trustee
(for more information, download the “Encryption and Key Management Datasheet”)
• Integrate with the leading enterprise metadata, lineage, and SIEM applications,
out-of-the-box
DATA SHEET

Hadoop-Scale Data Lifecycle Automation


Data stewards can efficiently manage and enforce critical lifecycle policies, risk-free
• Automate crucial stewardship and curation activities – such as metadata classification, data archiving and retention,
or even invoking partner products for additional data preparation and transformation – with the flexible policy engine
• Ensure business continuity with the only built-in backup and disaster recovery
• Manage the data lifecycle beyond just Hadoop with seamless partner tool integration

Key Features of Cloudera Navigator


Feature Function Benefit
Discovery & Exploration
Unified Technical Metadata Bring in all metadata from HCatalog, HDFS, Sqoop, etc. Stay in control of growing volumes of data for consistent, precise retrieval
into a single, searchable interface
Comprehensive Business Augment files, tables, and individual columns with custom Unlocking the value of data by classifying it in meaningful ways for various
Metadata business context, tags, and key/value pairs stakeholders, including business analysts, data scientists, and data stewards
Metadata Search Search across all business and technical metadata to find Find diverse data through a common interface
assets (ie data older than 7 years, tables created by certain
users, files that contain sensitive data)
Audit & Access Control
Audit Configuration Automatically configure audit tracking for HDFS, Impala, In a few simple clicks, ensure all necessary audit data is captured
Hive, HBase, and Sentry; set thresholds
Audit Dashboard Visualize and summarize data access via a simple, Single place to quickly identify outliers and security breaches
queryable interface
Third-Party Integration Integrate audit information for use in global Security Integrate data from Hadoop into infrastructure-wide reporting and processes
Information and Event Management (SIEM) systems
Lineage
Intuitive Visualization View upstream and downstream lineage in an easy-to-follow Quickly identify the origin of a data set and the impact on downstream operations
graph at the file, table, and individual column-level
Encrypt & Key Trustee

Transparent Encryption Protect all data without the performance impact Open up even the most sensitive data to analytics, risk-free

Enterprise Key Manage encryption keys and security assets through Integrate with existing HSMs and key management policies for compliance
Management a separate, centralized solution
Optimizer

Active Data Optimization Analyze workloads across all data management systems for Put the right workload in the right place at the right time for the best results
peak performance optimizations and workload rationalization with Hadoop
Metadata Policies

Metadata Policy Trigger actions (such as autoclassification of metadata) for Easily set, monitor, and enforce data management policies and integrate with
Management specific datasets based on arrival or scheduled intervals third-party tools

View upstream and downstream lineage Search metadata for specific items of interest

cloudera.com © 2015 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA
and other countries. All other trademarks are the property of their respective companies. Information is subject to change without notice.
1-888-789-1488 or 1-650-362-0488
Cloudera, Inc., 1001 Page Mill Road, Palo Alto, CA 94304, USA cloudera-datasheet-navigator-109

You might also like