Data MGMT Foundations
Data MGMT Foundations
Data MGMT Foundations
Tech for
Knowledge
Knowledge
Empowering
Data Management
Foundations
Certification Exam
Alexandre Fagundes
[email protected]
Cloud Architect, Oracle Latin America
Why become Oracle certified?
Make you a better technologist
Preparing for an Oracle Certification Exam will assess & challenge your ability to think & perform.
Earning a highly recognized and valued Oracle Certification will improve your job prospects and job stability!
Accelerate your career growth, became a better technologist \o/
2
Oracle Cloud Infrastructure Data Management Platform
OCI Certifications
Basics
Extensive experience
designing, implementing,
and operating advanced,
large-scale solutions Developer DevOps
using Oracle Cloud
Infrastructure.
Architect Operations
Associate
Experience in designing and
implementing solutions using
Oracle Cloud Infrastructure
Architect
Foundations
4
Exam Topics
5
Cloud Basics
Key terminology
High Availability: A system type with redundant components that provides consistent and
uninterrupted service, even in the event of hardware or software failures
Scalability: The ability to add additional nodes to database environments thereby maintaining
and achieving improved performance
Rolling upgrades: The process where software is developed continuously and the updates are
released in between the major releases. These updates are compatible with earlier versions where
complete re-installation of the software is not required.
Failover: A method of protecting computer systems from failure, in which standby equipment
automatically takes over when the main system fails
Recovery Point Objective (RPO): Tolerance for data loss (sec’s, hours, days); determines
frequency of backups and replication approaches
Recovery Time Objective (RTO): The shorter the Recovery Time Objective (RTO) the quicker you
get back to business
Data Management
Enterprise Data OCI Data Oracle or
Applications data Growth and
& Applications Integration Any Business
Resilience
Applications
☑️ Any
SaaS
Autonomous
Data Warehouse MySQL HeatWave OpenSearch PostgreSQL Redis
GoldenGate Operational
GoldenGate
Social (RT Data Efficiency
IoT Stream
Media Replication) Big Data and Data Lake Analytics
Any
OCI Applications
Data Flow Big Data Data Lake Query Service* Lower
Costs
OCI Data Access, Security, and Governance
Digital Assets Streaming
Israel 2
Toronto
Israel 2 Mumbai
Zurich Dubai
Newport Saudi Arabia 2 Chuncheon
Chicago Hyderabad
Serbia
NEW Madrid
Jerusalem
Milan
Saudi Arabia 3
San Jose Paris Marseille
Abu Dhabi
Ashburn Kenya
Vinhedo NEW Seoul
Phoenix
Jeddah
Singapore 2 Tokyo
Singapore
Bogotá
Osaka
São Paulo
Johannesburg
Queretaro
Santiago
Valparaíso NEW
Monterrey
Melbourne
Sydney
Government
Microsoft Interconnect Azure
Service Models
Traditional IT Infrastructure-as- Platform-as-a- Software-as-a-
a-service (IaaS) service (PaaS) service (SaaS)
You Manage
Applications Applications Applications Applications
You Manage
Runtime Runtime Runtime Runtime
Delivered as-a-service
Middleware Middleware Middleware Middleware
You Manage
Delivered as-a-service
Operating System Operating System Operating System Operating System
Delivered as-a-service
Virtualization Virtualization Virtualization Virtualization
10
Database
Convergence
Organizations typically use many database silos
Types of databases
Data Machine
Lines of Business OLTP Warehouse Document Learning Graph Spatial Blockchain
Finance
Supply Chain
Human Resources
Complete data type and Automation of the Mission-critical apps Important data is
workload support complete dev lifecycle are easy to run easy to secure
• Developers use one database when • Point-and-click cloud native • Automatic performance • Transparent encryption protects data
working with multiple data types development using native tools optimizations on Exadata at rest, in motion, and in backups
reduce tuning requirements
• Multiple application databases run • AutoML helps automate • Data masking and redaction remove
and are managed together using machine learning model creation • Transparent query parallelism sensitive information
Oracle Multitenant simplify performance tuning
• Autoscaling eliminates the need • Separation of duties controls
• Native JSON with high performance for developers to implement • Consistent governance of self- administrative access to data,
and scale simplify modern apps scaling inside applications service databases with fleet software, and infrastructure
management
• Multiple types of analytics and • Autoindexing eliminates the • Label security and virtual private
machine learning eliminate stand- need for DBAs to tune database • Databases run unchanged for databases limit users access to data
alone services indexes for greater performance Oracle and 3rd-party apps
• Blockchain tables enable tamper-
resistant ledgers for centralized apps
19c
• Automatic Indexes
• SQL Quarantine
18c
• Real-Time Statistics
• Autonomous Health Framework
• Automatic Diagnostic Framework
• Automatic Refresh of Clones • Automatic Columnar Flash
• Automatic IM population
•
•
Automatic Memory Management
Automatic Segment Space Mgmt
12c • Automatic Application Continuity
Fastest Analytics
• Automatically offloads SQL processing to parallel smart storage servers
Hot PMEM
• Smart flash cache and Storage Index accelerates database IO automatically
Warm FLASH
• Unique columnarization converts data to fast in-memory formats
Best Consolidation
Cold
• Unique prioritization of latency sensitive and important workloads
• Isolation of multiple tenants and workloads
Oracle Cloud
Self-Repairing
Self-Driving
Self-Securing
Oracle Autonomous Database – serverless or dedicated deployments
Autonomous JSON, Transaction Processing or Autonomous Data Warehouse
Create in Exadata Monitor threats, apply online Perform all OS and SYSDBA
Cloud Infrastructure, security updates, encrypt all operations, adjust settings, fix all
RAC scale-out database, data, secure connections software online, diagnose errors
Active Data Guard standby
The complete database lifecycle is automated
1 2 3 4 5
Oracle
Autonomous
Database
Autonomous Autonomous
Autonomous JSON Database
Data Warehouse (ATP) Transaction Processing (ATP)
• Mobile applications
• Data warehouse, data mart • Transactions, batch, reporting, • User profile management
• Data lake, machine learning IoT • Customer 360
• Application development, • Catalog and content
machine learning management
Deploy in minutes
Data Modeling Low-code APEX Notebooks Data Analysis
{JSON}
Java ODP Java ODP R PHP Ruby Perl Julia Erlang Rust Go Node.js Python Python
DBD:: Oracle.jl oranif rust-oracle godror node- cx_Oracle python-
ruby-oci8
Thin
.NET JDBC-OCI
.NET ROracle OCI8
PDO_OCI Oracle oracledb oracledb
Managed Unmanaged
ODPI-C
Also used by custom apps
Oracle Proprietary Drivers Third Party Open Source Drivers Oracle Open Source Drivers
Oracle maintains key driver APIs and works closely with driver communities
Python
• Default Thin mode: no Oracle Client libraries
• Runtime choice to use Thick Mode
• Support of new platforms: Alpine, Apple M1/M2, IoT
python-oracledb
driver
Users Autonomous • Python 3.6 – 3.11
Database
• Dual Apache 2 or UPL open source license
Python Process
• Binary module for performance
• Python Database API V2 support
Supports automatic data preparation, partitioned model ensembles, integrated text mining
Autonomous
Database
Liquibase + SQLcl Terraform/Ansible support provides industry standard EBR simplifies online
simple schema database-as-code for you automated deployments updates of
versioning, enables you
to execute commands to application schema,
generate a changelog for allows gradual transition
a single object or a full from old applications to
schema editions without
heavyweight cloning
processes
OCI DevOps with
Jenkins or Pipeline APIs for easy code
Process allow for management with
integration of the GitHub, AWS
database to your CodeCommit and
automation processes Azure repos
What is a graph?
• Data model representing entities as vertices and
relationships as edges C B
• Optionally including attributes
What are typical graphs? A D
• Social Networks
• LinkedIn, Facebook, Twitter, ... F
• Physical networks, Supplier networks,...
• Dependency Graphs
E
• Part hierarchies, data lineage, org charts, ...
• Knowledge Graphs
• Apple SIRI, Google Knowledge Graph, ...
Use your existing skills Compatible with existing apps Migrate with zero-downtime
{ JSON }
PL/SQL
No need to learn new database Supports existing workloads and Migrate your database with no
skills or languages because apps from on-premise deployments downtime using Zero Downtime
Autonomous Database fully (Oracle Apps, APEX apps, custom Database Migration Service
supports all of SQL and PL/SQL. apps, data marts, EDWs, etc)
DIY
Zero Downtime
vs.
Migration (ZDM)
Cloud Lift
Services
Database Migration
Service (DMS)
1
Simplify managing applications
Take advantage of Autonomous Database full
managed operations
2
Lower costs and risks
Only pay for the resources you use with auto-
scaling. Backup, restore, and cloning of Oracle
E-Business suite environments
3
Fast-track getting started
Use certified procedures for migrating to
Autonomous Database
Compute 3-nodes
Storage RAC
HeatWave
OCI Database with PostgreSQL
Fully Managed:
• Automated deployment and patching
Highly Available:
• SLA is 99.99%
• Data distributed across multiple redundant Availability
AD1 AD2 AD3 Domains or across multiple Fault Domains (single AD)
• Shared storage architecture with Zero RPO and Low RTO
Primary Read Repl. 1 Read Repl. 2 • Dynamic storage scaling (zero downtime)
High Performance
• 3x higher perf than the open-source version
Scalable:
Database Optimized Storage • Scales linearly with additional read replicas - utilizing
shared storage
Regional Block Volume Secure:
Availability Availability Availability • End-to-end encryption
Domain Domain Domain • Automatic security patching
Best-in-class TCO:
• 60% less expensive than Amazon Aurora PostgreSQL.
• Pay-as-you-grow shared(across nodes) storage
55 Copyright © 2024, Oracle and/or its affiliates. All rights reserved.
Open Source Databases - NoSQL
Data Protection
Gold
24
HA Features,
Silver Reference Replication Configurations & Flashback RMAN + ZDLRA
Architectures Operational
Practices Active Replication
Bronze
Production Site Replicated Site
Active Data Guard GoldenGate
Deployment Choices
Scale Out
Platinum
Gold Mission Critical
Business Critical
Silver Gold
Gold ++
Prod/Departmental Silver + GoldenGate
Bronze Silver + Zero Downtime
Bronze + Edition Based
Golden Redefinition
Gate Cloud Svc.
Maintenance / Migration
Dev, Test, Prod Bronze + Zero Data Loss with
DB Replication
Database HA with DR to
HA andthe
DRCloud Use Case
Zero Downtime Active Data Guard
Single Instance DB High
RAC Availability
RAC
Zero Data Loss Backup
Restartable Application Continuity
to the Cloud
Backup
Backup / and Use Case
Recovery
Restore
All tiers exist with on-premise and cloud. However, platinum currently must be configured manually while bronze to gold are covered with cloud tool automation
Strengthen
authentication
Encrypt Control
data access
Minimize Monitor
attack activity
surface
Assess
configuration
and detect
drift
Copyright © 2024, Oracle and/or its affiliates
Securing the Oracle Database
Strengthen
authentication
Encrypt Control
data access
Minimize
Monitor
attack
activity
surface
Assess
configuration
and detect
drift
Data-Driven Security
Virtual Private Database
Data Redaction
Label Security
Users
Real Application Security
Events
Database Vault
Applications
Alerts
Reports
Test Dev
Policies
Audit Data & Transparent Key Vault Data Masking
Event Logs Data and Subsetting
Audit Vault
Encryption Data Safe
Data Safe
OCI Data Lake is a fully managed service which provides centralized storage for structured and
unstructured data with unified access control and centralized metadata
Autonomous Data
Database
Warehouse
Oracle
MySQL HeatWave
GoldenGate
Accurate
Data Lake (Storage Zone for All Data) Forecasts and
Applications Predictions
Data Sharing
Share data securely across the Enterprise.
OCI Data
Video Integration
Service
Data Catalog Improved Real-
time Visibility
Unified security and governance.
IoT
(Apache Kafka) Scalable raw storage for all data.
OCI Data
Lake
• OCI Data Lake enable Data Engineers to store and work with data in a format of their choice
• OCI Data Lake supports CSV, JSON, Parquet, AVRO, ORC, DELTA and Text files
• DELTA file format provides ACID transactions, time travel support and is best suited for analytics
workloads
Data Sources Data Ingestion Data Storage
Amazon
RDS
OCIExternal
Object Storage
SDK/APIs Mounts/Tables
OCI-Azure Interconnect
Autonomous
Database <2ms RTT latency between clouds AZURE
Exadata
Database FastConnect / VPN Customer
Service Data Center
Customer-dedicated
Exadata Cloud Infrastructure X9M
Disaster recovery
FastConnect / VPN
Code
Data Lake
Notifications Repositories
File Formats
Data integration
sources
Secure object
Direct store access
Query Access Data
Catalogs
OCI
Data Amazon
Catalog Glue
Copyright © 2024, Oracle and/or its affiliates
ODSA creates more options for customers to harness cloud innovation
Build with the best of OCI and Use fully managed Oracle Run exclusive OCI database
Azure services Databases with Azure services with Azure
On-Prem
Customer- Exadata
managed Database App
Oracle DB Machine
Build with the best of OCI and Use fully managed Oracle
Azure services Databases with Azure
Customer-managed
MySQL database for transactions +
analytics database + ML tools + ETL
processes between data stores
Certification:
OCI Data Management Foundations 2023
→ Oracle University:
→ Become an OCI Data Management Foundations Associate 2023
Alexandre Fagundes
[email protected]
Cloud Architect, Oracle Latin America