02 - Data Preparation and Cleaning
02 - Data Preparation and Cleaning
02 - Data Preparation and Cleaning
Cleaning
Data Analytics for Accounting
OBJECTIVES
Customer
Customer Sales
Sales Customer
Customer Sales
Sales
Data
Data Data
Data Table
Table Table
Table
Product
Product Payment
Payment Product
Product Payment
Payment
Data
Data Data
Data Table
Table Table
Table
Data and Relationships in a Relational
Database
Supplier Table
Supplier ID Supplier Name Address • Supplier ID is primary key in Supplier
Table
1 PT ABC Jl. Maju Jaya 5
• Supplier ID acts as foreign key in
2 CV XYZ Jl. Merdeka no 10 Purchase Order Table to connect
Supplier Table and Purchase Order
Table
• Supplier Name, Address are descriptive
Purchase Order Table
columns in Supplier Table
PO No Date Amount Supplier ID
1787 11/1/2020 5.000.000 1
1788 12/1/2020 725.000 2
1789 15/1/2020 3.250.000 1
Data Dictionary
• Once the data have been validated, the data will likely need to
be cleaned. Some of the more common ways in cleansed data are
1. Remove headings or subtotals
2. Clean leading zeros and nonprintable characters
3. Format negative numbers
4. Correct inconsistencies across data
Loading