Normalization FORM
Normalization FORM
Normalization FORM
● Increased storage space which could lead to high cost and over-high resources
consumption.
● Data inconsistency
● Insertion Anomaly
● Deletion Anomaly
● Updation Anomaly
Data Integrity refers to the accuracy, completeness and consistency of data through its life-
cycle.
By accuracy, it means data must reflect reality. For example a customer’s address in the
database table must be up-to-date and correct.
By consistency, it means the data should be homogeneous across different systems and
formats. For example, the customer's name should be the same, no spelling changes
throughout. Example of a table with data – redundancy:
STUDENT-TABLE
student_id student_name department hod phone
Insertion Anomaly
When certain data or attributes cannot be inserted into the database table without the
presence of other data of which existence of both are independent of each other , this situation
is called Insertion Anomaly.
For example, in the student-table above, let's assume to insert a new department (say
Mathematics), the insertion would not be possible without inserting the student name. But
both the student and department values are not dependent on each other for existence. So the
inability to add data to the database table due to the absence of other data whose existence is
not dependent on the former data, is called Insertion Anomaly.
To correct this, another table must be created for departments so that we can add more
departments without depending on the addition of student records.
Deletion Anomaly
If deletion of data from a database table causes deletion of other information(or data ) which
exist independently, such a deletion is said to be an anomaly. For example if there was a need
to delete all the students’ records from the table, the deletion of all the departments would go
together. Since the existence of departments is independent of the student, a separate table
must be created for departments which can give room to deletion of students’ records without
affecting the records of the departments, and vice-versa.
Updation Anomaly
If a change in any single piece of data in a table calls for changes at all the other copies of the
same data which had already come under insertion anomaly, the update is said to be anomaly.
Like if the HOD name of a department is to be updated, the same update must happen at all the
rows where the copies exist. Instead , a separate table for the department will correct this.
Normalization of database table works through a series of stages called Normal Forms:
They are:
1. First Normal Form (1NF)
2. Second Normal Form (2NF)
3. Third Normal Form (3NF)
4. Boyce – Codd Normal Form (BNF)
5. Fourth Normal Form (4NF)
6. Fifth Normal Form (5NF)
Fifth Normal Form (5NF), also known as Projection-Join Normal Form (PJ/NF), is
the highest level of database normalization aimed at eliminating all potential
anomalies and redundancies arising from complex data relationships. While
achieving 5NF is not always necessary or applicable, it represents the purest
form of data organization in relational databases.