Abhitha Bhasuru

Followers

Following

Public Views

黃仙惠北科大-

National Taipei University of Technology

anisha angel

FLSHASE/INSIDE Psychobiology, Neurophysiology University of Luxembourg

Sulastri Riris Manurung

Universitas Sriwijaya

UNIVERSIDAD NACIONAL DE CHIMBORAZO

krishna priya

Rajeev Gandhi Memorial College of Engineering & Technology, Nandyal

Thắng Uông

K HAR

Interests

Uploads

Papers by Abhitha Bhasuru

Data preprocessing

Today's real-world databases are highly susceptible to noisy, missing, and inconsistent data due ... more Today's real-world databases are highly susceptible to noisy, missing, and inconsistent data due to their typically huge size (often several gigabytes or more) and their likely origin from multiple, heterogenous sources. Low-quality data will lead to low-quality mining results. "How can the data be preprocessed in order to help improve the quality of the data and, consequently, of the mining results? How can the data be preprocessed so as to improve the efficiency and ease of the mining process?" There are several data preprocessing techniques. Data cleaning can be applied to remove noise and correct inconsistencies in data. Data integration merges data from multiple sources into a coherent data store such as a data warehouse. Data reduction can reduce data size by, for instance, aggregating, eliminating redundant features, or clustering. Data transformations (e.g., normalization) may be applied, where data are scaled to fall within a smaller range like 0.0 to 1.0. This can improve the accuracy and efficiency of mining algorithms involving distance measurements. These techniques are not mutually exclusive; they may work together. For example, data cleaning can involve transformations to correct wrong data, such as by transforming all entries for a date field to a common format.

Download

Data preprocessing

Download

Abhitha Bhasuru

Uploads

Papers by Abhitha Bhasuru

Log In