Data Preparation

Zahraa S. Abdallah, Lan Du, Geoffrey I Webb

Research output: Chapter in Book/Report/Conference proceedingEntry for encyclopedia/dictionary


Before data can be analyzed, they must be organized into an appropriate form. Data preparation is the process of manipulating and organizing data prior to analysis.
Data preparation is typically an iterative process of manipulating raw data, which is
often unstructured and messy, into a more structured and useful form that is ready for further analysis. The whole preparation process consists of a series of major activities (or tasks) including data profiling, cleansing, integration, and transformation.
Original languageEnglish
Title of host publicationEncyclopedia of Machine Learning and Data Mining
Publication statusPublished - 2016


Dive into the research topics of 'Data Preparation'. Together they form a unique fingerprint.

Cite this