Data warehouse and data lake both are the centralized storage on an enterprise. But the basic difference between data warehouse and data lake is that the data warehouse has the structured and pre-processed data contrary to this the data lake accommodates a heterogeneous data in its raw format. Data in a data warehouse is analyzed for retrieving strategic information which … [Read more...] about Difference Between Data Warehouse and Data Lake
Data Warehouse and Mining
Data Transformation
Data transformation is data preprocessing technique used to reorganize or restructure the raw data in such a way that the data mining retrieves strategic information efficiently and easily. Data transformation include data cleaning and data reduction processes such as smoothing, clustering, binning, regression, histogram etc. In this section, we will study different … [Read more...] about Data Transformation
Data Reduction
Data reduction is a process that reduced the volume of original data and represents it in a much smaller volume. Data reduction techniques ensure the integrity of data while reducing the data. The time required for data reduction should not overshadow the time saved by the data mining on the reduced data set. In this section, we will discuss data reduction in brief and we … [Read more...] about Data Reduction
Data Integration
Data integration means merging data from several heterogeneous sources. While performing the data integration you have to deal with several issues such as data redundancy, inconsistency, duplicity and many more. In this section, we are going to discuss data integration at a stretch and along with that, we will also discuss the issues or challenges faced during data … [Read more...] about Data Integration
Data Cleaning
Data cleaning is the technique used to eliminate the inconsistencies and irregularities in the data. Redundant or irrelevant data only increase the amount of storage. So, it is very important to clean the data as the inaccurate data not only confuses the data mining programs but also degrades the quality of data. In this section, we will discuss data mining in brief along … [Read more...] about Data Cleaning
