Soft computing is a kind of computing technology that we use to develop intelligent machines. These machines have human-like problem-solving capabilities. Later they offer imprecise, but useful solutions for problems that are complex and even computational. The technologies used in soft computing to address such complex problems include fuzzy logic, neural network, genetic … [Read more...] about Soft Computing
Data Warehouse and Mining
Genetic Algorithm in Data Mining
A genetic algorithm in data mining is an advanced method of data classification. Data classification incorporates two steps, i.e. learning step and the classification step. The classification model is constructed in the learning step, and in the classification step, the model predicts the output for the provided input. A genetic algorithm is based on the basic principle of … [Read more...] about Genetic Algorithm in Data Mining
Neural Network in Data Mining
The neural network in data mining is a classification method that takes the input, trains itself to recognize the pattern of input data and predicts the output for new input of a similar kind. Neural network forms the basis of deep learning, a subfield of machine learning that comes under artificial intelligence. Designing neural network algorithms is inspired by the … [Read more...] about Neural Network in Data Mining
Big Data
Big data can be defined as extremely large data that increases with time. Big data collects data from disparate sources and the data is so complex that you cannot employ any traditional method or tool to manage such huge data. Big data is not a data repository like data warehouse but it is a technology invented to manage an extremely large data. In this section, we will be … [Read more...] about Big Data
Data Cube
A data cube in a data warehouse is a multidimensional structure used to store data. The data cube was initially planned for the OLAP tools that could easily access the multidimensional data. But the data cube can also be used for data mining. Data cube represents the data in terms of dimensions and facts. A data cube is used to represents the aggregated data. A data cube is … [Read more...] about Data Cube
Difference Between Data Warehouse and Data Lake
Data warehouse and data lake both are the centralized storage on an enterprise. But the basic difference between data warehouse and data lake is that the data warehouse has the structured and pre-processed data contrary to this the data lake accommodates a heterogeneous data in its raw format. Data in a data warehouse is analyzed for retrieving strategic information which … [Read more...] about Difference Between Data Warehouse and Data Lake
Data Transformation
Data transformation is data preprocessing technique used to reorganize or restructure the raw data in such a way that the data mining retrieves strategic information efficiently and easily. Data transformation include data cleaning and data reduction processes such as smoothing, clustering, binning, regression, histogram etc. In this section, we will study different … [Read more...] about Data Transformation
Data Reduction
Data reduction is a process that reduced the volume of original data and represents it in a much smaller volume. Data reduction techniques ensure the integrity of data while reducing the data. The time required for data reduction should not overshadow the time saved by the data mining on the reduced data set. In this section, we will discuss data reduction in brief and we … [Read more...] about Data Reduction
Data Integration
Data integration means merging data from several heterogeneous sources. While performing the data integration you have to deal with several issues such as data redundancy, inconsistency, duplicity and many more. In this section, we are going to discuss data integration at a stretch and along with that, we will also discuss the issues or challenges faced during data … [Read more...] about Data Integration
Data Cleaning
Data cleaning is the technique used to eliminate the inconsistencies and irregularities in the data. Redundant or irrelevant data only increase the amount of storage. So, it is very important to clean the data as the inaccurate data not only confuses the data mining programs but also degrades the quality of data. In this section, we will discuss data mining in brief along … [Read more...] about Data Cleaning