AUTHORS : Dr. Saroj Kumar , Mr. Dileep Singh
ISBN : 9789357553674
Syllabus
4.8.3.: Data Warehousing and Data Mining
Module 1: Introduction to Data Ware Housing 10 Hours
Introduction: Fundamentals of Data Mining, Data Mining Functionalities, Classification of Data Mining Systems, Data Mining Task Primitives, Integration of a Data Mining System with a Database or a Data Warehouse System, Major Issues in Data Mining. Data Preprocessing: Need for Preprocessing the Data, Data Cleaning, Data Integration and Transformation, Data Reduction, Discretization and Concept Hierarchy Generation.
Module 2: Data Warehouse and OLAP Technology for Data Mining 10 Hours
Data Warehouse, Multidimensional Data Model, Data Warehouse Architecture, Data Warehouse Implementation, Further Development of Data Cube Technology, From Data Warehousing to Data Mining Data Cube Computation and Data Generalization: Efficient Methods for Data Cube Computation, Further Development of Data Cube and OLAP Technology, Attribute-Oriented Induction.
Module 3: Patterns and Association Rules 10 Hours
Mining Frequent Patterns, Associations and Correlations: Basic Concepts, Efficient and Scalable Frequent Item set Mining Methods, Mining Various Kinds of Association Rules, From Association Mining to Correlation Analysis, Constraint-Based Association Mining
Module 4: Classification and Prediction 10 Hours
Issues Regarding Classification and Prediction, Classification by Decision Tree Induction, Bayesian Classification, Rule-Based Classification, Classification by Back propagation, Support Vector Machines, Associative Classification, Lazy Learners, Other Classification Methods, Prediction, Accuracy and Error measures, Evaluating the Accuracy of a Classifier or a Predictor, Ensemble Methods
Module 5: Cluster Analysis 10 Hours
Cluster Analysis Introduction- Types of Data in Cluster Analysis, A Categorization of Major Clustering Methods, Partitioning Methods, Hierarchical Methods, Density-Based Methods, Grid-Based Methods, Model-Based Clustering Methods, Clustering High-Dimensional Data, Constraint-Based Cluster Analysis, Outlier Analysis.
Module 6: Mining Streams, Time Series and Sequence Data 10 Hours
Mining Data Streams, Mining Time-Series Data, Mining Sequence Patterns in Transactional Databases, Mining Sequence Patterns in Biological Data, Graph Mining, Social Network Analysis and Multirelational Data Mining, Mining Object, Spatial, Multimedia, Text and Web Data: Multidimensional Analysis and Descriptive Mining of Complex Data Objects, Spatial Data Mining, Multimedia Data Mining, Word Cloud, Sentiment Analysis, Text Mining, Mining the World Wide Web.
Specific References
Your review appreciation cannot be sent
Report comment
Report sent
Your report cannot be sent
Write your review
Review sent
Your review cannot be sent
AUTHORS : Dr. Saroj Kumar , Mr. Dileep Singh
ISBN : 9789357553674
check_circle
check_circle