Bachelors Level/Fourth Year/Seventh Semester/Science csit/seventh semester/data warehousing and data mining/syllabus wise questions
B.Sc Computer Science and Information Technology
Institute of Science and Technology, TU
Data Warehousing and Data Mining (CSC420)
Year Asked: 2080, syllabus wise question
Classification and Prediction
1.
How classification differs from regression. Train ID3 classifier using the dataset given below. Then predict class label for the data [Age=Mid, Competition=Yes, Type=HW].
Which algorithm is used for training multi-layer perceptron? Discuss the algorithm in detail.[5]
3.
Write down short notes on: a. Support Vector Machine b. Multi-dimensional Data Model[5]
Cluster Analysis
1.
How K-medoids clustering differs from K-means clustering? Divide the following data points into two clusters using kmedoids algorithm. Show computation up to 3 iterations. {(70,85), (65,80), (72,88), (75,90), (60,50), (64,55), (62,52), (63,58)}.[5]
2.
Discuss working of DBSCAN algorithm.[5]
Data Cube Technology
1.
How many cuboids are possible from 5-dimensional data? Discuss the concept of full cube and iceberg cube.[5]
2.
Explain the OLAP operations with examples.[5]
Data Preprocessing
1.
Discuss different ways of smoothing noisy data along with suitable examples.[5]
Introduction to Data Mining
1.
How KDD differs from data mining? Explain various stages of KDD with suitable block diagram.[5]
Introduction to Data Warehousing
1.
Why the concept of data mart is important? Discuss different data warehouse schema with examples.[10]
Mining Frequent Patterns
1.
State Apriori property. Find frequent item sets and association rules from the transaction database given below using Apriori algorithm. Assume min. support is 50% and min confidence is 75%.