# Q1.Difference between classification of data and clustering of data, You are required to give your o

Aug 15th, 2016
Studypool Tutor
Brick Computer Science Institute
Price: \$15 USD

Tutor description

Q1.Difference between classification of data and clustering of data, You are required to give your opinion about it with any example of your choice. Q2. How we compute ranking cubes for efficient top-k ranking? Can you give example of one application domain where ranking cubes are used? Q3 Can apriori mining algorithm handle convertible constraints? Justify. Q4. Discuss the relationship between colossal and core patterns. Q5. What is boosting? State why it may improve the accuracy of decision tree induction? Q6. Ensemble methods improve classification accuracy. How?

Word Count: 2747
Showing Page: 1/8
Answer--------------1For someone who is new to Data mining, classification and clustering can seem similarbecause both data mining algorithms essentially divide the datasets into sub-datasets; Butthere is difference between them and this blog-post, well see exactly that:CLASSIFICATIONCLUSTERINGWe have a Training setcontaining data that have beenpreviously categorizedBased on this training set, thealgorithms finds the category thatthe new data points belong toWe do not know thecharacteristics of similarity ofdata in advanceUsing statistical concepts, wesplit the datasets into subdatasets such that the Subdatasets have Similar dataSince a Training set exists, wedescribe this technique asSupervised learningSince Training set is not used,we describe this technique asUnsupervised learningExample:We use training datasetwhich categorized customers thathave churned. Now based on thistraining set, we can classifywhether a customer will churn ornot.Example:We use a dataset ofcustomers and split them intosub-datasets of customers withsimilar characteristics. Nowthis information can be used tomarket a product to a specificsegment of customers that hasbeen identified by clusteringalgorithmGive your opinion about it with any example of your choice.-------------------------------------------------Introduces a new mathematical framework for two related classical problems in statisticallearning: data cluster

## Review from student

Studypool Student
" Tutor was very helpful and took the time to explain concepts to me. Very responsive, managed to get replies within the hour. "

1831 tutors are online

Brown University

1271 Tutors

California Institute of Technology

2131 Tutors

Carnegie Mellon University

982 Tutors

Columbia University

1256 Tutors

Dartmouth University

2113 Tutors

Emory University

2279 Tutors

Harvard University

599 Tutors

Massachusetts Institute of Technology

2319 Tutors

New York University

1645 Tutors

Notre Dam University

1911 Tutors

Oklahoma University

2122 Tutors

Pennsylvania State University

932 Tutors

Princeton University

1211 Tutors

Stanford University

983 Tutors

University of California

1282 Tutors

Oxford University

123 Tutors

Yale University

2325 Tutors