After reviewing Chapter 6 of your text, provide an example of an association rule from the market basket domain that satisfies each of the following conditions. Also, describe whether such rules are subjectively interesting. A pattern is subjectively interesting if it contradicts the expectation of a user and if it is actionable. A rule that has high support and high confidence A rule that has reasonably high support but low confidence A rule that has low support and low confidence A rule that has support and high confidence Question 2 Clusters of documents can be summarized by finding the top terms (words) for the documents in the cluster, e.g., by taking the most frequent k terms, where k is constant, say 10, or by taking all terms that occur more frequently than a specified threshold. Suppose that K-means is used to find clusters of both documents and words for a document data set. How might a set of term clusters defined by the terms in a document cluster differ from the word clusters found by clustering the terms with K-means? How could term clustering be used to define clusters of documents?
Sun | Mon | Tue | Wed | Thu | Fri | Sat |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 |
8 | 9 | 10 | 11 | 12 | 13 | 14 |
15 | 16 | 17 | 18 | 19 | 20 | 21 |
22 | 23 | 24 | 25 | 26 | 27 | 28 |
29 | 30 | 1 | 2 | 3 | 4 | 5 |