Answer questions in document with no plagiarism.

Information Technologydata mining

Save Time On Research and Writing
Hire a Pro to Write You a 100% Plagiarism-Free Paper.
Get My Paper

Homework 2

Answer the following questions: (5 point each)

1

. Classify the following attributes as binary, discrete, or continuous. Also classify them as qualitative (nominal or ordinal) or quantitative (interval or ratio). Some cases may have more than one interpretation, so briefly indicate your reasoning if you think there may be some ambiguity. Example: Age in years. Answer: Discrete, quantitative, ratio

Save Time On Research and Writing
Hire a Pro to Write You a 100% Plagiarism-Free Paper.
Get My Paper

(a) Time in terms of AM or PM.

(b) Brightness as measured by a light meter.

(c) Brightness as measured by people’s judgments.

(d) Angles as measured in degrees between 0 and 360.

(e) Bronze, Silver, and Gold medals as awarded at the Olympics.

(f) Height above sea level.

(g) Number of patients in a hospital.

(h) ISBN numbers for books. (Look up the format on the Web.)

(i) Ability to pass light in terms of the following values: opaque, translucent’ transparent.

(j) Military rank.

(k) Distance from the center of campus.

(l) Density of a substance in grams per cubic centimeter.

(m) Coat check number. (When you attend an event, you can often give your coat to someone who, in turn, gives you a number that you can use to claim your coat when you leave.)

2. Can you think of a situation in which identification numbers would be useful for prediction?

3. An educational psychologist wants to use association analysis to analyze test results. The test consists of 100 questions with four possible answers each.

(a) How would you convert this data into a form suitable for association analysis?

(b) In particular, what type of attributes would you have and how many of them are there?

4. Which of the following quantities is likely to show more temporal autocorrelation: daily rainfall or daily temperature? Why?

5. Many sciences rely on observation instead of (or in addition to) designed experiments. Compare the data quality issues involved in observational science with those of experimental science and data mining.

6. Discuss the difference between the precision of a measurement and the terms single and double precision, as they are used in computer science, typically to represent floating-point numbers that require 32 and 64 bits, respectively.

7. Give at least two advantages to working with data stored in text files instead of in a binary format.

8. Distinguish between noise and outliers. Be sure to consider the following questions.

(a) Is noise ever interesting or desirable? Outliers?

(b) Can noise objects be outliers?

(c) Are noise objects always outliers?

(d) Are outliers always noise objects?

(e) Can noise make a typical value into an unusual one, or vice versa?

9. For the following vectors, x and y, calculate the indicated similarity or distance measures.

a. (a) x : (0,0,1,1), y : (2,2,2,2) cosine, correlation, Euclidean

b. (b) x : (0,1,0,1), y : (0,1,0,1) cosine, correlation, Euclidean, Jaccard

c. (c) x : (1,1,0,1), y : (-1,0,-1,0) cosine, correlation, Euclidean

d. (d) x : (1,0,0,1,0,1), y : (0,1,1,0,0,1) cosine, correlation, Jaccard

e. (e) x : (2,1,0,2,0,3), y : (1,1,1,0,0,1) cosine, correlation

10. This exercise compares and contrasts some similarity and distance measures. For binary data, the L1 distance corresponds to the Hamming distance; that is, the number of bits that are different between two binary vectors. The Jaccard similarity is a measure of the similarity between two binary vectors. Compute the Hamming distance and the Jaccard similarity between the following two binary vectors.

x: 0111010101

y : 0110011010

1

Calculate your order
Pages (275 words)
Standard price: $0.00
Client Reviews
4.9
Sitejabber
4.6
Trustpilot
4.8
Our Guarantees
100% Confidentiality
Information about customers is confidential and never disclosed to third parties.
Original Writing
We complete all papers from scratch. You can get a plagiarism report.
Timely Delivery
No missed deadlines – 97% of assignments are completed in time.
Money Back
If you're confident that a writer didn't follow your order details, ask for a refund.

Calculate the price of your order

You will get a personal manager and a discount.
We'll send you the first draft for approval by at
Total price:
$0.00
Power up Your Academic Success with the
Team of Professionals. We’ve Got Your Back.
Power up Your Study Success with Experts We’ve Got Your Back.

Order your essay today and save 30% with the discount code ESSAYHELP