By Glenn J. Myatt, Wayne P. Johnson

A confirmed go-to consultant for information research, Making feel of information I: a pragmatic consultant to Exploratory info research and information Mining, moment version makes a speciality of easy facts research methods which are essential to make well timed and actual judgements in a various diversity of tasks. in accordance with the authors' sensible event in imposing info research and knowledge mining, the hot variation presents transparent reasons that advisor readers from nearly each box of research. with a view to facilitate the wanted steps whilst dealing with an information research or info mining undertaking, a step by step strategy aids execs in rigorously reading info and imposing effects, resulting in the advance of smarter company judgements.

Info 24 DESCRIBING DATA For variables with an even number of values, the average of the two values closest to the middle is selected (sum the two values and divide by 2). The median can be calculated for variables measured on the ordinal, interval, and ratio scales and is often the best indication of central tendency for variables measured on the ordinal scale. It is also a good indication of the central value for a variable measured on the interval or ratio scales since, unlike the mean, it will not be distorted by extreme values.

1 49 Histogram showing an outlier. Histograms and box plots can be useful in identifying outliers as previously described. 1 displays a variable Height where one value is eight times higher than the average of all data points. A particular variable may have been measured over different units. For example, a variable Weight may have been measured using both pounds and kilograms for different observations or a variable Price may be measured in different currencies. These should be standardized to a single scale so that they can be compared during analysis.

FURTHER READING This chapter has reviewed some of the sources of data used in exploratory data analysis and data mining. The following books provide more information on surveys and polls: Fowler (2009), Rea (2005), and Alreck & Settle (2003). There are many additional resources describing experimental design, including Montgomery (2012), Cochran & Cox (1999), Barrentine (1999), and Antony (2003). Operational databases and data warehouses are summarized in the following books: Oppel (2011) and Kimball & Ross (2013).

