본문 바로가기
카테고리 없음

MATH1280 Statistics 1 Discussion forum UNIT3

by 하나는외계인 2022. 8. 14.
반응형

MATH1280 Statistics 1 Discussion forum UNIT1

Grades 9.75/0-10

 

 

An important practice is to check the validity of any data set that you analyze. One goal is to detect typos in the data, and another would be to detect faulty measurements. Recall that outliers are observations with values outside the “normal” range of values of the rest of the observations.

Specify a large population that you might want to study and describe the type numeric measurement that you will collect (examples: a count of things, the height of people, a score on a survey, the weight of something).  What would you do if you found a couple outliers in a sample of size 100? What would you do if you found two values that were twice as big as the next highest value?

You may use examples from your area of interest, such as monthly sales levels of a product, file transfer times to different computer on a network, characteristics of people (height, time to run the 100 meter dash, statistics grades, etc.), trading volume on a stock exchange, or other such things.

There is no requirement to use sources from the Internet, but if you use an idea or a quotation from any source, it should be cited (such as putting the author and year at the end of the sentence and then adding a reference at the end to describe the source).

 

---------------------------------------------------------------------------------------------------------------------------------------------------------------

 

I will study the number of wild cats in my town in this discussion forum.

I will take a sample in various places to collect data. It is quantitative data and presented as numbers. "Outliers are values that do not t with the rest of the data and lie outside of the normal range"; "data points with values that are much too large or much too small in comparison to the vast majority of the observations" can be eliminated as outliers  (Yakir, 2011, chap.3). Then, I will find the interquartile range (IQR) to confirm whether the outlier was an error or not in my sample size of 100 (Yakir, 2011). 

In case of finding two values twice as large as the next highest value, I measure the sample standard deviations to examine how well the data is distributed, or use a statistical graph such as a box and mustache to show the concentration of the data (Yakir, 2011).

 

References

Yakir, B. (2011). Introduction to statistical thinking (with R, without calculus). The Hebrew University of Jerusalem, Department of Statistics. Retrieved from https://my.uopeople.edu/pluginfile.php/1524524/mod_resource/content/4/IntroStat.pdf

반응형

댓글