Stats and soft skills (STATA software)
Assignment3_2022.pdf
Stats and soft skills (STATA software)
I need assistance with something that consists of 20 questions a 4 hour time limit. I need this completed no later than Thursday…
In your initial post, share one example of a good graphical representation of data and one example of a bad graphical representation of data. These can be anything you find online, in print, or elsewhere. For each of your two examples, include why you think they are good and bad representations respectively.
You can include visuals and explain why they are good visual representations or bad visual representations, respectively.
EXAMPLE BELOW!!!!!!!
This graph is easy to read at a glance. The colors separate the columns well, and they are the same width. The vertical data is measured evenly and there are not a lot of unnecessary numbers above the data totals. The problem with it is that once it is removed from the text, the during and after labels don’t tell us what we are measuring exactly. In the article it is explained in the caption, so I feel as though it is an example of a
good graphical representation of data.
This pie chart is not easy to read. There are entirely too many categories listed. The labels have to be squeezed together for this reason, so the data is hard to decipher. This would have been better represented by a bar chart (Thales, 2020). For these reasons, I think that this is a bad graphical representation of data.
References:
Gao, T., & Gurd, B. (2019). Hospital size. Chart. BMC Health Services Research, 19(1), 6. https://doiorg.ezproxy.snhu.edu/10.1186/s1291301939076
Thales, B. (2020). Don’t use pie charts. DEV. https://www.dev.to/thalesbruno/dontuespiecharts4177
For this assignment, you are to install the Analysis Toolpak in Excel on your computer for use in this class.
Next, find a set of data in the public domain and copy 20 rows of that set of data into an Excel worksheet. Be sure that data contains at least one quantitative variable.
Using that data and the Analysis Toolpak, calculate the mean, median, range, standard deviation, and variance of the quantitative variable(s) in that set of data. Briefly looking at the data set, what type of business questions could you have answered using this data.
Submit the Excel worksheet showing the data and the calculation results.
USE BOTH EXCELS PROVIDED TO AND INCLUDE T CRITICAL & ZCRITICALS ALONG WITH SE, LOW, UP
Using the data set you collected in Week 1, excluding the super car outlier, you should have calculated the mean and standard deviation during Week 2 for price data. Along with finding a p and q from Week 3. Using this information, calculate two 95% confidence intervals. For the first interval you need to calculate a Tconfidence interval for the sample population. You have the mean, standard deviation and the sample size, all you have left to find is the Tcritical value and you can calculate the interval. For the second interval calculate a proportion confidence interval using the proportion of the number of cars that fall below the average. You have the p, q, and n, all that is left is calculating a Zcritical value,
Make sure you include these values in your post, so your fellow classmates can use them to calculate their own confidence intervals. Once you calculate the confidence intervals you will need to interpret your interval and explain what this means in words.
Do the confidence intervals surprise you, knowing what you have learned about confidence intervals, proportions and normal distribution? Please the Week 5 Confidence TInterval Mean and Unknown SD PDF and the Week 5 Confidence Interval Proportions PDF at the bottom of the discussion. This will give you a step by step example on how to help you calculate this using Excel. These PDFs will also help you in Quizzes section.
Once you have posted your initial discussion, you must reply to at least two other learner’s post. Each post must be a different topic. So, you will have your initial post from one topic, your first followup post from a different topic, and your second followup post from one of the other topics. Of course, you are more than welcome to respond to more than two learners.”
I attached the data set. Follow the data set and do the below questions. Need 2 pages.
1. One research question
2. Two variables (in the dataset) to analyze based on your research question:
a) One independent variable (IV) and an explanation of what kind of scale/variable it is (e.g., nominal/groups, interval, ordinal)
b) At least one dependent variable (DV) an explanation of what kind of scale/variable it is (e.g., nominal/groups, interval, ordinal)
3. At least one hypothesis that you will test using data analysis, hypothesizing the relationship between the IV and DV. Note that your hypothesis and variables should allow you to utilize a hypothesis test we have learned/are learning about in class (e.g., Ttest, ANOVA, correlation between two continuous variables
When our data aligns to the characteristics of normal distribution, it maintains specific properties that help us interpret results and make decisions.
Respond to the following in a minimum of 175 words:
