Question 1
Quantitative data are data in graphical form like figures percentages etc using figures the analyst analyzes data and assumes that the calculations can produce an accurate conclusion which can be applied to any wider community. In another hand qualitative analysis examines individual encounters extensively with the goal of defining and interpreting meaning through language narration or visual evidence by creating themes specifically for that group of participants (Kaushik and Mathur, 2014). Qualitative analysis provides knowledge only about the individual cases examined, although theories are often more general assumptions. Quantitative techniques may be used to confirm which of those assumptions is valid.
- The below options are derived from the given question:
Opinion | Response |
Vote against the law | 672 |
Vote for law | 295 |
No opinion | 51 |
The type of information mentioned above is variable and the change in the new project of the rail road is the explicit data. As per the measurement scale, it can be see that there are two levels of variable here – which are Yes/ Noand has no specific ranking or order. Hence the measurement scale is nominal.
- The graphical presentation mentioned below specifieswhether the general support is in favor oris againstthe increase in the tax development for commencing the new project of rail road. The best presentation for such categories is reflected through a bar chart hence we have used it here.
Looking at chart above, it can be seen that maximum amount of votes that are reflecting are not in favor of the law which means that lesspeople are there who are in support of paying thedevelopment tax which is required to start the new project of the rail road.
Question 2
To describe how much time is spent in a weekly meeting data of 25 observations will be used.
- Mention below is the summary of total time spent by 25 CEO in a meeting weekly, time calculated is in hours:
Time spent | |
Mean | 18.32 |
Sample Variance | 11.56 |
Mode | 15 |
Range | 11 |
Maximum | 23 |
Standard Deviation | 3.4 |
Median | 19 |
Minimum | 12 |
Quartile3 | 21 |
Quartile1 | 15 |
Quartile2 | 19 |
- For preparing the data of frequency distribution along with frequency percentage a two hours class width is used.
Highest value = 23, Lowest value = 12,
Classes | Frequency | Percentage Frequency |
12 | 1 | 4.0% |
14 | 2 | 8.0% |
16 | 6 | 24.0% |
18 | 3 | 12.0% |
20 | 5 | 20.0% |
22 | 4 | 16.0% |
24 | 4 | 16.0% |
Total | 25 | 100% |
- Mentioned below is the diagram used for calculating time spent by 25 CEOs in weekly meetings. The time is calculated in hours. As per the graphical presentation mention below,it can be seen that mostly the value is falling towards the left side .This shows that the data is tilted negatively.
Question 3
The question in the research that we are going through has multiple methods to collect the data and similarlyfor sampling also there are different methods which can be used. Let us see in the research which is the most suited method for sampling and collection.
- Investigate the voting purpose of the Australian voters in the forthcoming election:
Data collection: Since, many people will be involved the questionnaires method will be appropriate to collect the data as to a certain extent in the sample of large size close ended answers are used.
Definition of Random Sampling: Method in which a sample is taken randomly to draw a conclusion without being bias towards anything is Random Sampling method.
Significance: Any research in which large amount of people or data needs to be investigated, Random sampling is the best method to be used as it’s not possible to investigate the entire lot for drawing a conclusion. Just like in this case it’s not possible to get the views of the entire population of Australia.
Sampling:The random sampling method can be considered for this research, as it would not be possible to travelall of America enquiring each citizen regarding theirchoice. For drawing the concluding we can randomly pick any American citizenand question them.
- Find out reasons for the top 4 banks who are not giving its borrowers the complete interest cuts that the Australian reserve bank has introduced :
Data collection:To collectthis data we can use the interview method, where banking and financial expert can be interview to find out the likelyreasons.
Definition of Convenience sampling: A method in which a sample is taken from people or group who are related to the research and can be approached easily is Convenience sampling method. The only criteria in it are that people should be available and agree to participate.
Significance: In this scenario since the information that has to be gathered is related to banks and only people from banks can provide the inside information, Convenience sampling is the best method that can be used to gather the information as according to this method only those people who are related to the research, have information about it and are easily available to be investigated for drawing conclusion.
Sampling:Convenience sampling is another way to carry out this research. For this objectiveany expert in this field who is easy to approach can be interviewed to find out about the sentiments of the banks (Palinkas,et. al., 2015).
- Comprehend the demographic portfolio of Melbourne community which existat city council of Hume:
Data collection:For understand this; we can do a survey in that area and look for the required details from people residing over there.
Definition of stratified sampling:It is a method in which small groups are made out of the entire population to do the sampling. People having common characteristics are chosen to form a group and from this group samples are taken randomly.
Significance:In this scenario specific information is required from people within the community and to gather such information stratified sampling is best suited as according to this method groups are made of only those people who are within the specified community and share common interest once the group is formed then from within that group samples are take randomly to get to a conclusion.
Sampling: For this research, Stratified sampling method should be used,thatincludespeoplewho have similar characteristics to be gathered into subgroups,so that it becomes easy to know their demographic portfolio.
- Take adults opinionto find out whether use of marijuana in Australiashould be legalized.
Data collection:Focus Group method can be used in this, as views can be taken from the adults on legalizing the use of marijuana in Australia. As the method involves surveying, interviewingand observing the focusedgroup,hence the adult group can be survey, interview and observe here.
Sampling:In this research random sampling method can be implemented,which means taking opinionsof randomadults.
- Evaluatethe standardchildren’sage of Melbourne city :
Data Collection:Records and Documents are what could be used to find out the standard age of the children. These records and documents containreports of the census or something similar which can help in finding the standard age of the children.
Sampling:We can use the Stratified sampling method,according to which children’s date is separated from entire data that contains the completeinformation. That’s the reason Stratified sampling method should be used.
Question 4
- To find out whether watching television is the cause of increase in weight or not we can use the Scatterplot diagram in which we can layout the connection between spending time in front of the Television and becoming Overweight. In the overweight data negative number will be reflected for any underweight child.
- To calculate correlation coefficient amongst the variables we can use the toolpakof data analysis in excel.
Hours/KG | Television | Overweight |
Television | 1 | |
Overweight | 0.89119 | 1 |
Definition of Correlation coefficient:It is a statistical measureof strength to find out the relationship between relative activities of two variables. The range of the value falls between -1.0 and 1.0. Any number that is calculated and is more than 1.0 or less than -1.0 then the correlation is measured incorrectly. The value comes up as -1.0 then it reflects a negative correlation and if the value come up as 1.0 then it reflects a positive correlation. If the value is 0.0 then it reflect that there is no linear relationship amongst the movement of both the variables.
Correlation coefficient value consist of 0.89119 which means that there is a positive and strong relationship between Overweight (Kg) and television (hours). It meansthat as the hours of watching Television increase, there will be an increase in overweight andvice versa reflecting the relationship amongst the linear changes.
- Regression output in excel is mentioned below:
SUMMARY OUTPUT | ||||||||||
Regression Statistics | ||||||||||
Multiple R | 0.8912 | |||||||||
R Square | 0.7942 | |||||||||
Adjusted R Square | 0.7784 | |||||||||
Standard Error | 1.5894 | |||||||||
Observations | 15 | |||||||||
ANOVA | ||||||||||
DF | SS | MS | F | Significance F | ||||||
Regression | 1 | 126.757503 | 126.757503 | 50.17424642 | 0.0000 | |||||
Residual | 13 | 32.84249702 | 2.526345925 | |||||||
Total | 14 | 159.6 | ||||||||
Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | Lower 95.0% | Upper 95.0% | |||
Intercept | -11.069 | 1.973 | -5.611 | 0.000 | -15.331 | -6.807 | -15.331 | -6.807 | ||
Television(hours) | 0.434 | 0.061 | 7.083 | 0.000 | 0.302 | 0.567 | 0.302 | 0.567 | ||
The likely regression equation:
Overweight =-11.069 + 0.434*Television
Explanation:
Intercept: Average Overweight would be expected to go down by (average) 11.0691 if hours of television are zero.
Slope:The overweight will increase by (average) 0.434 Kg if one unit is increased in Television(hours)
- 0.7942 is the coefficient value of determination, whichmeans the variation of 79.42% inoverweight isdescribed by the total hours spent in front of the televisionand hence this is reasonablya model that fits strong.
- As per the outcome of the summary mentioned above, 7.083 is the value of T and 0.000 is the value of P. Since the value of P is not more than 0.05, Variable is significant statistically and hence has a relationship between overweight and television.
- As per the outcome of the summary mentioned above, Standard error value consists of 1.5894. The value of P in the F-stats consists of 0.000. Asthe value of P is not more than 0.05 of significant level, the complete model is significant statistically;also the R-square value is used to determine the model’s fitness. Because 0.7942 is the value which means that this model is a good fit.
Reference:
Kaushik, M. and Mathur, B., 2014. Data analysis of students marks with descriptive statistics. International Journal on Recent and Innovation Trends in Computing and Communication, 2(5), pp.1188-1190.
Kim, T.K., 2017. Understanding one-way ANOVA using conceptual figures. Korean journal of anesthesiology, 70(1), p.22.
Palinkas, L.A., Horwitz, S.M., Green, C.A., Wisdom, J.P., Duan, N. and Hoagwood, K., 2015. Purposeful sampling for qualitative data collection and analysis in mixed method implementation research. Administration and policy in mental health and mental health services research, 42(5), pp.533-544.
Suri, H., 2011. Purposeful sampling in qualitative research synthesis. Qualitative research journal, 11(2), p.63.
Taherdoost, H., 2016. Sampling methods in research methodology; how to choose a sampling technique for research. How to Choose a Sampling Technique for Research (April 10, 2016).