TSTA602 Assignment 1
April 25, 2020
Instructions. This assignment has a total of 20 marks and worth 20% of the final grade of TSTA602. The detailed marks allocation are provided at the beginning of each question. You are encouraged to discuss the assignment with your peers, but must write down your own solution. Unless otherwise specified, please calculate your answer to four decimal places, if your answer is not exact and you have to approximate your result. This assignment is due at 5pm on Friday of Week 6 (8th May, 2020). Please submit your assignment before the deadline.
Scenario. Bank X provides home loan service to their retail customers and charge a interest rate for their service. Due to market fluctuations, Bank X only offers floating interest rate (FIR) for their home loan customers. A friend of you who works as a banker at Bank X told you that the average yearly FIR in the last thirty years follows a Normal distribution with mean 3.2% (i.e., µ = 0.032) and standard deviation 0.04 (i.e., s = 0.04). In addition, a sample of size 10 (drawn from N(0.032,
0.0016)) is available to you which has been summaried in the table below
-0.021 0.029 -0.009 -0.002 0.002 -0.006 0.006 -0.064 0.023 0.031
Table 1: A sample of FIR
Your friend seek help from you to use your knowledge from TSTA602 to assist Bank X to do the following data analysis. Please write a report to answer all the following questions.
(a). (3 marks) calculate (mannually) the sample mean, and sample standard deviation for the sample in Table 1.
(b). (4 marks) if we draw samples of sizes 10 many times and form a distribution of sample mean, state the distribution of the sample mean and provide the reason for your conclusion; calculate the mean of the sample means and its standard deviation.
(c). (5 marks) based on (b), find the probability that the sample mean is smaller than 0.02. Please keep two decimal places in the calculation of standardization.
(d). (2 marks) if the sample mean from (a) is your observed value, calculate the 95% confidence interval for the sample mean.
(e). (2 marks) An outlier is a data point outside the interval [Q1- 1.5IQR,Q3 +
1.5IQR], where Q1 and Q3 are first and third quartile respectively, and IQR
is the interquartile range. Explain whether there is any outlier appears in the sample in Table 1? You can calculate Q1 and Q3, and IQR using R (in Rstudio).
(f). (4 marks) Use R (in Rstudio) to generate 1,000,000 samples of size 10 from N(0.032, 0.0016) (please set the seed equals to 602), compute the sample mean for each of these samples, draw a histgram (set the frequency parameter to FALSE) for these sample means and add a density curve to the histgram. Please provide the histgram with the density curve here and attach your R code as appendix.