# 🖩 Point Estimation Methods In the previous article we could see how the estimators support through the maximum likelihood method. But now we are going to study another method to obtain point estimators and it is the moment method. ## Moment Method The idea of this method is to match the population characteristics with the corresponding sample characteristics. Suppose we have a random variable x that has a point probability function for the discrete case p(x) or for the continuous case f(x) with a density function. We will call the moment of order k or population moment E(x^k), if the hopes exist. When studying the moment-generating function for a random variable. We can see that the moments are related to the parameters that are part of an associated distribution. Suppose a random sample of order k: x1, x2,…, xn, the sample moment is: `(∑_(i=1)^n▒x_i^k )/n` Given a random sample x1, x2, x3,…, xn from a distribution whose point probability function or perhaps its density function depends on m number of parameters θ1, θ2,…, θm and the moment estimators are obtained by equating the moments populations with their corresponding sample moments, that is, for this it will be necessary to solve the following system of equations: `E(x^k)= (∑_(i=1)^n▒x_i^k )/n` On the left side of the equation we have the population moments and they are equal to the right side of the equation, where we will have the sample moments. If we have a single parameter, we only have to find the first population moment and we will have a single equation: `E(x) = 1/n ∑_(i=1)^n▒x_i` But given the case that we have two parameters, we must solve the system of equations between the first population moment and the second population moment: 1st population moment: `E(x) = 1/n ∑_(i=1)^n▒x_i` 2nd population moment: `E(x^2)= 1/n ∑_(i=1)^n▒〖x_i〗^2` Therefore, as many moments are equal as parameters it finds to estimate. It is important to take into account that the sample moments only depend on the random sample, while the population moments will most likely appear in the parameters that we want to estimate, that is, it is from there that we will obtain the equations to clear the parameters that we require estimate. We are going to find the left side of the equation by calculating the mathematical expectation and in this way we will obtain an equation that contains the parameter to be able to solve it. ## Mathematical hope When we have a random variable X, the mean value that represents this variable is what we call the mathematical expectation or expected value. This expected value is obtained by adding the probabilities of a random event, multiplied by the value of that random event. In other words, it is the mean value of a certain data set or the mean value that an event has occurred. When we talk about discrete distributions, we refer to those that have the same probability for each event and in this case the mean will be equal to the mathematical expectation. ## Example of mathematical expectation Let's go with an example to find the mathematical expectation of an event: suppose that we throw a six-sided die and the random variable is defined by x squared of the result that the die shows when it falls. What would be the mathematical expectation of x? Since x is the square of the result, the values that the random variable will take are: 1, 4, 9, 16, 25 and 36. On the other hand, what is the probability that any of the heads will fall? Each head has a 1/6 chance. |x|1|4|9|16|25|36| |:---:|:---:|:---:|:---:|:---:|:---:|:---:| |P(x)|1\6|1\6|1\6|1\6|1\6|1\6| |:---:|:---:|:---:|:---:|:---:|:---:|:---:| According to the mathematical expectation formula, we must carry out the summation of the multiplication of the values that x takes multiplied by its probability, that is: `E(x) = ∑▒〖x.P(x)〗 = 1. 1/6 + 4. 1/6 + 9. 1/6 + 16. 1/6 + 25. 1/6 + 36. 1/6` `E(x) = 91/6 ≈ 15.17` This value being the mathematical expectation of the event Example 2 to find the mathematical expectation: Suppose we have a discrete random variable whose probability distribution is given by: `P(x) = {█(kx x=1,2,3,4 y 5@ k(10-x) x=6,7,8,9)┤` Find the value of the constant k To find the probability of the values of between 1 and 5, we simply need to multiply them by k. But for the values of x that are between 6 and 9 (both inclusive) we must apply the formula k (10 – x) so: |x|1|2|3|4|5|6|7|8|9| |:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:| |P(x)|1k|2k|3k|4k|5k|k(10-6)|k(10-7)|k(10-8)|k(10-9)| |:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:| |x|1|2|3|4|5|6|7|8|9| |:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:| |P(x)|1k|2k|3k|4k|5k|4k|3k|2k|1k| |:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:| In order to find the constant k we must take into account that the sum of the probabilities is always equal to unity: `∑▒〖P(x)=1〗` `1k + 2k + 3k + 4k + 5k + 4k + 3k + 2k + 1k = 1` `25k = 1` `k = 1/25` Once the value of k is known, we can find the mathematical expectation `E(x) = ∑▒〖x.P(x) 〗 = 1. 1/25 + 2. 2/25 + 3. 3/25 + 4. 4/25 + 5. 5/25 + 6. 4/25 + 7. 3/25 + 8. 2/25 + 9. 1/25` `E(x) = 5` Once we know how to calculate the mathematical expectation, we can carry out a population moment exercise. ## Exercise 1 Of Moments Let x be a random variable with a probability density function that is given by: `f(x) = {█(θx^(θ-1),0 0.05` In this case, we will observe evidence against the null hypothesis given the case that we obtain, a proportion of the sample that is relatively high. **When the type of contrast that we are carrying out when the problem is two samples is:** `Ho: ph = pm` This is an example for the proportion of smokers that exist among the students of the engineering faculty, which is determined as being the same for both men and women and in this case, two samples would have to be drawn, one for each population group . The alternative hypothesis would be that both proportions are simply different: `Ha: ph ≠ pm` We can also establish a somewhat stricter alternative hypothesis, which is that the proportion of smokers that exists among male students is higher than that of female smokers: `Ha: ph > pm` The way to solve the hypothesis test will be different depending on the specification of the alternative hypothesis. For the case where it is established that the alternative hypothesis is to state that the population of male smokers is different from the population of smokers, a two-tailed contrast would be carried out. While in the second alternative hypothesis, where it is established that the population of smokers is greater than that of female students, a single-tailed test would be carried out. ![Male Smoking](_static/images/point-estimation-methods/image3.jpeg) The idea of this type of problem is not precisely to test a numerical value for both parameters, but only to demonstrate that both are equal to each other. However, as the contrast is carried out, the proportions of each population will also be estimated, if they are potentially different or estimate a single proportion, but using both samples, that in the case of not rejecting the null hypothesis. ![Female Smoking](_static/images/point-estimation-methods/image4.jpeg) Another way of dealing with hypothesis testing is through the equality of variance or mathematical expectations between possible different populations. ## Hypothesis testing regarding proportions It is important to find the ideal method when carrying out a hypothesis test, for the case we are studying, where we want to test the population proportion. We are going to keep the assumption about the rule of rejection determined a priori, which is if the estimator of the sample proportion after the advertising campaign is equal to or greater than 0.07. We can commit a type I error when rejecting the null hypothesis being true, since the decision rule is to reject Ho when the proportion of the sample is equal to or greater than 0.07 and is denoted as p = X/n since the level of contrast significance is: `α=P(p≥0.07/ p =0.05)= P( X/n ≥0.07 / p = 0.05)` X being the number of clients who use the perfume among the people interviewed and n is the number of the sample, which in this case is 200. To find the probability, we use the information that, as is a Bernoulli probability distribution, the sum of the values of X would be a function of B(n,p) and therefore: `α=P(p≥0.07/ p =0.05)= P(X ≥ (0.07)(200) = 14 / p = 0.05)` `α=1- ∑_(i=0)^13▒P(X=i/p=0.05) = 1 - ∑_(i=0)^13▒(200¦i) 〖(0.05)〗^i 〖(0.95)〗^(200-i)` As we can see, calculating each of the addends is a somewhat complex process, so it is preferable to use the Poisson approximation in the binomial distribution or the normal approximation can also be used, when the probability p is small, however, the product np is quite large, so we have np = (200) (0.05) = 10. With this value λ = 10 and the number equal to 13 (which is the maximum value to take), we will go to the tables of the Poisson distribution and from there we obtain: The probability that the variable takes these numerical values is: 0.8645, so that α = 1 - 0.8645 = 0.1355, this being the contrast significance. In the same way, we could calculate the probability of committing a type II error, but for this it is necessary to assume that the population proportion has undergone a change and now has a new value of p '. Let's suppose that now p '= 0.6 and even though it is unknown, we must remember that the null hypothesis is not rejected if X <14, so: `β = P( X < 14 / p´ = 0.06) = ∑_(i=0)^13▒(200¦i) 〖(0.06)〗^i 〖(0.94)〗^(200-i) = 0.682` When we look for the value in the Poisson distribution tables for λ = np = (200) (0.06) = 12. But what if we want to calculate assuming that p´ = 0.07, we realize that λ = np = (200)(0.07) = 14 and when calculating β = 0.464. Therefore if we keep increasing the value of p '= 0.8 the value of λ = 16 and β = 0.275. This implies that if the true proportion of current perfume consumers exceeds 5%, the probability that we have of committing a type II error will be less as the difference between the proportion before the advertising campaign and the proportion that is get after the campaign. Depending on the strategy used to carry out the contrast, the values of α and β will vary. If the cosmetics company wanted to exceed 7% of consumers after the campaign, it would be being very demanding when rejecting the null hypothesis. While it is true that this implies a low probability of committing a type I error, the probability of committing a type II error would be quite high and this would imply that we would be demanding a lot of evidence against the null hypothesis to reject it, that is, , would increase the probability of not rejecting the null hypothesis even if it is false. `α=P( X ≥ 16/p= 0.05)= 1-∑_(i=0)^15▒(200¦i) (0.05)^i (0.95)^(200-i) = 1-0.951` `α=0.049` As we can see, the probability of committing type I error is much lower than that obtained with the 7% threshold, remember that it was α = 0.136. We can then come to the conclusion that depending on the strategy adopted a priori when carrying out the contrast, it will directly affect the probability levels of committing both errors. We can also conclude that by varying the rejection threshold, the probability of committing one of the types of error can be reduced, but always at the cost of increasing the other type of error. ## About the Author Idais, Graduated in Mechanical Engineering, and a master’s degree in teaching component, she gave classes in several institutes of mathematics and physics, but she also dedicated several years of my life as a television producer, she did the scripts for mikes, the camera direction, editing of video and even the location. Later she was dedicated to SEO writing for a couple of years. She likes poetry, chess and dominoes.