The mean and standard deviation of a set of $n_{1}$ observations are $\bar{x}_{1}$ and $s_{1},$ respectively,while the mean and standard deviation of another set of $n_{2}$ observations are $\bar{x}_{2}$ and $s_{2},$ respectively. Show that the standard deviation of the combined set of $(n_{1}+n_{2})$ observations is given by $SD = \sqrt{\frac{n_{1}(s_{1})^{2}+n_{2}(s_{2})^{2}}{n_{1}+n_{2}}+\frac{n_{1} n_{2}(\bar{x}_{1}-\bar{x}_{2})^{2}}{(n_{1}+n_{2})^{2}}}$.

Vedclass pdf generator app on play store
Vedclass iOS app on app store
(N/A) Let the two sets of observations be $x_{i}$ $(i=1, 2, \ldots, n_{1})$ and $y_{j}$ $(j=1, 2, \ldots, n_{2})$.
The combined mean $\bar{x}$ is given by $\bar{x} = \frac{n_{1}\bar{x}_{1} + n_{2}\bar{x}_{2}}{n_{1} + n_{2}}$.
The variance of the combined set is $\sigma^{2} = \frac{1}{n_{1}+n_{2}} [\sum (x_{i}-\bar{x})^{2} + \sum (y_{j}-\bar{x})^{2}]$.
Using the identity $\sum (x_{i}-\bar{x})^{2} = \sum (x_{i}-\bar{x}_{1} + \bar{x}_{1}-\bar{x})^{2} = \sum (x_{i}-\bar{x}_{1})^{2} + n_{1}(\bar{x}_{1}-\bar{x})^{2} = n_{1}s_{1}^{2} + n_{1}d_{1}^{2}$,where $d_{1} = \bar{x}_{1}-\bar{x}$.
Similarly,$\sum (y_{j}-\bar{x})^{2} = n_{2}s_{2}^{2} + n_{2}d_{2}^{2}$,where $d_{2} = \bar{x}_{2}-\bar{x}$.
Substituting $d_{1} = \frac{n_{2}(\bar{x}_{1}-\bar{x}_{2})}{n_{1}+n_{2}}$ and $d_{2} = \frac{n_{1}(\bar{x}_{2}-\bar{x}_{1})}{n_{1}+n_{2}}$,we get:
$\sigma^{2} = \frac{n_{1}s_{1}^{2} + n_{2}s_{2}^{2}}{n_{1}+n_{2}} + \frac{n_{1}d_{1}^{2} + n_{2}d_{2}^{2}}{n_{1}+n_{2}}$.
Simplifying the term $\frac{n_{1}d_{1}^{2} + n_{2}d_{2}^{2}}{n_{1}+n_{2}}$ leads to $\frac{n_{1}n_{2}(\bar{x}_{1}-\bar{x}_{2})^{2}}{(n_{1}+n_{2})^{2}}$.
Thus,$SD = \sqrt{\frac{n_{1}s_{1}^{2}+n_{2}s_{2}^{2}}{n_{1}+n_{2}}+\frac{n_{1} n_{2}(\bar{x}_{1}-\bar{x}_{2})^{2}}{(n_{1}+n_{2})^{2}}}$.

Explore More

Similar Questions

The range of the following set of observations $2, 3, 5, 9, 8, 7, 6, 5, 7, 4, 3$ is:

The mean and variance of the observations $x_1, x_2, x_3, \ldots, x_{15}$ are respectively $2$ and $4$. If the mean and variance of the observations $y_1, y_2, \ldots, y_{10}$ are respectively $2$ and $5$,then the variance of the combined observations $x_1, x_2, \ldots, x_{15}, y_1, y_2, \ldots, y_{10}$ is

The mean and variance of seven observations are $8$ and $16$ respectively. If $5$ of the observations are $2, 4, 10, 12, 14$, then the square root of the product of the remaining two observations is (in $\sqrt{3}$)

The mean and variance of a set of $6$ terms are $11$ and $24$ respectively,and the mean and variance of another set of $3$ terms are $14$ and $36$ respectively. Then,the variance of all $9$ terms is equal to:

Suppose a class has $7$ students. The average marks of these students in the mathematics examination is $62$,and their variance is $20$. $A$ student fails in the examination if they get less than $50$ marks. In the worst-case scenario,what is the maximum number of students who can fail?

Vedclass Products

For Students

Vedclass Test Series

Mock tests in real JEE/NEET style with performance analysis. 5-day free trial.

Start Free Trial
For Teachers

Exam Paper Generator

Generate Set A/B/C/D exam papers from 7.5L+ questions in 2 minutes. 3 chapters free.

Try Free
For Institutes

Online Exam Module

Live online exams with unlimited students, 360° analytics & white-label branding.

See Demo