Part-III Treatment of Data 1 OVERVIEW (1) Units of measurement - - PowerPoint PPT Presentation

part iii treatment of data
SMART_READER_LITE
LIVE PREVIEW

Part-III Treatment of Data 1 OVERVIEW (1) Units of measurement - - PowerPoint PPT Presentation

Part-III Treatment of Data 1 OVERVIEW (1) Units of measurement (a) must be indicated in tables/graphs. (b) use scientific notation Examples 2.2x10 -6 2.2 V V 7.1x10 -2 71 m m m 8.34x10 4 83.4 kJ J 2.1x10 5 0.21 MW W 7.3x10 -11


slide-1
SLIDE 1

Part-III Treatment of Data

1

slide-2
SLIDE 2

OVERVIEW (1) Units of measurement (a) must be indicated in tables/graphs. (b) use scientific notation Examples 2.2x10-6 2.2 7.1x10-2 71 8.34x104 83.4 2.1x105 0.21 7.3x10-11 73

V µ

J

m

V

m m

kJ W MW F pF

slide-3
SLIDE 3

Tabulation of Data Whether one is measuring the distance to the moon using laser interferometry

  • r

measuring the soil strength using a penetrometer or bond angles/lengths using XRD, carefully made (and recorded) observations are the cornerstone of a good experimental work. (i) Repeat measurements These are made even when the quantity (or the value) is NOT varying. Timing the fall of an object through a given distance or measuring the wavelength of light emitted from a lamp containing helium gas. (ii) Relationship between two variables : so assign different values to as , and measure ,

y = f(x) x

1

x

......

2

x

y

1 2

y ......

slide-4
SLIDE 4

Table: Fall times (t) for an object to fall in air through 25 m (ambient temperature 298 K, 3 pm, 17-09-2013). t(s) 2.2 2.0 2.6 2.1 1.9 2.2 2.4 2.0 2.3 2.3

(3) Use Scientific Notation

For example, 2 2x10-3 ≅ 2x10-6 Pressure values (Pa) : 1.03x105 1.01x105 9.9x104 9.83x104 1.01x105 1.05x105 Table : Measured values of pressure. Pressure(x105 ) 1.03 1.01 0.99 0.983 1.01 1.05

  • r

Pressure( ) 103 101 99 98.3 101 105

mg

Pa

g kg

kPa

slide-5
SLIDE 5

(4) Uncertainties in measurements

Despite our best efforts and/or the quality of instruments “Variability”/”uncertainty” in experimental data. graduation of a thermometer ∴ there is nothing called “Exact” measurement. (One can only do the best possible !) Fluctuating nature of the mercury in a thermometer ∼ ± 0.5 .

  • C
slide-6
SLIDE 6

What does it mean ? What you mean is that the temperature could be anything from 19.5 to 20.5 , with the possible average value of 20 !

6

  • C
  • C
  • C
  • C

There is an uncertainty of 20 ± 0.5

slide-7
SLIDE 7

One must always include uncertainty estimates.

Table : Dependence of electrical resistance on temperature of a Copper wire (Kirkup : Experimental Methods, Wiley 1994).

T Electrical Resistance 281 0.208 289.5 0.213 296.5 0.222 305.0 0.229 313.5 0.232 327.5 0.243

(K) ± 0.5K

Ω ±0.001Ω

slide-8
SLIDE 8

8

OTHER FACTORS

Compounding of uncertainties? Does uncertainty depend upon the magnitude of the measurement ?

slide-9
SLIDE 9

(5) Significant Figures

If a value is recorded as 5.1 2 your experiment is able to distinguish between 5.1 1 and 5.1 3 Similarly, 5.123 5.122 & 5.124 5.12 3 significant digits 5.123 4 significant digits Resist the temptation to record all the figures given by an instrument display or a calculator or a computer. Let us look at the value of 0.0020409

slide-10
SLIDE 10

how many significant figures are there ? between the first non-zero and the last digit inclusive five significant figures Exercise 2.564 four 0.00489 three 64000 ? 1.20 three 1.2 two 0.20000878 eight

slide-11
SLIDE 11

Rounding off Numbers

11

In calculations of derived variables, rounding off is needed. How to do it ?

slide-12
SLIDE 12

1.8671132 1.8 or 1.9 1.87 1.867 CALCULATION & SIGNIFICANT FIGURES Let us say, for a cylinder, D = 7.9 L = 1.5 X- Sectional area, Is this sensible ? Obviously not. Why ?

Reduce to 4 SF Reduce to 3 SF Reduce to 2 SF

mm mm

( )

2 2

D 7.9 A = = 4 4 π π

2

A = 49.016699mm

slide-13
SLIDE 13

Well, if D is known to 2 significant figures, area can not be calculated to 8 significant figures !! Volume of cylinder ∴ V = 73

2

A = 49mm

( )

2 2 3

D = L 4 = × 7.9 ×1.5 4 = 73.525049mm π π

3

mm

slide-14
SLIDE 14

Guidelines for rounding off

14

Rule # 1 :Addition/Subtraction Round off the result to the same number of decimal places = least number of decimal places of the constituents

  • Ex. 11.39 – 7.897 + 12.3538 = 15.8468

Correct answer is 15.85

slide-15
SLIDE 15

Rule # 2 : Multiplication /Division The number of significant digits in the answer = least number of significant digits in the primary numbers.

CAVEAT Volume

One might be tempted to round it up to 74 ! Do not round off the intermediate results

3.17×3.393×3.3937 = 36.501992

3 significant figures 8 significant figures 36.5

3

= Area×L = 49×1.5 = 73.5mm

3

mm

slide-16
SLIDE 16

PROBLEM

In an experiment, the density (ρ) of a metallic sphere is to be measured. Density ρ is defined as mass per unit volume , i.e., m/v. Lab notebook has the following entries

Mass of sphere = 0.44 Diameter of sphere = 4.76 ∴ Does it make sense ?

( )

3 3 3

d 4.76 V = = = 451.761761m m 6 6 π π

g

3

d V = 6 π

mm

  • 4

0.44 = = 9.739647 ×10 451.761761 ρ

slide-17
SLIDE 17

PRESENTATION OF DATA

(Graphs, bar/ pie charts)

  • 1. Purpose

Graphs: our visual ability is very strong to detect trends as

  • pposed to tabulated information.

x y 9.3 19.0 7.3 15.0 9.8 20 1.8 4 5.3 11

slide-18
SLIDE 18

Graph/plot can be used to show the:

a) Range of data b) Uncertainty in each measurement c) Trend or the absence of a trend d) Data which do not fall in line with the majority

  • f data points.
slide-19
SLIDE 19

Independent variable Dependent variable Horizontal axis (Independent variable) Vertical axis (Dependent variable) DEPENDENT/INDEPENDENT VARIABLES 2-D plots Independent variables : controlled or which is varied systematically P can be varied through a high pressure gas line P Q

P Q

slide-20
SLIDE 20

Effect of temperature on solubility of salt in water Amount of water = 1 m3 Mass of salt which can be dissolved = M kg ∴ solubility, S = M/1 (kg/m3) T (K) S (kg/m3)

S1 S2 S3 298 315 325

Independent variable Dependent variable

slide-21
SLIDE 21

Example: Effect of temperature on the length of a wire

T (K) L (m) 273 1.1155 298 1.1164 323 1.1170 348 1.1172 373 1.1180 398 1.1190 423 1.1199 448 1.1210 473 1.1213 498 1.1223 523 1.1223

slide-22
SLIDE 22

(3) ORIGINS Is it necessary to have a (0,0) point on both axes ? Re-plot the T-L data This is not a good graph ! Why ?

slide-23
SLIDE 23

(4) Error bars These show the uncertainties in both variables: For the independent variable: For the dependent variable: for any ( x, y) data point

Time (s) ± 5 s Temperature (°C) ± 4 °C 10 125 70 116 125 104 190 94 260 87 320 76 370 72 Uncertainty in y- value Uncertainty in x- value Cooling of an object

slide-24
SLIDE 24

NOTE 1: If uncertainties are not constant, the size of the error bars must also vary. NOTE 2: % error is not same for each data point.

slide-25
SLIDE 25

(5) Types of Graphs

  • Linear x-y graphs
  • Semi-log graphs (one scale is linear and one is logarithmic)
  • Log- Log graphs (both scales are logarithmic)

(6) Linear x-y plots

Dependence of relative density on sugar concentration at 298 K.

Concentration (kg/m3) Relative density (-) 1.005 50 1.034 100 1.066 150 1.095 200 1.122 250 1.150

slide-26
SLIDE 26

Draw the best line by naked eye ? For RD = 1.053, calculate the sugar concentration

slide-27
SLIDE 27

CAVEATS

(1) Do not ignore outliers - investigate them further (2) Interpolation of results is justified when there is a sufficient number of data points (3) Extrapolation should always be avoided.

? ?

slide-28
SLIDE 28

y = mx + C

For two data points, one can only draw one line and this is also the best line : (x1,y1), (x2,y2) and C = value of y at x = 0 But when we have a large number of data points (xi ,yi) (i=20 say) which pair of data points should be used to calculate the values of m and C? BEST FIT What are the uncertainties in the best fit values of m and C?

intercept slope

2 1 2 1

y y m x x − = −

What are the two characteristics of a linear graph/plot?

slide-29
SLIDE 29

T (h) Size (mm)

0.5 1.5 ± 0.3 1 2.3 ± 0.3 1.5 3.3 ± 0.3 2 4.3 ± 0.3 2.5 5.4 ± 0.3

Example

slide-30
SLIDE 30

Linearization of Equations

Prior knowledge about the expected form of

dependence.

Original data might exhibit a non-linear relationship. Transform one of the variables in such a fashion that

transformed variable leads to a linear relationship.

M

Period of oscillation (T) ~ mass of the body (M)

slide-31
SLIDE 31

M (kg) T (s)

0.02 0.7 0.05 1.11 0.10 1.6 0.20 2.25 0.30 2.76 0.40 3.18 0.50 3.58 0.60 3.97 0.70 4.16 0.80 4.60 Dependence is not linear

Let us recall our high school physics:

M T ∝ M = 2 π k

(k : spring constant)

∴ T ∝ M

slide-32
SLIDE 32

2 T M k π =

y = m x + C

We expect C = 0 Linear graphs great advantage

slide-33
SLIDE 33

Constant acceleration (a) Initial Velocity @ t = 0 u = u s : distance travelled in time t a = 9.81 m/s2 u = 1 m/s

2

1 2 s ut at = +

Relationship between “s” and “t” is quadratic

EXAMPLE - 1

1 2 s u at t = +

y C mx

slide-34
SLIDE 34

Linearization has come about at a price? We should now estimate uncertainty in (s/t) NOT in s.

m = a/2

s/t

C = u

t 5

slide-35
SLIDE 35

EXAMPLE-2

For a radioactive material, N = N0 exp(-λt) N : undecayed nuclei @ time t N0: Initial value of N @ t = 0 λ : characteristic constant of material

slide-36
SLIDE 36

How to linearize it?

ln N = ln N0 + (-λt) ln e ln e = 1 ln N = (-λ)t + ln N0 y = mx + C “Semi-log” plot

C = ln N0

ln N t

−λ

slide-37
SLIDE 37

Now we must estimate uncertainity in lnN? Finally, let us come to log-log graphs: Motivation Sometimes no matter what we do, it is not possible to choose suitable scales for linear graphs.

Table : Current-Voltage relationship for a silicon diode

Voltage (V) I (Amperes) 0.35 9 x 10-7 0.40 3 x 10-6 0.45 5 x 10-5 0.50 2 x 10-4 0.55 1.7 x 10-3 0.60 1.5 x 10-2 0.65 7.5 x 10-2 0.70 0.55 0.75 3.5

slide-38
SLIDE 38

Semi-Log Linear

V I

0.3 0.4 0.5 0.6 0.7 0.8 0.5 1 1.5 2 2.5 3 3.5 4

V I

0.3 0.4 0.5 0.6 0.7 0.8 10

  • 7

10

  • 6

10

  • 5

10

  • 4

10

  • 3

10

  • 2

10

  • 1

10 10

1

slide-39
SLIDE 39

When both variables entail several

  • rders of magnitudes

Use double log coordinates

y = axb

ln y = ln a + b ln x

ynew C m xnew

slide-40
SLIDE 40

40

log-log scale

slide-41
SLIDE 41

NATURE OF UNCERTAINTIES

Uncertainty is an inevitable evil, both in experimental and numerical studies. Let us look at a simple test: Same object, constant value of S, same operator/equipment.

Time(s) 0.74 0.71 0.73 0.63 0.69 0.75 0.70 0.71 0.74 0.81

What do you make of these measurements?

Uncertainty is an inherent part of experiments

s

water

slide-42
SLIDE 42

Two Questions

1) Identification of sources: temperature, tube not being vertical, object is not being dropped at the same location/different initial condition/air bubbles attached to it…. 2) Quantification of the uncertainty

Practice problems

  • Fill a coffee making kettle with 1L of water and record the time it

takes for the water to boil? What factors will contribute to the variability

  • f the results?
slide-43
SLIDE 43

(1) Uncertainty

(i) Single measurement:

  • No method to establish the extent of uncertainty. Repeat the test at least
  • ne more time.
  • There are situations when it is not possible to repeat a test: biology, radio-

activity, CERN, on the surface of the moon, etc.

  • Test itself is varying with time.

(2) Uncertainty stems from:

(a) Resolution of instruments: What is the minimum value the instrument can measure? Length: 1 mm graduations 0.5 mm For better resolution, one can use a micrometer or Vernier callipers, but these also have their least count.

375 1.2mm 373.8 truevalue 376.2 mm ± → ≤ ≤

This is what is used for a single test, i.e., the uncertainty introduced by the instrument.

slide-44
SLIDE 44
slide-45
SLIDE 45

(b) Reading uncertainty:

N = 0 T Corresponds to the value at a fixed point. N ≠ 0 Thermometer shows wild fluctuations.

T water

heating

So, if you make a single measurement, we can not evaluate the uncertainty arising from the heating process.

(c) Calibration uncertainty:

All instruments require benchmarking or calibration which can change over a period of time !!

slide-46
SLIDE 46

What is the way forward?

The mean or average comes in handy-returning to our earlier example.

Time(s) 0.74 0.71 0.73 0.63 0.69 0.75 0.70 0.71 0.74 0.81

min

t = 0.63 s

max

t =0.81s

On average, this is the result we can expect.

n min i i 1

1 0.74 0.71 0.73 .... t t n 10 t 0.721s

=

+ + + = = =

slide-47
SLIDE 47
  • No. of significant figures?

What is the uncertainty in the mean value?

max min

range (spread) x x = −

max min

x x Uncertainty = n −

For our example:

0.81 - 0.63 Uncertainty = = 0.018 10

Now we should round off, the mean value sensibly to xmean= 0.72 s P robable value of x (or t in our case) = 0.72 0.018s ± ∴

slide-48
SLIDE 48

One can also quote % uncertainty

uncertainty 100 % uncertainty = mean value 0.018 = 100 2.5% 0.72 × × =

mean

Therefore, t = x = 0.72s with ±2.5% uncertainty

Without repeating measurements, one can not estimate the uncertainty.

slide-49
SLIDE 49

(3)True value, accuracy & precision

Aim of an experiment true value But this is impossible to do. On the other hand, we are trying to approximate the true value by an average or mean value, i.e., How many times we must repeat the measurements? Recall, Larger the value of n, closer will be to its true value. If Our measurements are accurate.

true

x x ≈

i

1 x = x n ∑

x

t r u e

x x ≈

slide-50
SLIDE 50

For example, the charge of an electron is known to be i.e., an uncertainty of

What is precision?

Uncertainty is small range is small, but it does not mean that the results are accurate! How?

Let us look at an example:

Boiling point of water @ 1 standard atmosphere T (oC) 102.4 102.6 102.3 102.6 102.4 102.7 102.4 102.4 102.5 102.6

19

( 1.6021773 0.0000005) 10 C

− ± ×

5

3 10 %

×

slide-51
SLIDE 51
  • mean
  • T

x 102.49 C 102.7 102.3 Uncertainty = 0.04 C 10 Boiling point of water = 102.49 0.04 C = = − = ∴ ±

This looks very impressive in terms of precision except that it is not very accurate.!

Let us use a different thermometer (+0.5 oC) T (oC) 101.0 101.0 100.5 100.5 99.0 99.5 99.0 100.5 101.0

  • x

100.2 C 101 99 uncertainty = = 0.2 C 10 Boiling point of water = 100.2 0.2 C = − ∴ ± Less precise, but more accurate experiments.

slide-52
SLIDE 52

In summary,

Accurate close to the true value Precise Low uncertainty, but not necessarily close to the true value Accurate & Precise close to the true value, with a small uncertainty

slide-53
SLIDE 53

TYPES OF UNCERTAINTIES (1) Systematic

Difficult to detect and deal with. Offset uncertainty Melting point of ice

ice+water mixture Thermocouple

  • x( C) -7.5, -6.9, -7.3
  • 7.4, -7.6, -7.4
  • 7.3, -7.7, -7.6
  • 7.6
  • x

7.43, uncertainty = 0.08 C = −

Very precise but inaccurate measurements!

slide-54
SLIDE 54

The true value is expected to be close to zero! There is a big offset error here. Check your calibration, electronic gadgets, warm up period, insensitive thermocouple, etc.

On the other hand, for a plasma furnace (~ 1500 oC), 7.5 oC is not a significant

  • ffset.

Try to develop a feel for the answer you are looking for!

Gain uncertainty: This varies with the magnitude of quantity itself.

Example: Calibration masses and electronic balance

Standard mass (g) 0.00 20.00 40.00 60.00 80.00 100.00 Electronic balance value (g) 0.00 20.18 40.70 61.00 81.12 101.68

slide-55
SLIDE 55

The difference between the two values increases as the mass increases.

slide-56
SLIDE 56

(2) Random Uncertainties:

  • These are responsible for scatter in the measurements.
  • Environmental factors can also introduce random uncertainty:

electrical interference (switching on/off equipment, vibrations caused by rotomachinery, power supply fluctuations (water/air/steam main pressures etc.) If these are truly random, the averaging of several measurements will even out this effect.

slide-57
SLIDE 57

COMBINING UNCERTAINTIES

So far we have talked about uncertainties when we are interested in the measurement directly. Engineering experiments we need to combine several measurements to calculate the quantity of interest Let us say you are given a cylindrical bar of an unknown metal and we want to calculate its density.

slide-58
SLIDE 58

D L

mass volume ρ=

m D L

Uncertainty in the value of depends upon the uncertainties in the measured values of m, D, L ρ

2

m D L 4 ρ = π

D D D L L L m m m ≡ ± ∆ ≡ ± ∆ ≡ ± ∆ ρ ≡ ρ ± ∆ρ

slide-59
SLIDE 59

One way to estimate ∆ρ

∆ρ

D D D ≡ + ∆ D D D ≡ − ∆ L L L ≡ + ∆ L L L ≡ + ∆ L L L ≡ − ∆ L L L ≡ − ∆

m m m ≡ − ∆ m m m ≡ + ∆ m m m ≡ − ∆ m m m ≡ + ∆ m m m ≡ + ∆

m m m ≡ − ∆

m m m ≡ + ∆ m m m ≡ − ∆

This looks like a lot of hard work!!

slide-60
SLIDE 60

We can be a little smarter than this:

( )

2

m D L 4 ρ = π ln ln m ln ln 4 2ln D ln L Differentiate it: m D L 2 m D L ρ= − π+ − − ∆ρ ∆ ∆ ∆ = − − ρ

Multiply this equation by 100 on both sides

% uncertainty in = % uncertainty in m + 2 % uncertainty in D + % uncertainty in L ρ ×

NOTE:

  • Uncertainty in D is multiplied by 2.
  • All terms have been added up.
slide-61
SLIDE 61

Statistics is a science of numbers ! It helps you draw good inferences, but it also gives you confidence to tell lies !! “There are three kind of lies – lies, damned lies, and statistics.”

(attributed to Ben Desraeli & Mark Twain).

Naturally, statistics and statistical methods “large dataset”.

STATISTICAL ANALYSIS OF DATA

slide-62
SLIDE 62

DEFINITIONS

(i) Variance of a dataset

is assumed to be the best estimate of the true value of the quantity. Despite uncertainty in each individual measurement, , the single value ≈ true value.

62

x x

slide-63
SLIDE 63

Example: Time to slide down the plane:

xi (s) 0.74

  • 0.021

0.000441 0.74

  • 0.021

0.000441 0.69 0.029 0.000841 0.68 0.039 0.001521 0.80

  • 0.081

0.006561 0.71 0.009 0.000081 0.78

  • 0.061

0.003721 0.65 0.069 0.004761 0.67 0.049 0.002401 0.73

  • 0.011

0.000121 = 0.719 ∑di = 0 ∑di

2 = 0.02089

i i

d = x-x (s)

( )

=

2 2 i i

d x-x

2

(s )

θ

0.74, 0.74, 0.69, 0.68, 0.80, 0.71, 0.78, 0.65, 0.67, 0.73

i

1 x = x = 0.719 10 x

slide-64
SLIDE 64

2 2

0.02089 0.002089 10 s σ = = Variance, For our example,

( )

2 2 2 i i

x x d n n σ ∑ − ∑ = =

Another related parameter Standard deviation

( )

2 2

0.002089 0.04571

i

x x n s σ σ σ ∑ − = = = = Usually σ is NOT strongly dependent on n.

slide-65
SLIDE 65

Let us say :- We have repeat data sets.

Series I Series II

  • Series

VI 43

  • 55
  • 53
  • 52
  • 55
  • 52
  • 51
  • 54
  • 50
  • 52
  • I

II III IV V VI VII VIII 51 51.7 50.4 51.5 51.7 50.4 52.5 49.5

σ

3.13 3.29 3.07 3.11 3.20 2.94 2.73 3.20

Uncertainty in the mean of repeat measurements:

x

slide-66
SLIDE 66
  • Variability in the “means” < Variability in each set.

Therefore, the best estimate of x is 51.1 ± 0.893 This, however, only eliminates the role of random uncertainty & NOT

  • f the systematic uncertainty.

( ) ( )

1 2

51.1; 0.893

i x

x x x n σ   −   = = =    

x

n σ σ = COMMENTS

slide-67
SLIDE 67

If there are sufficient number of data points without any systematic uncertainties : This is more or less the universal curve which is encountered literally in every application relying on numerous data points. This is called Normal distribution or Bell-shaped curve.

Frequency or distribution

x

slide-68
SLIDE 68

Two metrics are needed to describe this population?

  • Mean () (Height of peak)
  • Standard deviation (σ) (spread of curve)

Line of symmetry

x-σ x +σ x

≤ ≤ x-σ x x +σ :

Area under the curve between these limits α number of data points lying in this range.

± x σ : ± x 2σ : x ± 6 σ :

∼ 70% of the total area ∼ 95% of the total area ∼ 99.9999% of the total area

What is so special about this curve?

slide-69
SLIDE 69

On one hand, we wish to have data which are reliable, reproducible and with as small uncertainty as possible, one can not go on making ∞ repeat measurements. Let us say that a population of measurements with mean µ and σpop: We would like our “sample” (small sub-set of population) such that sample mean ≈ µ and

( )

1 2 2 i i p o p

x n x n µ µ σ ∑ =   ∑ −   =      

Evidently, µ = true value

( )

1 2 2

1

i p o p

x x s n σ   ∑ − = =   −    

Formal Treatment of Population & Sample

slide-70
SLIDE 70

x-σ x x +σ

If 70% of the data lie within ±σ, so we can say that there is a 70% probability to predict the expected outcome within ±σ. If 95% of the data lie within ±2σ, so we can say that there is a 95% probability that we can predict the expected outcome within ̅x̅ ±2σ, etc.

Confidence Bands

x

slide-71
SLIDE 71

REJECTION OF DATA

Some will argue “all data are equal”. It is not correct to through away any data point. Other extreme is that “one data set looks like spurious or suspect” and therefore is less reliable than the other sets. There are statistical tests to deal with this issue. Therefore, the automatic filtering by a computer program or another device should be assessed properly. The question is:“truly spurious” vs.“new phenomenon”? Therefore, meticulous recording of data, observations, unusual features, frequent voltage fluctuations, exceptional temperature, etc. all must be documented in detail in lab notebooks.

slide-72
SLIDE 72

72

Methodology of rejecting data

One data point strikingly disagrees with all the others.

Fall time (seconds) of an object in a liquid : 3.8 , 3.5 , 3.9 , 3.9 , 3.4 , 1.8

Recall that individual data can differ within a band from each other. However, legitimate discrepancy of this size is highly improbable.

Data rejection Controversial Important

very different from all others !!

slide-73
SLIDE 73

73

3.8 3.5 3.9 3.9 3.4 1.8 3.38 6 t + + + + + = ≃

3.4 t s = 0.8s σ =

Our suspect measurement of 1.8s deviates ,

3.4 1.8 1.6s − =

i.e., by 2σ Assuming Normal or Gaussian distribution, we can calculate the probability of a measurement lying outside ±2σ : ؞ Probability (outside 2σ) = 1 – probability (within 2σ)

slide-74
SLIDE 74

74

95.45 %

2σ 3.4

؞ Probability (outside 2σ) = 1 – 0.9545 = 0.0455 < 0.05 Thus, there is only 5% chance of a measurement lying outside ± 2σ , i.e., 1 in 20 measurements could be beyond ± 2σ . Out of 6 measurements, only 6 x0.05 = 0.3 is likely to be beyond ± 2σ . Chauvenet’s criterion : If this number < 0.5 , this data can be rejected.

slide-75
SLIDE 75

75

1 2

For N measurements : , ,.....,

N

x x x : value in doubt

sus

x

sus sus

x x t σ − =

# of standard deviations Find the probability of (outside tsusσ).

  • No. of expected deviants , n = N x Prob (outside tsusσ)

If n < 0.5 reject the data point in question and re-calculate , , etc. x σ

slide-76
SLIDE 76

76

EXAMPLE A student makes 10 measurements of mass (g) as follows : 46 , 48 , 44 , 38 , 45 , 47 , 58 , 44 , 45 , 43. Our suspect is : 58

45.8 , 5.1 x σ = =

؞ 5845.82.45.1

sus

t − = =

i.e., our suspect deviates by 2.4σ. Prob (outside ± 2.4σ) = 1 – Prob (inside ± 2.4σ) = 1 – 0.9836 = 0.016 ؞ In a set of 10 measurements , 10 x 0.016 = 0.16 of data can be

  • utside ± 2.4σ.
slide-77
SLIDE 77

77

Since 0.16 < 5 , we can safely reject this data point. New results are :

44.4, 2.9 x σ = =

Not much change in , but σ has dropped significantly.

x

Remember, the choice of n < 0.5 is arbitrary.

slide-78
SLIDE 78

78

CONCLUDING REMARKS

  • Units of measurements, scientific notation, their

representation, uncertainties, significant figures.

  • Presentation of data (Graphs, tables, bar/pie

charts).

  • Uncertainties: Systematic/Random
  • Statistical analysis of data: linear regression/non-

linear regression, adequacy of fit, R2, etc.