Cookies

We use cookies to improve your experience on our website By continuing to browse the site you are agreeing to our use of cookies.
Our privacy policy

Save My Exams Logo
  • GCSE
  • IGCSE
  • AS
  • A Level
  • O Level
  • Pre U
  • IB
  • Login
  •  
MathsBiologyChemistryPhysicsCombined ScienceEnglish LanguageOther Subjects
GCSE > Maths
Edexcel Topic QuestionsRevision NotesPast PapersPast Papers (old spec)
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast Papers
GCSE > Biology
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Gateway Topic QuestionsRevision NotesPast Papers
GCSE > Chemistry
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Gateway Topic QuestionsRevision NotesPast Papers
GCSE > Physics
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Gateway Topic QuestionsRevision NotesPast Papers
GCSE > Combined Science
Edexcel Combined: Biology Revision NotesPast Papers
Edexcel Combined: Chemistry Revision NotesPast Papers
Edexcel Combined: Physics Revision NotesPast Papers
AQA Combined: Biology Topic QuestionsRevision NotesPast Papers
AQA Combined: Chemistry Topic QuestionsRevision NotesPast Papers
AQA Combined: Physics Topic QuestionsRevision NotesPast Papers
OCR Gateway Combined: Biology Topic QuestionsRevision Notes
OCR Gateway Combined: Physics Revision Notes
GCSE > English Language
AQA Revision NotesPractice PapersPast Papers
Edexcel Past Papers
OCR Past Papers
GCSE > Other Subjects
AQA English LiteratureBusiness StudiesComputer ScienceEconomicsFurther MathsGeographyHistoryPsychologySociologyStatistics
Edexcel English LiteratureBusiness StudiesComputer ScienceGeographyHistoryPsychologyStatistics
OCR English LiteratureBusiness StudiesComputer ScienceEconomicsPsychology
OCR Gateway GeographyHistory
MathsBiologyChemistryPhysicsDouble ScienceEnglish LanguageGeographyOther Subjects
IGCSE > Maths
Edexcel Topic QuestionsRevision NotesPast PapersBronze-Silver-Gold Questions
CIE (Extended) Topic QuestionsRevision NotesPast Papers
CIE (Core) Topic QuestionsPast Papers
IGCSE > Biology
Edexcel Topic QuestionsRevision NotesPast Papers
CIE Topic QuestionsRevision NotesPast Papers
IGCSE > Chemistry
Edexcel Topic QuestionsRevision NotesPast Papers
CIE Topic QuestionsRevision NotesPast Papers
IGCSE > Physics
Edexcel Topic QuestionsRevision NotesPast Papers
CIE Topic QuestionsRevision NotesPast Papers
IGCSE > Double Science
Edexcel Double: Biology Topic QuestionsRevision NotesPast Papers
Edexcel Double: Chemistry Topic QuestionsRevision NotesPast Papers
Edexcel Double: Physics Topic QuestionsRevision NotesPast Papers
IGCSE > English Language
CIE Revision NotesPractice PapersPast Papers
Edexcel Past Papers
IGCSE > Geography
CIE Past Papers
Edexcel Past Papers Topic QuestionsPast Papers
IGCSE > Other Subjects
CIE Additional MathsEnglish LiteratureBusinessComputer ScienceEconomicsHistorySociology
Edexcel English LiteratureBusinessComputer ScienceHistoryFurther Maths
MathsBiologyChemistryPhysicsEnglish LanguageOther Subjects
AS > Maths
Edexcel Pure MathsMechanicsStatistics
AQA Pure MathsMechanicsStatistics
OCR Pure MathsMechanicsStatistics
CIE Pure 1Pure 2MechanicsProbability & Statistics 1
Edexcel IAS Pure 1Pure 2MechanicsStatistics
AS > Biology
AQA Topic QuestionsRevision NotesPast Papers
OCR Revision NotesPast Papers
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Revision Notes
AS > Chemistry
Edexcel Revision Notes
AQA Topic QuestionsRevision NotesPast Papers
OCR Revision Notes
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Revision Notes
AS > Physics
Edexcel Revision Notes
AQA Topic QuestionsRevision NotesPast Papers
OCR Revision NotesPast Papers
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Revision Notes
AS > English Language
AQA Past Papers
Edexcel Past Papers
OCR Past Papers
AS > Other Subjects
AQA Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther MathsGeographyHistoryPsychologySociology
Edexcel Business StudiesEconomicsEnglish LiteratureFurther MathsGeographyHistoryPsychology
OCR Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther Maths AGeographyHistoryPsychologySociology
CIE Further Maths
MathsBiologyChemistryPhysicsEnglish LanguageEconomicsPsychologyOther Subjects
A Level > Maths
Edexcel Pure MathsMechanicsStatistics
AQA Pure MathsMechanicsStatistics
OCR Pure MathsMechanicsStatistics
CIE Pure 1Pure 3MechanicsProbability & Statistics 1Probability & Statistics 2
Edexcel IAL Pure 1Pure 2Pure 3Pure 4Mechanics 1Mechanics 2Statistics 1Statistics 2
A Level > Biology
Edexcel Topic QuestionsPast Papers
Edexcel A (SNAB) Revision Notes
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast PapersGold Questions
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Topic QuestionsRevision NotesPast Papers
A Level > Chemistry
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast PapersGold Questions
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Topic QuestionsRevision NotesPast Papers
A Level > Physics
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast Papers
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Topic QuestionsRevision NotesPast Papers
A Level > English Language
AQA Past Papers
CIE Past Papers
Edexcel Past Papers
OCR Past Papers
Edexcel IAL Past Papers
A Level > Economics
Edexcel Past PapersPast Papers Topic Questions
AQA Past PapersPast Papers Topic Questions
OCR Past Papers
CIE Past Papers
A Level > Psychology
AQA Past Papers Topic QuestionsPast Papers
CIE Past Papers
Edexcel Past Papers
OCR Past Papers
Edexcel IAL Past Papers
A Level > Other Subjects
AQA Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther MathsGeographyHistorySociology
CIE BusinessComputer ScienceEconomicsEnglish LiteratureFurther MathsGeographySociology
Edexcel Business StudiesEconomics AEnglish LiteratureFurther MathsGeographyHistory
OCR Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther Maths AGeographyHistorySociology
Edexcel IAL English LiteratureGeography
CIE IAL History
BiologyChemistryPhysicsOther Subjects
O Level > Biology
CIE Topic QuestionsPast Papers
O Level > Chemistry
CIE Topic QuestionsPast Papers
O Level > Physics
CIE Topic QuestionsPast Papers
O Level > Other Subjects
CIE Additional MathsMaths D
MathsBiologyChemistryPhysics
Pre U > Maths
CIE Topic QuestionsPast Papers
Pre U > Biology
CIE Topic QuestionsPast Papers
Pre U > Chemistry
CIE Topic QuestionsPast Papers
Pre U > Physics
CIE Topic QuestionsPast Papers
MathsBiologyChemistryPhysics
IB > Maths
Maths: AA HL Topic QuestionsRevision Notes
Maths: AI HL Topic QuestionsRevision Notes
Maths: AA SL Topic QuestionsRevision NotesPractice Papers
Maths: AI SL Topic QuestionsRevision NotesPractice Papers
IB > Biology
Biology: SL Topic QuestionsRevision Notes
Biology: HL Topic QuestionsRevision Notes
IB > Chemistry
Chemistry: SL Topic QuestionsRevision Notes
Chemistry: HL Topic QuestionsRevision Notes
IB > Physics
Physics: SL Topic QuestionsRevision Notes
Physics: HL Revision Notes

Edexcel A Level Maths: Statistics

Revision Notes

Home / A Level / Maths: Statistics / Edexcel / Revision Notes / 2. Data Presentation & Interpretation / 2.3 Working with Data / 2.3.1 Outliers & Cleaning Data


2.3.1 Outliers & Cleaning Data


Outliers

What are outliers?

  • Outliers are extreme data values that do not fit with the general pattern of the data
  • They can come from one or two extreme events or from mistakes in the data collection
  • Outliers will affect some statistics that are calculated from the data
    • They can have a big effect on the mean, but not on the median or usually the mode
    • The range will be completely changed by a single outlier, but the interquartile range will not be affected
    • When calculating the mean or the range it is important to decide whether the outlier(s) should be included in the calculations
      • The question will tell you whether to include the outliers or not
      • You may have to decide which value is the outlier to be removed
      • In general outliers are included if they are a valid piece of data and excluded if it is likely that they are erroneous

How are outliers calculated?

  • Most of the time within this syllabus the outliers will be a particular distance either side of the interquartile range
    • The most common way to calculate an outlier will be using the formulae:
      • A value that is less than begin mathsize 16px style Q subscript 1 minus k end style(interquartile range)
      • A value that is greater than begin mathsize 16px style Q subscript 3 plus k end style(interquartile range)
      • k is a constant that will be given to you in the exam, commonly k=1.5
  • Outliers could also be situated a number of standard deviations away from the mean
    • The most common way to calculate an outlier will be using the formulae
      • A value that is less than begin mathsize 16px style x with bar on top minus k sigma end style
      • A value that is greater than size 16px x with size 16px bar on top plus size 16px k size 16px sigma
      • k is a constant that will be given to you in the exam, commonly begin mathsize 16px style k equals 2 end style

How are outliers represented on box plots?

  • On a box plot an outlier is represented as a cross either side of the maximum or minimum value
  • If the maximum or minimum value is discovered to be an outlier, the new maximum or minimum value will need to be found for the box plot
    • If the data value just above the minimum or just below the maximum is known, this will become the new value
    • If the data value is not known, the new minimum or maximum will become the outlier boundary

Cleaning Data

When should data be cleaned?

  • The cause of the outlier should be examined by looking into the context of the data
  • For example:
    • a test score of over 100% would most likely be a data collection error
    • a single salary that is much higher than the others would likely be for the CEO of the company
  • If an outlier is determined to be from an error in data collection it should be removed from the data.
    • Removing the incorrect data value(s) is called cleaning the data
    • It is important to consider very carefully whether you should remove the data value or not
      • If the data value is not an error it should not be removed from the data
  • If a data value is removed from the data set before calculations are carried out, a justification for the removal of the outlier must be made
  • Cleaning data also involves removing missing data and errors

Worked Example

The ages, in years, of a number of children attending a birthday party are given below:

 2,   7,   5,  4,   8,   4,   6,   5,   5,   29,     2,   5,   13,

An outlier is defined as an observation that falls more than 1.5 space cross times the interquartile range above the upper quartile or below the lower quartile

(i)
Identify any outliers within the data set.

 

(ii)
Clean the data by deciding which values should be removed, justify your answer.

2-3-1-outliers-we-solution

Exam Tip

  • Read the question carefully to determine which type of outlier you should be finding and to make sure you are using the correct method.


  • 1. Statistical Sampling
    • 1.1 Sampling & Data Collection
      • 1.1.1 Sampling & Data Collection
    • 2. Data Presentation & Interpretation
      • 2.1 Statistical Measures
        • 2.1.1 Basic Statistical Measures
          • 2.1.2 Frequency Tables
            • 2.1.3 Standard Deviation & Variance
              • 2.1.4 Coding
              • 2.2 Data Presentation
                • 2.2.1 Data Presentation
                  • 2.2.2 Box Plots & Cumulative Frequency
                    • 2.2.3 Histograms
                    • 2.3 Working with Data
                      • 2.3.1 Outliers & Cleaning Data
                        • 2.3.2 Intrepreting Data
                        • 2.4 Correlation & Regression
                          • 2.4.1 Correlation & Regression
                          • 2.5 Further Correlation & Regression (A Level only)
                            • 2.5.1 PMCC & Non-linear Regression
                              • 2.5.2 Hypothesis Testing for Correlation
                            • 3. Probability
                              • 3.1 Basic Probability
                                • 3.1.1 Calculating Probabilities & Events
                                  • 3.1.2 Venn Diagrams
                                    • 3.1.3 Tree Diagrams
                                    • 3.2 Further Probability (A Level only)
                                      • 3.2.1 Set Notation & Conditional Probability
                                        • 3.2.2 Further Venn Diagrams
                                          • 3.2.3 Further Tree Diagrams
                                            • 3.2.4 Probability Formulae
                                          • 4. Statistical Distributions
                                            • 4.1 Probability Distributions
                                              • 4.1.1 Discrete Probability Distributions
                                              • 4.2 Binomial Distribution
                                                • 4.2.1 The Binomial Distribution
                                                  • 4.2.2 Calculating Binomial Probabilities
                                                  • 4.3 Normal Distribution (A Level only)
                                                    • 4.3.1 The Normal Distribution
                                                      • 4.3.2 Normal Distribution - Calculations
                                                        • 4.3.3 Standard Normal Distribution
                                                        • 4.4 Choosing Distributions (A Level only)
                                                          • 4.4.1 Modelling with Distributions
                                                            • 4.4.2 Normal Approximation of Binomial
                                                          • 5. Hypothesis Testing
                                                            • 5.1 Hypothesis Testing
                                                              • 5.1.1 Hypothesis Testing
                                                              • 5.2 Hypothesis Testing (Binomial Distribution)
                                                                • 5.2.1 Binomial Hypothesis Testing
                                                                • 5.3 Hypothesis Testing (Normal Distribution) (A Level only)
                                                                  • 5.3.1 Sample Mean Distribution
                                                                    • 5.3.2 Normal Hypothesis Testing
                                                                  • 6. Large Data Set
                                                                    • 6.1 Large Data Set
                                                                      • 6.1 Large Data Set


                                                                      DOWNLOAD PDF

                                                                    Author: Amber

                                                                    Amber gained a first class degree in Mathematics & Meteorology from the University of Reading before training to become a teacher. She is passionate about teaching, having spent 8 years teaching GCSE and A Level Mathematics both in the UK and internationally. Amber loves creating bright and informative resources to help students reach their potential.


                                                                    Save My Exams Logo
                                                                    Resources
                                                                    Home Join Support

                                                                    Members
                                                                    Members Home Account Login

                                                                    Company
                                                                    About Us Contact Us Jobs Terms Privacy Facebook Twitter

                                                                    Quick Links
                                                                    GCSE Revision Notes IGCSE Revision Notes A Level Revision Notes Biology Chemistry Physics Maths 2022 Advance Information

                                                                     
                                                                    © Copyright 2015-2022 Save My Exams Ltd. All Rights Reserved.
                                                                    IBO was not involved in the production of, and does not endorse, the resources created by Save My Exams.