Cookies

We use cookies to improve your experience on our website By continuing to browse the site you are agreeing to our use of cookies.
Our privacy policy

Save My Exams Logo
  • GCSE
  • IGCSE
  • AS
  • A Level
  • O Level
  • Pre U
  • IB
  • Login
  •  
MathsBiologyChemistryPhysicsCombined ScienceEnglish LanguageOther Subjects
GCSE > Maths
Edexcel Topic QuestionsRevision NotesPast PapersPast Papers (old spec)
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast Papers
GCSE > Biology
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Gateway Topic QuestionsRevision NotesPast Papers
GCSE > Chemistry
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Gateway Topic QuestionsRevision NotesPast Papers
GCSE > Physics
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Gateway Topic QuestionsRevision NotesPast Papers
GCSE > Combined Science
Edexcel Combined: Biology Revision NotesPast Papers
Edexcel Combined: Chemistry Revision NotesPast Papers
Edexcel Combined: Physics Revision NotesPast Papers
AQA Combined: Biology Topic QuestionsRevision NotesPast Papers
AQA Combined: Chemistry Topic QuestionsRevision NotesPast Papers
AQA Combined: Physics Topic QuestionsRevision NotesPast Papers
OCR Gateway Combined: Biology Topic QuestionsRevision Notes
OCR Gateway Combined: Physics Revision Notes
GCSE > English Language
AQA Revision NotesPractice PapersPast Papers
Edexcel Past Papers
OCR Past Papers
GCSE > Other Subjects
AQA English LiteratureBusiness StudiesComputer ScienceEconomicsFurther MathsGeographyHistoryPsychologySociologyStatistics
Edexcel English LiteratureBusiness StudiesComputer ScienceGeographyHistoryPsychologyStatistics
OCR English LiteratureBusiness StudiesComputer ScienceEconomicsPsychology
OCR Gateway GeographyHistory
MathsBiologyChemistryPhysicsDouble ScienceEnglish LanguageGeographyOther Subjects
IGCSE > Maths
Edexcel Topic QuestionsRevision NotesPast PapersBronze-Silver-Gold Questions
CIE (Extended) Topic QuestionsRevision NotesPast Papers
CIE (Core) Topic QuestionsPast Papers
IGCSE > Biology
Edexcel Topic QuestionsRevision NotesPast Papers
CIE Topic QuestionsRevision NotesPast Papers
IGCSE > Chemistry
Edexcel Topic QuestionsRevision NotesPast Papers
CIE Topic QuestionsRevision NotesPast Papers
IGCSE > Physics
Edexcel Topic QuestionsRevision NotesPast Papers
CIE Topic QuestionsRevision NotesPast Papers
IGCSE > Double Science
Edexcel Double: Biology Topic QuestionsRevision NotesPast Papers
Edexcel Double: Chemistry Topic QuestionsRevision NotesPast Papers
Edexcel Double: Physics Topic QuestionsRevision NotesPast Papers
IGCSE > English Language
CIE Revision NotesPractice PapersPast Papers
Edexcel Past Papers
IGCSE > Geography
CIE Past Papers
Edexcel Past Papers Topic QuestionsPast Papers
IGCSE > Other Subjects
CIE Additional MathsEnglish LiteratureBusinessComputer ScienceEconomicsHistorySociology
Edexcel English LiteratureBusinessComputer ScienceHistoryFurther Maths
MathsBiologyChemistryPhysicsEnglish LanguageOther Subjects
AS > Maths
Edexcel Pure MathsMechanicsStatistics
AQA Pure MathsMechanicsStatistics
OCR Pure MathsMechanicsStatistics
CIE Pure 1Pure 2MechanicsProbability & Statistics 1
Edexcel IAS Pure 1Pure 2MechanicsStatistics
AS > Biology
AQA Topic QuestionsRevision NotesPast Papers
OCR Revision NotesPast Papers
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Revision Notes
AS > Chemistry
Edexcel Revision Notes
AQA Topic QuestionsRevision NotesPast Papers
OCR Revision Notes
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Revision Notes
AS > Physics
Edexcel Revision Notes
AQA Topic QuestionsRevision NotesPast Papers
OCR Revision NotesPast Papers
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Revision Notes
AS > English Language
AQA Past Papers
Edexcel Past Papers
OCR Past Papers
AS > Other Subjects
AQA Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther MathsGeographyHistoryPsychologySociology
Edexcel Business StudiesEconomicsEnglish LiteratureFurther MathsGeographyHistoryPsychology
OCR Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther Maths AGeographyHistoryPsychologySociology
CIE Further Maths
MathsBiologyChemistryPhysicsEnglish LanguageEconomicsPsychologyOther Subjects
A Level > Maths
Edexcel Pure MathsMechanicsStatistics
AQA Pure MathsMechanicsStatistics
OCR Pure MathsMechanicsStatistics
CIE Pure 1Pure 3MechanicsProbability & Statistics 1Probability & Statistics 2
Edexcel IAL Pure 1Pure 2Pure 3Pure 4Mechanics 1Mechanics 2Statistics 1Statistics 2
A Level > Biology
Edexcel Topic QuestionsPast Papers
Edexcel A (SNAB) Revision Notes
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast PapersGold Questions
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Topic QuestionsRevision NotesPast Papers
A Level > Chemistry
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast PapersGold Questions
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Topic QuestionsRevision NotesPast Papers
A Level > Physics
Edexcel Topic QuestionsRevision NotesPast Papers
AQA Topic QuestionsRevision NotesPast Papers
OCR Topic QuestionsRevision NotesPast Papers
CIE 2019-2021 Topic QuestionsRevision NotesPast Papers
CIE 2022-2024 Topic QuestionsRevision NotesPast Papers
Edexcel IAL Topic QuestionsRevision NotesPast Papers
A Level > English Language
AQA Past Papers
CIE Past Papers
Edexcel Past Papers
OCR Past Papers
Edexcel IAL Past Papers
A Level > Economics
Edexcel Past PapersPast Papers Topic Questions
AQA Past PapersPast Papers Topic Questions
OCR Past Papers
CIE Past Papers
A Level > Psychology
AQA Past Papers Topic QuestionsPast Papers
CIE Past Papers
Edexcel Past Papers
OCR Past Papers
Edexcel IAL Past Papers
A Level > Other Subjects
AQA Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther MathsGeographyHistorySociology
CIE BusinessComputer ScienceEconomicsEnglish LiteratureFurther MathsGeographySociology
Edexcel Business StudiesEconomics AEnglish LiteratureFurther MathsGeographyHistory
OCR Business StudiesComputer ScienceEconomicsEnglish LiteratureFurther Maths AGeographyHistorySociology
Edexcel IAL English LiteratureGeography
CIE IAL History
BiologyChemistryPhysicsOther Subjects
O Level > Biology
CIE Topic QuestionsPast Papers
O Level > Chemistry
CIE Topic QuestionsPast Papers
O Level > Physics
CIE Topic QuestionsPast Papers
O Level > Other Subjects
CIE Additional MathsMaths D
MathsBiologyChemistryPhysics
Pre U > Maths
CIE Topic QuestionsPast Papers
Pre U > Biology
CIE Topic QuestionsPast Papers
Pre U > Chemistry
CIE Topic QuestionsPast Papers
Pre U > Physics
CIE Topic QuestionsPast Papers
MathsBiologyChemistryPhysics
IB > Maths
Maths: AA HL Topic QuestionsRevision Notes
Maths: AI HL Topic QuestionsRevision Notes
Maths: AA SL Topic QuestionsRevision NotesPractice Papers
Maths: AI SL Topic QuestionsRevision NotesPractice Papers
IB > Biology
Biology: SL Topic QuestionsRevision Notes
Biology: HL Topic QuestionsRevision Notes
IB > Chemistry
Chemistry: SL Topic QuestionsRevision Notes
Chemistry: HL Topic QuestionsRevision Notes
IB > Physics
Physics: SL Topic QuestionsRevision Notes
Physics: HL Revision Notes

OCR A Level Maths: Statistics

Revision Notes

Home / A Level / Maths: Statistics / OCR / Revision Notes / 2. Data Presentation & Interpretation / 2.4 Correlation & Regression / 2.4.1 Correlation & Regression


2.4.1 Correlation & Regression


Scatter Diagrams

What does bivariate data mean?

  • A lot of statistics is about looking at how different factors, or variables change how data behaves
  • Bivariate data is data which is collected on two variables and looks at how one of the factors affects the other
    • Each data value from one variable will be paired with a data value from the other variable
    • The two variables are often related, but do not have to be

What is a scatter diagram?

  • A scatter diagram is a way of graphing bivariate data
    • You may be asked to plot, or add to, a scatter diagram
    • One variable will be on the begin mathsize 16px style x end style – axis and the other will be on the  begin mathsize 16px style y end style– axis
    • The variable that can be controlled in the data collection is known as the independent or explanatory variable and is plotted on the size 16px x – axis
    • The variable that is measured or discovered in the data collection is known as the dependent or response variable and is plotted on the begin mathsize 16px style y end style – axis
  • Scatter diagrams allow statisticians to look for relationships between the two variables
    • Some scatter diagrams will show a clear relationship know as correlation (see below)
    • Others will not display on obvious relationship
    • If a scatter diagram shows a relationship you may be asked to identify outliers

Worked Example

The scatter diagram below shows the number of Save My Exams question packs completed by a group of students and the percentage score they received in their A-Level Statistics exam.

2-4-1-scatter-diagrams-we-diagrams

(i)
State which of the variables is the explanatory variable and which is the response variable.

 

(ii)
Explain why the number of question packs completed is on the x– axis.

 

(iii)
Another student completed 50 question packs and scored 80% on their A Level Statistics exam, add this data to the scatter diagram.

Q5b59KCo_2-4-1-scatter-diagrams-we-solution-1-part-1

2-4-1-scatter-diagrams-we-solution-1-part-2

Exam Tip

  • Learn the vocabulary for the types of variables as you could be asked a question on this. Make sure you check the scales carefully when plotting any points. 

Correlation

What is correlation?

  • Correlation is how the relationship between the two variables is described
  • Perfect linear correlation means that the bivariate data will all lie on a straight line on a scatter diagram
  • Linear correlation can be positive or negative and it can be strong or weak
    • Positive correlation describes a data set where both variables are increasing
    • Negative correlation describes a data set where one variable is increasing and the other is decreasing
  • When describing correlation you should say whether it is positive or negative and also say whether it is strong or weak
  • If correlation exists then there could be outliers, these will be data points that do not fit the pattern seen on the graph
    • There will likely be a maximum of one or two outliers on any scatter diagram
    • You may be asked to identify the outliers

2-4-1-correlation-diagram-1

What is the difference between correlation and causation?

  • It is important to be aware that just because correlation exists, it does not mean that the change in one of the variables is causing the change in the other variable
    • Correlation does not imply causation!
  • If a change in one variable causes a change in the other then the two variables are said to have a causal relationship
    • Observing correlation between two variables does not always mean that there is a causal relationship
    • Look at the two variables in question and consider the context of the question to decide if there could be a causal relationship
      • If the two variables are temperature and number of ice creams sold at a park then it is likely to be a causal relationship
      • Correlation may exist between global temperatures and the number of monkeys kept as pets in the UK but they are unlikely to have a causal relationship
    • Observing a relationship between two variables can allow you to create a hypothesis about those two variables

Worked Example

The scatter diagram below shows the number of Save My Exams question packs completed by a group of students and the percentage score they received in their A-Level Statistics exam.

2-4-1-correlation-we_diagram-1

(i)
Describe the correlation shown in the scatter diagram.

 

(ii)
Decide if you think there could be a causal relationship between the two variables and explain your reasoning.

2-4-1-correlation-we-solution-2

Linear Regression

What is linear regression?

  • If strong linear correlation exists on a scatter diagram, then a line of best fit can be drawn
    • This is a linear graph added to the scatter diagram that best approximates the relationship between the two variables
    • At GCSE this will have been drawn by eye as a line that fits closest to the data values
    • The data can be used to calculate the equation of the straight line that represents the best fit of the relationship between the two variables
      • You do not need to know how to calculate it but you will need to be able to interpret one
  • The least squares regression line is the line of best fit that minimises the sum of the squares of the gap between the line and each data value
    • This is usually called the regression line and can be calculated either be looking at the vertical or the horizontal distances between the line and the data values
    • If the regression line is calculated by looking at the vertical distances it is called the regression line of y on x
    • If the regression line is calculated by looking at the horizontal distances it is called the regression line of x on y
      • The regression line of x on y is rarely used and you are unlikely to come across it at this level
  • The regression line of y on x is written in the form begin mathsize 16px style y equals a plus b x end style
    • b is the gradient of the line and represents the change in for each individual unit change in begin mathsize 16px style x end style
    • a is the y – intercept and shows the value of begin mathsize 16px style y end style for which begin mathsize 16px style x end style is zero
  • You are not expected to calculate the regression line in your exam but you should be prepared to draw one onto a scatter diagram, interpret or use one to predict values
    • It is useful to know that the point begin mathsize 16px style left parenthesis space x with bar on top comma space y with bar on top space right parenthesis end style  will lie on the regression line

How to use a regression line?

  • Drawing a regression line is done in the same way as drawing a straight line graph, substitute some values from the independent data set to help you
  • The regression line can be used to decide what type of correlation there is if there is no scatter diagram
    • If b is positive then the data set has positive correlation and if b is negative then the data set has negative correlation
  • The value of b can be used to interpret how the data is changing
    • b is the gradient of the line and represents the change in y  for each individual unit change in begin mathsize 16px style x end style
  • The regression line can also be used to predict the value of a dependent variable from an independent variable
    • Predictions should only be made for values of the dependent variable that are within the range of the given data
    • Making a prediction within the range of the given data is called interpolation
    • Making a prediction outside of the range of the given data is called extrapolation and is much less reliable
    • The prediction will be more reliable if the number of data values in the original sample set is bigger

Worked Example

The scatter diagram below shows the number of Save My Exams question packs completed by a group of students, x comma and the percentage score they received in their A-Level Statistics exam, y.

 2-4-1-linear-regression-we-diagram

The equation of the regression line of y on x is y equals 18 plus 1.3 x  

(i)
Draw the regression line onto the scatter diagram.

 

(ii)
Interpret the meaning of the values 18 and 1.3 in the scatter diagram.

 

(iii)
Explain why the regression line given should not be used to estimate the percentage when someone has completed 80 question packs.

nRXh6kD0_2-4-1-linear-regression-we-solution-3-part-1

2-4-1-linear-regression-we-solution-3-part-2

Exam Tip

  • Remember that the value of b is the gradient of the regression line, a greater value of b does not mean stronger correlation. When using a regression line to make a prediction make sure that the value you are predicting from falls within the range of the data used to calculate the regression line.


  • 1. Statistical Sampling
    • 1.1 Sampling & Data Collection
      • 1.1.1 Sampling & Data Collection
    • 2. Data Presentation & Interpretation
      • 2.1 Statistical Measures
        • 2.1.1 Basic Statistical Measures
          • 2.1.2 Frequency Tables
            • 2.1.3 Standard Deviation & Variance
            • 2.2 Data Presentation
              • 2.2.1 Data Presentation
                • 2.2.2 Box Plots & Cumulative Frequency
                  • 2.2.3 Histograms
                  • 2.3 Working with Data
                    • 2.3.1 Outliers & Cleaning Data
                      • 2.3.2 Intrepreting Data
                      • 2.4 Correlation & Regression
                        • 2.4.1 Correlation & Regression
                        • 2.5 Further Correlation & Regression (A Level only)
                          • 2.5.1 PMCC & Non-linear Regression
                            • 2.5.2 Hypothesis Testing for Correlation
                          • 3. Probability
                            • 3.1 Basic Probability
                              • 3.1.1 Calculating Probabilities & Events
                                • 3.1.2 Venn Diagrams
                                  • 3.1.3 Tree Diagrams
                                  • 3.2 Further Probability (A Level only)
                                    • 3.2.1 Set Notation & Conditional Probability
                                      • 3.2.2 Further Venn Diagrams
                                        • 3.2.3 Further Tree Diagrams
                                          • 3.2.4 Probability Formulae
                                        • 4. Statistical Distributions
                                          • 4.1 Probability Distributions
                                            • 4.1.1 Discrete Probability Distributions
                                            • 4.2 Binomial Distribution
                                              • 4.2.1 The Binomial Distribution
                                                • 4.2.2 Calculating Binomial Probabilities
                                                • 4.3 Normal Distribution (A Level only)
                                                  • 4.3.1 The Normal Distribution
                                                    • 4.3.2 Normal Distribution - Calculations
                                                      • 4.3.3 Standard Normal Distribution
                                                      • 4.4 Choosing Distributions (A Level only)
                                                        • 4.4.1 Modelling with Distributions
                                                          • 4.4.2 Normal Approximation of Binomial
                                                        • 5. Hypothesis Testing
                                                          • 5.1 Hypothesis Testing
                                                            • 5.1.1 Hypothesis Testing
                                                            • 5.2 Hypothesis Testing (Binomial Distribution)
                                                              • 5.2.1 Binomial Hypothesis Testing
                                                              • 5.3 Hypothesis Testing (Normal Distribution) (A Level only)
                                                                • 5.3.1 Sample Mean Distribution
                                                                  • 5.3.2 Normal Hypothesis Testing
                                                                • 6. Large Data Set
                                                                  • 6.1 Large Data Set
                                                                    • 6.1 Large Data Set


                                                                    DOWNLOAD PDF

                                                                  Author: Amber

                                                                  Amber gained a first class degree in Mathematics & Meteorology from the University of Reading before training to become a teacher. She is passionate about teaching, having spent 8 years teaching GCSE and A Level Mathematics both in the UK and internationally. Amber loves creating bright and informative resources to help students reach their potential.


                                                                  Save My Exams Logo
                                                                  Resources
                                                                  Home Join Support

                                                                  Members
                                                                  Members Home Account Login

                                                                  Company
                                                                  About Us Contact Us Jobs Terms Privacy Facebook Twitter

                                                                  Quick Links
                                                                  GCSE Revision Notes IGCSE Revision Notes A Level Revision Notes Biology Chemistry Physics Maths 2022 Advance Information

                                                                   
                                                                  © Copyright 2015-2022 Save My Exams Ltd. All Rights Reserved.
                                                                  IBO was not involved in the production of, and does not endorse, the resources created by Save My Exams.