Testing for Correlation

Neil Trivedi

Teacher

Testing for Zero Correlation

↓

Challenging Question

↓

Neil Trivedi

Head of Mathematics

Testing for Zero Correlation

Generally speaking, we say two variables have no correlation if they are independent.

We use hypothesis testing to determine whether we can suggest correlation between two variables given the sample size. Remember, the sample is supposed to represent the general population. Therefore, instead of using , we will use in our null and alternative hypothesis.

• The null hypothesis has the form . So, we are assuming that the population has no correlation unless proven otherwise.

• The alternative hypothesis can have one of three forms:

(for a one-tailed test testing for positive correlation)

(for a one-tailed test testing for negative correlation)

(for a two-tailed test testing for any correlation)

A statistical table lists critical values of Pearson’s correlation coefficient 𝑟 for different sample sizes 𝑛 at significance levels 10%, 5%, 2%, 1% and 0.5%, with a note explaining that smaller samples require stronger correlation to be significant.

The critical value can be found using the table. Look for the column with the required significance level, and the row for the matching sample size.

We reject if the Product Moment Correlation Coefficient (PMCC) we calculated is outside of the critical region.

When we are testing for positive correlation, it would be the case that our PMCC is bigger than the critical value. When we are testing for negative correlation, we reject if the PMCC is less than the negative of the value in the table.

When we are conducting a two-tailed test, we are finding the critical value that is half of the significance level when we are looking through the table. We reject if the PMCC we find is bigger than the critical value or lower than the negative critical value.

We do not reject if the PMCC we calculated is outside the critical region.

Example 1:

Test at the significance level, whether there is positive correlation between the temperature and number of ice creams sold. The PMCC of the sample with size is .

Step 1: Set up the hypotheses.

This is a one-tailed test for positive correlation, so the hypotheses are:

(remember we are assuming no correlation unless proven otherwise)

Step 2: Find the critical value using the table

Sample size

A Student’s 𝑡-distribution critical values table is shown with the row for 𝑛=16 degrees of freedom and the 10% one-tailed (or 20% two-tailed) column highlighted, giving the value 0.3383.

From the PMCC critical values table above, the critical value at the significance level is .

Critical region at the significance: . This means that any calculated PMCC above is strong enough to suggest that there is positive correlation with at least certainty (because of the significance level).

Step 3: Draw a conclusion based on the result.

Since , we do not reject .

There is insufficient evidence to suggest a positive correlation between temperature and the number of ice creams sold.

No answer provided.

Example 2:

From the Edexcel large data set, the daily mean windspeed, knots, and the daily maximum gust, knots, were recorded for the first days in July in Hurn, in 1987.

Day

Windspeed kn

Gust kn

n/a

a) State the meaning of ‘n/a’ in the table.

Single Step: Apply knowledge of the Large Data Set.

‘n/a’ means that the data is not available for that entry.

b) Calculate the PMCC for the remaining days.

Single Step: Calculate PMCC using a calculator.

As the data for the gust on Day 12 are missing, we exclude that day from the calculation.

The PMCC is

c) Test, at the level of significance, whether there is evidence of any correlation between the daily mean windspeed, and the daily maximum gust.

Step 1: Set up the hypotheses.

This is a two-tailed test, so the hypotheses are:

(here we are testing for any correlation as stated in the question, not necessarily positive nor negative)

Step 2: Find the critical value using the table.

Sample size = (excluding day 12)

A Product Moment Correlation Coefficient (PMCC) critical values table is shown with the row for sample size 𝑛=14 and the significance level 0.025 highlighted, giving the critical value 0.5324.

For a two-tailed test at the significance level, each tail has . From the PMCC critical values table: Critical value .

Hence the critical region is or .

Step 3: Draw a conclusion based on the result.

Since , we reject in favour of . There is sufficient evidence to suggest a correlation between the daily mean wind speed and the daily maximum gust. (Note: Because this is a two-tailed test, we can only conclude that a correlation exists – not necessarily a positive one.)

No answer provided.

Challenging Question

Practice Questions

Product Moment Correlation Coefficient

Maths

Year 13 Pure

Algebraic Methods

Proof by Contradiction

Partial Fractions

The Binomial Expansion

Vectors - Intersecting Diagonals

Functions

Function Notation

Domain and Range

Composite and Inverse Functions

Introduction to Modulus Functions and Graphs

Solving with Modulus Functions

Sequences and Series

Arithmetic Sequences

Geometric Sequences

Sigma Notation

Recurrence Relations

Chain Rule

Differentiating Composite Functions

Product Rule

Quotient Rule

Differentiating Reciprocated Trig Functions

Differentiating Inverse Trigonometric Functions

Parametric Differentiation

Implicit Differentiation

Inflection Points and Concavity

Related Rates of Change

Integration

Integration by Recognition

Integration Using Trigonometric Identities

Integration by the Reverse Chain Rule

Substitution

Integration by Parts

Integration using Partial Fractions

Parametric Integration

Separation of Variables

Forming and Solving Differential Equations

Numerical Methods

Trapezium Rule

Fixed Point Iteration

Newton-Raphson Method

Year 13 Mechanics

Moments

Introduction to Moments

Moment of Forces Acting on a Uniform Rod

Moment of Forces Acting on a Non-Uniform Rod

Tilting of Rigid Bodies

Forces

Resolving Forces

Friction

Equilibrium of Forces

Newton's 2nd Law in 2D with Friction

Connected Particles

Statics of Rigid Bodies

Projectile Motion

Horizontal Projections

Acute Angle Projections

Projectile Motion and Vectors

Variable Acceleration

Further Variable Acceleration

Variable Acceleration and Vectors

Vectors and SUVAT

SUVAT in Two Dimensions

Year 13 Statistics

Conditional Probability

Set Notation and Venn Diagrams

Conditional Probability and the Addition Rule

Probability and Venn Diagrams

Probability and Tree Diagrams

Correlation

Product Moment Correlation Coefficient

Testing for Correlation

The Normal Distribution

Normal Distribution Properties

The Inverse and Cumulative Normal Distribution

The Z-Distribution

Conditional Probability and the Normal Distribution

Approximating the Binomial Distribution

Hypothesis Testing on the Normal Distribution

Year 12 Pure (Coming soon)

Algebra & Functions

Index Laws

Change of Base

Surds

Solving Quadratics

Simultaneous Equations

Completing the Square

Algebraic Fractions

Sketching Quadratics

Quadratic Inequalities

Modelling with Quadratics

Discriminant

Graphs of Functions

Transformations of Functions

Factor Theorem

Coordinate Geometry

Equation of a Straight Line

Straight Line Graphs

Equations of Circles

Circles and Lines

Circle Properties

Proof

Methods of Proof

Sequences & Series

Pascal's Triangle

Binomial Expansion

Trigonometry

Trigonometric Ratios

Trigonometric Graphs

Trigonometric Identities

Solving Trigonometric Equations

Differentiation

Differentiation from First Principles

Differentiation

Tangents and Normals

Increasing/Decreasing Functions

Stationary Points

Sketching Gradient Functions

Optimisation

Integration

Definite Integrals

Using Integration to Find Areas

Finding Areas Below The Horizontal Axis and Between Curves

Exponentials & Logarithms

Exponentials and Euler's Number

Logarithms

Laws of Logarithms

Solving with Logarithms and Exponentials

Modelling with Exponentials

Non-Linear Data

Vectors

Introduction to Vectors

Vector Problems and Applications

Year 12 Mechanics (Coming soon)

Kinematics

Graphs of Motion

SUVAT Equations

Motion Under Gravity

Variable Acceleration

Forces & Newton's Laws

Introduction to Forces

Forces and Acceleration

Motion in a Lift and Horizontal Surfaces

The Connected Particles

Pulleys

Vectors

Introduction to Vectors

Vector Problems and Applications

Year 12 Statistics (Coming soon)

Data Presentation & Interpretation

Measures of Location

Measures of Spread

Representation of Data

Correlation

Probability

Venn Diagrams

Tree Diagrams

Statistical Distributions

The Binomial Distribution

Hypothesis Testing

Hypothesis Testing with The Binomial Distribution

Critical Regions

Testing for Correlation

Contents

Testing for Zero Correlation

Challenging Question

Practice Questions