# maximum Corelation coefficient - how the numarator and denominator becomes equal?

Pearsons corelation coefficient is defined as follows

$$r_{x,y} =frac {sum (x_i -bar X)(y_i – bar Y)}{sqrt{sum(x_i-bar X)^2} sqrt{sum(y_i-bar Y)^2}}$$.

Now, maximum magnitude of a correlation coefficient is known to be $$1$$;
which is possible only if

$$Numarator = denominator$$,

$$or, {sum (x_i -bar X)(y_i – bar Y)}={sqrt{sum(x_i-bar X)^2} sqrt{sum(y_i-bar Y)^2}} … (1)$$;

Now; if the dataset is perfectly corelated, then how or why the relation $$… (1)$$ works? or in other words, the denominator or numarator becomes equal?

Samely the alternate equation,

$$r_{xy}= frac {nsum X_iY_i- (sum x_i)(sum y_i)} {sqrt{nsum x_i^2 -(sum x_i)^2} sqrt{nsum y_i^2-(sum y_i)^2}}$$

now, the coefficient of correlation will be =1 only if

$$numerator=denominator$$

$$or,{nsum X_iY_i- (sum x_i)(sum y_i)} = {sqrt{nsum x_i^2 -(sum x_i)^2} sqrt{nsum y_i^2-(sum y_i)^2}} … (2)$$

If the dataset is perfectly correlated ; then how to tell relation $$… (2)$$ will work?

So in brief my question is ….

I am looking for necessarily intuitive explanation of what these mathematical terms mean; such as it is quite clear that $${sum (x_i -bar X)(y_i – bar Y)}$$ is a sum of areas made by the data points which comes from covariance. But why this exact same area will be cancelled out by the denominator, that I could not understand by any way.

1. Searched Google and Stackexchange by search terms.
2. Tried youtube videos like this, this etc.
3. Have visited several internet articles but the articles are appearently lacking visual or intuitive explanations.

Cross Validated Asked on November 21, 2021

## Related Questions

### Forecasting with mixed models

1  Asked on January 13, 2021 by katy

### Why do some researchers use the oxymoron “prevalence rate”?

0  Asked on January 13, 2021

### How to calculate out of sample R squared?

2  Asked on January 13, 2021 by crazydriver

### Denoising 3D matrix

0  Asked on January 13, 2021 by haohan-wang

### In this Bayesian network, where does this posterior probability come from?

1  Asked on January 13, 2021 by vin

### What is wrong with my approach on a custom way of creating Gabor-filter convolution kernels?

0  Asked on January 12, 2021 by g-s-luimstra

### Pseudo-inverse matrix for multivariate linear regression

1  Asked on January 12, 2021 by somethingsomething

### Assessing the representativeness of population sampling

1  Asked on January 12, 2021 by user3136

### How can an A/B test show significant result without enough data

0  Asked on January 11, 2021 by jonas-palaionis

### Cross-lagged model and supplement regressions: Do I have to include my control variables in the supplement regression analyses?

0  Asked on January 11, 2021 by sventon

### Is it Valid to Grid Search Cross Validation for Model Hyperparameter Selection then a separate Cross Validation for Generalisation Error?

2  Asked on January 11, 2021 by benjamin-phua

### Find $E[N^2 | N > 2]$ for a frequency distribution

1  Asked on January 10, 2021 by confusedmathstudent

### Finding meaningful boundaries between two continuous variables in R

0  Asked on January 10, 2021

### Using categorical feature as both a continuous feature, and also doing One hot encoding. Is this overkill?

2  Asked on January 10, 2021 by stats_nerd

0  Asked on January 10, 2021 by anto_zoolander

### Expected value of the residuals

2  Asked on January 10, 2021 by snoopy

### How to determine relationship categorical and numerical data

1  Asked on January 9, 2021 by onhalu

### Multiple Poisson regression (?) in R

2  Asked on January 9, 2021 by jonas8

### Propose a model for this time series

1  Asked on January 8, 2021 by le-anh-dung