Archive for the ‘Research’ Category

Correlation and Data Transformations

By Neep Hazarika | |

In this article, we will show how data transformations can be an important tool for the proper statistical analysis of data. The association, or correlation, between two variables can be visualised by creating a scatterplot of the data. In certain instances, it may appear that the relationship between the two variables is not linear; in…


Majestic SEO Beginners Guide to Correlation: Part 5

By Neep Hazarika | |

Review of Previous Parts In Part 1, we described how two datasets can have strong, weak or no correlation depending on how the points in the resultant scatter plot are located relative to one another along a straight line. In keeping with conventional practice, the two variables are often referred to separately as X, with…


Majestic SEO Beginners Guide to Correlation: Part 4

In Parts 1, 2 and 3, we looked at statistical measures of datasets that were purely one-dimensional in nature, i.e., all the observations constituting the dataset were of the same type.  Examples of such datasets are the time elapsed between eruptions of the Old Faithful Geyser Continue reading


A Study of Subnet/ IP Link Ratios

This short blog is in response to a recent support ticket querying whether particular domains display significant differences in terms of the ratio of referring subnets to referring IP addresses. We were asked what a “normal” value of this ratio could be. A statistical study was performed on the Majestic Million data. We found that…


MajesticSEO Beginners Guide to Correlation: Part 3

In Parts 1 and 2, we have looked at the definitions of correlation, population and sample size. In this part, we will go through the essentials of the mathematics involved.  Before we can proceed any further, however, we have to define certain fundamental statistical concepts. Statistical Measures of a Data Set In this section, it…