Archive for the ‘Research’ Category

Some Useful Excel Templates

Compare two Lists Compare two lists. Sometimes, it may be necessary to compare data between two columns  in an Excel spreadsheet. It is particularly important, when analysing backlinks for example, to de-duplicate lists of data into some kind of useful list of domains that is easier… Continue reading


Correlation and Data Transformations

By Neep Hazarika | |

In this article, we will show how data transformations can be an important tool for the proper statistical analysis of data. The association, or correlation, between two variables can be visualised by creating a scatterplot of the data. In certain instances, it may appear that the relationship between the two variables is not linear; in…


Majestic SEO Beginners Guide to Correlation: Part 5

By Neep Hazarika | |

Review of Previous Parts In Part 1, we described how two datasets can have strong, weak or no correlation depending on how the points in the resultant scatter plot are located relative to one another along a straight line. In keeping with conventional practice, the two variables are often referred to separately as X, with…


Majestic SEO Beginners Guide to Correlation: Part 4

In Parts 1, 2 and 3, we looked at statistical measures of datasets that were purely one-dimensional in nature, i.e., all the observations constituting the dataset were of the same type.  Examples of such datasets are the time elapsed between eruptions of the Old Faithful Geyser Continue reading


A Study of Subnet/ IP Link Ratios

This short blog is in response to a recent support ticket querying whether particular domains display significant differences in terms of the ratio of referring subnets to referring IP addresses. We were asked what a “normal” value of this ratio could be. A statistical study was performed on the Majestic Million data. We found that…