Dimensions: Which answers the who, what, where, and when of our data
The data that contains qualitative information are categorized as dimensions. These are expressive attributes, like a category of product, address of the customer, or country of origin. We can say, Dimensions can contain numeric characters (like an alphanumeric customer ID) but are not numeric values (It wouldn’t make sense to add up all the ID numbers in a column, for example).
Let us think in this way: if we can’t (or wouldn’t) compute a field, it’s a dimension.
Eg. Title of the product, category of products, vendor list, etc.
Measures: Which are the numerical fields that we can compute
The data that can be quantified are categorized as measures. Fields like subtotal of the order, the number of items purchased, or duration spent on a specific page. “Hence measures are computable”. Say we have a measure, quantity of items purchased: we can do things like calculating the average quantity ordered, sorting by descending quantities, sum all quantities, and so on.
Eg. Price of a product, Customer rating for the product, etc.,
Note: Date fields are dimensions too. Eg, The Year of production will be a dimension because calculating min/max/sum here will not help. Instead, we may group this date according to the year of manufacturing.
Dimensions
Measures
It is an independent variable.
It is a dependent variable.
It is not dependent on the measure.
It is dependent on the dimension.
Adding to the filter will give us insight into the data, it is beneficial to add this in the filters.
Adding to the filter will not give us many insights of the data.
We can’t aggregate it.
We can aggregate it.
Min, max, and sum won’t work.
Min, max, and the sum will work.
It is used to compare the data.
It is a metric that we use to compare the dimension.
It may contain duplication of data.
It does not contain duplication of data.
Headers are generated when added to the rows or columns.
Axes are generated when added to the rows and columns.
It contains qualitative and categorical information.
It contains quantitative data.
It describes data records.
It cannot describe data records.
It cannot be continuous and discrete.
It can be continuous and discrete.
It is not possible to get several records because aggregation does not apply to it.
Due to the aggregation feature, we can get the number of records present in the database no matter how huge the dataset is