Datasets

Information on a range of data sources has been gathered to investigate the extent and availability of data and statistics relating to the construction industry and resource efficiency. Information has been separated into ‘key’ areas, namely:

  • Construction materials
  • Waste
  • Building rates and availability
  • Energy, emissions and transport
  • Building regulations and cost models

It is intended that this summary of data sources will be a useful resource to help find the information required by professionals working within the construction industry. The aim has also been to assess the robustness, availability and update frequency of the sources gathered.

For each data source, two tables of information are provided. The first table provides a summary of the information available in that data source, along with a brief description of the methodology and the availability of the data. The second table provides a statistical assessment of the data in an attempt to provide some measure of the quality of data available for the construction industry.

Each statistical assessment includes both a qualitative assessment of the data and a quantitative ‘score’ under a series of six categories. The scores generated are relative for this study and are based on the information that was available to make the assessment. It does not necessarily indicate a ‘good’ or ‘poor’ data source, but more a reflection of the clarity, objectivity and extent to which the methodology is explained. If no score is given in a particular category, this indicates that it has not been possible to tell how biased, accurate, timely, etc. a data source is.  More often than not, a low score indicates that little information was provided, so it should not be assumed that a data source with a low score is in some way a ‘bad’ source of information. It should be noted that this statistical assessment is meant to be indicative only and is subject to a good deal of interpretation. 

The categories assessed and the grounds on which the scores have been derived are given in the table below:

 

Category

 

 

Definition

 

 

Score*

 

 

Objectivity

 

 

This refers to the objectivity of the group that collected the data and the transparency of collection method.  For example, a study conducted by an independent organisation would presumably be more objective than a company that conducts a review of its own data.

 

 

1: Highly biased
2: Partially biased
3: Unbiased

 

 

Clarity of methodology

 

 

This is a measure of the accuracy and reliability of the methodology. This includes notes on whether the methodology is clearly defined and consistent within the study and with other studies of this kind.

 

 

1: Methodology is poorly explained or implemented
2: Fairly clear
3: Clearly defined and implemented methodology

 

 

Timeliness

 

 

Timeliness refers to whether the study is a one-off or whether the data is reviewed and re-published regularly. Notes will also be included on the consistency of the data and whether the collection process is re-produced in the same way each time.

 

 

1: One-off study
2: Will be reviewed but not regularly
3: Reviewed regularly

 

 

Scope

 

 

This measures how wide the scope of the report is - what geographical region is covered, how many topics are covered, how specific the data is. For the purposes of this work, UK-wide data is considered sufficient geographic scope

 

 

1: Specific data
2: Fairly wide-ranging data
3: Wide-ranging data

 

 

Gap filling

 

 

This category notes whether any gap filling or estimation has been used where exact data is not available.

 

 

1: Lots of estimation used
2: Some estimation used
3: No estimation used

 

 

Statistical clarity

 

 

Notes will be included in this section as to whether the study has been conducted in a statistically unbiased way, whether a sufficient sample size has been used, whether the statistical techniques used are appropriate (if applicable), etc. A low score in this section does not necessarily mean that the study was conducted poorly - the intention with many of these datasets may not be to present statistical data.

 

 

1: The study is not designed to be a statistically unbiased study
2: Some attention to statistics
3: A "statistically sound" study

 

 

Sample size/
response rate

 

 

If information is provided on the sample size or the response rate of the survey, then that information will be given here. This will not be included in the grading system because that information is often not reported.