Curtin Biometry and Agricultural Data Analytics
September 5, 2024
Source: Dr Karyn Reeves, SAGI-West
Source: Dr Karyn Reeves, SAGI-West
Source: Dr Ben Biddulph, DPIRD
Source: Dr Karyn Reeves, SAGI-West
All columns supplied with the trial design should be retained
Every plot needs a unique identifier (e.g., plot number)
Avoid copy-paste (easy to introduce errors)
Ensure embedded equations reference the correct columns
S10is different than
S 10
Good Name | Good Alternative | Avoid |
---|---|---|
max_temp_C | MaxTemp | Maximum Temp (˚C) |
precipitation_mm | precip | precmm |
mean_year_growth | MeanYearGrowth | Mean growth/year |
yield_kg_ha | yield | yield kg/ha |
observation_o1 | first_observation | 1st Obs. |
A spreadsheet with inconsistent date formats. This spreadsheet does not adhere to recommendations for consistency of date format. From Broman and Woo (2018). Also see https://xkcd.com/1179.
Examples of spreadsheets that violate the no empty cells
recommendation. (a) A spreadsheet where only the first of several repeated values was included. (b) A spreadsheet with a complicated layout and some implicit column headers, from Broman and Woo (2018).
rep-plotfor a header and use
1-1,
1-2…
2-1, in the column, etc.
repand
plotas headers and have the values in separated columns
Examples of spreadsheets with nonrectangular layouts. These layouts are likely to cause problems in analysis, from Broman and Woo (2018).
Example A, what not to do.
A spreadsheet with two header rows. It is better to have a single header row, from Broman and Woo (2018).
Example B, what not to do.
An example spreadsheet of the previous example’s data in a rectangular layout, from Broman and Woo (2018).
Yes, do this!
An example data dictionary, from Broman and Woo (2018).
Highlighting in spreadsheets.(a) A potential outlier indicated by highlighting the cell.(b) The preferred method for indicating outliers, via an additional column, from Broman and Woo (2018).
(a) An example spreadsheet. (b) The same data as a plain text file in CSV format, from Broman and Woo (2018).
Exercise (20 min)
Working with your partner on the experiment you designed this morning.