I have a good chunk of the work and the planning now done for how mapping would be done from NWSS to ODM, though acquiring NWSS-formatted data to test this is still on my to-do list.
With that said, here is a a (non-exhaustive) summary of of the challenges/decisions that need addressed before we can move forward:
-
hum_frac_target_chem
&hum_frac_target_mic
- severity: low:
In NWSS they have different fields for chemical normalization targets and microbial normalization targets. In the ODM, all of these would just be captured as a measure of the given target. Do we want to try and capture the microbial vs. chemical distinction? My instinct is to say no. -
other_norm_name
- severity: low:
This field seems to be used in instances where two targets are used for normalization, so this second target can be specified here. This is still (I think) just a measure as with 1, and this structure is not important to maintain. Though a “measure report ID” will need to be programmatically generated to link these measures together. -
vs_mic_chem_units
- severity: medium:
The units used (copies per litre, milligrams per litre, milligrams per gram, etc.), however NWSS also includes 10 copies/L wastewater, etc. Do we want to transform these log values into a normal format? They also specify units in log10 copies/g dry sludge micrograms/L wastewater, and
log10 micrograms/L wastewater etc. Do we want to capture these materials in our units as well? Or should this go elsewhere? Please note that the sample material is still recorded in a different column, and is not dry/wet, etc. -
num_no_target_control
- severity: low:
This asks users to specify the number of NTCs (non-template controls) run on a plate. I think this can be captured as a new measure, but again would need to be connected to other items via a machine-generated measure report ID. -
MHV (PREvalence), BCoV (GT-Digital) - severity: low:
In the list of spike targets murine hepatitis virus and bovine coronavirus (MHV and BCoV, respectively) are already listed, but these two additional and tweaker categories are also included in the list. It’s not clear what the difference is between MHV and MHV (PREvalence) is, but do we think this differentiation is worth preserving? -
vs_reporting_jurisdiction
&vs_wwtp_jurisdiction
- severity: low:
I think these would both be organization IDs, but the reporting jurisdiction is the sites table’srepOrg1
, and the wwtp jurisdiction is the sites tableorgID
. Are folks in agreement with that mapping? -
vs_sample_type
- severity: medium:
For sample type, NWSS has a 195 categories that are actually all just grab samples, passive samples, manual composite samples, time-weighted composite samples, and flow-weighted composite samples - just with different hourly length options. In ODM, the type of sample would be incollType
and the period in hours would be recorded incollPer
.