Data Quality Flag for Mapped data?

matthomson · November 4, 2024, 8:49pm

I bit of a one-off thought, but I was wondering what people thought about creating a data quality flag for mapped data.

The idea came to me because as I have been developing the “translation keys” for mapping between the ODM and other models/data structures, and as I go through I sometimes think “this mapping works, and is true, but something is also maybe lost in this ‘translation’”. So I wonder if mapped data might benefit from such a flag, so that if there’s any oddness or missing mandatory fields, there’s an explanation for that? Or does that defeat the point of mapping and interoperability? Curious to hear people’s thoughts!

@dmanuel @jeandavidt @Sorin

dmanuel · November 18, 2024, 7:25pm

Information about the mapping process would be helpful to capture. This issue combines data quality and data provanance.

Not to make the issue more complicated; it is common to go through several data mapping stages before reaching ODM and also these mappings can occur after.

What about identifying the data was mapped from. The database table may be the most appropriate location, and we would likely require a new field. That field would be something like: original data dictionary. Ideally, we would also provide support for a chain of mapping or data manipulation, but knowing how the data was first collected may be the most important step.

Having a data quality flag could work well. However, knowing how the data was mapped to/from ODM will give a good indication of data quality of the mapping process.

matthomson · November 27, 2024, 9:22pm

I’m still inclined to agree with your idea here, @dmanuel, to add a field for “original data structure” or something to that effect. I agree that it makes the most sense in the datasets table as well. @jeandavidt - any thoughts?

matthomson · December 19, 2024, 9:36pm

After team discussion, it was decide to add an “originalFormat” field to the datasets table. I’ll come up with a category set for this field as well to try to keep it relatively clean. It’ll show up in red in the ERD for the next little while, but that makes this issue resolved.

matthomson · December 20, 2024, 9:36pm

This topic was automatically closed 24 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Additional fields? Or how best to capture diverse European data Development odm-v2 , wide-names , mapping	9	54	February 20, 2025
orgSectorSet option Bugs odm-v2-rc2	6	17	July 24, 2024
Protocol Relationship: V1 <--> V2 Development odm-v2 , mapping	5	31	November 4, 2024
Wide-naming exceptions - when one value is pre-populated Development odm-v2 , wide-names , mapping	2	20	January 30, 2025
Minimum reporting Community minimum-or-mandatory	1	87	August 23, 2023

Data Quality Flag for Mapped data?

Related topics