Skip to content

DQ: Develop more sophisticated DQ checking #1640

@RDmitchell

Description

@RDmitchell

Expected Behavior

Users have developed spreadsheets to enable sophisticated data quality checking.

Here are a few examples:

  • If Building Type = Multifamily AND (Electricity Use < 15% of total energy OR ENERGY STAR SCORE > 95), flag (label) the record as "Problem"
  • Create a new field based on other fields (may involve math), then do DQ on that new field
    • Calculate % of electricity mix == the ratio of total electricity consumption (which might be the sum of several fields and which requires converting kWh to kBtu) to Total Site Energy
    • then have a check if the % electricity mix > N, flag (label) it

I can put together some specific examples from cities who are already doing these DQ checks in their spreadsheets.

We need to come up with a way for users to construct these "query expressions" in SEED -- maybe SEED can interpret SQL statements as a start ?

Steps to Reproduce

Can't do this in SEED

Instance Information

instance: LBNL staging/ production
SHA: 36cc59d

Metadata

Metadata

Assignees

Labels

BPSData Quality CheckingFeatureAdd this label to new features. This will be reflected in the change log when generated.Verify No Longer RelevantThese issues are possibly no longer relevant

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions