You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When we create a Gaussian copula model without defining metadata explicitly, the generated synthetic data holds the properties of the sample data( like min and max values). When we define the same by explicitly passing meta data the generated data is loosing the properties.
From the above screenshot consider the 'col2' column in the original data(df1) min value is 8.37 and max is 196.8 .
now the generated new_data_without_meta holds the min and max values of the sample but the new_data_with_meta generates negative values. I even verified for the distributions of the data for each column, both the models have same distributions.
The same behaviour is seen with constraints also, when passed directly gives similar data, when passed with explicit metadata gives out of bound values.
can you please look into this issue.
Thankyou.
The text was updated successfully, but these errors were encountered:
I agree that we should expect min and max values to be automatically detected the same way in both cases (when metadata is provided and when it is not). We will look into this issue soon.
When we create a Gaussian copula model without defining metadata explicitly, the generated synthetic data holds the properties of the sample data( like min and max values). When we define the same by explicitly passing meta data the generated data is loosing the properties.
rounding_data_test.csv
From the above screenshot consider the 'col2' column in the original data(df1) min value is 8.37 and max is 196.8 .
now the generated new_data_without_meta holds the min and max values of the sample but the new_data_with_meta generates negative values. I even verified for the distributions of the data for each column, both the models have same distributions.
The same behaviour is seen with constraints also, when passed directly gives similar data, when passed with explicit metadata gives out of bound values.
can you please look into this issue.
Thankyou.
The text was updated successfully, but these errors were encountered: