You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please indicate the following details about the environment in which you found the bug:
RDT version: 0.5.2.dev0
Python version: 3.8
Operating System: macOS 11.5.1
Error Description
After optimizing the OneHotEncoder Transformer, CopulaGAN, CTGAN, and TVAE start failing in SDV. The primary reason for this failure is caused by the line:
where DataTransformer access ohe.dummies. In PR #186, we altered the contents of self.dummies to not include NaN values (because pd.Categorical doesn't accept it as a category), which inconsequence makes us initialize weights of the the wrong dimensions in consequent parts of the code because we are missing the NaN value in ohe.dummies.
Steps to reproduce
Initialize any dataframe with missing values and it will crash
Environment Details
Please indicate the following details about the environment in which you found the bug:
Error Description
After optimizing the OneHotEncoder Transformer,
CopulaGAN
,CTGAN
, andTVAE
start failing in SDV. The primary reason for this failure is caused by the line:https://github.com/sdv-dev/CTGAN/blob/2dbbc18770d0dc17e7c5e83da6980ace424e9326/ctgan/data_transformer.py#L57
where DataTransformer access
ohe.dummies
. In PR #186, we altered the contents ofself.dummies
to not include NaN values (becausepd.Categorical
doesn't accept it as a category), which inconsequence makes us initialize weights of the the wrong dimensions in consequent parts of the code because we are missing the NaN value inohe.dummies
.Steps to reproduce
Initialize any dataframe with missing values and it will crash
the code crashes with the following message
Steps to fix
OneHotEncoder
and letdummies
containnp.nan
when that's the case.np.nan
is part ofdummies
whendummy_na
is set to True.The text was updated successfully, but these errors were encountered: