Float formatter `learn_rounding_scheme` doesn't work on all digits #556

npatki · 2022-09-28T14:55:49Z

Environment Details

RDT version: 1.2.1

Error Description

When using a FloatFormatter with learn_rounding_scheme=True, we expect the transformer to learn the maximum # of significant digits. In practice, we see the following:

If data has 0-14 digits, then the transformer learns rounding scheme [Working as intended]
If the data has 15+ digits, then the transformer learns 0 digits, producing whole numbers instead [Bug]

We expect that case 2 to work. Or as a fallback, at least stop enforcing the rounding if there are already a large number of digits.

Steps to reproduce

import pandas as pd
from rdt import HyperTransformer
from rdt.transformers.numerical import FloatFormatter

# create test data with 16 digits
test_data = pd.DataFrame(data={
    'column': [1.1234567890123456]
})

ht = HyperTransformer()
ht.set_config({
    'sdtypes': { 'column': 'numerical' },
    'transformers': { 'column': FloatFormatter(learn_rounding_scheme=True)}  
})

t = ht.fit_transform(test_data)
ht.reverse_transform(t)

Output: 1.0 (no digits learned)

The text was updated successfully, but these errors were encountered:

npatki added the bug Something isn't working label Sep 28, 2022

npatki modified the milestone: 1.3.0 Sep 28, 2022

npatki mentioned this issue Sep 28, 2022

Incorrectly enforced rounding on numerical/float data columns sdv-dev/SDV#1039

Closed

This was referenced Dec 2, 2022

Cap learn_rounding_scheme correctly sdv-dev/SDV#1126

Merged

Fix learn_rounding_scheme for more than 14 digits #591

Merged

fealho closed this as completed in #591 Dec 7, 2022

npatki added this to the 1.3.0 milestone Jan 11, 2023

npatki mentioned this issue Jan 12, 2023

Using field_transformer option in CTGAN model gives better results than explicitly transforming data outside CTGAN model sdv-dev/SDV#1172

Closed

amontanez24 assigned fealho Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float formatter `learn_rounding_scheme` doesn't work on all digits #556

Float formatter `learn_rounding_scheme` doesn't work on all digits #556

npatki commented Sep 28, 2022 •

edited

Loading

Float formatter learn_rounding_scheme doesn't work on all digits #556

Float formatter learn_rounding_scheme doesn't work on all digits #556

Comments

npatki commented Sep 28, 2022 • edited Loading

Environment Details

Error Description

Steps to reproduce

Float formatter `learn_rounding_scheme` doesn't work on all digits #556

Float formatter `learn_rounding_scheme` doesn't work on all digits #556

npatki commented Sep 28, 2022 •

edited

Loading