Skip to content

Tags: hitsz-ids/synthetic-data-generator

Tags

0.2.4

Toggle 0.2.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
New feature support for DataFrameConnector, NormalizedFrequencyEncode…

…r & NormalizedLabelEncoder; CTGAN Optimization and Performance Enhancements. (#247)

* 1

* q

* 1

* 1

* test

* 可选encoder

* 进度说明

* 修复bug,归一化

* 100k

* 100k

* 1ktest

* 1ktest

* 1ktest

* 1ktest

* test

* 1

* 1ktest

* 1ktest

* 1

* test

* 1ktest

* Rfecv

* Rfecv

* Rfecv

* Rfecv

* 1

* param

* test

* test

* test

* test

* test

* test

* remove unused

* test

* batched

* remove

* Update dataframe_connector.py

* fix typo

* fix

* fix

* Update data_transformer.py

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* test

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* doc

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update .gitignore

* Update sdgx/data_connectors/dataframe_connector.py

Co-authored-by: Zhongsheng Ji <9573586@qq.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update document

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Zhongsheng Ji <9573586@qq.com>

0.2.3

Toggle 0.2.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
chore: Drop python 3.8 support and improve ci file name (#237)

* chore: Drop python 3.8 support

* Fix ci

* Test against 3.9 and 3.12

* Rename ci file

0.2.2

Toggle 0.2.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: fix high memory issues in Gaussian copula fitting for high card…

…inality discrete columns based on frequency encoding. (#233)

0.2.1

Toggle 0.2.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Fix Division by Zero Error in Numeric Column Inspection (#220)

0.2.0

Toggle 0.2.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add ctgan description on Readme.md (#151)

0.1.5

Toggle 0.1.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add start history chart in README (#122)

0.1.4

Toggle 0.1.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add mutual information metric (#101)

* test

* test_v2

* no-test

* pair_v1

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove_old_mi_sim

* modify single&multi_table MISim

* modify single_mi_sim by using pair_sim instance

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* modify multi_mi_sim by using pair_sim instance

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* change_class_name_err

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* modify_paircolumn

* mi only needs dataframe

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* modify based on review

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* complete test_mi_sim

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* modify test file

* change_var_name

* Update sdgx/metrics/multi_table/multitable_mi_sim.py

Co-authored-by: MoooCat <141886018+MooooCat@users.noreply.github.com>

* add MULTI_TABLE_DEMO_DATA

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* modify comments

* JSD->MISIM

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* modify base of pair_column

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add cls

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* change self into cls instance

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* change cls

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* series2array

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* test

* test

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add label_encoder for category in mi_sim

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* use series.array

* change le_fit

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* change transform type to np.array instead of list

* add astype

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* series2array

* foo

* change test_suit

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* all right?

* all right

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: Z712023 <3422685015@qq.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Z712023 <132286135+Z712023@users.noreply.github.com>

0.1.3

Toggle 0.1.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add demo data for multi-table scenario (#98)

* add multi-table demo data

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update get_demo_multi_table

* add some multi-table pytest fixture

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix typo

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add return type hints

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

0.1.2

Toggle 0.1.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CLI for singe table synthesizer (#86)

- Intro `Data Exporter` for exporting sampled data to data sources
- CLI updates for synthesizer

0.1.1

Toggle 0.1.1's commit message
Fix self release