-
Notifications
You must be signed in to change notification settings - Fork 185
[WIP] Staging/dev/profile serialization #908
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
7cd3ae4
initial changes to categoricalColumn decoder (#818)
micdavis b4ad93d
Implemented decoding for numerical stats mixin and integer profiles (…
ksneab7 620f0d9
hot fixes for encode and decode of numeric stats mixin and intcol pro…
ksneab7 cd77962
Float column profiler encode decode (#854)
ksneab7 8b4d6e2
Json decode date time column (#861)
tyfarnan 42ad4a4
Added decoding for encoding of ordered column profiles (#864)
ksneab7 f4f7e47
Added ordered col test to ensure correct response to update when diff…
ksneab7 2390004
added decode text_column_profiler functionality and tests (#870)
micdavis 6eb6852
Created encoder for the datalabelercolumn (#869)
ksneab7 6c51368
feat: add test and compiler serialization (#884)
JGSweets 8285293
[WIP] Adds tests validating serialization with Primitive type for com…
JGSweets 12b7dee
Adds deserialization for compilers and validates tests for Primitive;…
JGSweets d9b5f49
Add Serialization and Deserialization Tests for Stats Compiler, plus …
JGSweets f46b8a9
ready datalabeler for deserialization and improvement on serializatio…
ksneab7 3bb1127
Deserialization of datalabeler (#891)
ksneab7 3eb7f75
Encode Options (#875)
micdavis 3ebadb4
[WIP] ColumnDataLabelerCompiler: serialize / deserialize (#888)
taylorfturner 31f2a7a
Quick Test update (#893)
taylorfturner 96ca39f
Decode options (#894)
micdavis ee1f602
refactor: allow options to go through all (#902)
JGSweets ecceaed
StructuredColProfiler Encode / Decode (#901)
taylorfturner 1e03f90
fix: bug and add tests for structuredcolprofiler (#904)
JGSweets 8032bb0
Stuctured profiler encode decode (#903)
ksneab7 4f49819
[WIP] Added NoImplementationError for UnstructuredProfiler (#907)
micdavis 7f6dffa
Added testing for values for test_json_decode_after_update (#915)
ksneab7 2320718
Reuse passed labeler (#924)
JGSweets 1a59a33
BaseProfiler save() for json (#923)
micdavis 5ec8907
refactor: use seed for sample for consistency (#927)
JGSweets 52f54f6
WIP top level load (#925)
tyfarnan 55202ac
quick hot fix for input validation on save() save_metho (#931)
micdavis bd04cd5
BaseProfiler: `load_method` hotfix (#932)
micdavis bcf9eeb
fix: null_rep mat should calculate even if datetime (#933)
JGSweets 423bc0a
Notebook Example save/load Profile (#930)
taylorfturner File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,12 +1,92 @@ | ||
| """Package for providing statistics and predictions for a given dataset.""" | ||
| from . import json_decoder | ||
| from .base_column_profilers import BaseColumnProfiler | ||
| from .categorical_column_profile import CategoricalColumn | ||
| from .column_profile_compilers import ( | ||
| BaseCompiler, | ||
| ColumnDataLabelerCompiler, | ||
| ColumnPrimitiveTypeProfileCompiler, | ||
| ColumnStatsProfileCompiler, | ||
| ) | ||
| from .data_labeler_column_profile import DataLabelerColumn | ||
| from .datetime_column_profile import DateTimeColumn | ||
| from .float_column_profile import FloatColumn | ||
| from .int_column_profile import IntColumn | ||
| from .numerical_column_stats import NumericStatsMixin | ||
| from .order_column_profile import OrderColumn | ||
| from .profile_builder import Profiler, StructuredProfiler, UnstructuredProfiler | ||
| from .profile_builder import ( | ||
| Profiler, | ||
| StructuredColProfiler, | ||
| StructuredProfiler, | ||
| UnstructuredProfiler, | ||
| ) | ||
| from .profiler_options import ( | ||
| BaseInspectorOptions, | ||
| BooleanOption, | ||
| CategoricalOptions, | ||
| CorrelationOptions, | ||
| DataLabelerOptions, | ||
| DateTimeOptions, | ||
| FloatOptions, | ||
| HistogramOption, | ||
| IntOptions, | ||
| ModeOption, | ||
| NumericalOptions, | ||
| OrderOptions, | ||
| PrecisionOptions, | ||
| ProfilerOptions, | ||
| StructuredOptions, | ||
| TextOptions, | ||
| TextProfilerOptions, | ||
| UnstructuredOptions, | ||
| ) | ||
| from .text_column_profile import TextColumn | ||
| from .unstructured_labeler_profile import UnstructuredLabelerProfile | ||
|
|
||
| # set here to avoid circular imports | ||
| json_decoder._profiles = { | ||
| CategoricalColumn.__name__: CategoricalColumn, | ||
| FloatColumn.__name__: FloatColumn, | ||
| IntColumn.__name__: IntColumn, | ||
| DateTimeColumn.__name__: DateTimeColumn, | ||
| OrderColumn.__name__: OrderColumn, | ||
| DataLabelerColumn.__name__: DataLabelerColumn, | ||
| TextColumn.__name__: TextColumn, | ||
| } | ||
|
|
||
|
|
||
| json_decoder._compilers = { | ||
| ColumnDataLabelerCompiler.__name__: ColumnDataLabelerCompiler, | ||
| ColumnPrimitiveTypeProfileCompiler.__name__: ColumnPrimitiveTypeProfileCompiler, | ||
| ColumnStatsProfileCompiler.__name__: ColumnStatsProfileCompiler, | ||
| } | ||
|
|
||
| json_decoder._options = { | ||
| BooleanOption.__name__: BooleanOption, | ||
| HistogramOption.__name__: HistogramOption, | ||
| ModeOption.__name__: ModeOption, | ||
| BaseInspectorOptions.__name__: BaseInspectorOptions, | ||
| NumericalOptions.__name__: NumericalOptions, | ||
| IntOptions.__name__: IntOptions, | ||
| PrecisionOptions.__name__: PrecisionOptions, | ||
| FloatOptions.__name__: FloatOptions, | ||
| TextOptions.__name__: TextOptions, | ||
| DateTimeOptions.__name__: DateTimeOptions, | ||
| OrderOptions.__name__: OrderOptions, | ||
| CategoricalOptions.__name__: CategoricalOptions, | ||
| CorrelationOptions.__name__: CorrelationOptions, | ||
| DataLabelerOptions.__name__: DataLabelerOptions, | ||
| TextProfilerOptions.__name__: TextProfilerOptions, | ||
| StructuredOptions.__name__: StructuredOptions, | ||
| UnstructuredOptions.__name__: UnstructuredOptions, | ||
| ProfilerOptions.__name__: ProfilerOptions, | ||
| } | ||
|
|
||
|
|
||
| json_decoder._profilers = { | ||
| StructuredProfiler.__name__: StructuredProfiler, | ||
| } | ||
|
|
||
| json_decoder._structured_col_profiler = { | ||
| StructuredColProfiler.__name__: StructuredColProfiler, | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
actually concerned there may be an issue with the rebase here.... I thought there was or should be another small class here... but don't see it inmain,dev, orfeature/profile-serialization. Just take your time reviewingUh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
disregard -- the thing I thought would be here is actually properly in order_column_profile.py