tag:github.com,2008:https://github.com/databrickslabs/discoverx/releases Release notes from discoverx 2024-01-15T13:44:50Z tag:github.com,2008:Repository/604719532/v0.0.8 2024-01-15T13:50:25Z v0.0.8 <ul> <li>Fixed bug for tables containing <code>-</code> character in the table name</li> <li>Added example for cloning all catalog/schema content</li> <li>Added filtering for table format (exclude views from queries by default)</li> <li>Added support for PII detection on non-string columns</li> <li>Updated LICENSE file</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.7 2023-11-13T13:29:10Z v0.0.7 <ul> <li>Added filtering for speedup intro message checks</li> <li>Added tags metadata in table info</li> <li>Added map function for arbitrary python code table processing support</li> <li>Added AI example notebooks</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.6 2023-10-03T07:22:51Z v0.0.6 <ul> <li>Refactored scan() in order to be chainable with from_tables()</li> <li>Improved metadata fetching speed for table information</li> <li>Refactored to remove duplicated SQL code from scanner class</li> <li>Updated intro messages and documentation</li> <li>Added example for detecting tables with many small files</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.5 2023-08-29T01:05:05Z v0.0.5 <ul> <li>Added support for multi-table SQL execution dx.from_tables(...).apply_sql(...)</li> <li>Added example of VACUUM command to multiple tables</li> <li>Added example of PII detection using Presidio over multiple tables</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.4 2023-08-03T07:36:32Z v0.0.4 <ul> <li>Removed pydantic dependency</li> <li>Fixed issues with special characters in column names</li> <li>Fixed readme docs</li> <li>Added integer and decimal rules</li> <li>Fixed case insensitive regex expressions</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.3 2023-07-05T14:39:25Z v0.0.3 <ul> <li>Upgraded pydantic dependency to 2.0</li> <li>Added support for special characters in column names</li> <li>Updated readme</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.2 2023-07-03T08:14:35Z v0.0.2 <ul> <li>Improved Readme and examples</li> <li>Added System tables permissions check with friendly message</li> <li>Refactored save and load methods after customer feedback</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.1 2023-07-03T08:09:01Z v0.0.1 <p>First release of DiscoverX.<br> It includes:</p> <ul> <li>Lakehouse scanning with REGEX rules on string columns for 16 class types (email, IP v4, IP v6, URLs, MAC address, FQDNs, credit card numbers, credit card expiry date, ISO date, ISO datetime, US mailing address, US phone number, US social security number, US state, US state abbreviation, US zip code</li> <li>Save and load scan result</li> <li>Cross-table query based on semantic types of columns (rather than column names)</li> </ul> edurdevic tag:github.com,2008:Repository/604719532/v0.0.0 2023-03-15T09:47:13Z v0.0.0 <p>Initial version to reserve PyPi project</p> david-tempelmann