All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- ENHANCEMENT Update CI Pipeline and dev dependencies, target Python 3.8 issue #154, by Andreas Schuderer.
- REMOVED Drop support for Python 3.6, issue #154, by Andreas Schuderer.
- FEATURE Add training metadata reporting and querying, issue #142, by Andreas Schuderer.
- FEATURE Add support for typed CSVs (option
dtypes_path
of :class:`FileDataSource <mllaunchpad.datasources.FileDataSource>` and :class:`FileDataSink <mllaunchpad.datasources.FileDataSink>`), issue #127, by Elisa Partodikromo. - FEATURE Add Spark support (experimental), see
examples/spark_datasource.py
and issue #145, by Andreas Schuderer. - ENHANCEMENT FileDataSink (csv, euro_csv, raw DataSink types) now attempts to create missing paths if required, issue #148, by Andreas Schuderer.
- FIXED Keep generated RAML files free of command line messages, issue #126, by Andreas Schuderer.
- FIXED Change default text file encoding to UTF-8 for config files, text_file Data Sources/Sinks and JSON model metadata. NOTE: If you have been using non-ASCII characters in any of the above, you will need to check that the encoding of the relevant existing file(s) is UTF-8. issue #128, by Andreas Schuderer.
- FIXED Explicitly support repeated query parameters (and array types like number[]) through RAML definition. Remove IP from training metadata due to problem on macOS Big Sur. Fix doc build by pinning Sphinx dependency docutils to version 0.17.1. issue #147, by Andreas Schuderer.
- FIXED
mllaunchpad --verbose
now correctly logs DEBUG information, issue #119, by Andreas Schuderer. - FIXED Fixed an issue where builtin DataSources could not be found when configured, issue #118, by Andreas Schuderer.
- FIXED Readthedocs now shows the up-to-date :doc:`API docs <mllaunchpad>`, issue #110, by Andreas Schuderer.
- ENHANCEMENT Added
chunksize
parameter for piecemeal data reading to builtin DataSources, issue #120, by Andreas Schuderer. - FEATURE Added functionality to better support unit testing in model development (added optional parameters to :meth:`mllaunchpad.train_model`, :meth:`mllaunchpad.retest` and :meth:`mllaunchpad.predict`, added :meth:`mllaunchpad.get_validated_config_str`), issue #116, by Andreas Schuderer.
- FEATURE Added generic SqlDataSource for RedShift, Postgres, MySQL, SQLite, Oracle, Microsoft SQL (ODBC), and their dialects, issue #121, by Andreas Schuderer.
- ENHANCEMENT New command line interface (usage changes only slightly, see issue), issue #77, by Andreas Schuderer.
- ENHANCEMENT DataSource caching overhaul: data cached separately for different
:meth:`params <mllaunchpad.datasources.FileDataSource.get_dataframe>`,
configurable
cache_size
, issue #97, by Andreas Schuderer. - REMOVED Removed 'api:version:' (deprecated since 0.1.0) from configuration ('model:version:' is now the only location to specify both the model and the API version), issue #66, by Andreas Schuderer.
- FIXED Oracle DataSource's
get_dataframe
now interpretsNull
asnan
, issue #86, by Bob Platte. - ENHANCEMENT Add a truckload of unit tests, issue #46, by Andreas Schuderer.
- FIXED Fix missing classifiers on PyPI, by Andreas Schuderer.
- FIXED Fix misleading error message at WSGI entry point if model could not be loaded, issue #61, by Bob Platte.
- FIXED Use correct reference to werkzeug's FileStorage, issue #63, by Andreas Schuderer.
- ENHANCEMENT Document DataSources and DataSinks, issue #88, by Andreas Schuderer.
- ENHANCEMENT Document configuration, issue #67, by Andreas Schuderer.
- ENHANCEMENT Add utility user function
order_columns
for enforcing equal data column order between data sources and API parameters, issue #37, by Andreas Schuderer. - ENHANCEMENT Config file is now being checked for omitted required keys, PR #65, by Andreas Schuderer.
- FEATURE Add Impala DataSource (in examples), issue #4, by Elisa Partodikromo.
- DEPRECATED 'api:version:' to be removed from configuration ('model:version:' will be the only location to specify both the model and the API version), issue #66, by Andreas Schuderer.
- FIXED Fix examples which could not be run on Windows, issue #34, by Andreas Schuderer.
- FIXED Correcting variable names in TEMPLATE_cfg.yml, issue #43, by Bart Driessen.
- FIXED Changed config fallback file name to the more ugly ./LAUNCHPAD_CFG.yml, direct commit, by Andreas Schuderer.
- FIXED Migrate from
pipenv
topip
withrequirements/*.txt
, issue #36, by Andreas Schuderer. - ENHANCEMENT Extend documentation: getting started, use case, structure, deployment requirements, usage, issue #18, by Andreas Schuderer.
- ENHANCEMENT Improve contribution documentation, issue #35, by Gosia Rorat.
- FEATURE Added funcionality to include sub-config support, issue #28, by Elisa Partodikromo.
- FEATURE Added file upload support (multipart/form-data, experimental), PR #47, by Andreas Schuderer.
- FIXED Link from GitHub README to documentation, issue #18, by Andreas Schuderer.
- FEATURE First release on PyPI, by Andreas Schuderer.