Utilizar o MLFlow para log de métricas e parâmetros #120

dfvneto · 2021-10-20T06:11:01Z

A instalação da PlatIA agora possui o MLFlow como um de seus componentes.
Uma das funcionalidades do MLFlow é o tracking de métricas, dependências e artefatos.

Nesta tarefas iremos usar o SDK do MLFlow para salvar métricas das tarefas nativas da Plataforma.

https://www.mlflow.org/docs/latest/tracking.html

A instalação da PlatIA agora possui o MLFlow como um de seus componentes. Uma das funcionalidades do MLFlow é o tracking de métricas, dependências e artefatos. Nesta tarefas iremos usar o SDK do MLFlow para salvar métricas das tarefas nativas da Plataforma. https://www.mlflow.org/docs/latest/tracking.html

dfvneto · 2021-10-20T06:12:32Z

@fberanizo eu acabei utilizando em todos os casos o método "log_param" e não "log_metrics" não sei se tem diferença ou não.

changed log_param to log_metric

fberanizo

@dfvneto pelo que vi o log_metric parece que deu certo, mas vou deixar algumas orientações para terminar a tarefa:

O mlflow.sklearn.autolog() só faz sentido para Notebooks que usam o sklearn: linear regression, kmeans, isolation forest, logistic regression, mlp-classifier, mlp-regressor, random-forest classifier, random-forest regressor, svc, svr.
O autolog é algo que deve ser executado antes do treinamento (antes do .fit(X_train, y_train)) para que o mlflow detecte os parametros usados no sklearn.
Acredito que a melhor solução é criar uma célula logo após os parâmetros do notebook, lá no início com o start_run e o autolog.

mlflow.start_run()
mlflow.sklearn.autolog()

Por fim, só relembro de não deixar código-fonte comentado. Pode remover o platiagro.save_metrics

fberanizo · 2021-10-21T13:08:33Z

@dfvneto os notebooks falharam os testes por conta da dependência do MLflow:

Será necessário adicionar o @mock nos testes:

@mock.patch(
    "mlflow.log_metric",
)

Changed the placement of the initialization of the mlflow

Changed the placement of the initialization of the mlflow and removed wrong initialization of sklearn in notebooks where sklearn is not used

added mock to tests to include mlflow.log_metric

Changed cell description to fit new metric saving

Changed the number of arguments in mock.patch

Added mocked method as argument to test method so when called the method can assert true

Changed assert call location

Changed assert call location and formatted files

Removed unnecessary assert call

dfvneto · 2021-11-08T07:27:40Z

Lendo alguns resultados do CI percebi que está tendo problema no conda e no pickling na hora de serializar os objetos na hora de salvar

Removed wrong extra import

new utils file for mocked method

Removed sklearn autolog

Changed mock annotation placement

Removed mock of log metrics since it's possible to test and use inside the container

Removed usage of mlflow

Removed autolog

Removed log_metric state because log metric can't be used to log arrays or dataframes

It is necessary to cause this tests to fail and catch the error because they take too long to execute then causing timeout on the continuous integration. The errors are caused passing wrong arguments at the paper_mill.execute_notebook method, triggering the PapermillExecutionError at the majority of the cases (one error in the isolation forest clustering triggers JSONDecodeError).

sonarqubecloud · 2021-11-17T15:11:37Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
0.0% Duplication

dfvneto · 2021-11-17T15:57:40Z

tests/test_isolation_forest_clustering.py

-                max_features=1.0,
-            ),
-        )
+        with pytest.raises(JSONDecodeError):


O statement desta linha ja faz o assert para o erro. Caso o erro ocorra, o teste é concluído com sucesso.

Esse foi o único caso que consegui criar outro erro além do PapermillExecutionError. Embora todos os erros seguintes sejam os mesmos, as causas são diferentes, mas o papermill trata todos como a mesma exceção. Para capturar a exceção correta e ter acesso aos dados da exceção é necessário encapsular a exceção do papermill com a exceção desejada.

Fonte: nteract/papermill#344

dfvneto · 2021-11-17T15:58:04Z

tests/test_isolation_forest_clustering.py

+                "Experiment.ipynb",
+                "/dev/null",
+                parameters=dict(
+


Nesse caso, o parâmetro removido foi o de dataset

dfvneto · 2021-11-17T15:58:33Z

tests/test_isolation_forest_clustering.py

+
+                    max_samples="auto",
+                    contamination=0.1,
+                    max_features=105665,


Nesse caso, max features excedeu o limite

dfvneto · 2021-11-17T15:58:56Z

tests/test_logistic_regression.py

+                    C=1.0,
+                    fit_intercept=True,
+                    class_weight=None,
+                    solver="arroz",


Nesse caso, o solver foi alterado para um inválido

dfvneto · 2021-11-17T15:59:36Z

tests/test_mlp_classifier.py

+                    one_hot_features="",
+
+                    hidden_layer_sizes=100,
+                    activation="lulu",


Nesse caso, o parâmetro activation foi alterado para um inválido

dfvneto · 2021-11-17T16:00:15Z

tests/test_random_forest_classifier.py

+                    one_hot_features="",
+
+                    n_estimators=10,
+                    criterion="gina",


Nesse caso, o parâmetro criterion foi alterado para um inválido

dfvneto · 2021-11-17T16:00:38Z

tests/test_svc.py

+                    one_hot_features="",
+
+                    C=1.0,
+                    kernel="linux",


Nesse caso, o parâmetro kernel foi alterado para um inválido

dfvneto · 2021-11-17T16:03:46Z

tests/test_isolation_forest_clustering.py

-                max_features=1.0,
-            ),
-        )
+        with pytest.raises(JSONDecodeError):


Esse foi o único caso que consegui criar outro erro além do PapermillExecutionError. Embora todos os erros seguintes sejam os mesmos, as causas são diferentes, mas o papermill trata todos como a mesma exceção. Para capturar a exceção correta e ter acesso aos dados da exceção é necessário encapsular a exceção do papermill com a exceção desejada.

Fonte: nteract/papermill#344

dfvneto · 2021-11-17T16:05:55Z

Os testes que falharam são os testes que estão falhando na master atualmente. Neles foram removidas as alterações referentes ao mlflow e removidas as chamadas referentes ao platia.save_metrics conforme orientado

fberanizo · 2021-11-19T12:52:23Z

@dfvneto vou aguardar uns merges de outros PRs, mas já vou fechar a tarefa no Jira.
Vlws!

removed unnecessary code block

dfvneto requested review from andreluiz27, dnlcesilva and fberanizo October 20, 2021 06:11

96609db

changed log_param to log_metric

fberanizo requested changes Oct 21, 2021

View reviewed changes

dfvneto added 5 commits October 21, 2021 12:32

2bdf5dc

Changed the placement of the initialization of the mlflow

5794f3b

Changed the placement of the initialization of the mlflow

6583ea1

Changed the placement of the initialization of the mlflow and removed wrong initialization of sklearn in notebooks where sklearn is not used

0862b91

added mock to tests to include mlflow.log_metric

248c9e9

Changed cell description to fit new metric saving

dfvneto marked this pull request as ready for review October 21, 2021 19:28

dfvneto added 6 commits October 29, 2021 10:24

7970269

Changed the number of arguments in mock.patch

71e1543

Added mocked method as argument to test method so when called the method can assert true

f197e85

Changed assert call location

78f649a

Changed assert call location and formatted files

aa56b02

Removed unnecessary assert call

c0eee4b

Removed unnecessary assert call

dfvneto added 9 commits November 8, 2021 11:48

05a09cd

Removed wrong extra import

f63a162

new utils file for mocked method

83f8a9e

Removed sklearn autolog

8a67824

Changed mock annotation placement

55829e6

Removed mock of log metrics since it's possible to test and use inside the container

b088557

Removed usage of mlflow

48e30fd

Removed autolog

3dc47d2

Removed log_metric state because log metric can't be used to log arrays or dataframes

038c5f6

Removed log_metric state because log metric can't be used to log arrays or dataframes

dfvneto commented Nov 17, 2021

View reviewed changes

fa31712

removed unnecessary code block

Utilizar o MLFlow para log de métricas e parâmetros #120

Are you sure you want to change the base?

Utilizar o MLFlow para log de métricas e parâmetros #120

Uh oh!

Conversation

dfvneto commented Oct 20, 2021

Uh oh!

dfvneto commented Oct 20, 2021

Uh oh!

fberanizo left a comment

Choose a reason for hiding this comment

Uh oh!

fberanizo commented Oct 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dfvneto commented Nov 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonarqubecloud bot commented Nov 17, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dfvneto commented Nov 17, 2021

Uh oh!

fberanizo commented Nov 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fberanizo commented Oct 21, 2021 •

edited

Loading

dfvneto commented Nov 8, 2021 •

edited

Loading