Skip to content

Fix two issues in linear tree-based model: #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Dec 12, 2024

Conversation

will945945945
Copy link
Contributor

@will945945945 will945945945 commented Dec 8, 2024

  1. We should not filter any data instance in the root. (In the original code, we do not include the instances with zero label in the training.)

  2. The outputs of a tree-based model should be in the range [0, 1]^{# of label}, which is corresponding to the probability estimates. Moreover, if we want to use sparse matrix to store the prediction values of a tree model, the value ``-inf'' will be a trouble issue. (This one is fixed by PR keep the output of linear tree-based model as probability estimates #2)

What does this PR do?

add data rule for root in tree-based model

Test CLI & API (bash tests/autotest.sh)

Test APIs used by main.py.

  • Test Pass
    • (Copy and paste the last outputted line here.)
  • Not Applicable (i.e., the PR does not include API changes.)

Check API Document

If any new APIs are added, please check if the description of the APIs is added to API document.

  • API document is updated (linear, nn)
  • Not Applicable (i.e., the PR does not include API changes.)

Test quickstart & API (bash tests/docs/test_changed_document.sh)

If any APIs in quickstarts or tutorials are modified, please run this test to check if the current examples can run correctly after the modified APIs are released.

1. We should not filter any data instance in the root.
  (In the original code, we do not include the instances with zero label in the training.)

2. The outputs of a tree-based model should be in the range [0, 1]^{# of label}, which is corresponding to the probability estimates.
  Moreover, if we want to use sparse matrix to store the prediction values of a tree model, the value ``-inf'' will be a trouble issue.
@Eleven1Liu Eleven1Liu requested a review from a team December 10, 2024 14:18
@will945945945 will945945945 merged commit 9b394aa into ntumlgroup:master Dec 12, 2024
1 check passed
@will945945945 will945945945 deleted the fix_bugs_in_tree branch December 12, 2024 06:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants