-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Features/340 gaussian nb #474
Merged
+1,367
−63
Merged
Changes from all commits
Commits
Show all changes
232 commits
Select commit
Hold shift + click to select a range
3a70b0c
added workflow file
d849594
changed branches name
6cb1dc8
changed branches name
3f75147
build always on push
96a7bb4
changed to ubuntu and dependencies are now installed properly
85686f6
changed workflow to run on docker image
f56e8f3
using docker image properly
b364a26
changed to always use bash
68f2cf9
added debug code
9ec0f10
added earlier debug code
14f39e1
changed all to use /bin/bash
4c58b35
changed all to use sudo
cbe4375
running in a container now
1990539
still using /bin/bash
a27842f
removed sudo
ae82b27
added debug information for shell
3c43e35
added more shell debugging
81d44b0
fixed typo
8d2b2f4
added debug for standard shell
a0dae39
fixed more typos
b60b745
removed hostnamectl
54d5fc8
changed way os is fetched
29032a5
added module debug info
4dce1f9
moved debug code into bin/bash
b076661
removed loading of bashrc
2098a73
added installation to workflow
6ff3816
removed sudo
bb2d2ac
added more capabilities to docker
3370de7
changed options syntax
a573c3a
using different image
2b52001
using new image
ea3e932
removed typo
9e0ff24
changed debug information
014d2fd
added new no docker workflow
1cc73ba
fixed type
dc91953
added version info to checkout action
e66aaaa
added sudo
3c88f01
changed libopenmpi version
dd0e5bc
added python setup and testing stages
4c7e878
moved to python 3
b618eb1
added venv dependency
0c6e3e1
added y flag to work in automated shell
088b484
fixed typo and added correct location for test run
e068b65
added some debug code to find current location
961f881
removed folder switch
3e81fe1
fixed typo
32bf0ef
added pytest dependency
a2f43aa
changed run stage to activate virutalenv
62b097b
added more test stages
61c7242
added virtalenv activation for coverage creation
3f42f0f
added dev dependency
bbf313e
create new workflow using a ubuntu image
06f663c
fixed workflow syntax
f59ce5b
fixed type
8332eb8
removed version info
8eeae46
fixed spelling error
0f552a3
moved container usage
77f956b
added virtualenv steps
96d2d3e
checking home path
aecae36
adapted activate path
e526fe9
adding pip check
836b744
added shell checks
456027c
added code to install python packages and run tests
dec13af
fixed typo
7edde39
cleaned up code and added coverage combine steps
f645257
added bash in docker file
ba4403f
removed bash from docker due to not working properly
793eea9
removed bash from docker due to not working properly
02949a5
added fedora step to workflow
841a2dc
fixed type
f6db040
fixed another typo
2587222
changing ubuntu version
c7d7029
added volume
4d3c53e
sourcing module now
6595c3d
added options to docker
ea9e6fa
activating modules
597d0ec
moved module activation to docker file
e3a46af
removed volumes and added number of cpus
5914121
added oversubscribe to allow more than 2 parallel jobs
5da6f3c
fixed typo
e53dc5e
created build matrix
ca63124
fixed build matrix
2e1e7dc
added number of processes to build matrix
e1b9370
added codecv action
9212f0f
added pre commit execution
7a8d228
fixed action import
eab67e9
fixed missing dollar sign
f26e4ba
removed wrong dollar sign
17cec3d
added git installation
58cf0ca
Merge branch 'master' into features/424-github-actions
TheSlimvReal 139dca8
Merge branch 'master' into features/424-github-actions
coquelin77 64f3147
Merge branch 'master' into features/424-github-actions
coquelin77 aba2b80
Merge branch 'master' into features/424-github-actions
coquelin77 696885e
codecov fails if an error occurs in this step
a27c24b
updated docker image
ad1f01a
added deploy workflow
b38d0a5
Merge branch 'features/424-github-actions' of https://github.com/helm…
4b8875d
updated docker images with wheel
725806f
added stage to build python package
8897d2b
added docker image to publish workflow
ee20c60
undone wheel dependency
0ebb316
changed workflow to run without docker image
1de5eb9
changed name
9a8fe32
fixed workflow file
c939a7c
added correct versions to actions
b3b76a6
fixed action paths
dec2fa1
fixed the token for pypi
fbdcd8f
Merge branch 'master' into features/424-github-actions
TheSlimvReal d9e6648
Merge remote-tracking branch 'origin/master' into features/424-github…
83462ad
removed travis file
183334f
Introducing test failure to check GitHub Workflows behaviour
ClaudiaComito 72bfd89
Formatting
ClaudiaComito 778ff73
More formatting and blake pickiness
ClaudiaComito 5d5efb3
Next attempt at failing
ClaudiaComito d1db24e
Undoing all failed failure attempts and bringing code to original state
ClaudiaComito b4fab82
Adding new submodule
ClaudiaComito 2f3fb66
Adding new class
ClaudiaComito 3f90f02
GaussianNB first pass, replaced numpy with heat calls, added input sa…
ClaudiaComito ca33cd3
Removed unused call to sklearn _check_X
ClaudiaComito c7896b9
Replaced call to sklearn.utils.validation.check_X_y with basic shape …
ClaudiaComito 4ca18d8
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito cd55d73
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito 17551db
Added basic checks for sample_weight
ClaudiaComito 57c96df
Added _check_partial_fit_first_call as a staticmethod for now
ClaudiaComito 50f36a2
Removed obsolete comment
ClaudiaComito ae84b5b
Added _BaseNB class from scikit_learn
ClaudiaComito 6cb88e8
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito 3dc93b6
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito 5459a19
Moved relevant _BaseNB methods to GaussianNB class, removed _BaseNB f…
ClaudiaComito 0f66a66
Integrate heat.core.naive_bayes in set up.
ClaudiaComito e1e8542
Formatting
ClaudiaComito 882ba56
Removing sklearn-specific validation calls for now.
ClaudiaComito 04d41c2
Replacing np.unique/np.sort calls with ht.unique(sorted=True) calls
ClaudiaComito a770dec
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito 1a7e70d
Replacing np.in1d call with ht.eq, equivalent in this context.
ClaudiaComito 4affd6d
_partial_fit(), temporary (hacky) replacement for np.searchsorted.
ClaudiaComito 293a54d
Improved temporary searchsorted, debugging
ClaudiaComito b4cab25
Adapted joint_log_likelihood to absence of append() for torch/heat te…
ClaudiaComito 1198f05
Adapted predict() to ht.argmax returning a heat tensor
ClaudiaComito 14ffcfc
Implemented GaussianNB.logsumexp() (hacky early version)
ClaudiaComito 8ea2914
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito 3456ae0
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito c33a2a3
Modified _joint_log_likelihood to not rely on append(). No need to tr…
ClaudiaComito bf20bfe
logsumexp fixes
ClaudiaComito 2bb84c5
Removed print/debugging statements
ClaudiaComito 2638a78
formatting
ClaudiaComito 297423a
Fixed mistake in shape of joint_log_likelihood tensor
ClaudiaComito 80c22fb
Implementing test_gaussiannb(). First pass, test locally.
ClaudiaComito 9d61505
Tidying up comments and #TODOs
ClaudiaComito 7ac033a
Implemented testing of distributed GaussianNB (fails).
ClaudiaComito 3c2f4e1
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito a7a80db
Distributed __getitem__ now returns tensor of values at LIST of indic…
ClaudiaComito 3911a12
Test for distributed case where data and labels are split along axis 0.
ClaudiaComito 374d460
Enforce split=None for list of unique labels for now.
ClaudiaComito b21204f
Updated changelog
ClaudiaComito 7706348
Adding back .travis.yml
ClaudiaComito 731d99f
Removed outdated comment line
ClaudiaComito a818ec2
Refined conditional statements
ClaudiaComito bfd3244
formatting
ClaudiaComito daed150
Renamed internal functions according to HeAT convention (starting wit…
ClaudiaComito e3f9dbd
Testing gnb predictions vs. test labels
ClaudiaComito 1d5a30d
Resolved flake8 conflicts
ClaudiaComito 73e5bb9
More flake-iness
ClaudiaComito 58bf7cd
Line breaks after """
ClaudiaComito ac4436a
Docs rewording
ClaudiaComito 5bc63ff
In-place resplitting
ClaudiaComito 1a0ee4d
Removed confusing reference to scikit-learn version 0.17
ClaudiaComito 552119d
__check_partial_fit_first_call(clf, ...) --> __check_partial_fit_firs…
ClaudiaComito 53e4c67
Reference to #351, added pointer to topic
ClaudiaComito 2580f43
Formatting error messages
ClaudiaComito 6da348d
Added dtype, device calls where missing
ClaudiaComito 045cec6
Added references to Issue #468, sanitation TODOs
ClaudiaComito 36ab57c
Added missing line breaks
ClaudiaComito 058ceec
Added missing line breaks
ClaudiaComito ba138ef
Added docs for __joint_log_likelihood
ClaudiaComito e0a2587
Merge branch 'master' into features/340-GaussianNB
ClaudiaComito f84beef
Move naive_bayes one level up
ClaudiaComito f523587
Removed heat/core/naive_bayes
ClaudiaComito 7d1412d
Updated import heat.core.naive_bayes --> heat.naive_bayes
ClaudiaComito 21f192e
Fixing flake8 "E402 module level import not at top of file" complaint
ClaudiaComito 00d6123
Removed -quiet option from pip install.
ClaudiaComito b356fea
Import version after sys.path.append("./heat/core") in spite of flake…
ClaudiaComito 535e397
Building problems. Setting --progress-bar off for pip install
ClaudiaComito 20fe2d5
Uploading standard sklearn train/test iris data.
ClaudiaComito 4388207
Rewrote test_gaussiannb to compare to sklearn gnb() without importing…
ClaudiaComito 5e56fc1
Bypassing test with 7 procs for now as test dataset is too small
ClaudiaComito 0ba4a73
Setting pip install back to -quiet
ClaudiaComito 2fccd8f
Increasing test coverage. Testing exceptions
ClaudiaComito 266c091
Switching pip install out of quiet mode again
ClaudiaComito d423142
Added checks for sample_weight = 0 locally
ClaudiaComito 8a80683
Added test case for sample_weight not None (split=None)
ClaudiaComito 56713c8
__getitem__ call on sample_weight
ClaudiaComito 78532f7
testing GaussianNB when sample_weight is not None, sample_weight not …
ClaudiaComito 0a37705
nonzero now returns 1-D tensor if input is 1-D
ClaudiaComito f2c0404
Modified __getitem__ for case in which the only input (key) is a list…
ClaudiaComito 310a7c2
ht.average(), improved weighted average when weights are 1-D and inpu…
ClaudiaComito e16e118
Adapted test_average() to changes in statistics.average
ClaudiaComito 1b16fce
Small change to __getitem__ call
ClaudiaComito a956f25
Added test for GaussianNB with weights, both local and distributed
ClaudiaComito 7399eb5
Updated test_average()
ClaudiaComito 061fc27
Skipping test on 7 nodes for now. Cf. #490
ClaudiaComito f8e5d78
Extending test coverage: weighted average with 3d weights.
ClaudiaComito 95de460
ht.average(), NotImplementedError if weights.split != x.split until #…
ClaudiaComito e14e8b0
test_average(), test NotImplementedError when weights.split != x.split
ClaudiaComito 9fa985d
Extending test coverage: testing exceptions.
ClaudiaComito 005ccf6
Increasing test coverage.
ClaudiaComito c6ae05c
Tests fail on 7 nodes, #490
ClaudiaComito 92d4a48
Added tests for gnb.predict_proba()
ClaudiaComito 2c3e4d9
Shape of log_prob_x must match __joint_log_likelihood(X) output
ClaudiaComito 6e6c1f2
scikit-learn predict_proba output for testing/comparison
ClaudiaComito c6a85be
Resolving conflicts with master
ClaudiaComito bbb5610
Updated documentation and example of GaussianNB
ClaudiaComito 165cfbb
Adapted documentation from scikit-learn to HeAT.
ClaudiaComito 887f55c
ht.var() now returns same dtype as input tensor in distributed mode a…
ClaudiaComito 50c113a
Extending test coverage to gnb attributes
ClaudiaComito 1596496
turn gnb.priors into ht.DNDarray only if it isn't already
ClaudiaComito 0f4544a
Test coverage gnb.partial_fit(), gnb.priors
ClaudiaComito 2bac14e
Removed dtype mismatch embuggerance between priors.sum() and 1.0
ClaudiaComito d890e60
More tests
ClaudiaComito 2a14bf5
Merge branch 'master' into features/340-GaussianNB
coquelin77 c15b0c8
Improving tests
ClaudiaComito b4cdd20
Merge branch 'features/340-GaussianNB' of https://github.com/helmholt…
ClaudiaComito ad8804d
7-node test
ClaudiaComito 0ca9244
Removing 7-node testing again
ClaudiaComito b769ad2
Extending test coverage
ClaudiaComito 74b2d5b
Improved conditional statement as per review
ClaudiaComito 915ee4a
Added references section to documentation
ClaudiaComito 2083434
Added References section to __update_mean_variance documentation
ClaudiaComito 907e38f
Test coverage
ClaudiaComito 4dcfa97
More test coverage
ClaudiaComito 721157b
Removed dead code
ClaudiaComito a829607
added extra dtype calls
coquelin77 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
possible dtype problem here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is var() supposed to return a float32? Even if x is float64?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it should not. i guess i missed a couple dtype calls here. can you add them for me? it should be just in this spot
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's what I thought, so is this the possible dtype problem you were talking about? I guess I just misunderstood your first comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i just also realized that the dtype of n_tot doesnt matter because it is only used internally