Making some unit tests work #1000

Neeratyoy · 2020-11-10T14:34:52Z

What does this PR implement/fix? Explain your changes.

Fixes some of the failing unit tests post the test server clean-up.
Update docs to ensure that any publish to the test server during unit testing is also scheduled for deletion.

Neeratyoy · 2020-11-10T14:38:41Z

~~@mfeurer this step appears to be required in order to test the subsequent clause under which the failure message is raised. That is, without uploading a task, the test passes without exceptions.~~

However, on local tests with the current change, it gave the following error:
openml.exceptions.OpenMLServerException: https://test.openml.org/api/v1/xml/task/ returned code 623: Tasks can only be created upon active datasets. - problematic input: source_data, dataset not active.

This issue was addressed by waiting for the dataset to be processed before creating a task on it.

Neeratyoy · 2020-11-20T13:13:54Z

@mfeurer many more tests fail now
This is a common error:

openml.exceptions.OpenMLServerException: https://test.openml.org/api/v1/xml/flow/exists returned code 104: This is a read-only account, it does not have permission for write operations. - API calls of the read-only user can only be of type GET.

While other fetches from the server return empty, which seems to make many of the assertions fail too!

mfeurer · 2020-11-23T08:29:27Z

@Neeratyoy I just discussed with @joaquinvanschoren and he gave the API key write permissions

Neeratyoy · 2020-11-30T18:10:18Z

@mfeurer

If this function is in line with what we discussed, I can then extend it to other tests: https://github.com/openml/openml-python/pull/1000/files#diff-cce4323af466db2f54edb9161a510818e26857ea520e94923c3b987aff28dfc8R35
By the design of create_task, I wasn't able to create a meta_data that can be uniformly used to create a task, publish, retrieve and check the meta-data, so I needed this line:

openml-python/tests/test_runs/test_run_functions.py

Line 935 in 431447c

task_meta_data["task_type"] = TaskType.SUPERVISED_CLASSIFICATION

codecov-io · 2020-12-08T20:03:53Z

Codecov Report

Merging #1000 (b5e1242) into develop (560e952) will decrease coverage by 0.15%.
The diff coverage is 78.57%.

@@             Coverage Diff             @@
##           develop    #1000      +/-   ##
===========================================
- Coverage    87.86%   87.70%   -0.16%     
===========================================
  Files           36       36              
  Lines         4531     4580      +49     
===========================================
+ Hits          3981     4017      +36     
- Misses         550      563      +13

Impacted Files	Coverage Δ
openml/testing.py	`83.92% <70.58%> (-3.58%)`	⬇️
openml/_api_calls.py	`88.97% <84.37%> (-2.02%)`	⬇️
openml/config.py	`79.67% <100.00%> (+0.33%)`	⬆️
openml/utils.py	`91.33% <100.00%> (+0.72%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 560e952...f5e4a3e. Read the comment docs.

openml/utils.py

tests/test_datasets/test_dataset_functions.py

tests/test_runs/test_run_functions.py

tests/test_tasks/test_regression_task.py

into fix_unit_tests

openml/_api_calls.py

openml/testing.py

tests/test_runs/test_run_functions.py

tests/test_study/test_study_functions.py

into fix_unit_tests

PGijsbers

I only looked at _api_calls. I think the main point of discussion is whether or not we want two loops of retries for server connects. Currently the outer loop is ran as long as the response content is incorrect (not same hash), and the inner loop also connects up to n_retries in _send_request until a status: 200. If we don't want to nest this loop (and have n_retries^2 retries, we should probably allow the _send_request to check the response against a checksum for get queries.

openml/_api_calls.py

Neeratyoy · 2020-12-23T01:01:18Z

@PGijsbers the tests that fail are oddly due to either a worker crashing or the issue pertaining to attaching entities to a study.

tests/test_runs/test_run_functions.py

PGijsbers

I think _api_calls looks fine now. I also left a comment on a change that was shown to me when I clicked the notification, would like some clarification for that first.
edit: but I do still need to look at the ci results as well

PGijsbers · 2020-12-23T09:47:01Z

Looks mostly like server failure? Though something like this is suspicious:

=================================== FAILURES ===================================
____________________ TestStudyFunctions.test_publish_study _____________________
[gw3] linux -- Python 3.6.12 /opt/hostedtoolcache/Python/3.6.12/x64/bin/python

self = <test_study_functions.TestStudyFunctions testMethod=test_publish_study>

    @pytest.mark.flaky()
    def test_publish_study(self):
        # get some random runs to attach
        run_list = openml.evaluations.list_evaluations("predictive_accuracy", size=10)
        self.assertEqual(len(run_list), 10)
    
        fixt_alias = None
        fixt_name = "unit tested study"
        fixt_descr = "bla"
        fixt_flow_ids = set([evaluation.flow_id for evaluation in run_list.values()])
        fixt_task_ids = set([evaluation.task_id for evaluation in run_list.values()])
        fixt_setup_ids = set([evaluation.setup_id for evaluation in run_list.values()])
    
        study = openml.study.create_study(
            alias=fixt_alias,
            benchmark_suite=None,
            name=fixt_name,
            description=fixt_descr,
            run_ids=list(run_list.keys()),
        )
        study.publish()
        # not tracking upload for delete since _delete_entity called end of function
        # asserting return status from openml.study.delete_study()
        self.assertGreater(study.id, 0)
        study_downloaded = openml.study.get_study(study.id)
        self.assertEqual(study_downloaded.alias, fixt_alias)
        self.assertEqual(study_downloaded.name, fixt_name)
        self.assertEqual(study_downloaded.description, fixt_descr)
        self.assertEqual(study_downloaded.main_entity_type, "run")
    
        self.assertSetEqual(set(study_downloaded.runs), set(run_list.keys()))
        self.assertSetEqual(set(study_downloaded.setups), set(fixt_setup_ids))
        self.assertSetEqual(set(study_downloaded.flows), set(fixt_flow_ids))
        self.assertSetEqual(set(study_downloaded.tasks), set(fixt_task_ids))
    
        # test whether the list run function also handles study data fine
        run_ids = openml.runs.list_runs(study=study.id)
        self.assertSetEqual(set(run_ids), set(study_downloaded.runs))
    
        # test whether the list evaluation function also handles study data fine
        run_ids = openml.evaluations.list_evaluations(
            "predictive_accuracy", size=None, study=study.id
        )
        self.assertSetEqual(set(run_ids), set(study_downloaded.runs))
    
        # attach more runs
        run_list_additional = openml.runs.list_runs(size=10, offset=10)
>       openml.study.attach_to_study(study.id, list(run_list_additional.keys()))

tests/test_study/test_study_functions.py:164: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
openml/study/functions.py:371: in attach_to_study
    result_xml = openml._api_calls._perform_api_call(uri, "post", post_variables)
openml/_api_calls.py:61: in _perform_api_call
    response = __read_url(url, request_method, data)
openml/_api_calls.py:161: in __read_url
    request_method=request_method, url=url, data=data, md5_checksum=md5_checksum
openml/_api_calls.py:192: in _send_request
    __check_response(response=response, url=url, file_elements=files)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

response = <Response [412]>
url = 'https://test.openml.org/api/v1/xml/study/1685/attach'
file_elements = None

    def __check_response(response, url, file_elements):
        if response.status_code != 200:
>           raise __parse_server_exception(response, url, file_elements=file_elements)
E           openml.exceptions.OpenMLServerException: https://test.openml.org/api/v1/xml/study/1685/attach returned code 1045: Problem attaching entities. Please ensure to only attach entities that exist - None

openml/_api_calls.py:230: OpenMLServerException

would mean there were no runs on the server (or at most10)?

Neeratyoy · 2020-12-23T14:38:13Z

I also left a comment on a change that was shown to me when I clicked the notification, would like some clarification for that first.

I'm afraid I can't see exactly which comment :/

PGijsbers · 2020-12-23T15:12:34Z

I'm afraid I can't see exactly which comment :/

It's the one about the time measurements that you commented on 👍

PGijsbers · 2020-12-23T16:12:18Z

Thanks for all the work on this PR! I'm merging this as it's a big improvement to what was before, and so we don't need to carry this over a break.
Hopefully we can sort out the last failures soon. Ultimately we will need a better test structure one day, but that will be a major undertaking.

mfeurer · 2021-01-04T09:10:43Z

Thanks for merging @PGijsbers

The flaky error you found was reported by @Neeratyoy in openml/OpenML#1080

Making some unit tests work

e0af15e

Neeratyoy added 2 commits November 16, 2020 13:19

Waiting for dataset to be processed

14aa11d

Minor test collection fix

31d48d8

Template to handle missing tasks

431447c

Neeratyoy added 5 commits November 30, 2020 19:51

Accounting for more missing tasks:

cc3199e

Fixing some more unit tests

8a29668

Simplifying check_task_existence

405e03c

black changes

caf4f46

Minor formatting

b308e71

Handling task exists check

436a9fe

Neeratyoy marked this pull request as ready for review December 9, 2020 14:21

Neeratyoy requested a review from mfeurer December 9, 2020 14:21

mfeurer reviewed Dec 10, 2020

View reviewed changes

Neeratyoy added 3 commits December 14, 2020 14:12

Testing edited check task func

ddd8b04

Merge branch 'fix_unit_tests' of https://github.com/openml/openml-python

74ae622

into fix_unit_tests

Flake fix

50ce90e

Neeratyoy added a commit that referenced this pull request Dec 15, 2020

Updating with fixed unit tests from PR #1000

aea2832

mfeurer mentioned this pull request Dec 15, 2020

Rework local openml directory #987

Merged

More retries on connection error

56cd639

Neeratyoy requested a review from mfeurer December 16, 2020 20:41

mfeurer reviewed Dec 17, 2020

View reviewed changes

openml/_api_calls.py Outdated Show resolved Hide resolved

openml/_api_calls.py Outdated Show resolved Hide resolved

openml/testing.py Outdated Show resolved Hide resolved

tests/test_runs/test_run_functions.py Outdated Show resolved Hide resolved

Adding max_retries to config default

8e8ea2e

Neeratyoy commented Dec 17, 2020

View reviewed changes

tests/test_study/test_study_functions.py Show resolved Hide resolved

Neeratyoy added 2 commits December 17, 2020 19:57

Update database retry unit test

d518beb

Print to debug hash exception

37d9f6b

mfeurer and others added 10 commits December 21, 2020 09:38

Update custom_flow_tutorial.py

8f380de

Update test_study_functions.py

bc1745e

Update test_dataset_functions.py

d95b5e6

more retries, but also more time between retries

91c6cf5

allow for even more retries on get calls

a9430b3

Catching failed get task

e9cfba8

Merge branch 'fix_unit_tests' of https://github.com/openml/openml-python

c13f6ce

into fix_unit_tests

undo stupid change

3d7abc2

Merge branch 'fix_unit_tests' of https://github.com/openml/openml-python

94576b1

into fix_unit_tests

fix one more test

b5e1242

mfeurer requested a review from PGijsbers December 21, 2020 13:52

mfeurer approved these changes Dec 21, 2020

View reviewed changes

mfeurer mentioned this pull request Dec 21, 2020

Fix 1013: Store run setup_string #1015

Merged

PGijsbers requested changes Dec 21, 2020

View reviewed changes

openml/_api_calls.py Outdated Show resolved Hide resolved

openml/_api_calls.py Outdated Show resolved Hide resolved

openml/_api_calls.py Outdated Show resolved Hide resolved

Refactoring md5 hash check inside _send_request

f5e4a3e

Neeratyoy requested a review from PGijsbers December 21, 2020 23:20

Fixing a fairly common unit test fail

07ce722

PGijsbers reviewed Dec 23, 2020

View reviewed changes

tests/test_runs/test_run_functions.py Outdated Show resolved Hide resolved

PGijsbers reviewed Dec 23, 2020

View reviewed changes

Reverting loose check on unit test

82e1b72

PGijsbers approved these changes Dec 24, 2020

View reviewed changes

PGijsbers merged commit fba6aab into develop Dec 24, 2020

PGijsbers deleted the fix_unit_tests branch December 24, 2020 09:08

github-actions bot pushed a commit that referenced this pull request Dec 24, 2020

Neeratyoy Mallik: Making some unit tests work (#1000)

e75af7f

Uh oh!

Making some unit tests work #1000

Making some unit tests work #1000

Uh oh!

Conversation

Neeratyoy commented Nov 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR implement/fix? Explain your changes.

Uh oh!

Neeratyoy commented Nov 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Neeratyoy commented Nov 20, 2020

Uh oh!

mfeurer commented Nov 23, 2020

Uh oh!

Neeratyoy commented Nov 30, 2020

Uh oh!

codecov-io commented Dec 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PGijsbers left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Neeratyoy commented Dec 23, 2020

Uh oh!

Uh oh!

PGijsbers left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PGijsbers commented Dec 23, 2020

Uh oh!

Neeratyoy commented Dec 23, 2020

Uh oh!

PGijsbers commented Dec 23, 2020

Uh oh!

PGijsbers commented Dec 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer commented Jan 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Neeratyoy commented Nov 10, 2020 •

edited

Loading

Neeratyoy commented Nov 10, 2020 •

edited

Loading

codecov-io commented Dec 8, 2020 •

edited

Loading

PGijsbers left a comment •

edited

Loading

PGijsbers commented Dec 23, 2020 •

edited

Loading