Skip to content

Conversation

@recursix
Copy link
Collaborator

Sequential studies for being able to launch sequentially multiple agents on webarena with resets.

also add AbstractStudy class to define API and extract reusable code

@recursix recursix requested a review from TLSDC November 15, 2024 17:00
@recursix recursix changed the base branch from main to dev November 15, 2024 17:02
Copy link
Collaborator

@TLSDC TLSDC left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

runs on my end 😃

@TLSDC TLSDC merged commit 3286505 into dev Nov 16, 2024
@TLSDC TLSDC deleted the Study-to-multi-eval branch November 16, 2024 00:49
gasse added a commit that referenced this pull request Nov 20, 2024
* yet another way to kill timedout jobs

* Improve timeout handling in task polling logic

* Add method to override max_steps in Study class

* add support for tab visibility in observation flags and update related components

* fix tests

* black

* Improve timeout handling in task polling logic

* yet another way to kill timedout jobs (#108)

* Add method to override max_steps in Study class

* add support for tab visibility in observation flags and update related components

* fix tests

* black

* black

* Fix sorting bug.
 improve directory content retrieval with summary statistics

* fix test

* black

* tmp

* add error report, add cum cost to summary and ray backend by default

* sequential studies

---------

Co-authored-by: Maxime Gasse <maxime.gasse@gmail.com>
gasse added a commit that referenced this pull request Nov 20, 2024
* yet another way to kill timedout jobs

* Improve timeout handling in task polling logic

* Add method to override max_steps in Study class

* add support for tab visibility in observation flags and update related components

* fix tests

* black

* Improve timeout handling in task polling logic

* yet another way to kill timedout jobs (#108)

* Add method to override max_steps in Study class

* add support for tab visibility in observation flags and update related components

* fix tests

* black

* black

* Fix sorting bug.
 improve directory content retrieval with summary statistics

* fix test

* black

* tmp

* add error report, add cum cost to summary and ray backend by default

* sequential studies

---------

Co-authored-by: Maxime Gasse <maxime.gasse@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants