-
Notifications
You must be signed in to change notification settings - Fork 107
adding message class and updating generic agent accordingly #68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
a0ed59e
285ab1c
e6cb45f
6d5a703
86b4ce8
057b7d4
f41bfa8
8b2b3f3
029309d
0bdcd5e
ad8f26e
88bdea1
285a79a
ac4e05b
a5238bc
1770eba
63e1417
ccff60f
2b92354
25b7831
c1fc907
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -19,8 +19,9 @@ | |
|
|
||
| from agentlab.analyze import inspect_results | ||
| from agentlab.experiments.exp_utils import RESULTS_DIR | ||
| from agentlab.llm.chat_api import make_system_message, make_user_message | ||
| from agentlab.experiments.study import get_most_recent_study | ||
| from agentlab.llm.chat_api import make_system_message, make_user_message | ||
| from agentlab.llm.llm_utils import Discussion | ||
|
|
||
| select_dir_instructions = "Select Experiment Directory" | ||
| AGENT_NAME_KEY = "agent.agent_name" | ||
|
|
@@ -581,7 +582,9 @@ def update_chat_messages(): | |
| global info | ||
| agent_info = info.exp_result.steps_info[info.step].agent_info | ||
| chat_messages = agent_info.get("chat_messages", ["No Chat Messages"]) | ||
| messages = [] | ||
| if isinstance(chat_messages, Discussion): | ||
| return chat_messages.to_markdown() | ||
| messages = [] # TODO(ThibaultLSDC) remove this at some point | ||
| for i, m in enumerate(chat_messages): | ||
| if isinstance(m, BaseMessage): # TODO remove once langchain is deprecated | ||
| m = m.content | ||
|
Comment on lines
+585
to
590
Collaborator
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The Discussion class deprecates a lot code pieces but I figured it might be safer to keep for a while
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. For backward compatibility? Perhaps we can wrap backward compatible code in some isolated function (no need to do now). As long as we're at laeast forward compatible :)
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. How does that work with AgentInfo in browsergym. They type won't be Discussion since it's only defined in AgentLab.
Collaborator
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Haven't thought of that. I feel like it would be a weird usecase to use only browsergym on traces that were made with Agentlab though |
||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -177,6 +177,7 @@ def get_reproducibility_info( | |
| "*/reproducibility_script.py", | ||
| "*reproducibility_journal.csv", | ||
| "*main.py", | ||
| "*inspect_results.ipynb", | ||
|
Collaborator
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That's just convenient |
||
| ), | ||
| ignore_changes=False, | ||
| ): | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comparison of before/after performances w/ 4o mini on miniwob and workarena.l1
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
awesome!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this with our without the new benchmark class with new miniwob action space?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1st 3rd line are without
2nd 4th are with