GitHub - mrmoxon/Agent-WebVoyager: Agent-WebVoyager autonomously navigates the web like a human, performing tasks without specific APIs. It uses visual cues and intelligent decisions for web scraping and information retrieval, documenting each step visually. This innovative approach showcases AI's versatility in dynamic web exploration.

Agent-WebVoyager

Overview

Agent-WebVoyager is an innovative approach to web navigation and data extraction, capable of performing complex browsing tasks without the need for specific APIs and recording progress along the way. Mimicking human-like browsing behavior, WebVoyager navigates the web, interacts with pages, and extracts information, all through visual cues and intelligent decision-making processes.

The project showcases the agent's capability to perform a "meta-webscrape" task, such as browsing X to report Elon Musk's most recent post purely by simulating a few user interactions with the web page. This method stands out by its independence from platform-specific APIs, highlighting a versatile and adaptive web scraping approach. The sky is the limit.

Features

Human-like Web Navigation: Employs visual cues and page elements for navigation, making the process similar to how a human would browse.
No API Required: Performs tasks without relying on specific web service APIs, enabling broader applicability across various platforms.
Intelligent Decision Making: Utilizes a set of defined functions to make decisions, interact with web elements, and navigate through pages. Leverages GPT-4V and LangSmith.
Visual Task Documentation: Generates a visual path history, documenting each step taken during the task execution.

Installation

To set up Agent-WebVoyager, follow these steps:

Clone the repository:

git clone https://github.com/mrmoxon/Agent-WebVoyager.git

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

To run Agent-WebVoyager for a specific task, execute the following command:

python agent_voyage.py --task "Browse Twitter and tell me Musk's most recent tweet."

The agent will perform up to 25 steps to navigate through the web and accomplish the task.

Example Task

An example task, "Browse Twitter and tell me Musk's most recent tweet.", demonstrates the agent's ability to perform complex web navigation and information extraction without direct API calls. The agent successfully navigates Twitter, finds Elon Musk's profile, and reports the most recent tweet.

Task Visual Documentation

The process and steps taken by the agent are documented visually in the path history file: path-history/agent_path_(twitter).png. This file illustrates the agent's navigation path, including interactions and key decisions made along the way.

Agent Path Example

Using objective = "Could you go to Google Trends and compare 'p(doom)' to 'e/acc'?"

Note: The example image is a representation. Run the agent to generate current task visual documentation.

Contributing

Contributions to Agent-WebVoyager are welcome. Please feel free to fork the repository, make changes, and submit pull requests. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
path-history		path-history
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent_steps.txt		agent_steps.txt
agent_voyage.py		agent_voyage.py
mark_page.js		mark_page.js
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent-WebVoyager

Overview

Features

Installation

Usage

Example Task

Task Visual Documentation

Agent Path Example

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent-WebVoyager

Overview

Features

Installation

Usage

Example Task

Task Visual Documentation

Agent Path Example

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages