A versatile Python-based tool for web interaction, Selenium automation, and AI-assisted task execution.
- 🤖 AI-powered command generation and execution
- 🌐 Web browsing and content retrieval
- 📷 Website screenshot capture
- 🔧 Custom Selenium code generation and execution
- 🚀 Interactive session with AI model
- 📊 Performance measurement for AI models
- Python 3.12
- Poetry (for dependency management)
- Chrome browser
- FFmpeg (for video clipping)
Key dependencies include:
- selenium==4.25.0
- requests==2.32.3
- rich==13.8.1
- pillow==10.4.0
- validators==0.34.0
- webdriver-manager==4.0.2
For a full list of dependencies, see the pyproject.toml
file.
- Clone this repository
- Ensure you have Python 3.12 and Poetry installed
- Install dependencies using Poetry:
poetry install
- Ensure Chrome and ChromeDriver are installed
- Install FFmpeg for video clipping functionality
Run the script in interactive mode:
poetry run python main.py -i --model llama3.2:3b
Available commands in interactive mode:
/cmd <prompt>
: Web interaction or custom Selenium code/screenshot
: Take a screenshot of the last visited URL/selenium <task>
: Generate custom Selenium code/clip <input_file> <output_file> <start_time> <duration>
: Clip a videoexit
: Exit the session
Customize system prompts:
poetry run python main.py -i --model llama3.2:3b --system-prompt-prefix "Your prefix here" --system-prompt-suffix "Your suffix here"
Measure AI model performance:
poetry run python main.py --model llama3.2:3b --prompt "Your prompt here"
Contributions are welcome! Please feel free to submit a Pull Request.
MIT-ish