-
GitHub - Skyvern-AI/skyvern: Automate browser-based workflows with LLMs and Computer Vision (github.com)
Skyvern is a tool that automates browser-based workflows using Language Models (LLMs) and Computer Vision. It offers a novel approach to automation by not relying on pre-defined selectors like XPath, but instead using visual elements and LLMs to navigate and interact with websites in real-time. This makes Skyvern adaptable to new websites and resistant to layout changes, enhancing the reliability of browser-based automation.
Main Points- Introduction to SkyvernSkyvern automates browser-based workflows using LLMs and computer vision, providing a simple API endpoint to fully automate manual workflows.
- How Skyvern operatesSkyvern uses computer vision and LLMs to parse items in the viewport in real-time, creating a plan for interaction and interacting with them.
- About Skyvern CloudSkyvern offers a managed cloud version that allows running multiple instances in parallel with added features like anti-bot detection and CAPTCHA solving.
122004763