Category: Multimodal-LLM
-
Visual-based Web Scraping: Using power of multimodal LLMs to Dynamic Web Content Extraction
With LLMs and vision models becoming more accessible than ever, I started exploring the intersection of web scraping and AI. As a test case, I recently wrote a code leveraging LLAMA vision to scrape the content of a webpage but using the multi-modality of LLM; we can call it Visual-based WebScraping. Instead of traditional HTML…