Category: GenAI
-
Multimodal LLM vs. Multimodal mom said we have at home.
In a world where AI is evolving into increasingly versatile multimodal systems, I decided to build a tool that brings together state-of-the-art object detection, OCR, and language generation into one cohesive workflow. Inspired by my mom’s extreme positivity when we couldn’t afford to dine outside as kids, I combined YOLOv8 with pytesseract and a Llama-based…
-
Visual-based Web Scraping: Using power of multimodal LLMs to Dynamic Web Content Extraction
With LLMs and vision models becoming more accessible than ever, I started exploring the intersection of web scraping and AI. As a test case, I recently wrote a code leveraging LLAMA vision to scrape the content of a webpage but using the multi-modality of LLM; we can call it Visual-based WebScraping. Instead of traditional HTML…
-
Ever tried turning an open-source LLM into a personal paparazzi?
Ever tried turning an open-source LLM into a personal paparazzi? I did—purely as a proof-of-concept—and it’s both fascinating and a little alarming. I wrote a simple script that waits for Outlook to open, takes a screenshot, and instantly sends it off via Telegram. Yes. Classic programming could do the same thing, sure, but here’s the…
-
GenAI Projects: Bridging the Gap Between Feasibility, Reliability And Suitability
GenAI is a term that refers to the use of large language models, such as GPT-4, to generate natural language content for various purposes. GenAI projects are becoming more popular and promising, as they can potentially offer solutions for many domains and tasks. But GenAI projects are not as straightforward as they may seem. They…
-
Don’t Fall for These 5 Anti-Patterns in GenAI Research and Development
Have you ever wondered what makes a successful GenAI project? Is it the model, the data, the tech stack, or something else? We have been working on various GenAI projects already and along this journey, we have discovered that the most important factor for GenAI’s success is often overlooked – the team crafting solutions. They…