Andrej Baranovskij Blog

Monday, July 14, 2025

Structured Data Query with Sparrow AI Agent

Sparrow comes with option to extract stuctured data with query. In this video I explain how you can define such query to fetch array and field data.

Tuesday, July 8, 2025

Vision LLM with MLX: Extracting Electric Meter Data in Production

In this video, I share my experience using the MLX backend to run Vision LLM (with MLX-VLM) for structured data extraction in a production environment. See how I used Sparrow to accurately read electric meter data and learn practical tips for deploying similar solutions.

Monday, June 30, 2025

Solving Upwork Client Task with Sparrow

I show how Sparrow can be used to handle complex Upwork task, with accurate table data extraction. Key requirement is to prevent Vision LLM hallucinations, this is achieved by Sparrow hybrid data processing approach.

Monday, June 23, 2025

How to Extract Financial Statement Data with Sparrow & Vision LLM

Extract financial statement data with Sparrow and Vision LLM in this quick tutorial! Sparrow auto-detects tables, builds clear grids, and uses OCR for accurate Vision LLM results, preventing errors. Runs locally with no cloud dependency, making it great for private financial documents. Perfect for anyone handling sensitive financial data.

Monday, June 16, 2025

Boost Vision LLM Accuracy with OCR Text Integration

I show an interesting approach where I send both an image and OCR text to a Vision LLM. The prompt is constructed to instruct the Vision LLM to prioritize the OCR text. This allows the use of a Vision LLM for structured output construction while relying on external OCR text, giving you more control over the results.

Tuesday, June 10, 2025

Solving Vision LLM Number Formatting Issues Using PaddleOCR and Sparrow

Discover how to fix number formatting errors in vision LLMs like Mistral! In this video, I show how Mistral misreads "56,000" as "56000" and how combining PaddleOCR’s text extraction with Sparrow’s spatial data processing solves this hallucination issue.

Tuesday, June 3, 2025

PaddleOCR 3.0: Supercharge Your AI

I upgraded to PaddleOCR 3.0 and explain the new PaddleOCR API integration. My goal is to integrate OCR result output with Vision LLM processing to enhance large-scale, structured table data output.