Andrej Baranovskij Blog
Blog about Oracle, Full Stack, Machine Learning and Cloud
Wednesday, June 10, 2026
Gemma 4 12B vs Ministral 14B: Who Wins at Structured Table Extraction?
›
Head-to-head test: Gemma 4 12B vs Ministral 14B on structured table extraction. In this video, I run a head-to-head test: Gemma 4 12B (8-bit...
Monday, June 1, 2026
Building Agentic AI Pipelines for Document Analysis
›
In this video, I show how to build a local agentic AI pipeline using Sparrow to extract and analyze data from financial documents. The age...
Monday, May 18, 2026
Instruction-Based Data Analysis with Sparrow and Local LLM
›
In this video, I show how to use Sparrow instruction processing pipeline to analyze a bond portfolio JSON extracted from a financial documen...
Monday, May 11, 2026
Smart Document Extraction with Business Rules — Gemma vs Qwen vs Ministral
›
In this video I show how Sparrow hints work — a powerful feature that goes beyond simple field extraction. Using a bank bonds portfolio docu...
Monday, May 4, 2026
Large Table Extraction to JSON with dots.ocr — No Vision LLM Hallucinations
›
Sparrow now supports a dedicated table mode for extracting large, complex tables into structured JSON — without Vision LLM hallucinations. ...
Monday, April 27, 2026
MoE vs Dense Models for Structured Data Extraction — Who Wins?
›
MoE or Dense — which model architecture wins for structured data extraction from documents? It depends on document complexity. In this vide...
Tuesday, April 21, 2026
Gemma 4 for Structured Data Extraction: Can It Beat Qwen 3.5?
›
In this video, I put Gemma 4 to the test on a real-world task — extracting structured data from bank statements — and benchmark it head-to-h...
‹
›
Home
View web version