Monday, March 31, 2025
Extract Structured Data from Documents with Sparrow (Free Tier Available)
I built Sparrow for document data extraction 🚀
It's fully open-source and runs locally on your machine
You can extract structured data from any document using powerful Mistral 24B 8bit and Qwen 2.5 72B 4bit models
It's free to try with no registration (3 calls per 6 hours, max 3-page documents) and doesn't send your documents to third parties
Tuesday, March 25, 2025
Oracle DB 23ai Free Connection Pool in Python
I describe how to connect to Oracle DB from Python. I explain why DB connection pool is important for better performance. Connection is done through thin oracledb mode, without installing Oracle Client.
Monday, March 17, 2025
Temporary Files Cleaner for Gradio Web App
Learn how to implement an automatic temporary file cleanup solution for Gradio web applications. This tutorial shows you how to prevent disk space issues by periodically removing old upload files and folders that Gradio leaves behind. Perfect for developers who deploy Gradio apps in production environments or run memory-intensive applications.
Wednesday, March 12, 2025
Building AI Agent for Local Structured JSON Output
I explain key steps of building AI agent to process document and extract structured JSON data locally. I'm running it with Sparrow and using Qwen VL model for vision processing backend and OCR. The steps are explained with Sparrow code walkthrough.
Monday, March 3, 2025
Querying Non Existing Fields with Qwen2.5 Vision LLM
I describe how Sparrow helps to query non existing fields with Qwen2.5 Vision LLM. Running it locally with MLX and MLX-VLM.
Subscribe to:
Posts (Atom)