Andrej Baranovskij Blog
Blog about Oracle, Full Stack, Machine Learning and Cloud
Sunday, July 21, 2024
Invoice Table Detection with Table Transformer
I show how an open-source transformer model from Microsoft for table detection and structure recognition works. The code is integrated into Sparrow Parse and runs on a local CPU. This approach helps to crop the table area first and then get coordinates for the table cells. Each cell can be cropped and text can be extracted with OCR. This allows retaining the original table structure and reporting the result in JSON or CSV formats. The data extraction part is not in this video; this will be the topic for the next video.
Sunday, July 14, 2024
Sparrow OCR Service with PaddleOCR
In this video, I demonstrate the latest updates to the Sparrow OCR Service using PaddleOCR. I walk you through the OCR service workflow in Sparrow, showcasing its integration with FastAPI and highlighting the enhanced functionalities brought by the recent PaddleOCR update. Join me to see how you can leverage these powerful tools for efficient OCR processing!
Wednesday, July 3, 2024
FastAPI Endpoint for Sparrow LLM Agent
FastAPI Endpoint for Sparrow LLM Agent. I show how FastAPI endpoint is used in Sparrow to run LLM agent functionality from API client.
Sunday, June 23, 2024
Sparrow Parse API for PDF Invoice Data Extraction
I explain how Sparrow Parse API is integrated into Sparrow for data extraction from PDF documents, such as invoices, receipts, etc.
Monday, June 17, 2024
Avoid LLM Hallucinations: Use Sparrow Parse for Tabular PDF Data, Instructor LLM for Forms
LLMs tend to hallucinate and produce incorrect results for table data extraction. For this reason in Sparrow we are using Instructor structured output for LLM to query form data and Sparrow Parse to process tabular data within the same document in combined approach.
Monday, June 10, 2024
Effective Table Data Extraction from PDF without LLM
Sparrow Parse helps to read tabular data from PDFs, relying on various libraries, such as Unstructured or PyMuPDF4LLM. This allows us to avoid data hallucination errors often produced by LLMs when processing complex data structures.
Monday, June 3, 2024
Instructor and Ollama for Invoice Data Extraction in Sparrow [LLM, JSON]
Structured output from invoice document, running local LLM. This works well with Instructor and Ollama.
Labels:
Instructor,
LLM,
Python
Subscribe to:
Posts (Atom)