Monday, April 28, 2025
Vision LLM on Mac Mini M4 Pro: Real-World MLX Performance
I discuss the real-world MLX performance of Sparrow for structured data extraction with public access. The current Sparrow online instance runs on a Mac Mini M4 Pro with 64GB of memory. On average, it processes one page in 100 seconds. I explain why tokens-per-second measurements can be misleading when evaluating structured data extraction.
Tuesday, April 22, 2025
Running Vision Models on Apple Silicon with MLX-VLM
I show and explain how to run Qwen and Mistral vision models on Apple Silicon with MLX-VLM. I share technical tips about how to run both models and show how to pass query prompt.
Tuesday, April 15, 2025
Dashboard with Gradio Python
This video showcases the Sparrow dashboard, where you can view statistics on document data extraction events processed by Sparrow. This elegant dashboard is built with Python using Gradio, a server-side web UI framework.
Subscribe to:
Posts (Atom)