I discuss the real-world MLX performance of Sparrow for structured data extraction with public access. The current Sparrow online instance runs on a Mac Mini M4 Pro with 64GB of memory. On average, it processes one page in 100 seconds. I explain why tokens-per-second measurements can be misleading when evaluating structured data extraction.
No comments:
Post a Comment