Andrej Baranovskij Blog
Blog about Oracle, Full Stack, Machine Learning and Cloud
Monday, November 25, 2024
Batch Inference with Qwen2 Vision LLM (Sparrow)
I'm explaining several hints how to optimize Qwen2 Visual LLM performance for batch processing.
No comments:
Post a Comment
Older Post
Home
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment