• 31 Posts
  • 671 Comments
Joined 3 years ago
cake
Cake day: July 13th, 2023

help-circle



  • Using a LLM for detecting a specific object on an image is possible but stupid: if your object is always the same (like in this case) it’s several orders of magnitude cheaper to train once on that specific object then use the computer vision model running directly on the local server that’s recording the video.

    Otherwise:

    1. the api costs would be colossal, 0.001$ per each image, at 30 fps it’s $100 per hour, nobody would pay that
    2. The detection latency would be several seconds vs almost instant
    3. Without internet connection the system wouldn’t work

    Use cases for LLM-based image recognition is if the object changes at every request or it’s ultra specific with brands and colors