ordellrb@lemmy.world to linuxmemes@lemmy.world · edit-28 个月前Not Total Recall (1990)lemmy.worldimagemessage-square64fedilinkarrow-up1824arrow-down110
arrow-up1814arrow-down1imageNot Total Recall (1990)lemmy.worldordellrb@lemmy.world to linuxmemes@lemmy.world · edit-28 个月前message-square64fedilink
minus-squareMacN'Cheezus@lemmy.todaylinkfedilinkEnglisharrow-up3·8 个月前Llava and Bakllava are two Ollama models than can not only extract text but also describe what’s happening on screen. Using tesseract-ocr, as the other guy suggested, is probably simpler and less resource intensive though.
Llava and Bakllava are two Ollama models than can not only extract text but also describe what’s happening on screen.
Using
tesseract-ocr
, as the other guy suggested, is probably simpler and less resource intensive though.