Synthetic intelligence-based instruments can generate photographs and illustrations from textual content descriptions. However related instruments can do the alternative: flip photos into textual content.
Listed below are six of my favorites.
Accessibility and search engine optimisation
Picture to Textual content. AI’s understanding of photos is new and imperfect. Nonetheless, it’s useful in my expertise.
Picture to Textual content gives brief, AI-powered descriptions of a picture. Add a picture, and the device will describe it. (It’s much less useful for illustrations, nonetheless.) Picture to Textual content presents free and premium variations.

Picture to Textual content gives brief descriptions of a picture, corresponding to “a younger woman sitting at a desk writing on a chunk of paper.”
Gradio’s InkyMM, one other device, gives free detailed descriptions of any picture. It presents two fashions: MPT and Dolly. The latter produced significantly better leads to my testing, even for complicated illustrations.

Gradio’s InkyMM gives detailed descriptions of any picture, corresponding to this portray of two llamas.
Each instruments can create alt textual content, important for visually-handicapped customers and search engine marketing. For search engine optimisation, think about tweaking the textual content with focused key phrases.
Social Media Captions
CaptionIt is a freemium telephone app that creates captions for social media. Add a photograph and select the caption’s model. CaptionIt will then generate captions based mostly on these settings and the picture content material. The device has elevated my productiveness and improved my captions.
CaptionIt’s free model is proscribed. The (a lot) extra sturdy Professional model is $1.99 per thirty days.
Textual content-from-image Extraction
Textual content extraction instruments will not be new. Many accessibility display readers embody them. AI makes these instruments extra correct — for accessibility, search engine optimisation, video scripts, and extra. The device extract textual content from photos, video frames, and presentation slides.
Nanonet’s free text-from-image extraction device can course of any picture as much as 30 MB in seconds. The output is a downloadable textual content file. The device may also extract hand-written textual content however with inconsistent leads to my check. Nanonets additionally presents a free Google Chrome extension.
Google Lens is a cellular app various to Nanonets. It’s constructed into the Google Search app for iPhone and Android. Grant the app entry to your photographs, select a picture, after which navigate Textual content > Choose all > Copy textual content.
For extreme textual content on photos, think about extracting after which pasting it into ChatGPT for a abstract.
Picture-to-text Translation
Google Translate is a well-liked and free web-based device to translate textual content alone or on photos.
Google Translate will detect textual content (typed or handwritten) on any picture and produce that picture translated into the chosen language or as textual content alone.
Translate, like Lens, is constructed into Google’s Search app.
—