Vision Models

Overview

Our API enables you to use machine learning models for tasks that require visual capabilities. These models are referred to as vision models. Within our API, we offer two categories of vision models: OCR and OFR.

OCR: Optical Character Recognition

With OCR technology, you can analyze any document and extract text as well as other characters and symbols. This allows you to detect:

  • Text

  • Paragraph blocks

  • Handwriting

  • Text inside PDF/TIFF files

OCR: Optical Character Recognition

OFR: Optical Feature Recognition

In contrast to OCR, OFR allows you to analyze not just documents but also images. You can filter exactly what you want to find in the image by the features they include:

  • Crop hints

  • Faces

  • Image properties

  • Labels

  • Landmarks

  • Logos

  • Multiple objects

  • Explicit content

  • Web entities and pages

  • And many more

OFR: Optical Feature Recognition

Last updated