Skip to main content

OCR Inference API

POSThttps://ocrdoc.infer.nt-ai.cloud/predict
Header
    X-API-Key string required

    Your API key

form-data body

Request Body
  • files Filerequired

    Image or document raw files in a form of multi-part form data using the key name files.

optional

Send with the form of multi-part form data

    box_threshold float

    Default value: 0.4

    Adjusting the box_threshold value, ranged between 0 to 1, affects the detection of text in documents. A lower value allows the model to detect more bounding boxes, while a higher value reduces detection sensitivity. It is recommended to start with the default value of 0.4 and gradually increment the value by 0.1 until achieving the desired result for the document being used. (Number between 0 - 1)

Responses


Schema
  • Array [
  • object
    filename string

    File name

    status string

    success | failed

    Status of request

    result Array [
    object
    page number

    The page number corresponding to the retrieved text.

    full_text string

    The full text content of the specified page. This includes all text present on the page, with newline characters (\n) representing line breaks.

    data Array [
    object
    bbox: [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]

    4-pixel coordinate x, y of text box.

    text string

    The extracted text in each box.

    ]
  • ]
Loading...