PaddleOCR VL 1.5

PaddleOCR VL 1.5 is available after commercial review or workspace setup.

Model detailAccess reviewMultimodal Transformer

Params

1B OCR

Context

33K

Max Output

N/A

License

Open

API Surface

/v1/responses multimodal

Access

Access review

Why pick it

Access is enabled only after workspace setup or commercial review.
Pricing appears only when verified by the live service catalog.

Pricing

TierPublicCachedNote

Realtime$0.000 / $0.000$0.000Current published price

Batch$0.000 / $0.000$0.000Published batch rates appear here when available

Current pricing synced from the BatchIn catalog.

Quick start

This model is not yet open for public testing. Contact the team for the right access path.

Contact team Open pricing

Python

from openai import OpenAI

client = OpenAI(
    base_url="https://api.batchin.tech/v1",
    api_key="BATCHIN_API_KEY"
)

resp = client.responses.create(
    model="paddleocr-vl-1.5",
    input=[{
        "role": "user",
        "content": [
            {"type": "input_text", "text": "Describe the uploaded image"},
            {"type": "input_image", "image_url": "https://example.com/sample.png"}
        ]
    }]
)

print(resp.output_text)

JavaScript

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.batchin.tech/v1",
  apiKey: process.env.BATCHIN_API_KEY,
});

const resp = await client.responses.create({
  model: "paddleocr-vl-1.5",
  input: [{
    role: "user",
    content: [
      { type: "input_text", text: "Describe the uploaded image" },
      { type: "input_image", image_url: "https://example.com/sample.png" }
    ],
  }],
});

console.log(resp.output_text);

cURL

curl https://api.batchin.tech/v1/responses \  -H "Authorization: Bearer ***" \  -H "Content-Type: application/json" \  -d '{
    "model": "paddleocr-vl-1.5",
    "input": [{
      "role": "user",
      "content": [
        {"type":"input_text","text":"Describe the uploaded image"},
        {"type":"input_image","image_url":"https://example.com/sample.png"}
      ]
    }]
  }'

Specs

Architecture

Multimodal Transformer

Vendor

Baidu

Context window

33K

Max output

N/A

Best for

baidu

request-access

Related models

Back to model center