Baidu

Baidu

paddleocr-vl-1.5

PaddleOCR VL 1.5

PaddleOCR VL 1.5 is available after commercial review or workspace setup.

Model detailAccess reviewMultimodal Transformer

Params

1B OCR

Context

33K

Max Output

N/A

License

Open

API Surface

/v1/responses multimodal

Access

Access review

Why pick it

  • Access is enabled only after workspace setup or commercial review.
  • Pricing appears only when verified by the live service catalog.

Pricing

TierPublicCachedNote
Realtime$0.000 / $0.000$0.000Current published price
Batch$0.000 / $0.000$0.000Published batch rates appear here when available
Current pricing synced from the BatchIn catalog.

Quick start

This model is not yet open for public testing. Contact the team for the right access path.

Python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.batchin.tech/v1",
    api_key="BATCHIN_API_KEY"
)

resp = client.responses.create(
    model="paddleocr-vl-1.5",
    input=[{
        "role": "user",
        "content": [
            {"type": "input_text", "text": "Describe the uploaded image"},
            {"type": "input_image", "image_url": "https://example.com/sample.png"}
        ]
    }]
)

print(resp.output_text)
JavaScript
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.batchin.tech/v1",
  apiKey: process.env.BATCHIN_API_KEY,
});

const resp = await client.responses.create({
  model: "paddleocr-vl-1.5",
  input: [{
    role: "user",
    content: [
      { type: "input_text", text: "Describe the uploaded image" },
      { type: "input_image", image_url: "https://example.com/sample.png" }
    ],
  }],
});

console.log(resp.output_text);
cURL
curl https://api.batchin.tech/v1/responses \  -H "Authorization: Bearer ***" \  -H "Content-Type: application/json" \  -d '{
    "model": "paddleocr-vl-1.5",
    "input": [{
      "role": "user",
      "content": [
        {"type":"input_text","text":"Describe the uploaded image"},
        {"type":"input_image","image_url":"https://example.com/sample.png"}
      ]
    }]
  }'

Specs

Architecture

Multimodal Transformer

Vendor

Baidu

Context window

33K

Max output

N/A

Best for

baidu
request-access

Related models

Back to model center
PaddleOCR VL 1.5 | BatchIn