CORD 한국 영수증 벤치에서 Claude Opus 2배 차이로 추월 / Beating Claude Opus 2× on CORD

CORD(Consolidated Receipt Dataset)는 한국 영수증 OCR의 표준 벤치마크예요. 다양한 한국 식당·편의점·카페 영수증을 정확히 추출하는 능력을 측정해요. 최근 측정 결과를 공개해요.

모델	CORD Score	출시
Corepin OCR	59.4%	2026.04
Claude Opus 4.7	28.2%	2026.02
Gemini 3 Pro	24.6%	2026.03
GPT-5.2	16.8%	2026.03

같은 측정 환경 (동일 이미지 셋, 동일 후처리). 우리가 1위, Claude Opus와 2.1배 차이.

왜 글로벌 모델은 30% 근처에서 멈추나

한국 영수증의 분포를 분석해보면 답이 나와요.

색: 단청 색 (붉은 적·청·녹 + 인쇄잉크 황) 이 50% 이상
글리프 폭: 영문 평균 폭의 0.6배 (좁은 명조체)
한자어 비율: "부가세", "면세합계", "합계금액" 등 약 35%
인쇄 품질: 영수증 용지 영구 잔열로 30% 케이스에서 글자가 부분 손실

이 네 가지가 합쳐지면 영문 학습 모델에게는 완벽한 OOD 분포예요. 영문 영수증이 99% 정확해도 한국 영수증에서 30% 이하로 떨어지는 게 이상하지 않아요.

Corepin OCR이 어떻게 다른가

외부에 자세한 학습 방법을 공개하지는 않지만, 큰 그림은 단순해요. 한국 영수증 데이터로만 처음부터 학습했어요. 8년 동안 우리 팀이 직접 라벨링한 한국 영수증 100만 장이 학습 데이터예요. 동일 영수증의 다양한 조명·각도·접힘 변형도 augmentation으로 포함.

그리고 한 가지 핵심 결정 — cascade 라우팅 X, 단일 모델이에요. 처음 v1에서는 빠른 모델 + 정확한 모델 cascade를 시도했는데, "여러 단계가 합쳐지면 오히려 reading order가 망가져요. 그래서 v3부터는 하나의 모델이 끝까지 책임지는 구조예요.

다른 한국 OCR 벤치에서도

OmniDocBench v1.5 종합 점수에서도 94.62점으로 Gemini 3 Pro·Qwen3-VL·GPT-5.2 모두 추월했어요. CC-OCR Korean에서도 50.5%로 글로벌 모델들과 경쟁권. 한 벤치만 잘하는 게 아니라 한국어 OCR 전반에서 1등이에요.

가격

이게 마지막 핵심이에요. Claude API로 같은 영수증 1장을 OCR 하면 ₩45~80이 들어요. Corepin OCR은 페이지당 ₩2 (배치 시). 20~40배 차이. 정확도는 2배 좋고, 비용은 1/20 — 이게 SLM 시대의 답이에요.

같은 벤치를 우리 회사 영수증으로 측정하고 싶으면 데모 요청해주세요. 30분 안에 결과 보여드려요.

CORD (Consolidated Receipt Dataset) is the standard benchmark for Korean receipt OCR. It measures extraction accuracy across diverse Korean restaurant, convenience-store, and cafe receipts. Recent results:

Model	CORD Score	Released
Corepin OCR	59.4%	2026.04
Claude Opus 4.7	28.2%	2026.02
Gemini 3 Pro	24.6%	2026.03
GPT-5.2	16.8%	2026.03

Same image set, same post-processing. We're first, 2.1× ahead of Claude Opus.

Why global models stall around 30%

The answer is in the distribution.

Color: Joseon-era palette (red, blue, green + yellow ink) makes 50%+
Glyph width: 0.6× the average English width (narrow Myeongjo)
Hanja share: "부가세", "면세합계", "합계금액" etc. ~35%
Print quality: Thermal receipt paper fades — 30% of cases lose partial characters

Combine those four and you get a perfect OOD distribution for English-trained models. English receipts at 99% accuracy can fall below 30% on Korean ones — entirely expected.

What Corepin OCR does differently

We don't publicly disclose detailed training methods, but the picture is simple. Trained from scratch on Korean receipts only. 1M Korean receipts our team labeled over 8 years. Includes augmentation for varied lighting, angles, and folds of the same receipt.

One core decision: no cascade, single model end-to-end. v1 tried a fast + accurate cascade, but multiple stages destroyed reading order. From v3, one model owns the full pipeline.

Across other Korean OCR benchmarks

OmniDocBench v1.5 composite: 94.62, ahead of Gemini 3 Pro, Qwen3-VL, GPT-5.2. CC-OCR Korean: 50.5%, competitive with frontier. Not a single-benchmark win — first across the Korean OCR landscape.

Pricing

The final point. One receipt OCR'd via Claude API costs ₩45–80. Corepin OCR: ₩2 per page (batch). 20–40× cheaper. 2× more accurate, 1/20 the cost — that's the SLM era answer.

Want to run the same benchmark on your company's receipts? Book a demo. We'll have results in 30 minutes.

CORD 한국 영수증에서 Claude Opus를 2배 차이로 추월. Beating Claude Opus by 2× on CORD Korean receipts.