CatchTheTornado/text-extract-api

概要

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

リポジトリ情報

スター数	★ 2,987
フォーク数	252
言語	Python
ライセンス	MIT
作成日	2024/10/23
最終更新	2025/12/9
Issue数	47

トピック

anonymizationapiextractjsonllmocrocr-pythonpdfpii

OSS Insight JP

CatchTheTornado/text-extract-api

概要

リポジトリ情報

トピック

関連サービス

CatchTheTornado/text-extract-api

概要

リポジトリ情報

トピック

関連リポジトリ

関連サービス