pdf-processing
MervinPraison/PraisonAIPraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, RAG, and support for 100+ LLMs.
6,951 stars
1,061 forks
Python
178 views
SKILL.md
name: pdf-processing description: Process and extract information from PDF documents. Use this skill when the user asks to read, analyze, or extract data from PDF files. license: Apache-2.0 compatibility: Works with PraisonAI Agents metadata: author: praisonai version: "1.0"
PDF Processing Skill
Overview
This skill enables agents to process PDF documents, extract text content, and analyze the information within them.
When to Use
Activate this skill when:
- User asks to read or analyze a PDF file
- User needs to extract text from a PDF
- User wants to summarize a PDF document
- User needs to search for information within a PDF
Instructions
- First, verify the PDF file exists at the specified path
- Use appropriate tools to read the PDF content
- Extract text while preserving structure where possible
- For large PDFs, process in chunks to manage context
- Summarize or analyze based on user's specific request
Best Practices
- Always confirm the file path before processing
- Handle encrypted PDFs gracefully with appropriate error messages
- For scanned PDFs, note that OCR may be required
- Preserve important formatting like tables and lists