Invoice Upload & OCR
Learn how to upload invoices and use SOPHIOS’s AI-powered OCR to automatically extract invoice data.
Overview
SOPHIOS uses advanced Optical Character Recognition (OCR) powered by private AI infrastructure to automatically extract data from invoices, reducing manual data entry by up to 80%.
95%+ Accuracy on structured invoices with confidence scoring for quality assurance.
Uploading an Invoice
Select Your Asset
Ensure the correct asset is selected in the asset selector at the top of the page.
Navigate to Invoices
Click “Invoices” in the main sidebar.
Upload Invoice File
Option 1: Drag and Drop
- Drag your invoice file directly onto the upload zone
- File will upload automatically
Option 2: Click to Browse
- Click the upload zone
- Select file from your computer
- Click “Open”
Supported Formats:
- PDF (Portable Document Format)
- JPEG / JPG (photos of invoices)
- PNG (scanned images)
File Size Limits:
- Maximum: 10 MB per file
- Recommended: Under 5 MB for faster processing
Pro Tip: PDFs provide the best OCR accuracy. If you have paper invoices, scan them as PDF rather than taking photos.
Wait for OCR Processing
SOPHIOS processes the invoice automatically:
- Processing Time: 10-30 seconds (depending on file size)
- Progress Indicator: Shows “Processing…”
- Status Updates: Real-time feedback
OCR Data Extraction
What Gets Extracted
SOPHIOS’s AI automatically identifies and extracts:
Vendor Information:
- Vendor name
- Vendor address
- Vendor contact details
- VAT/Tax ID number
Invoice Details:
- Invoice number
- Invoice date
- Due date
- Purchase order number (if present)
Line Items:
- Item descriptions
- Quantities
- Unit prices
- Line totals
Financial Data:
- Subtotal
- Tax amount(s)
- Total amount
- Currency
Additional Data:
- Payment terms
- Bank details (if present)
- Notes and references
Reviewing OCR Results
After processing, review the extracted data:
Check Extraction Quality
SOPHIOS provides a confidence score for each field:
- 🟢 High (90-100%) - Very confident, likely accurate
- 🟡 Medium (70-89%) - Fairly confident, review recommended
- 🔴 Low (<70%) - Low confidence, manual verification required
Verify Key Fields
Always verify these critical fields:
- Total Amount - Most important!
- Invoice Number - Ensure uniqueness
- Invoice Date - Check format (DD/MM/YYYY or MM/DD/YYYY)
- Vendor Name - Verify spelling
Correct Errors
If OCR made mistakes:
- Click the field you want to edit
- Type the correction
- Tab or click away to save
Machine Learning: SOPHIOS learns from your corrections and improves accuracy over time.
Review Line Items
Check extracted line items:
- Add Missing Items - Click ”+ Add Line Item”
- Edit Items - Click any item to modify
- Delete Items - Click delete icon (🗑️) to remove
Categorizing Invoices
Automatic Categorization
SOPHIOS suggests categories based on:
- Vendor Name - Recognizes known suppliers
- Description Keywords - Matches item descriptions
- Historical Patterns - Learns from past categorizations
- Machine Learning - AI-powered suggestions
Manual Categorization
Override auto-suggestion:
- Click “Category” dropdown
- Select from available categories
- Create new category if needed
Common Categories:
- Fuel & Lubricants
- Maintenance & Repairs
- Marina/Port/Hangar Fees
- Catering & Provisions
- Insurance
- Crew Expenses
- Equipment & Supplies
Assigning to Budgets
Link invoices to budgets for tracking:
Select Budget (Optional)
- Click “Assign to Budget” dropdown
- Choose relevant budget
- Invoice amount deducts from budget automatically
Prorate Expense (Optional)
For large expenses to distribute:
- Check “Prorate this expense”
- Choose proration method:
- Temporal - Spread over months
- Category - Distribute across categories
- Manual - Custom allocation
- Configure proration settings
See Expense Proration for details.
Invoice Status Workflow
Invoices progress through these statuses:
PENDING → PROCESSING → NEEDS_REVIEW → VERIFIED → APPROVED → EXECUTEDStatus Definitions:
| Status | Description | Actions Available |
|---|---|---|
| PENDING | Uploaded, waiting for OCR | Cancel upload |
| PROCESSING | OCR in progress | Wait |
| NEEDS_REVIEW | OCR complete, review needed | Review & edit |
| VERIFIED | Data verified by accountant | Approve/reject |
| APPROVED | Approved for payment | Mark as executed |
| REJECTED | Rejected, needs revision | Re-submit |
| EXECUTED | Payment processed | Archive |
Invoice Approval Workflow
For Accountants
- Review OCR-extracted data
- Verify against original invoice
- Correct any errors
- Change status to VERIFIED
- Submit for approval
For Approvers (Admin/Owner)
- Review verified invoices
- Check budget impact
- Approve or reject with notes
- Mark as APPROVED
For Executors (Owner/Admin)
- Review approved invoices
- Confirm payment has been made
- Mark as EXECUTED
Expense Classification
Expense Type
Each invoice can be classified by expense type:
- OPEX - Operational expenditure (day-to-day costs)
- CAPEX - Capital expenditure (investments, improvements)
- MIXED - Invoices containing both OPEX and CAPEX items
Spending Line Assignment
Invoices can be assigned to spending lines (cost centers) for granular cost tracking:
- Assign during invoice creation or editing
- Change spending line assignment at any time via the inline badge
- View spending breakdowns in the Financial Reports dashboard
Advanced Features
Duplicate Detection
SOPHIOS automatically detects potential duplicates:
- Same vendor + invoice number
- Same vendor + similar amount + similar date
- Warns before saving
Review duplicate warnings carefully to avoid processing the same invoice twice.
Anomaly Detection
AI flags unusual transactions:
- Amount significantly higher than usual
- New vendor (never seen before)
- Missing required fields
- Unusual payment terms
Vendor Matching
Automatic vendor management:
- Creates vendor records from invoices
- Links all invoices to vendors
- Tracks spending per vendor
- Suggests vendor name corrections
OCR Accuracy Tips
Best Practices for Scanning
To maximize OCR accuracy:
Optimal Settings
- File Format: Use PDF when possible
- Resolution: Scan at 300 DPI or higher
- Orientation: Ensure invoice is right-side up
- Lighting: Good lighting for photos, no shadows
- Focus: Clear, not blurry images
- Cropping: Include entire invoice, minimal margins
- Color: Color or grayscale (better than black & white)
What Reduces Accuracy
Avoid these common issues:
- ❌ Blurry or out-of-focus images
- ❌ Poor lighting or shadows
- ❌ Handwritten invoices (not supported)
- ❌ Skewed or rotated images
- ❌ Low resolution (under 150 DPI)
- ❌ Watermarks obscuring text
- ❌ Crumpled or damaged invoices
Handling Different Invoice Formats
Structured Invoices
Best OCR Performance:
- Standard invoice templates
- Clear table structures
- Printed text
- Professional formats
Examples:
- Fuel company invoices
- Marina/port invoices
- Supplier invoices from major vendors
Unstructured Invoices
More Manual Review:
- Non-standard formats
- Receipts without headers
- Invoices in unusual layouts
Best Practice: For unstructured invoices, verify all fields carefully.
Multi-Page Invoices
SOPHIOS handles multi-page PDFs:
- Processes all pages automatically
- Extracts items from multiple pages
- Combines into single invoice record
Editing After Save
Edit Invoice Data
After saving, you can still edit:
- Navigate to Invoices list
- Click the invoice to open details
- Click “Edit” button
- Modify any field
- Click “Save Changes”
Re-Process OCR
If OCR failed or produced poor results:
- Open invoice details
- Click three-dot menu (⋮)
- Select “Re-process OCR”
- Upload new/better quality file (optional)
- Review new results
Bulk Upload
For multiple invoices:
Prepare Files
- Collect all invoice files
- Name them clearly (optional)
- Ensure all files under 10 MB each
Drag Multiple Files
- Navigate to Invoices
- Drag all files at once onto upload zone
- SOPHIOS processes each file sequentially
Review Queue
Monitor processing queue:
- See all uploaded files
- Check processing status
- Review as each completes
Processing Time: Bulk uploads process one at a time. For 10 invoices, expect 3-5 minutes total.
Invoice Attachments
Attach Supporting Documents
Add additional files to invoices:
- Purchase orders
- Delivery notes
- Contracts
- Payment confirmations
How to attach:
- Open invoice details
- Scroll to “Attachments” section
- Click “Add Attachment”
- Upload file(s)
Exporting Invoices
Export Options
Single Invoice:
- Open invoice details
- Click three-dot menu (⋮)
- Select “Export as PDF” or “Download Original”
Multiple Invoices:
- Go to Invoices list
- Select checkboxes for invoices
- Click “Export Selected”
- Choose format (PDF, CSV, Excel)
Troubleshooting OCR
OCR Failed
If OCR fails completely:
- Check file format - Must be PDF, JPG, or PNG
- Check file size - Must be under 10 MB
- Check image quality - Must be clear and legible
- Try re-upload - Sometimes temporary glitch
- Manual entry - Enter data manually as fallback
Low Accuracy
If OCR accuracy is poor:
- Check original quality - Rescan if blurry
- Convert to PDF - Photos work best as PDFs
- Crop properly - Remove excess whitespace
- Rotate correctly - Ensure upright orientation
- Report issue - Contact support with sample
Missing Fields
If OCR missed some fields:
- Check original - Field might genuinely be missing
- Manual entry - Type in missing data
- Mark incomplete - Flag for later completion
Common Questions
Does OCR work in multiple languages?
Yes, SOPHIOS OCR supports:
- English
- Spanish
- French
- Italian
- German
- Portuguese
- (More languages being added)
Can I edit OCR results?
Absolutely! Always verify and correct OCR results. Your corrections train the AI.
What if my invoice has no text?
Image-only invoices with very poor quality may have limited OCR support. Manual entry recommended for handwritten documents.
How secure is invoice upload?
Very secure:
- Encrypted transfer (HTTPS/TLS)
- Encrypted storage (AES-256)
- Access control (role-based permissions)
- Audit trail (all actions logged)
Ready to try it? Upload your first invoice and experience the power of AI-powered OCR!
Next Steps: