inspirationull/texttractor

A Swift package to extract text from images and PDFs using Apple's Vision framework.

Features

Extract text from various image formats (PNG, JPEG, etc.).
Extract text from PDF documents (multi-page supported).
Simple command-line interface.
Lightweight, easy-to-integrate library for use in other Swift projects.

Requirements

macOS 13.0 or later.
Swift 5.9 or later.
Xcode 14 or later.

CLI Usage

1. Build the CLI

First, build the project from the root directory:

swift build -c release

The executable text-tractor will be located in .build/release/.

2. Run the Executable

The CLI takes a single required argument: the path to the input file. It can either print the extracted text to the console or save it to a file.

Syntax:

.build/release/text-tractor <path-to-image-or-pdf> [options]

Arguments:

inputPath: The path to the input image or PDF file.

Options:

-o, --output-path <path>: The path to save the output .txt file. If omitted, text is printed to standard output.

Examples:

Extract text and print to console:

``bash .build/release/text-tractor /path/to/my/image.png ``

Extract text and save to a file:

``bash .build/release/text-tractor /path/to/my/document.pdf -o /path/to/output/extracted.txt ``

Library Integration for GUI Apps

You can easily use TextTractorCore in your own Xcode project (e.g., a SwiftUI or AppKit app).

1. Add Package Dependency

In Xcode, go to File > Add Packages... and enter the repository URL for this project.

https://github.com/inspirationull/texttractor

Xcode will fetch the package. When prompted to "Choose Package Products for TextTractor", select TextTractorCore and add it to your app's target.

2. Use in Code

Now you can import TextTractorCore and use the OCRService to process files.

import SwiftUI
import TextTractorCore

struct ContentView: View {
    @State private var extractedText = "Loading..."
    private let ocrService = OCRService()

    var body: some View {
        ScrollView {
            Text(extractedText)
                .padding()
        }
        .onAppear(perform: processFile)
    }

    private func processFile() {
        // Get a URL to a local image or PDF file
        guard let fileURL = Bundle.main.url(forResource: "my-document", withExtension: "pdf") else {
            self.extractedText = "Error: File not found."
            return
        }

        Task {
            do {
                let text = try await ocrService.processFile(at: fileURL)
                DispatchQueue.main.async {
                    self.extractedText = text
                }
            } catch {
                DispatchQueue.main.async {
                    self.extractedText = "An error occurred: \(error.localizedDescription)"
                }
            }
        }
    }
}

API

The public API of the TextTractorCore library is simple and focused.

`OCRService`

This is the main struct you will interact with.

public struct OCRService {
    public init() {}

    /// Main entry point: Process a file at a URL (Image or PDF)
    public func processFile(at url: URL) async throws -> String
}

init(): Creates a new instance of the service.
processFile(at: URL): Asynchronously processes the file at the given URL. It automatically detects whether the file is an image or a PDF. It returns the extracted text as a String or throws an OCRError.

`OCRError`

A custom error enum for handling processing failures.

public enum OCRError: Error {
    case invalidFile
    case pdfConversionFailed
    case processingFailed(Error) // Wraps an underlying system error
}

Package Metadata

Repository: inspirationull/texttractor

Default branch: main

README: README.md