nombra

command module

v0.0.0-...-b41d105 Latest Latest Go to latest Published: Mar 7, 2026 License: Apache-2.0 Imports: 19 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/rtyx/nombra

Links

Open Source Insights

README ¶

Nombra - AI-Powered PDF Filename Generator

Overview

Nombra is a CLI tool that analyzes the content of PDF documents and generates meaningful file names using OpenAI's API. This is useful for organizing documents, automating metadata generation, and improving file management.

Features

Extracts text from PDFs
Uses AI to generate relevant titles
Supports OCR (via Tesseract) for scanned PDFs
Falls back to OpenAI image analysis when OCR/text extraction finds nothing
Handles special characters and long filenames safely
Verbose mode for debugging

Installation

Prerequisites

Go 1.24 or later
OpenAI API Key (required for AI-generated titles)
Optional: tesseract-ocr and pdftoppm for OCR functionality

Install Nombra

# Clone the repository
git clone https://github.com/YOUR_USERNAME/nombra.git
cd nombra

# Build the binary with version information
go build -ldflags="-X main.version=$(git rev-parse --short HEAD)" -o nombra

Make Nombra a Global Command

To use nombra from anywhere in your system, move the built binary to a directory included in your PATH, such as /usr/local/bin:

sudo mv nombra /usr/local/bin/

You can now run nombra from any directory:

nombra myfile.pdf

If /usr/local/bin is not in your PATH, you can check your PATH with:

echo $PATH

And add it to your shell configuration if needed.

Usage

Basic usage

./nombra myfile.pdf

This will generate a title based on the PDF’s content and rename the file accordingly.

Processing Multiple Files

You can process several files in one command:

./nombra file1.pdf file2.pdf file3.pdf

You can also process all PDFs in a directory:

./nombra --dir ./documents

Control parallel processing with workers:

./nombra --dir ./documents --workers 6

Using an API key

./nombra myfile.pdf --key YOUR_OPENAI_API_KEY

Or set the environment variable:

export OPENAI_API_KEY=your_api_key
./nombra myfile.pdf

Forcing OCR

While nombra automatically opts for OCR when needed, you can also force it:

./nombra myfile.pdf --ocr

If both native extraction and OCR fail, Nombra analyzes the first PDF page as an image using OpenAI (gpt-4o-mini) and uses that description to generate a filename.

Verbose Mode

./nombra myfile.pdf --verbose

Interactive Confirmation

Use interactive mode to approve the rename before applying it:

./nombra myfile.pdf --interactive

Note: --interactive currently supports a single input file at a time.

Selecting an OpenAI model

By default, Nombra uses gpt-5.4 for title generation. You can choose a different model with the --model flag:

./nombra myfile.pdf --model gpt-5.4-pro

Setting Reasoning Effort (GPT-5 family)

You can control GPT-5 reasoning depth with --reasoning-effort:

./nombra myfile.pdf --reasoning-effort none
./nombra myfile.pdf --reasoning-effort medium

Checking the Version

Display the Git commit the binary was built from:

./nombra --version

Adjusting Content Length

You can control how much text is sent to the language model. The --max-content-length flag specifies the maximum number of characters that will be included when generating a title. Experimenting with this value can help strike a balance between providing enough context and keeping requests small:

./nombra myfile.pdf --max-content-length 5000

The minimum length required for processing can also be changed via --min-content-length.

Content Extraction Optimization

extractTextFromPDF concatenates the text from every page. When the combined text would exceed the configured limit, Nombra keeps portions from the start and end of the document and discards the middle. This strategy preserves important sections like titles, parties, and dates even when large documents are truncated.

Contributing

Pull requests and issues are welcome! Please follow these guidelines:

Report bugs and feature requests via GitHub issues.
Follow Go best practices and maintain code readability.
Add comments where necessary to explain logic.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

nombra.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL