pdf cleanup by andiwand · Pull Request #541 · opendocument-app/OpenDocument.core

andiwand · 2026-06-17T20:39:00Z

move stateless pdf parsing functions from DocumentParser to FileParser
improve Dictionary interface
stream version for read_xref_stream_table
upstream decrypt_strings to Decryptor
make some classes final

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: af7d826733

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-06-17T20:54:30Z

+  std::istringstream in(std::move(data));
+  Xref xref = FileParser(in).read_xref_stream_table(field_widths, subsections);


Parse xref streams from the decoded buffer

For PDFs whose cross-reference stream is filtered (for example the common /Filter /FlateDecode case), decode(...) puts the usable entry table in decoded.data, but this new stream is built from data, which was already moved into decode and is still the encoded payload at best. That makes xref-stream PDFs parse garbage or immediately fall into recovery, and recovery loses trailer-only data such as /Encrypt from xref-stream dictionaries, so encrypted modern PDFs can no longer open correctly.

Useful? React with 👍 / 👎.

cleanup

af7d826

andiwand marked this pull request as ready for review June 17, 2026 20:51

andiwand enabled auto-merge (squash) June 17, 2026 20:51

chatgpt-codex-connector Bot reviewed Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf cleanup#541

pdf cleanup#541
andiwand wants to merge 1 commit into
mainfrom
pdf-cleanup

andiwand commented Jun 17, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		std::istringstream in(std::move(data));
		Xref xref = FileParser(in).read_xref_stream_table(field_widths, subsections);

Conversation

andiwand commented Jun 17, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant