-
Notifications
You must be signed in to change notification settings - Fork 2
Description
Describe the Compatibility Issue
EncypherAI uses standard Unicode variation selectors to embed metadata invisibly. When text encoded with EncypherAI is viewed in LibreOffice Writer using the Liberation Serif font, these variation selectors are not rendered invisibly. Instead, they appear as visible diamond/question mark characters, disrupting the text's appearance.
Steps to Reproduce
Please provide detailed steps to reliably reproduce the behavior:
- Environment Setup: Open LibreOffice Writer (e.g., version 7.6.x) on a system where Liberation Serif is the active font (common on many Linux distributions).
- Action: Copy the following EncypherAI-encoded text (or any text generated by
encypher-ai):
This is the first sentence containing hidden metadata. - Observation: Observe the pasted text within the LibreOffice Writer document.
Expected Behavior
The pasted text should appear normally and legibly as: "This is the first sentence containing hidden metadata." The embedded Unicode variation selectors representing the metadata should be entirely invisible and have zero width, not affecting layout.
Actual Behavior
Visible diamond or question-mark placeholder characters (<?>) appear interspersed throughout the text, typically adjacent to spaces or punctuation. This makes the text difficult to read and appear corrupted. See screenshot (to be attached to issue).
Environment Details
Please provide as much detail as possible about the environment where the issue occurs:
- Operating System: Windows 10/11)
- Application & Version: LibreOffice Writer 7.6.5 (Observed in various 7.x versions)
- Font & Version (if known): Liberation Serif (Standard version bundled with LibreOffice/Ubuntu)
- EncypherAI Version Used (if generating text):
encypher-aiv2.0.0 (or latest)
Screenshots / Logs
Additional Context / Workarounds
- This rendering issue is due to the LibreOffice rendering engine and/or the Liberation Serif font not correctly handling these standard Unicode variation selectors according to their defined non-printing, zero-width properties.
- A similar issue occurs in some terminal emulators where the characters render as empty boxes (
□). - Copying the text from LibreOffice Writer (after pasting it in) and pasting it elsewhere (e.g., a plain text editor) introduces extraneous line breaks and appears to corrupt or alter the original sequence of variation selectors, potentially breaking metadata verification.
- No known workarounds within EncypherAI itself, other than advising users to view/verify text in compatible environments or use fonts with better Unicode variation selector support if available.
