PDF Glossary
A plain-language reference for the most important PDF terms, standards, and technologies. Whether you're a beginner or a professional, this glossary has you covered.
A
- AES (Advanced Encryption Standard)
- A symmetric block cipher adopted by the U.S. government for encrypting classified information. AES-128 and AES-256 are the two key lengths used in PDF encryption. AES-256 is the strongest option available and is the default used by SmartPDFSuite's Protect tool. Try the Protect Tool
- Annotation
- A layer of content added on top of a PDF page — such as highlights, sticky notes, text comments, stamps, or drawn shapes. Annotations are stored separately from the page content and can usually be removed without affecting the underlying document.
B
- Bleed
- The area of a printed document that extends beyond the final trim edge. Bleed ensures that ink coverage reaches the very edge of the page after cutting. PDF supports bleed boxes to define this area for professional print workflows.
- Bookmark
- A navigational shortcut in a PDF that links to a specific page or section. Bookmarks appear in a sidebar panel and are commonly used in long documents like manuals, contracts, and textbooks to let readers jump between chapters.
C
- Certificate Authority (CA)
- A trusted organization that issues digital certificates used to verify the identity of the signer in digital signature workflows. When you see a green checkmark on a signed PDF, it means the certificate chain traces back to a recognized CA. Try the eSign Tool
- Compression
- The process of reducing a PDF's file size by optimizing images, removing redundant data, and re-encoding content streams. Lossless compression preserves every bit of data; lossy compression trades some quality for a smaller file. Try the Compress Tool
- Content Stream
- The binary data inside a PDF that describes how text, images, and vector graphics are rendered on a page. Content streams use a PostScript-like syntax and can be compressed with algorithms like Flate (zlib).
- Cross-Reference Table (xref)
- An internal lookup table in a PDF file that records the byte offset of every object. The xref table allows PDF readers to jump directly to any object without scanning the entire file, enabling fast random access even in large documents.
D
- Digital Signature
- A cryptographically secure type of electronic signature that uses public-key infrastructure (PKI) to verify the signer's identity and prove the document has not been tampered with after signing. Stronger than a simple electronic signature. Try the eSign Tool
- DPI (Dots Per Inch)
- A measure of print resolution. Higher DPI means finer detail. For screen viewing, 72–150 DPI is sufficient; for standard print, 150–200 DPI works well; for professional press output, 300 DPI is the minimum standard.
E
- Electronic Signature (eSignature)
- Any electronic indication of intent to agree to the contents of a document — including typed names, drawn signatures, and "I Agree" buttons. Legally valid in most jurisdictions under laws like the ESIGN Act and eIDAS, but not as cryptographically secure as a digital signature. Try the eSign Tool
- Encryption
- The process of encoding a PDF's content so it can only be read with the correct password or key. PDF supports RC4 (deprecated) and AES encryption. A user password prevents opening; an owner password restricts actions like printing and copying. Try the Protect Tool
F
- Flattening
- The process of merging annotations, form field values, or transparency layers into the base page content. After flattening, these elements become permanent parts of the page and can no longer be edited or removed independently.
- Font Embedding
- Including the font data inside the PDF file so that text renders correctly on any device, even if the recipient does not have the font installed. PDF/A mandates full font embedding. Font subsetting embeds only the characters actually used, reducing file size.
- Font Subsetting
- A technique that embeds only the specific characters (glyphs) used in the document rather than the entire font file. This reduces file size significantly — often by 80–90% for the font data — while still ensuring the document renders correctly.
- Form Fields
- Interactive elements in a PDF form — text boxes, checkboxes, radio buttons, dropdown menus, and signature fields. Form fields are part of the AcroForm or XFA specification and allow users to fill in data within a PDF viewer.
L
- Linearization (Fast Web View)
- A PDF optimization that rearranges the file's internal structure so the first page can be displayed before the entire file has been downloaded. This is especially useful for large PDFs served from a web server.
- Lossless Compression
- A compression method that reduces file size without any loss of data or quality. The original content can be perfectly reconstructed from the compressed version. Examples include Flate/Deflate (used in PDF content streams) and PNG for images.
- Lossy Compression
- A compression method that achieves smaller file sizes by discarding some data. The process is irreversible — original quality cannot be fully recovered. JPEG is the most common lossy image compression used in PDFs.
M
- Merge
- Combining two or more separate PDF files into a single document. Pages from all source files are joined sequentially, preserving their content, formatting, and resolution. Try the Merge Tool
- Metadata
- Descriptive information embedded in a PDF — typically title, author, subject, keywords, creation date, and producing software. Metadata can be viewed in document properties and is indexed by search engines. PDF/A requires certain metadata fields.
O
- OCR (Optical Character Recognition)
- A technology that converts images of text (from scanned documents or photographs) into machine-readable text characters. OCR makes scanned PDFs searchable and allows text to be copied, edited, and indexed.
- Owner Password
- A PDF password that restricts what actions a user can perform — printing, copying, editing, form-filling, etc. Unlike a user password, the owner password does not encrypt the actual content; restrictions are enforced by the viewer software and can be bypassed by non-compliant tools. Try the Protect Tool
P
- Page Tree
- The hierarchical structure inside a PDF that organizes pages. The page tree determines page order, allows efficient access to any page by number, and stores inherited properties like media box dimensions and rotation.
- PDF (Portable Document Format)
- A file format created by Adobe in 1993 and standardized as ISO 32000. PDFs preserve the exact visual layout of a document across all devices, operating systems, and printers. The format supports text, images, vector graphics, forms, annotations, encryption, and digital signatures.
- PDF/A
- An ISO-standardized subset of PDF (ISO 19005) designed for long-term archival. PDF/A requires all fonts to be embedded, prohibits JavaScript and encryption, and ensures the file is entirely self-contained so it can be rendered identically decades into the future.
- PDF/UA
- The PDF standard for universal accessibility (ISO 14289). PDF/UA requires proper structure tags, alternative text for images, logical reading order, and other features that make the document usable with assistive technologies like screen readers.
- PDF/X
- A subset of PDF (ISO 15930) standardized for prepress and professional print production. PDF/X files must meet strict color-management and font-embedding requirements to ensure predictable printed output.
- PKI (Public Key Infrastructure)
- A framework of roles, policies, and technology used to create, manage, and verify digital certificates. In PDF signing, PKI ensures that the signer's identity is authenticated by a trusted Certificate Authority and that the signed document has not been modified.
R
- Rasterization
- The process of converting vector graphics and text into a pixel-based image. PDF viewers rasterize each page for display on screen. High rasterization quality preserves sharp text and smooth curves.
- Redaction
- The permanent, irreversible removal of sensitive content from a PDF. True redaction replaces the selected text or image with a black box and deletes the underlying data from the file, ensuring it cannot be recovered by any means. Try the Redact Tool
- Rendering
- The process by which a PDF viewer interprets the file's content streams and draws the visual result on screen or paper. Rendering involves decoding fonts, decompressing images, compositing transparent layers, and applying color spaces.
S
- Split
- Extracting specific pages or page ranges from a PDF to create one or more new, smaller documents. The original file remains unchanged. Try the Split Tool
T
- Tagged PDF
- A PDF that contains structural tags identifying headings, paragraphs, tables, lists, images, and reading order. Tags enable screen readers and assistive technologies to interpret the document correctly. Required for PDF/UA compliance.
U
- User Password (Open Password)
- A PDF password that must be entered before the document can be viewed. The user password triggers actual decryption of the file's content — without it, the data is unreadable scrambled bytes. This is the stronger of the two PDF password types. Try the Unlock Tool
V
- Vector Graphics
- Graphics defined by mathematical paths (lines, curves, shapes) rather than pixel grids. Vectors can be scaled to any size without losing quality and are used in PDFs for text rendering, logos, diagrams, and illustrations.
W
- Watermark
- A visible overlay — typically text (e.g., "DRAFT," "CONFIDENTIAL") or a semi-transparent image — added to PDF pages for branding, copyright protection, or status indication. Watermarks can appear behind or in front of the page content. Try the Watermark Tool
X
- XFA (XML Forms Architecture)
- A legacy XML-based form format supported in some PDFs. XFA forms are more powerful than AcroForms but are poorly supported outside Adobe products. The PDF 2.0 specification deprecated XFA in favor of AcroForms.
Want to Learn More?
Our blog dives deeper into PDF topics — from compression techniques to encryption best practices.
Read the Blog