Read a ZUGFeRD PDF: Extract the Embedded XML
A ZUGFeRD invoice looks like a normal PDF — but the interesting part is hidden invisibly inside it: the embedded XML with the structured data.
Where is the XML?
ZUGFeRD and Factur-X files are PDF/A-3 documents with an embedded XML file (in the UN/CEFACT CII format). Depending on the version, this file is named factur-x.xml or zugferd-invoice.xml and contains all invoice data in machine-readable form. It is the authoritative part: if the display in the PDF and the data in the XML differ, the XML counts.
How to read out the XML
- Choose the PDF: Drag the ZUGFeRD PDF into the tool or pick it via the file dialog.
- Extract automatically: The tool detects the embedded XML and reads it out.
- View readably & validate: The structured data is displayed clearly and checked against EN 16931.
This way you can see at a glance which data is really in the invoice — and whether it is valid.
Why read it out at all?
- Posting: Accounting software takes the data straight from the XML.
- Verification: You can tell whether the PDF display and the data match.
- Profile check: Not every ZUGFeRD profile is a valid e-invoice — MINIMUM and BASIC WL are not enough. The extracted XML reveals the profile used.
Plain XML instead of PDF?
If you have instead received a plain XRechnung (XML without PDF), the guide Open an e-invoice will help you.
Local and no upload
Reading it out runs entirely in the browser — the PDF never leaves your computer. Try it right away: Read your ZUGFeRD PDF now.
Frequently asked questions
How do I read the XML out of a ZUGFeRD PDF?
You open the PDF in a tool that detects and extracts the embedded XML file. In a browser tool this happens automatically — the data is displayed readably and validated.
Where is the XML inside a ZUGFeRD file?
As an attachment in the PDF/A-3. Depending on the version the file is named factur-x.xml or zugferd-invoice.xml and contains the structured invoice data in CII format.
Which part prevails if they differ?
The embedded XML is the authoritative (leading) part. If the PDF display and the XML data differ, the structured data in the XML counts.