mirror of
git://git.proxmox.com/git/proxmox-backup.git
synced 2025-01-20 14:03:53 +03:00
5cff9c6fe8
Describes the pxar metadata archive and the corresponding pxar payload file-format layout. Signed-off-by: Christian Ebner <c.ebner@proxmox.com>
167 lines
4.5 KiB
ReStructuredText
167 lines
4.5 KiB
ReStructuredText
File Formats
|
|
============
|
|
|
|
.. _pxar-format:
|
|
|
|
Proxmox File Archive Format (``.pxar``)
|
|
---------------------------------------
|
|
|
|
.. graphviz:: pxar-format-overview.dot
|
|
|
|
.. _pxar-meta-format:
|
|
|
|
Proxmox File Archive Format - Meta (``.mpxar``)
|
|
-----------------------------------------------
|
|
|
|
Pxar metadata archive with same structure as a regular pxar archive, with the
|
|
exception of regular file payloads not being contained within the archive
|
|
itself, but rather being stored as payload references to the corresponding pxar
|
|
payload (``.ppxar``) file.
|
|
|
|
Can be used to lookup all the archive entries and metadata without the size
|
|
overhead introduced by the file payloads.
|
|
|
|
.. graphviz:: meta-format-overview.dot
|
|
|
|
.. _ppxar-format:
|
|
|
|
Proxmox File Archive Format - Payload (``.ppxar``)
|
|
--------------------------------------------------
|
|
|
|
Pxar payload file storing regular file payloads to be referenced and accessed by
|
|
the corresponding pxar metadata (``.mpxar``) archive. Contains a concatenation
|
|
of regular file payloads, each prefixed by a `PAYLOAD` header. Further, the
|
|
actual referenced payload entries might be separated by padding (full/partial
|
|
payloads not referenced), introduced when reusing chunks of a previous backup
|
|
run, when chunk boundaries did not aligned to payload entry offsets.
|
|
|
|
All headers are stored as little-endian.
|
|
|
|
.. list-table::
|
|
:widths: auto
|
|
|
|
* - ``PAYLOAD_START_MARKER``
|
|
- header of ``[u8; 16]`` consisting of type hash and size;
|
|
marks start
|
|
* - ``PAYLOAD``
|
|
- header of ``[u8; 16]`` cosisting of type hash and size;
|
|
referenced by metadata archive
|
|
* - Payload
|
|
- raw regular file payload
|
|
* - Padding
|
|
- partial/full unreferenced payloads, caused by unaligned chunk boundary
|
|
* - ...
|
|
- further concatenation of payload header, payload and padding
|
|
* - ``PAYLOAD_TAIL_MARKER``
|
|
- header of ``[u8; 16]`` consisting of type hash and size;
|
|
marks end
|
|
.. _data-blob-format:
|
|
|
|
Data Blob Format (``.blob``)
|
|
----------------------------
|
|
|
|
The data blob format is used to store small binary data. The magic number
|
|
decides the exact format:
|
|
|
|
.. list-table::
|
|
:widths: auto
|
|
|
|
* - ``[66, 171, 56, 7, 190, 131, 112, 161]``
|
|
- unencrypted
|
|
- uncompressed
|
|
* - ``[49, 185, 88, 66, 111, 182, 163, 127]``
|
|
- unencrypted
|
|
- compressed
|
|
* - ``[123, 103, 133, 190, 34, 45, 76, 240]``
|
|
- encrypted
|
|
- uncompressed
|
|
* - ``[230, 89, 27, 191, 11, 191, 216, 11]``
|
|
- encrypted
|
|
- compressed
|
|
|
|
The compression algorithm used is ``zstd``. The encryption cipher is
|
|
``AES_256_GCM``.
|
|
|
|
Unencrypted blobs use the following format:
|
|
|
|
.. list-table::
|
|
:widths: auto
|
|
|
|
* - ``MAGIC: [u8; 8]``
|
|
* - ``CRC32: [u8; 4]``
|
|
* - ``Data: (max 16MiB)``
|
|
|
|
Encrypted blobs additionally contain a 16 byte initialization vector (IV),
|
|
followed by a 16 byte authenticated encryption (AE) tag, followed by the
|
|
encrypted data:
|
|
|
|
.. list-table::
|
|
|
|
* - ``MAGIC: [u8; 8]``
|
|
* - ``CRC32: [u8; 4]``
|
|
* - ``IV: [u8; 16]``
|
|
* - ``TAG: [u8; 16]``
|
|
* - ``Data: (max 16MiB)``
|
|
|
|
|
|
.. _fixed-index-format:
|
|
|
|
Fixed Index Format (``.fidx``)
|
|
-------------------------------
|
|
|
|
All numbers are stored as little-endian.
|
|
|
|
.. list-table::
|
|
|
|
* - ``MAGIC: [u8; 8]``
|
|
- ``[47, 127, 65, 237, 145, 253, 15, 205]``
|
|
* - ``uuid: [u8; 16]``,
|
|
- Unique ID
|
|
* - ``ctime: i64``,
|
|
- Creation Time (epoch)
|
|
* - ``index_csum: [u8; 32]``,
|
|
- SHA-256 over the index (without header) ``SHA256(digest1||digest2||...)``
|
|
* - ``size: u64``,
|
|
- Image size
|
|
* - ``chunk_size: u64``,
|
|
- Chunk size
|
|
* - ``reserved: [u8; 4016]``,
|
|
- Overall header size is one page (4096 bytes)
|
|
* - ``digest1: [u8; 32]``
|
|
- First chunk digest
|
|
* - ``digest2: [u8; 32]``
|
|
- Second chunk digest
|
|
* - ...
|
|
- Next chunk digest ...
|
|
|
|
|
|
.. _dynamic-index-format:
|
|
|
|
Dynamic Index Format (``.didx``)
|
|
--------------------------------
|
|
|
|
All numbers are stored as little-endian.
|
|
|
|
.. list-table::
|
|
|
|
* - ``MAGIC: [u8; 8]``
|
|
- ``[28, 145, 78, 165, 25, 186, 179, 205]``
|
|
* - ``uuid: [u8; 16]``,
|
|
- Unique ID
|
|
* - ``ctime: i64``,
|
|
- Creation Time (epoch)
|
|
* - ``index_csum: [u8; 32]``,
|
|
- SHA-256 over the index (without header) ``SHA256(offset1||digest1||offset2||digest2||...)``
|
|
* - ``reserved: [u8; 4032]``,
|
|
- Overall header size is one page (4096 bytes)
|
|
* - ``offset1: u64``
|
|
- End of first chunk
|
|
* - ``digest1: [u8; 32]``
|
|
- First chunk digest
|
|
* - ``offset2: u64``
|
|
- End of second chunk
|
|
* - ``digest2: [u8; 32]``
|
|
- Second chunk digest
|
|
* - ...
|
|
- Next chunk offset/digest
|