aboutsummaryrefslogtreecommitdiffstats
path: root/Documentation/technical
diff options
context:
space:
mode:
authorJunio C Hamano <gitster@pobox.com>2021-02-08 14:05:53 -0800
committerJunio C Hamano <gitster@pobox.com>2021-02-08 14:05:54 -0800
commit71e83b2e7da61f699f4405233cc20ebf9cb7c66e (patch)
tree4e51b2e70cb669f1a4f5d444c1732e40f98a5605 /Documentation/technical
parent5731e4040998d978108dd6ca80f4f67129b6453e (diff)
parent7b77f5a13effd01c40b3b354d93749ece79e0acc (diff)
downloadgit-71e83b2e7da61f699f4405233cc20ebf9cb7c66e.tar.gz
Merge branch 'ma/doc-pack-format-varint-for-sizes' into maint
Doc update. * ma/doc-pack-format-varint-for-sizes: pack-format.txt: document sizes at start of delta data
Diffstat (limited to 'Documentation/technical')
-rw-r--r--Documentation/technical/pack-format.txt17
1 files changed, 16 insertions, 1 deletions
diff --git a/Documentation/technical/pack-format.txt b/Documentation/technical/pack-format.txt
index f96b2e605f..96d2fc589f 100644
--- a/Documentation/technical/pack-format.txt
+++ b/Documentation/technical/pack-format.txt
@@ -55,6 +55,18 @@ Valid object types are:
Type 5 is reserved for future expansion. Type 0 is invalid.
+=== Size encoding
+
+This document uses the following "size encoding" of non-negative
+integers: From each byte, the seven least significant bits are
+used to form the resulting integer. As long as the most significant
+bit is 1, this process continues; the byte with MSB 0 provides the
+last seven bits. The seven-bit chunks are concatenated. Later
+values are more significant.
+
+This size encoding should not be confused with the "offset encoding",
+which is also used in this document.
+
=== Deltified representation
Conceptually there are only four object types: commit, tree, tag and
@@ -73,7 +85,10 @@ Ref-delta can also refer to an object outside the pack (i.e. the
so-called "thin pack"). When stored on disk however, the pack should
be self contained to avoid cyclic dependency.
-The delta data is a sequence of instructions to reconstruct an object
+The delta data starts with the size of the base object and the
+size of the object to be reconstructed. These sizes are
+encoded using the size encoding from above. The remainder of
+the delta data is a sequence of instructions to reconstruct the object
from the base object. If the base object is deltified, it must be
converted to canonical form first. Each instruction appends more and
more data to the target object until it's complete. There are two