Documentation / git-pack-objects.txton commit test-lib: try harder to ensure a working jgit (abd0f28)
   1git-pack-objects(1)
   2===================
   3
   4NAME
   5----
   6git-pack-objects - Create a packed archive of objects
   7
   8
   9SYNOPSIS
  10--------
  11[verse]
  12'git pack-objects' [-q | --progress | --all-progress] [--all-progress-implied]
  13        [--no-reuse-delta] [--delta-base-offset] [--non-empty]
  14        [--local] [--incremental] [--window=<n>] [--depth=<n>]
  15        [--revs [--unpacked | --all]] [--keep-pack=<pack-name>]
  16        [--stdout [--filter=<filter-spec>] | base-name]
  17        [--shallow] [--keep-true-parents] [--sparse] < object-list
  18
  19
  20DESCRIPTION
  21-----------
  22Reads list of objects from the standard input, and writes either one or
  23more packed archives with the specified base-name to disk, or a packed
  24archive to the standard output.
  25
  26A packed archive is an efficient way to transfer a set of objects
  27between two repositories as well as an access efficient archival
  28format.  In a packed archive, an object is either stored as a
  29compressed whole or as a difference from some other object.
  30The latter is often called a delta.
  31
  32The packed archive format (.pack) is designed to be self-contained
  33so that it can be unpacked without any further information. Therefore,
  34each object that a delta depends upon must be present within the pack.
  35
  36A pack index file (.idx) is generated for fast, random access to the
  37objects in the pack. Placing both the index file (.idx) and the packed
  38archive (.pack) in the pack/ subdirectory of $GIT_OBJECT_DIRECTORY (or
  39any of the directories on $GIT_ALTERNATE_OBJECT_DIRECTORIES)
  40enables Git to read from the pack archive.
  41
  42The 'git unpack-objects' command can read the packed archive and
  43expand the objects contained in the pack into "one-file
  44one-object" format; this is typically done by the smart-pull
  45commands when a pack is created on-the-fly for efficient network
  46transport by their peers.
  47
  48
  49OPTIONS
  50-------
  51base-name::
  52        Write into pairs of files (.pack and .idx), using
  53        <base-name> to determine the name of the created file.
  54        When this option is used, the two files in a pair are written in
  55        <base-name>-<SHA-1>.{pack,idx} files.  <SHA-1> is a hash
  56        based on the pack content and is written to the standard
  57        output of the command.
  58
  59--stdout::
  60        Write the pack contents (what would have been written to
  61        .pack file) out to the standard output.
  62
  63--revs::
  64        Read the revision arguments from the standard input, instead of
  65        individual object names.  The revision arguments are processed
  66        the same way as 'git rev-list' with the `--objects` flag
  67        uses its `commit` arguments to build the list of objects it
  68        outputs.  The objects on the resulting list are packed.
  69        Besides revisions, `--not` or `--shallow <SHA-1>` lines are
  70        also accepted.
  71
  72--unpacked::
  73        This implies `--revs`.  When processing the list of
  74        revision arguments read from the standard input, limit
  75        the objects packed to those that are not already packed.
  76
  77--all::
  78        This implies `--revs`.  In addition to the list of
  79        revision arguments read from the standard input, pretend
  80        as if all refs under `refs/` are specified to be
  81        included.
  82
  83--include-tag::
  84        Include unasked-for annotated tags if the object they
  85        reference was included in the resulting packfile.  This
  86        can be useful to send new tags to native Git clients.
  87
  88--window=<n>::
  89--depth=<n>::
  90        These two options affect how the objects contained in
  91        the pack are stored using delta compression.  The
  92        objects are first internally sorted by type, size and
  93        optionally names and compared against the other objects
  94        within --window to see if using delta compression saves
  95        space.  --depth limits the maximum delta depth; making
  96        it too deep affects the performance on the unpacker
  97        side, because delta data needs to be applied that many
  98        times to get to the necessary object.
  99+
 100The default value for --window is 10 and --depth is 50. The maximum
 101depth is 4095.
 102
 103--window-memory=<n>::
 104        This option provides an additional limit on top of `--window`;
 105        the window size will dynamically scale down so as to not take
 106        up more than '<n>' bytes in memory.  This is useful in
 107        repositories with a mix of large and small objects to not run
 108        out of memory with a large window, but still be able to take
 109        advantage of the large window for the smaller objects.  The
 110        size can be suffixed with "k", "m", or "g".
 111        `--window-memory=0` makes memory usage unlimited.  The default
 112        is taken from the `pack.windowMemory` configuration variable.
 113
 114--max-pack-size=<n>::
 115        In unusual scenarios, you may not be able to create files
 116        larger than a certain size on your filesystem, and this option
 117        can be used to tell the command to split the output packfile
 118        into multiple independent packfiles, each not larger than the
 119        given size. The size can be suffixed with
 120        "k", "m", or "g". The minimum size allowed is limited to 1 MiB.
 121        This option
 122        prevents the creation of a bitmap index.
 123        The default is unlimited, unless the config variable
 124        `pack.packSizeLimit` is set.
 125
 126--honor-pack-keep::
 127        This flag causes an object already in a local pack that
 128        has a .keep file to be ignored, even if it would have
 129        otherwise been packed.
 130
 131--keep-pack=<pack-name>::
 132        This flag causes an object already in the given pack to be
 133        ignored, even if it would have otherwise been
 134        packed. `<pack-name>` is the the pack file name without
 135        leading directory (e.g. `pack-123.pack`). The option could be
 136        specified multiple times to keep multiple packs.
 137
 138--incremental::
 139        This flag causes an object already in a pack to be ignored
 140        even if it would have otherwise been packed.
 141
 142--local::
 143        This flag causes an object that is borrowed from an alternate
 144        object store to be ignored even if it would have otherwise been
 145        packed.
 146
 147--non-empty::
 148        Only create a packed archive if it would contain at
 149        least one object.
 150
 151--progress::
 152        Progress status is reported on the standard error stream
 153        by default when it is attached to a terminal, unless -q
 154        is specified. This flag forces progress status even if
 155        the standard error stream is not directed to a terminal.
 156
 157--all-progress::
 158        When --stdout is specified then progress report is
 159        displayed during the object count and compression phases
 160        but inhibited during the write-out phase. The reason is
 161        that in some cases the output stream is directly linked
 162        to another command which may wish to display progress
 163        status of its own as it processes incoming pack data.
 164        This flag is like --progress except that it forces progress
 165        report for the write-out phase as well even if --stdout is
 166        used.
 167
 168--all-progress-implied::
 169        This is used to imply --all-progress whenever progress display
 170        is activated.  Unlike --all-progress this flag doesn't actually
 171        force any progress display by itself.
 172
 173-q::
 174        This flag makes the command not to report its progress
 175        on the standard error stream.
 176
 177--no-reuse-delta::
 178        When creating a packed archive in a repository that
 179        has existing packs, the command reuses existing deltas.
 180        This sometimes results in a slightly suboptimal pack.
 181        This flag tells the command not to reuse existing deltas
 182        but compute them from scratch.
 183
 184--no-reuse-object::
 185        This flag tells the command not to reuse existing object data at all,
 186        including non deltified object, forcing recompression of everything.
 187        This implies --no-reuse-delta. Useful only in the obscure case where
 188        wholesale enforcement of a different compression level on the
 189        packed data is desired.
 190
 191--compression=<n>::
 192        Specifies compression level for newly-compressed data in the
 193        generated pack.  If not specified,  pack compression level is
 194        determined first by pack.compression,  then by core.compression,
 195        and defaults to -1,  the zlib default,  if neither is set.
 196        Add --no-reuse-object if you want to force a uniform compression
 197        level on all data no matter the source.
 198
 199--sparse::
 200        Use the "sparse" algorithm to determine which objects to include in
 201        the pack, when combined with the "--revs" option. This algorithm
 202        only walks trees that appear in paths that introduce new objects.
 203        This can have significant performance benefits when computing
 204        a pack to send a small change. However, it is possible that extra
 205        objects are added to the pack-file if the included commits contain
 206        certain types of direct renames.
 207
 208--thin::
 209        Create a "thin" pack by omitting the common objects between a
 210        sender and a receiver in order to reduce network transfer. This
 211        option only makes sense in conjunction with --stdout.
 212+
 213Note: A thin pack violates the packed archive format by omitting
 214required objects and is thus unusable by Git without making it
 215self-contained. Use `git index-pack --fix-thin`
 216(see linkgit:git-index-pack[1]) to restore the self-contained property.
 217
 218--shallow::
 219        Optimize a pack that will be provided to a client with a shallow
 220        repository.  This option, combined with --thin, can result in a
 221        smaller pack at the cost of speed.
 222
 223--delta-base-offset::
 224        A packed archive can express the base object of a delta as
 225        either a 20-byte object name or as an offset in the
 226        stream, but ancient versions of Git don't understand the
 227        latter.  By default, 'git pack-objects' only uses the
 228        former format for better compatibility.  This option
 229        allows the command to use the latter format for
 230        compactness.  Depending on the average delta chain
 231        length, this option typically shrinks the resulting
 232        packfile by 3-5 per-cent.
 233+
 234Note: Porcelain commands such as `git gc` (see linkgit:git-gc[1]),
 235`git repack` (see linkgit:git-repack[1]) pass this option by default
 236in modern Git when they put objects in your repository into pack files.
 237So does `git bundle` (see linkgit:git-bundle[1]) when it creates a bundle.
 238
 239--threads=<n>::
 240        Specifies the number of threads to spawn when searching for best
 241        delta matches.  This requires that pack-objects be compiled with
 242        pthreads otherwise this option is ignored with a warning.
 243        This is meant to reduce packing time on multiprocessor machines.
 244        The required amount of memory for the delta search window is
 245        however multiplied by the number of threads.
 246        Specifying 0 will cause Git to auto-detect the number of CPU's
 247        and set the number of threads accordingly.
 248
 249--index-version=<version>[,<offset>]::
 250        This is intended to be used by the test suite only. It allows
 251        to force the version for the generated pack index, and to force
 252        64-bit index entries on objects located above the given offset.
 253
 254--keep-true-parents::
 255        With this option, parents that are hidden by grafts are packed
 256        nevertheless.
 257
 258--filter=<filter-spec>::
 259        Requires `--stdout`.  Omits certain objects (usually blobs) from
 260        the resulting packfile.  See linkgit:git-rev-list[1] for valid
 261        `<filter-spec>` forms.
 262
 263--no-filter::
 264        Turns off any previous `--filter=` argument.
 265
 266--missing=<missing-action>::
 267        A debug option to help with future "partial clone" development.
 268        This option specifies how missing objects are handled.
 269+
 270The form '--missing=error' requests that pack-objects stop with an error if
 271a missing object is encountered.  This is the default action.
 272+
 273The form '--missing=allow-any' will allow object traversal to continue
 274if a missing object is encountered.  Missing objects will silently be
 275omitted from the results.
 276+
 277The form '--missing=allow-promisor' is like 'allow-any', but will only
 278allow object traversal to continue for EXPECTED promisor missing objects.
 279Unexpected missing object will raise an error.
 280
 281--exclude-promisor-objects::
 282        Omit objects that are known to be in the promisor remote.  (This
 283        option has the purpose of operating only on locally created objects,
 284        so that when we repack, we still maintain a distinction between
 285        locally created objects [without .promisor] and objects from the
 286        promisor remote [with .promisor].)  This is used with partial clone.
 287
 288--keep-unreachable::
 289        Objects unreachable from the refs in packs named with
 290        --unpacked= option are added to the resulting pack, in
 291        addition to the reachable objects that are not in packs marked
 292        with *.keep files. This implies `--revs`.
 293
 294--pack-loose-unreachable::
 295        Pack unreachable loose objects (and their loose counterparts
 296        removed). This implies `--revs`.
 297
 298--unpack-unreachable::
 299        Keep unreachable objects in loose form. This implies `--revs`.
 300
 301--delta-islands::
 302        Restrict delta matches based on "islands". See DELTA ISLANDS
 303        below.
 304
 305
 306DELTA ISLANDS
 307-------------
 308
 309When possible, `pack-objects` tries to reuse existing on-disk deltas to
 310avoid having to search for new ones on the fly. This is an important
 311optimization for serving fetches, because it means the server can avoid
 312inflating most objects at all and just send the bytes directly from
 313disk. This optimization can't work when an object is stored as a delta
 314against a base which the receiver does not have (and which we are not
 315already sending). In that case the server "breaks" the delta and has to
 316find a new one, which has a high CPU cost. Therefore it's important for
 317performance that the set of objects in on-disk delta relationships match
 318what a client would fetch.
 319
 320In a normal repository, this tends to work automatically. The objects
 321are mostly reachable from the branches and tags, and that's what clients
 322fetch. Any deltas we find on the server are likely to be between objects
 323the client has or will have.
 324
 325But in some repository setups, you may have several related but separate
 326groups of ref tips, with clients tending to fetch those groups
 327independently. For example, imagine that you are hosting several "forks"
 328of a repository in a single shared object store, and letting clients
 329view them as separate repositories through `GIT_NAMESPACE` or separate
 330repos using the alternates mechanism. A naive repack may find that the
 331optimal delta for an object is against a base that is only found in
 332another fork. But when a client fetches, they will not have the base
 333object, and we'll have to find a new delta on the fly.
 334
 335A similar situation may exist if you have many refs outside of
 336`refs/heads/` and `refs/tags/` that point to related objects (e.g.,
 337`refs/pull` or `refs/changes` used by some hosting providers). By
 338default, clients fetch only heads and tags, and deltas against objects
 339found only in those other groups cannot be sent as-is.
 340
 341Delta islands solve this problem by allowing you to group your refs into
 342distinct "islands". Pack-objects computes which objects are reachable
 343from which islands, and refuses to make a delta from an object `A`
 344against a base which is not present in all of `A`'s islands. This
 345results in slightly larger packs (because we miss some delta
 346opportunities), but guarantees that a fetch of one island will not have
 347to recompute deltas on the fly due to crossing island boundaries.
 348
 349When repacking with delta islands the delta window tends to get
 350clogged with candidates that are forbidden by the config. Repacking
 351with a big --window helps (and doesn't take as long as it otherwise
 352might because we can reject some object pairs based on islands before
 353doing any computation on the content).
 354
 355Islands are configured via the `pack.island` option, which can be
 356specified multiple times. Each value is a left-anchored regular
 357expressions matching refnames. For example:
 358
 359-------------------------------------------
 360[pack]
 361island = refs/heads/
 362island = refs/tags/
 363-------------------------------------------
 364
 365puts heads and tags into an island (whose name is the empty string; see
 366below for more on naming). Any refs which do not match those regular
 367expressions (e.g., `refs/pull/123`) is not in any island. Any object
 368which is reachable only from `refs/pull/` (but not heads or tags) is
 369therefore not a candidate to be used as a base for `refs/heads/`.
 370
 371Refs are grouped into islands based on their "names", and two regexes
 372that produce the same name are considered to be in the same
 373island. The names are computed from the regexes by concatenating any
 374capture groups from the regex, with a '-' dash in between. (And if
 375there are no capture groups, then the name is the empty string, as in
 376the above example.) This allows you to create arbitrary numbers of
 377islands. Only up to 14 such capture groups are supported though.
 378
 379For example, imagine you store the refs for each fork in
 380`refs/virtual/ID`, where `ID` is a numeric identifier. You might then
 381configure:
 382
 383-------------------------------------------
 384[pack]
 385island = refs/virtual/([0-9]+)/heads/
 386island = refs/virtual/([0-9]+)/tags/
 387island = refs/virtual/([0-9]+)/(pull)/
 388-------------------------------------------
 389
 390That puts the heads and tags for each fork in their own island (named
 391"1234" or similar), and the pull refs for each go into their own
 392"1234-pull".
 393
 394Note that we pick a single island for each regex to go into, using "last
 395one wins" ordering (which allows repo-specific config to take precedence
 396over user-wide config, and so forth).
 397
 398SEE ALSO
 399--------
 400linkgit:git-rev-list[1]
 401linkgit:git-repack[1]
 402linkgit:git-prune-packed[1]
 403
 404GIT
 405---
 406Part of the linkgit:git[1] suite