1git-fast-import(1) 2================== 3 4NAME 5---- 6git-fast-import - Backend for fast Git data importers 7 8 9SYNOPSIS 10-------- 11[verse] 12frontend | 'git fast-import' [options] 13 14DESCRIPTION 15----------- 16This program is usually not what the end user wants to run directly. 17Most end users want to use one of the existing frontend programs, 18which parses a specific type of foreign source and feeds the contents 19stored there to 'git fast-import'. 20 21fast-import reads a mixed command/data stream from standard input and 22writes one or more packfiles directly into the current repository. 23When EOF is received on standard input, fast import writes out 24updated branch and tag refs, fully updating the current repository 25with the newly imported data. 26 27The fast-import backend itself can import into an empty repository (one that 28has already been initialized by 'git init') or incrementally 29update an existing populated repository. Whether or not incremental 30imports are supported from a particular foreign source depends on 31the frontend program in use. 32 33 34OPTIONS 35------- 36--date-format=<fmt>:: 37 Specify the type of dates the frontend will supply to 38 fast-import within `author`, `committer` and `tagger` commands. 39 See ``Date Formats'' below for details about which formats 40 are supported, and their syntax. 41 42-- done:: 43 Terminate with error if there is no 'done' command at the 44 end of the stream. 45 46--force:: 47 Force updating modified existing branches, even if doing 48 so would cause commits to be lost (as the new commit does 49 not contain the old commit). 50 51--max-pack-size=<n>:: 52 Maximum size of each output packfile. 53 The default is unlimited. 54 55--big-file-threshold=<n>:: 56 Maximum size of a blob that fast-import will attempt to 57 create a delta for, expressed in bytes. The default is 512m 58 (512 MiB). Some importers may wish to lower this on systems 59 with constrained memory. 60 61--depth=<n>:: 62 Maximum delta depth, for blob and tree deltification. 63 Default is 10. 64 65--active-branches=<n>:: 66 Maximum number of branches to maintain active at once. 67 See ``Memory Utilization'' below for details. Default is 5. 68 69--export-marks=<file>:: 70 Dumps the internal marks table to <file> when complete. 71 Marks are written one per line as `:markid SHA-1`. 72 Frontends can use this file to validate imports after they 73 have been completed, or to save the marks table across 74 incremental runs. As <file> is only opened and truncated 75 at checkpoint (or completion) the same path can also be 76 safely given to \--import-marks. 77 78--import-marks=<file>:: 79 Before processing any input, load the marks specified in 80 <file>. The input file must exist, must be readable, and 81 must use the same format as produced by \--export-marks. 82 Multiple options may be supplied to import more than one 83 set of marks. If a mark is defined to different values, 84 the last file wins. 85 86--import-marks-if-exists=<file>:: 87 Like --import-marks but instead of erroring out, silently 88 skips the file if it does not exist. 89 90--relative-marks:: 91 After specifying --relative-marks the paths specified 92 with --import-marks= and --export-marks= are relative 93 to an internal directory in the current repository. 94 In git-fast-import this means that the paths are relative 95 to the .git/info/fast-import directory. However, other 96 importers may use a different location. 97 98--no-relative-marks:: 99 Negates a previous --relative-marks. Allows for combining 100 relative and non-relative marks by interweaving 101 --(no-)-relative-marks with the --(import|export)-marks= 102 options. 103 104--cat-blob-fd=<fd>:: 105 Write responses to `cat-blob` and `ls` queries to the 106 file descriptor <fd> instead of `stdout`. Allows `progress` 107 output intended for the end-user to be separated from other 108 output. 109 110--done:: 111 Require a `done` command at the end of the stream. 112 This option might be useful for detecting errors that 113 cause the frontend to terminate before it has started to 114 write a stream. 115 116--export-pack-edges=<file>:: 117 After creating a packfile, print a line of data to 118 <file> listing the filename of the packfile and the last 119 commit on each branch that was written to that packfile. 120 This information may be useful after importing projects 121 whose total object set exceeds the 4 GiB packfile limit, 122 as these commits can be used as edge points during calls 123 to 'git pack-objects'. 124 125--quiet:: 126 Disable all non-fatal output, making fast-import silent when it 127 is successful. This option disables the output shown by 128 \--stats. 129 130--stats:: 131 Display some basic statistics about the objects fast-import has 132 created, the packfiles they were stored into, and the 133 memory used by fast-import during this run. Showing this output 134 is currently the default, but can be disabled with \--quiet. 135 136 137Performance 138----------- 139The design of fast-import allows it to import large projects in a minimum 140amount of memory usage and processing time. Assuming the frontend 141is able to keep up with fast-import and feed it a constant stream of data, 142import times for projects holding 10+ years of history and containing 143100,000+ individual commits are generally completed in just 1-2 144hours on quite modest (~$2,000 USD) hardware. 145 146Most bottlenecks appear to be in foreign source data access (the 147source just cannot extract revisions fast enough) or disk IO (fast-import 148writes as fast as the disk will take the data). Imports will run 149faster if the source data is stored on a different drive than the 150destination Git repository (due to less IO contention). 151 152 153Development Cost 154---------------- 155A typical frontend for fast-import tends to weigh in at approximately 200 156lines of Perl/Python/Ruby code. Most developers have been able to 157create working importers in just a couple of hours, even though it 158is their first exposure to fast-import, and sometimes even to Git. This is 159an ideal situation, given that most conversion tools are throw-away 160(use once, and never look back). 161 162 163Parallel Operation 164------------------ 165Like 'git push' or 'git fetch', imports handled by fast-import are safe to 166run alongside parallel `git repack -a -d` or `git gc` invocations, 167or any other Git operation (including 'git prune', as loose objects 168are never used by fast-import). 169 170fast-import does not lock the branch or tag refs it is actively importing. 171After the import, during its ref update phase, fast-import tests each 172existing branch ref to verify the update will be a fast-forward 173update (the commit stored in the ref is contained in the new 174history of the commit to be written). If the update is not a 175fast-forward update, fast-import will skip updating that ref and instead 176prints a warning message. fast-import will always attempt to update all 177branch refs, and does not stop on the first failure. 178 179Branch updates can be forced with \--force, but it's recommended that 180this only be used on an otherwise quiet repository. Using \--force 181is not necessary for an initial import into an empty repository. 182 183 184Technical Discussion 185-------------------- 186fast-import tracks a set of branches in memory. Any branch can be created 187or modified at any point during the import process by sending a 188`commit` command on the input stream. This design allows a frontend 189program to process an unlimited number of branches simultaneously, 190generating commits in the order they are available from the source 191data. It also simplifies the frontend programs considerably. 192 193fast-import does not use or alter the current working directory, or any 194file within it. (It does however update the current Git repository, 195as referenced by `GIT_DIR`.) Therefore an import frontend may use 196the working directory for its own purposes, such as extracting file 197revisions from the foreign source. This ignorance of the working 198directory also allows fast-import to run very quickly, as it does not 199need to perform any costly file update operations when switching 200between branches. 201 202Input Format 203------------ 204With the exception of raw file data (which Git does not interpret) 205the fast-import input format is text (ASCII) based. This text based 206format simplifies development and debugging of frontend programs, 207especially when a higher level language such as Perl, Python or 208Ruby is being used. 209 210fast-import is very strict about its input. Where we say SP below we mean 211*exactly* one space. Likewise LF means one (and only one) linefeed 212and HT one (and only one) horizontal tab. 213Supplying additional whitespace characters will cause unexpected 214results, such as branch names or file names with leading or trailing 215spaces in their name, or early termination of fast-import when it encounters 216unexpected input. 217 218Stream Comments 219~~~~~~~~~~~~~~~ 220To aid in debugging frontends fast-import ignores any line that 221begins with `#` (ASCII pound/hash) up to and including the line 222ending `LF`. A comment line may contain any sequence of bytes 223that does not contain an LF and therefore may be used to include 224any detailed debugging information that might be specific to the 225frontend and useful when inspecting a fast-import data stream. 226 227Date Formats 228~~~~~~~~~~~~ 229The following date formats are supported. A frontend should select 230the format it will use for this import by passing the format name 231in the \--date-format=<fmt> command line option. 232 233`raw`:: 234 This is the Git native format and is `<time> SP <offutc>`. 235 It is also fast-import's default format, if \--date-format was 236 not specified. 237+ 238The time of the event is specified by `<time>` as the number of 239seconds since the UNIX epoch (midnight, Jan 1, 1970, UTC) and is 240written as an ASCII decimal integer. 241+ 242The local offset is specified by `<offutc>` as a positive or negative 243offset from UTC. For example EST (which is 5 hours behind UTC) 244would be expressed in `<tz>` by ``-0500'' while UTC is ``+0000''. 245The local offset does not affect `<time>`; it is used only as an 246advisement to help formatting routines display the timestamp. 247+ 248If the local offset is not available in the source material, use 249``+0000'', or the most common local offset. For example many 250organizations have a CVS repository which has only ever been accessed 251by users who are located in the same location and timezone. In this 252case a reasonable offset from UTC could be assumed. 253+ 254Unlike the `rfc2822` format, this format is very strict. Any 255variation in formatting will cause fast-import to reject the value. 256 257`rfc2822`:: 258 This is the standard email format as described by RFC 2822. 259+ 260An example value is ``Tue Feb 6 11:22:18 2007 -0500''. The Git 261parser is accurate, but a little on the lenient side. It is the 262same parser used by 'git am' when applying patches 263received from email. 264+ 265Some malformed strings may be accepted as valid dates. In some of 266these cases Git will still be able to obtain the correct date from 267the malformed string. There are also some types of malformed 268strings which Git will parse wrong, and yet consider valid. 269Seriously malformed strings will be rejected. 270+ 271Unlike the `raw` format above, the timezone/UTC offset information 272contained in an RFC 2822 date string is used to adjust the date 273value to UTC prior to storage. Therefore it is important that 274this information be as accurate as possible. 275+ 276If the source material uses RFC 2822 style dates, 277the frontend should let fast-import handle the parsing and conversion 278(rather than attempting to do it itself) as the Git parser has 279been well tested in the wild. 280+ 281Frontends should prefer the `raw` format if the source material 282already uses UNIX-epoch format, can be coaxed to give dates in that 283format, or its format is easily convertible to it, as there is no 284ambiguity in parsing. 285 286`now`:: 287 Always use the current time and timezone. The literal 288 `now` must always be supplied for `<when>`. 289+ 290This is a toy format. The current time and timezone of this system 291is always copied into the identity string at the time it is being 292created by fast-import. There is no way to specify a different time or 293timezone. 294+ 295This particular format is supplied as it's short to implement and 296may be useful to a process that wants to create a new commit 297right now, without needing to use a working directory or 298'git update-index'. 299+ 300If separate `author` and `committer` commands are used in a `commit` 301the timestamps may not match, as the system clock will be polled 302twice (once for each command). The only way to ensure that both 303author and committer identity information has the same timestamp 304is to omit `author` (thus copying from `committer`) or to use a 305date format other than `now`. 306 307Commands 308~~~~~~~~ 309fast-import accepts several commands to update the current repository 310and control the current import process. More detailed discussion 311(with examples) of each command follows later. 312 313`commit`:: 314 Creates a new branch or updates an existing branch by 315 creating a new commit and updating the branch to point at 316 the newly created commit. 317 318`tag`:: 319 Creates an annotated tag object from an existing commit or 320 branch. Lightweight tags are not supported by this command, 321 as they are not recommended for recording meaningful points 322 in time. 323 324`reset`:: 325 Reset an existing branch (or a new branch) to a specific 326 revision. This command must be used to change a branch to 327 a specific revision without making a commit on it. 328 329`blob`:: 330 Convert raw file data into a blob, for future use in a 331 `commit` command. This command is optional and is not 332 needed to perform an import. 333 334`checkpoint`:: 335 Forces fast-import to close the current packfile, generate its 336 unique SHA-1 checksum and index, and start a new packfile. 337 This command is optional and is not needed to perform 338 an import. 339 340`progress`:: 341 Causes fast-import to echo the entire line to its own 342 standard output. This command is optional and is not needed 343 to perform an import. 344 345`done`:: 346 Marks the end of the stream. This command is optional 347 unless the `done` feature was requested using the 348 `--done` command line option or `feature done` command. 349 350`cat-blob`:: 351 Causes fast-import to print a blob in 'cat-file --batch' 352 format to the file descriptor set with `--cat-blob-fd` or 353 `stdout` if unspecified. 354 355`ls`:: 356 Causes fast-import to print a line describing a directory 357 entry in 'ls-tree' format to the file descriptor set with 358 `--cat-blob-fd` or `stdout` if unspecified. 359 360`feature`:: 361 Require that fast-import supports the specified feature, or 362 abort if it does not. 363 364`option`:: 365 Specify any of the options listed under OPTIONS that do not 366 change stream semantic to suit the frontend's needs. This 367 command is optional and is not needed to perform an import. 368 369`commit` 370~~~~~~~~ 371Create or update a branch with a new commit, recording one logical 372change to the project. 373 374.... 375 'commit' SP <ref> LF 376 mark? 377 ('author' (SP <name>)? SP LT <email> GT SP <when> LF)? 378 'committer' (SP <name>)? SP LT <email> GT SP <when> LF 379 data 380 ('from' SP <committish> LF)? 381 ('merge' SP <committish> LF)? 382 (filemodify | filedelete | filecopy | filerename | filedeleteall | notemodify)* 383 LF? 384.... 385 386where `<ref>` is the name of the branch to make the commit on. 387Typically branch names are prefixed with `refs/heads/` in 388Git, so importing the CVS branch symbol `RELENG-1_0` would use 389`refs/heads/RELENG-1_0` for the value of `<ref>`. The value of 390`<ref>` must be a valid refname in Git. As `LF` is not valid in 391a Git refname, no quoting or escaping syntax is supported here. 392 393A `mark` command may optionally appear, requesting fast-import to save a 394reference to the newly created commit for future use by the frontend 395(see below for format). It is very common for frontends to mark 396every commit they create, thereby allowing future branch creation 397from any imported commit. 398 399The `data` command following `committer` must supply the commit 400message (see below for `data` command syntax). To import an empty 401commit message use a 0 length data. Commit messages are free-form 402and are not interpreted by Git. Currently they must be encoded in 403UTF-8, as fast-import does not permit other encodings to be specified. 404 405Zero or more `filemodify`, `filedelete`, `filecopy`, `filerename`, 406`filedeleteall` and `notemodify` commands 407may be included to update the contents of the branch prior to 408creating the commit. These commands may be supplied in any order. 409However it is recommended that a `filedeleteall` command precede 410all `filemodify`, `filecopy`, `filerename` and `notemodify` commands in 411the same commit, as `filedeleteall` wipes the branch clean (see below). 412 413The `LF` after the command is optional (it used to be required). 414 415`author` 416^^^^^^^^ 417An `author` command may optionally appear, if the author information 418might differ from the committer information. If `author` is omitted 419then fast-import will automatically use the committer's information for 420the author portion of the commit. See below for a description of 421the fields in `author`, as they are identical to `committer`. 422 423`committer` 424^^^^^^^^^^^ 425The `committer` command indicates who made this commit, and when 426they made it. 427 428Here `<name>` is the person's display name (for example 429``Com M Itter'') and `<email>` is the person's email address 430(``\cm@example.com''). `LT` and `GT` are the literal less-than (\x3c) 431and greater-than (\x3e) symbols. These are required to delimit 432the email address from the other fields in the line. Note that 433`<name>` and `<email>` are free-form and may contain any sequence 434of bytes, except `LT`, `GT` and `LF`. `<name>` is typically UTF-8 encoded. 435 436The time of the change is specified by `<when>` using the date format 437that was selected by the \--date-format=<fmt> command line option. 438See ``Date Formats'' above for the set of supported formats, and 439their syntax. 440 441`from` 442^^^^^^ 443The `from` command is used to specify the commit to initialize 444this branch from. This revision will be the first ancestor of the 445new commit. The state of the tree built at this commit will begin 446with the state at the `from` commit, and be altered by the content 447modifications in this commit. 448 449Omitting the `from` command in the first commit of a new branch 450will cause fast-import to create that commit with no ancestor. This 451tends to be desired only for the initial commit of a project. 452If the frontend creates all files from scratch when making a new 453branch, a `merge` command may be used instead of `from` to start 454the commit with an empty tree. 455Omitting the `from` command on existing branches is usually desired, 456as the current commit on that branch is automatically assumed to 457be the first ancestor of the new commit. 458 459As `LF` is not valid in a Git refname or SHA-1 expression, no 460quoting or escaping syntax is supported within `<committish>`. 461 462Here `<committish>` is any of the following: 463 464* The name of an existing branch already in fast-import's internal branch 465 table. If fast-import doesn't know the name, it's treated as a SHA-1 466 expression. 467 468* A mark reference, `:<idnum>`, where `<idnum>` is the mark number. 469+ 470The reason fast-import uses `:` to denote a mark reference is this character 471is not legal in a Git branch name. The leading `:` makes it easy 472to distinguish between the mark 42 (`:42`) and the branch 42 (`42` 473or `refs/heads/42`), or an abbreviated SHA-1 which happened to 474consist only of base-10 digits. 475+ 476Marks must be declared (via `mark`) before they can be used. 477 478* A complete 40 byte or abbreviated commit SHA-1 in hex. 479 480* Any valid Git SHA-1 expression that resolves to a commit. See 481 ``SPECIFYING REVISIONS'' in linkgit:gitrevisions[7] for details. 482 483The special case of restarting an incremental import from the 484current branch value should be written as: 485---- 486 from refs/heads/branch^0 487---- 488The `^0` suffix is necessary as fast-import does not permit a branch to 489start from itself, and the branch is created in memory before the 490`from` command is even read from the input. Adding `^0` will force 491fast-import to resolve the commit through Git's revision parsing library, 492rather than its internal branch table, thereby loading in the 493existing value of the branch. 494 495`merge` 496^^^^^^^ 497Includes one additional ancestor commit. The additional ancestry 498link does not change the way the tree state is built at this commit. 499If the `from` command is 500omitted when creating a new branch, the first `merge` commit will be 501the first ancestor of the current commit, and the branch will start 502out with no files. An unlimited number of `merge` commands per 503commit are permitted by fast-import, thereby establishing an n-way merge. 504However Git's other tools never create commits with more than 15 505additional ancestors (forming a 16-way merge). For this reason 506it is suggested that frontends do not use more than 15 `merge` 507commands per commit; 16, if starting a new, empty branch. 508 509Here `<committish>` is any of the commit specification expressions 510also accepted by `from` (see above). 511 512`filemodify` 513^^^^^^^^^^^^ 514Included in a `commit` command to add a new file or change the 515content of an existing file. This command has two different means 516of specifying the content of the file. 517 518External data format:: 519 The data content for the file was already supplied by a prior 520 `blob` command. The frontend just needs to connect it. 521+ 522.... 523 'M' SP <mode> SP <dataref> SP <path> LF 524.... 525+ 526Here usually `<dataref>` must be either a mark reference (`:<idnum>`) 527set by a prior `blob` command, or a full 40-byte SHA-1 of an 528existing Git blob object. If `<mode>` is `040000`` then 529`<dataref>` must be the full 40-byte SHA-1 of an existing 530Git tree object or a mark reference set with `--import-marks`. 531 532Inline data format:: 533 The data content for the file has not been supplied yet. 534 The frontend wants to supply it as part of this modify 535 command. 536+ 537.... 538 'M' SP <mode> SP 'inline' SP <path> LF 539 data 540.... 541+ 542See below for a detailed description of the `data` command. 543 544In both formats `<mode>` is the type of file entry, specified 545in octal. Git only supports the following modes: 546 547* `100644` or `644`: A normal (not-executable) file. The majority 548 of files in most projects use this mode. If in doubt, this is 549 what you want. 550* `100755` or `755`: A normal, but executable, file. 551* `120000`: A symlink, the content of the file will be the link target. 552* `160000`: A gitlink, SHA-1 of the object refers to a commit in 553 another repository. Git links can only be specified by SHA or through 554 a commit mark. They are used to implement submodules. 555* `040000`: A subdirectory. Subdirectories can only be specified by 556 SHA or through a tree mark set with `--import-marks`. 557 558In both formats `<path>` is the complete path of the file to be added 559(if not already existing) or modified (if already existing). 560 561A `<path>` string must use UNIX-style directory separators (forward 562slash `/`), may contain any byte other than `LF`, and must not 563start with double quote (`"`). 564 565A path can use C-style string quoting; this is accepted in all cases 566and mandatory if the filename starts with double quote or contains 567`LF`. In C-style quoting, the complete name should be surrounded with 568double quotes, and any `LF`, backslash, or double quote characters 569must be escaped by preceding them with a backslash (e.g., 570`"path/with\n, \\ and \" in it"`). 571 572The value of `<path>` must be in canonical form. That is it must not: 573 574* contain an empty directory component (e.g. `foo//bar` is invalid), 575* end with a directory separator (e.g. `foo/` is invalid), 576* start with a directory separator (e.g. `/foo` is invalid), 577* contain the special component `.` or `..` (e.g. `foo/./bar` and 578 `foo/../bar` are invalid). 579 580The root of the tree can be represented by an empty string as `<path>`. 581 582It is recommended that `<path>` always be encoded using UTF-8. 583 584`filedelete` 585^^^^^^^^^^^^ 586Included in a `commit` command to remove a file or recursively 587delete an entire directory from the branch. If the file or directory 588removal makes its parent directory empty, the parent directory will 589be automatically removed too. This cascades up the tree until the 590first non-empty directory or the root is reached. 591 592.... 593 'D' SP <path> LF 594.... 595 596here `<path>` is the complete path of the file or subdirectory to 597be removed from the branch. 598See `filemodify` above for a detailed description of `<path>`. 599 600`filecopy` 601^^^^^^^^^^^^ 602Recursively copies an existing file or subdirectory to a different 603location within the branch. The existing file or directory must 604exist. If the destination exists it will be completely replaced 605by the content copied from the source. 606 607.... 608 'C' SP <path> SP <path> LF 609.... 610 611here the first `<path>` is the source location and the second 612`<path>` is the destination. See `filemodify` above for a detailed 613description of what `<path>` may look like. To use a source path 614that contains SP the path must be quoted. 615 616A `filecopy` command takes effect immediately. Once the source 617location has been copied to the destination any future commands 618applied to the source location will not impact the destination of 619the copy. 620 621`filerename` 622^^^^^^^^^^^^ 623Renames an existing file or subdirectory to a different location 624within the branch. The existing file or directory must exist. If 625the destination exists it will be replaced by the source directory. 626 627.... 628 'R' SP <path> SP <path> LF 629.... 630 631here the first `<path>` is the source location and the second 632`<path>` is the destination. See `filemodify` above for a detailed 633description of what `<path>` may look like. To use a source path 634that contains SP the path must be quoted. 635 636A `filerename` command takes effect immediately. Once the source 637location has been renamed to the destination any future commands 638applied to the source location will create new files there and not 639impact the destination of the rename. 640 641Note that a `filerename` is the same as a `filecopy` followed by a 642`filedelete` of the source location. There is a slight performance 643advantage to using `filerename`, but the advantage is so small 644that it is never worth trying to convert a delete/add pair in 645source material into a rename for fast-import. This `filerename` 646command is provided just to simplify frontends that already have 647rename information and don't want bother with decomposing it into a 648`filecopy` followed by a `filedelete`. 649 650`filedeleteall` 651^^^^^^^^^^^^^^^ 652Included in a `commit` command to remove all files (and also all 653directories) from the branch. This command resets the internal 654branch structure to have no files in it, allowing the frontend 655to subsequently add all interesting files from scratch. 656 657.... 658 'deleteall' LF 659.... 660 661This command is extremely useful if the frontend does not know 662(or does not care to know) what files are currently on the branch, 663and therefore cannot generate the proper `filedelete` commands to 664update the content. 665 666Issuing a `filedeleteall` followed by the needed `filemodify` 667commands to set the correct content will produce the same results 668as sending only the needed `filemodify` and `filedelete` commands. 669The `filedeleteall` approach may however require fast-import to use slightly 670more memory per active branch (less than 1 MiB for even most large 671projects); so frontends that can easily obtain only the affected 672paths for a commit are encouraged to do so. 673 674`notemodify` 675^^^^^^^^^^^^ 676Included in a `commit` `<notes_ref>` command to add a new note 677annotating a `<committish>` or change this annotation contents. 678Internally it is similar to filemodify 100644 on `<committish>` 679path (maybe split into subdirectories). It's not advised to 680use any other commands to write to the `<notes_ref>` tree except 681`filedeleteall` to delete all existing notes in this tree. 682This command has two different means of specifying the content 683of the note. 684 685External data format:: 686 The data content for the note was already supplied by a prior 687 `blob` command. The frontend just needs to connect it to the 688 commit that is to be annotated. 689+ 690.... 691 'N' SP <dataref> SP <committish> LF 692.... 693+ 694Here `<dataref>` can be either a mark reference (`:<idnum>`) 695set by a prior `blob` command, or a full 40-byte SHA-1 of an 696existing Git blob object. 697 698Inline data format:: 699 The data content for the note has not been supplied yet. 700 The frontend wants to supply it as part of this modify 701 command. 702+ 703.... 704 'N' SP 'inline' SP <committish> LF 705 data 706.... 707+ 708See below for a detailed description of the `data` command. 709 710In both formats `<committish>` is any of the commit specification 711expressions also accepted by `from` (see above). 712 713`mark` 714~~~~~~ 715Arranges for fast-import to save a reference to the current object, allowing 716the frontend to recall this object at a future point in time, without 717knowing its SHA-1. Here the current object is the object creation 718command the `mark` command appears within. This can be `commit`, 719`tag`, and `blob`, but `commit` is the most common usage. 720 721.... 722 'mark' SP ':' <idnum> LF 723.... 724 725where `<idnum>` is the number assigned by the frontend to this mark. 726The value of `<idnum>` is expressed as an ASCII decimal integer. 727The value 0 is reserved and cannot be used as 728a mark. Only values greater than or equal to 1 may be used as marks. 729 730New marks are created automatically. Existing marks can be moved 731to another object simply by reusing the same `<idnum>` in another 732`mark` command. 733 734`tag` 735~~~~~ 736Creates an annotated tag referring to a specific commit. To create 737lightweight (non-annotated) tags see the `reset` command below. 738 739.... 740 'tag' SP <name> LF 741 'from' SP <committish> LF 742 'tagger' (SP <name>)? SP LT <email> GT SP <when> LF 743 data 744.... 745 746where `<name>` is the name of the tag to create. 747 748Tag names are automatically prefixed with `refs/tags/` when stored 749in Git, so importing the CVS branch symbol `RELENG-1_0-FINAL` would 750use just `RELENG-1_0-FINAL` for `<name>`, and fast-import will write the 751corresponding ref as `refs/tags/RELENG-1_0-FINAL`. 752 753The value of `<name>` must be a valid refname in Git and therefore 754may contain forward slashes. As `LF` is not valid in a Git refname, 755no quoting or escaping syntax is supported here. 756 757The `from` command is the same as in the `commit` command; see 758above for details. 759 760The `tagger` command uses the same format as `committer` within 761`commit`; again see above for details. 762 763The `data` command following `tagger` must supply the annotated tag 764message (see below for `data` command syntax). To import an empty 765tag message use a 0 length data. Tag messages are free-form and are 766not interpreted by Git. Currently they must be encoded in UTF-8, 767as fast-import does not permit other encodings to be specified. 768 769Signing annotated tags during import from within fast-import is not 770supported. Trying to include your own PGP/GPG signature is not 771recommended, as the frontend does not (easily) have access to the 772complete set of bytes which normally goes into such a signature. 773If signing is required, create lightweight tags from within fast-import with 774`reset`, then create the annotated versions of those tags offline 775with the standard 'git tag' process. 776 777`reset` 778~~~~~~~ 779Creates (or recreates) the named branch, optionally starting from 780a specific revision. The reset command allows a frontend to issue 781a new `from` command for an existing branch, or to create a new 782branch from an existing commit without creating a new commit. 783 784.... 785 'reset' SP <ref> LF 786 ('from' SP <committish> LF)? 787 LF? 788.... 789 790For a detailed description of `<ref>` and `<committish>` see above 791under `commit` and `from`. 792 793The `LF` after the command is optional (it used to be required). 794 795The `reset` command can also be used to create lightweight 796(non-annotated) tags. For example: 797 798==== 799 reset refs/tags/938 800 from :938 801==== 802 803would create the lightweight tag `refs/tags/938` referring to 804whatever commit mark `:938` references. 805 806`blob` 807~~~~~~ 808Requests writing one file revision to the packfile. The revision 809is not connected to any commit; this connection must be formed in 810a subsequent `commit` command by referencing the blob through an 811assigned mark. 812 813.... 814 'blob' LF 815 mark? 816 data 817.... 818 819The mark command is optional here as some frontends have chosen 820to generate the Git SHA-1 for the blob on their own, and feed that 821directly to `commit`. This is typically more work than it's worth 822however, as marks are inexpensive to store and easy to use. 823 824`data` 825~~~~~~ 826Supplies raw data (for use as blob/file content, commit messages, or 827annotated tag messages) to fast-import. Data can be supplied using an exact 828byte count or delimited with a terminating line. Real frontends 829intended for production-quality conversions should always use the 830exact byte count format, as it is more robust and performs better. 831The delimited format is intended primarily for testing fast-import. 832 833Comment lines appearing within the `<raw>` part of `data` commands 834are always taken to be part of the body of the data and are therefore 835never ignored by fast-import. This makes it safe to import any 836file/message content whose lines might start with `#`. 837 838Exact byte count format:: 839 The frontend must specify the number of bytes of data. 840+ 841.... 842 'data' SP <count> LF 843 <raw> LF? 844.... 845+ 846where `<count>` is the exact number of bytes appearing within 847`<raw>`. The value of `<count>` is expressed as an ASCII decimal 848integer. The `LF` on either side of `<raw>` is not 849included in `<count>` and will not be included in the imported data. 850+ 851The `LF` after `<raw>` is optional (it used to be required) but 852recommended. Always including it makes debugging a fast-import 853stream easier as the next command always starts in column 0 854of the next line, even if `<raw>` did not end with an `LF`. 855 856Delimited format:: 857 A delimiter string is used to mark the end of the data. 858 fast-import will compute the length by searching for the delimiter. 859 This format is primarily useful for testing and is not 860 recommended for real data. 861+ 862.... 863 'data' SP '<<' <delim> LF 864 <raw> LF 865 <delim> LF 866 LF? 867.... 868+ 869where `<delim>` is the chosen delimiter string. The string `<delim>` 870must not appear on a line by itself within `<raw>`, as otherwise 871fast-import will think the data ends earlier than it really does. The `LF` 872immediately trailing `<raw>` is part of `<raw>`. This is one of 873the limitations of the delimited format, it is impossible to supply 874a data chunk which does not have an LF as its last byte. 875+ 876The `LF` after `<delim> LF` is optional (it used to be required). 877 878`checkpoint` 879~~~~~~~~~~~~ 880Forces fast-import to close the current packfile, start a new one, and to 881save out all current branch refs, tags and marks. 882 883.... 884 'checkpoint' LF 885 LF? 886.... 887 888Note that fast-import automatically switches packfiles when the current 889packfile reaches \--max-pack-size, or 4 GiB, whichever limit is 890smaller. During an automatic packfile switch fast-import does not update 891the branch refs, tags or marks. 892 893As a `checkpoint` can require a significant amount of CPU time and 894disk IO (to compute the overall pack SHA-1 checksum, generate the 895corresponding index file, and update the refs) it can easily take 896several minutes for a single `checkpoint` command to complete. 897 898Frontends may choose to issue checkpoints during extremely large 899and long running imports, or when they need to allow another Git 900process access to a branch. However given that a 30 GiB Subversion 901repository can be loaded into Git through fast-import in about 3 hours, 902explicit checkpointing may not be necessary. 903 904The `LF` after the command is optional (it used to be required). 905 906`progress` 907~~~~~~~~~~ 908Causes fast-import to print the entire `progress` line unmodified to 909its standard output channel (file descriptor 1) when the command is 910processed from the input stream. The command otherwise has no impact 911on the current import, or on any of fast-import's internal state. 912 913.... 914 'progress' SP <any> LF 915 LF? 916.... 917 918The `<any>` part of the command may contain any sequence of bytes 919that does not contain `LF`. The `LF` after the command is optional. 920Callers may wish to process the output through a tool such as sed to 921remove the leading part of the line, for example: 922 923==== 924 frontend | git fast-import | sed 's/^progress //' 925==== 926 927Placing a `progress` command immediately after a `checkpoint` will 928inform the reader when the `checkpoint` has been completed and it 929can safely access the refs that fast-import updated. 930 931`cat-blob` 932~~~~~~~~~~ 933Causes fast-import to print a blob to a file descriptor previously 934arranged with the `--cat-blob-fd` argument. The command otherwise 935has no impact on the current import; its main purpose is to 936retrieve blobs that may be in fast-import's memory but not 937accessible from the target repository. 938 939.... 940 'cat-blob' SP <dataref> LF 941.... 942 943The `<dataref>` can be either a mark reference (`:<idnum>`) 944set previously or a full 40-byte SHA-1 of a Git blob, preexisting or 945ready to be written. 946 947Output uses the same format as `git cat-file --batch`: 948 949==== 950 <sha1> SP 'blob' SP <size> LF 951 <contents> LF 952==== 953 954This command can be used anywhere in the stream that comments are 955accepted. In particular, the `cat-blob` command can be used in the 956middle of a commit but not in the middle of a `data` command. 957 958See ``Responses To Commands'' below for details about how to read 959this output safely. 960 961`ls` 962~~~~ 963Prints information about the object at a path to a file descriptor 964previously arranged with the `--cat-blob-fd` argument. This allows 965printing a blob from the active commit (with `cat-blob`) or copying a 966blob or tree from a previous commit for use in the current one (with 967`filemodify`). 968 969The `ls` command can be used anywhere in the stream that comments are 970accepted, including the middle of a commit. 971 972Reading from the active commit:: 973 This form can only be used in the middle of a `commit`. 974 The path names a directory entry within fast-import's 975 active commit. The path must be quoted in this case. 976+ 977.... 978 'ls' SP <path> LF 979.... 980 981Reading from a named tree:: 982 The `<dataref>` can be a mark reference (`:<idnum>`) or the 983 full 40-byte SHA-1 of a Git tag, commit, or tree object, 984 preexisting or waiting to be written. 985 The path is relative to the top level of the tree 986 named by `<dataref>`. 987+ 988.... 989 'ls' SP <dataref> SP <path> LF 990.... 991 992See `filemodify` above for a detailed description of `<path>`. 993 994Output uses the same format as `git ls-tree <tree> -- <path>`: 995 996==== 997 <mode> SP ('blob' | 'tree' | 'commit') SP <dataref> HT <path> LF 998==== 9991000The <dataref> represents the blob, tree, or commit object at <path>1001and can be used in later 'cat-blob', 'filemodify', or 'ls' commands.10021003If there is no file or subtree at that path, 'git fast-import' will1004instead report10051006====1007 missing SP <path> LF1008====10091010See ``Responses To Commands'' below for details about how to read1011this output safely.10121013`feature`1014~~~~~~~~~1015Require that fast-import supports the specified feature, or abort if1016it does not.10171018....1019 'feature' SP <feature> ('=' <argument>)? LF1020....10211022The <feature> part of the command may be any one of the following:10231024date-format::1025export-marks::1026relative-marks::1027no-relative-marks::1028force::1029 Act as though the corresponding command-line option with1030 a leading '--' was passed on the command line1031 (see OPTIONS, above).10321033import-marks::1034import-marks-if-exists::1035 Like --import-marks except in two respects: first, only one1036 "feature import-marks" or "feature import-marks-if-exists"1037 command is allowed per stream; second, an --import-marks=1038 or --import-marks-if-exists command-line option overrides1039 any of these "feature" commands in the stream; third,1040 "feature import-marks-if-exists" like a corresponding1041 command-line option silently skips a nonexistent file.10421043cat-blob::1044ls::1045 Require that the backend support the 'cat-blob' or 'ls' command.1046 Versions of fast-import not supporting the specified command1047 will exit with a message indicating so.1048 This lets the import error out early with a clear message,1049 rather than wasting time on the early part of an import1050 before the unsupported command is detected.10511052notes::1053 Require that the backend support the 'notemodify' (N)1054 subcommand to the 'commit' command.1055 Versions of fast-import not supporting notes will exit1056 with a message indicating so.10571058done::1059 Error out if the stream ends without a 'done' command.1060 Without this feature, errors causing the frontend to end1061 abruptly at a convenient point in the stream can go1062 undetected. This may occur, for example, if an import1063 front end dies in mid-operation without emitting SIGTERM1064 or SIGKILL at its subordinate git fast-import instance.10651066`option`1067~~~~~~~~1068Processes the specified option so that git fast-import behaves in a1069way that suits the frontend's needs.1070Note that options specified by the frontend are overridden by any1071options the user may specify to git fast-import itself.10721073....1074 'option' SP <option> LF1075....10761077The `<option>` part of the command may contain any of the options1078listed in the OPTIONS section that do not change import semantics,1079without the leading '--' and is treated in the same way.10801081Option commands must be the first commands on the input (not counting1082feature commands), to give an option command after any non-option1083command is an error.10841085The following commandline options change import semantics and may therefore1086not be passed as option:10871088* date-format1089* import-marks1090* export-marks1091* cat-blob-fd1092* force10931094`done`1095~~~~~~1096If the `done` feature is not in use, treated as if EOF was read.1097This can be used to tell fast-import to finish early.10981099If the `--done` command line option or `feature done` command is1100in use, the `done` command is mandatory and marks the end of the1101stream.11021103Responses To Commands1104---------------------1105New objects written by fast-import are not available immediately.1106Most fast-import commands have no visible effect until the next1107checkpoint (or completion). The frontend can send commands to1108fill fast-import's input pipe without worrying about how quickly1109they will take effect, which improves performance by simplifying1110scheduling.11111112For some frontends, though, it is useful to be able to read back1113data from the current repository as it is being updated (for1114example when the source material describes objects in terms of1115patches to be applied to previously imported objects). This can1116be accomplished by connecting the frontend and fast-import via1117bidirectional pipes:11181119====1120 mkfifo fast-import-output1121 frontend <fast-import-output |1122 git fast-import >fast-import-output1123====11241125A frontend set up this way can use `progress`, `ls`, and `cat-blob`1126commands to read information from the import in progress.11271128To avoid deadlock, such frontends must completely consume any1129pending output from `progress`, `ls`, and `cat-blob` before1130performing writes to fast-import that might block.11311132Crash Reports1133-------------1134If fast-import is supplied invalid input it will terminate with a1135non-zero exit status and create a crash report in the top level of1136the Git repository it was importing into. Crash reports contain1137a snapshot of the internal fast-import state as well as the most1138recent commands that lead up to the crash.11391140All recent commands (including stream comments, file changes and1141progress commands) are shown in the command history within the crash1142report, but raw file data and commit messages are excluded from the1143crash report. This exclusion saves space within the report file1144and reduces the amount of buffering that fast-import must perform1145during execution.11461147After writing a crash report fast-import will close the current1148packfile and export the marks table. This allows the frontend1149developer to inspect the repository state and resume the import from1150the point where it crashed. The modified branches and tags are not1151updated during a crash, as the import did not complete successfully.1152Branch and tag information can be found in the crash report and1153must be applied manually if the update is needed.11541155An example crash:11561157====1158 $ cat >in <<END_OF_INPUT1159 # my very first test commit1160 commit refs/heads/master1161 committer Shawn O. Pearce <spearce> 19283 -04001162 # who is that guy anyway?1163 data <<EOF1164 this is my commit1165 EOF1166 M 644 inline .gitignore1167 data <<EOF1168 .gitignore1169 EOF1170 M 777 inline bob1171 END_OF_INPUT11721173 $ git fast-import <in1174 fatal: Corrupt mode: M 777 inline bob1175 fast-import: dumping crash report to .git/fast_import_crash_843411761177 $ cat .git/fast_import_crash_84341178 fast-import crash report:1179 fast-import process: 84341180 parent process : 13911181 at Sat Sep 1 00:58:12 200711821183 fatal: Corrupt mode: M 777 inline bob11841185 Most Recent Commands Before Crash1186 ---------------------------------1187 # my very first test commit1188 commit refs/heads/master1189 committer Shawn O. Pearce <spearce> 19283 -04001190 # who is that guy anyway?1191 data <<EOF1192 M 644 inline .gitignore1193 data <<EOF1194 * M 777 inline bob11951196 Active Branch LRU1197 -----------------1198 active_branches = 1 cur, 5 max11991200 pos clock name1201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~1202 1) 0 refs/heads/master12031204 Inactive Branches1205 -----------------1206 refs/heads/master:1207 status : active loaded dirty1208 tip commit : 00000000000000000000000000000000000000001209 old tree : 00000000000000000000000000000000000000001210 cur tree : 00000000000000000000000000000000000000001211 commit clock: 01212 last pack :121312141215 -------------------1216 END OF CRASH REPORT1217====12181219Tips and Tricks1220---------------1221The following tips and tricks have been collected from various1222users of fast-import, and are offered here as suggestions.12231224Use One Mark Per Commit1225~~~~~~~~~~~~~~~~~~~~~~~1226When doing a repository conversion, use a unique mark per commit1227(`mark :<n>`) and supply the \--export-marks option on the command1228line. fast-import will dump a file which lists every mark and the Git1229object SHA-1 that corresponds to it. If the frontend can tie1230the marks back to the source repository, it is easy to verify the1231accuracy and completeness of the import by comparing each Git1232commit to the corresponding source revision.12331234Coming from a system such as Perforce or Subversion this should be1235quite simple, as the fast-import mark can also be the Perforce changeset1236number or the Subversion revision number.12371238Freely Skip Around Branches1239~~~~~~~~~~~~~~~~~~~~~~~~~~~1240Don't bother trying to optimize the frontend to stick to one branch1241at a time during an import. Although doing so might be slightly1242faster for fast-import, it tends to increase the complexity of the frontend1243code considerably.12441245The branch LRU builtin to fast-import tends to behave very well, and the1246cost of activating an inactive branch is so low that bouncing around1247between branches has virtually no impact on import performance.12481249Handling Renames1250~~~~~~~~~~~~~~~~1251When importing a renamed file or directory, simply delete the old1252name(s) and modify the new name(s) during the corresponding commit.1253Git performs rename detection after-the-fact, rather than explicitly1254during a commit.12551256Use Tag Fixup Branches1257~~~~~~~~~~~~~~~~~~~~~~1258Some other SCM systems let the user create a tag from multiple1259files which are not from the same commit/changeset. Or to create1260tags which are a subset of the files available in the repository.12611262Importing these tags as-is in Git is impossible without making at1263least one commit which ``fixes up'' the files to match the content1264of the tag. Use fast-import's `reset` command to reset a dummy branch1265outside of your normal branch space to the base commit for the tag,1266then commit one or more file fixup commits, and finally tag the1267dummy branch.12681269For example since all normal branches are stored under `refs/heads/`1270name the tag fixup branch `TAG_FIXUP`. This way it is impossible for1271the fixup branch used by the importer to have namespace conflicts1272with real branches imported from the source (the name `TAG_FIXUP`1273is not `refs/heads/TAG_FIXUP`).12741275When committing fixups, consider using `merge` to connect the1276commit(s) which are supplying file revisions to the fixup branch.1277Doing so will allow tools such as 'git blame' to track1278through the real commit history and properly annotate the source1279files.12801281After fast-import terminates the frontend will need to do `rm .git/TAG_FIXUP`1282to remove the dummy branch.12831284Import Now, Repack Later1285~~~~~~~~~~~~~~~~~~~~~~~~1286As soon as fast-import completes the Git repository is completely valid1287and ready for use. Typically this takes only a very short time,1288even for considerably large projects (100,000+ commits).12891290However repacking the repository is necessary to improve data1291locality and access performance. It can also take hours on extremely1292large projects (especially if -f and a large \--window parameter is1293used). Since repacking is safe to run alongside readers and writers,1294run the repack in the background and let it finish when it finishes.1295There is no reason to wait to explore your new Git project!12961297If you choose to wait for the repack, don't try to run benchmarks1298or performance tests until repacking is completed. fast-import outputs1299suboptimal packfiles that are simply never seen in real use1300situations.13011302Repacking Historical Data1303~~~~~~~~~~~~~~~~~~~~~~~~~1304If you are repacking very old imported data (e.g. older than the1305last year), consider expending some extra CPU time and supplying1306\--window=50 (or higher) when you run 'git repack'.1307This will take longer, but will also produce a smaller packfile.1308You only need to expend the effort once, and everyone using your1309project will benefit from the smaller repository.13101311Include Some Progress Messages1312~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~1313Every once in a while have your frontend emit a `progress` message1314to fast-import. The contents of the messages are entirely free-form,1315so one suggestion would be to output the current month and year1316each time the current commit date moves into the next month.1317Your users will feel better knowing how much of the data stream1318has been processed.131913201321Packfile Optimization1322---------------------1323When packing a blob fast-import always attempts to deltify against the last1324blob written. Unless specifically arranged for by the frontend,1325this will probably not be a prior version of the same file, so the1326generated delta will not be the smallest possible. The resulting1327packfile will be compressed, but will not be optimal.13281329Frontends which have efficient access to all revisions of a1330single file (for example reading an RCS/CVS ,v file) can choose1331to supply all revisions of that file as a sequence of consecutive1332`blob` commands. This allows fast-import to deltify the different file1333revisions against each other, saving space in the final packfile.1334Marks can be used to later identify individual file revisions during1335a sequence of `commit` commands.13361337The packfile(s) created by fast-import do not encourage good disk access1338patterns. This is caused by fast-import writing the data in the order1339it is received on standard input, while Git typically organizes1340data within packfiles to make the most recent (current tip) data1341appear before historical data. Git also clusters commits together,1342speeding up revision traversal through better cache locality.13431344For this reason it is strongly recommended that users repack the1345repository with `git repack -a -d` after fast-import completes, allowing1346Git to reorganize the packfiles for faster data access. If blob1347deltas are suboptimal (see above) then also adding the `-f` option1348to force recomputation of all deltas can significantly reduce the1349final packfile size (30-50% smaller can be quite typical).135013511352Memory Utilization1353------------------1354There are a number of factors which affect how much memory fast-import1355requires to perform an import. Like critical sections of core1356Git, fast-import uses its own memory allocators to amortize any overheads1357associated with malloc. In practice fast-import tends to amortize any1358malloc overheads to 0, due to its use of large block allocations.13591360per object1361~~~~~~~~~~1362fast-import maintains an in-memory structure for every object written in1363this execution. On a 32 bit system the structure is 32 bytes,1364on a 64 bit system the structure is 40 bytes (due to the larger1365pointer sizes). Objects in the table are not deallocated until1366fast-import terminates. Importing 2 million objects on a 32 bit system1367will require approximately 64 MiB of memory.13681369The object table is actually a hashtable keyed on the object name1370(the unique SHA-1). This storage configuration allows fast-import to reuse1371an existing or already written object and avoid writing duplicates1372to the output packfile. Duplicate blobs are surprisingly common1373in an import, typically due to branch merges in the source.13741375per mark1376~~~~~~~~1377Marks are stored in a sparse array, using 1 pointer (4 bytes or 81378bytes, depending on pointer size) per mark. Although the array1379is sparse, frontends are still strongly encouraged to use marks1380between 1 and n, where n is the total number of marks required for1381this import.13821383per branch1384~~~~~~~~~~1385Branches are classified as active and inactive. The memory usage1386of the two classes is significantly different.13871388Inactive branches are stored in a structure which uses 96 or 1201389bytes (32 bit or 64 bit systems, respectively), plus the length of1390the branch name (typically under 200 bytes), per branch. fast-import will1391easily handle as many as 10,000 inactive branches in under 2 MiB1392of memory.13931394Active branches have the same overhead as inactive branches, but1395also contain copies of every tree that has been recently modified on1396that branch. If subtree `include` has not been modified since the1397branch became active, its contents will not be loaded into memory,1398but if subtree `src` has been modified by a commit since the branch1399became active, then its contents will be loaded in memory.14001401As active branches store metadata about the files contained on that1402branch, their in-memory storage size can grow to a considerable size1403(see below).14041405fast-import automatically moves active branches to inactive status based on1406a simple least-recently-used algorithm. The LRU chain is updated on1407each `commit` command. The maximum number of active branches can be1408increased or decreased on the command line with \--active-branches=.14091410per active tree1411~~~~~~~~~~~~~~~1412Trees (aka directories) use just 12 bytes of memory on top of the1413memory required for their entries (see ``per active file'' below).1414The cost of a tree is virtually 0, as its overhead amortizes out1415over the individual file entries.14161417per active file entry1418~~~~~~~~~~~~~~~~~~~~~1419Files (and pointers to subtrees) within active trees require 52 or 641420bytes (32/64 bit platforms) per entry. To conserve space, file and1421tree names are pooled in a common string table, allowing the filename1422``Makefile'' to use just 16 bytes (after including the string header1423overhead) no matter how many times it occurs within the project.14241425The active branch LRU, when coupled with the filename string pool1426and lazy loading of subtrees, allows fast-import to efficiently import1427projects with 2,000+ branches and 45,114+ files in a very limited1428memory footprint (less than 2.7 MiB per active branch).14291430Signals1431-------1432Sending *SIGUSR1* to the 'git fast-import' process ends the current1433packfile early, simulating a `checkpoint` command. The impatient1434operator can use this facility to peek at the objects and refs from an1435import in progress, at the cost of some added running time and worse1436compression.14371438GIT1439---1440Part of the linkgit:git[1] suite