1git-filter-branch(1) 2==================== 3 4NAME 5---- 6git-filter-branch - Rewrite branches 7 8SYNOPSIS 9-------- 10[verse] 11'git-filter-branch' [--env-filter <command>] [--tree-filter <command>] 12 [--index-filter <command>] [--parent-filter <command>] 13 [--msg-filter <command>] [--commit-filter <command>] 14 [--tag-name-filter <command>] [--subdirectory-filter <directory>] 15 [--original <namespace>] [-d <directory>] [-f | --force] 16 [<rev-list options>...] 17 18DESCRIPTION 19----------- 20Lets you rewrite git revision history by rewriting the branches mentioned 21in the <rev-list options>, applying custom filters on each revision. 22Those filters can modify each tree (e.g. removing a file or running 23a perl rewrite on all files) or information about each commit. 24Otherwise, all information (including original commit times or merge 25information) will be preserved. 26 27The command will only rewrite the _positive_ refs mentioned in the 28command line (i.e. if you pass 'a..b', only 'b' will be rewritten). 29If you specify no filters, the commits will be recommitted without any 30changes, which would normally have no effect. Nevertheless, this may be 31useful in the future for compensating for some git bugs or such, 32therefore such a usage is permitted. 33 34*WARNING*! The rewritten history will have different object names for all 35the objects and will not converge with the original branch. You will not 36be able to easily push and distribute the rewritten branch on top of the 37original branch. Please do not use this command if you do not know the 38full implications, and avoid using it anyway, if a simple single commit 39would suffice to fix your problem. 40 41Always verify that the rewritten version is correct: The original refs, 42if different from the rewritten ones, will be stored in the namespace 43'refs/original/'. 44 45Note that since this operation is extensively I/O expensive, it might 46be a good idea to redirect the temporary directory off-disk with the 47'-d' option, e.g. on tmpfs. Reportedly the speedup is very noticeable. 48 49 50Filters 51~~~~~~~ 52 53The filters are applied in the order as listed below. The <command> 54argument is always evaluated in shell using the 'eval' command (with the 55notable exception of the commit filter, for technical reasons). 56Prior to that, the $GIT_COMMIT environment variable will be set to contain 57the id of the commit being rewritten. Also, GIT_AUTHOR_NAME, 58GIT_AUTHOR_EMAIL, GIT_AUTHOR_DATE, GIT_COMMITTER_NAME, GIT_COMMITTER_EMAIL, 59and GIT_COMMITTER_DATE are set according to the current commit. 60 61A 'map' function is available that takes an "original sha1 id" argument 62and outputs a "rewritten sha1 id" if the commit has been already 63rewritten, and "original sha1 id" otherwise; the 'map' function can 64return several ids on separate lines if your commit filter emitted 65multiple commits. 66 67 68OPTIONS 69------- 70 71--env-filter <command>:: 72 This is the filter for modifying the environment in which 73 the commit will be performed. Specifically, you might want 74 to rewrite the author/committer name/email/time environment 75 variables (see gitlink:git-commit[1] for details). Do not forget 76 to re-export the variables. 77 78--tree-filter <command>:: 79 This is the filter for rewriting the tree and its contents. 80 The argument is evaluated in shell with the working 81 directory set to the root of the checked out tree. The new tree 82 is then used as-is (new files are auto-added, disappeared files 83 are auto-removed - neither .gitignore files nor any other ignore 84 rules *HAVE ANY EFFECT*!). 85 86--index-filter <command>:: 87 This is the filter for rewriting the index. It is similar to the 88 tree filter but does not check out the tree, which makes it much 89 faster. For hairy cases, see gitlink:git-update-index[1]. 90 91--parent-filter <command>:: 92 This is the filter for rewriting the commit's parent list. 93 It will receive the parent string on stdin and shall output 94 the new parent string on stdout. The parent string is in 95 a format accepted by gitlink:git-commit-tree[1]: empty for 96 the initial commit, "-p parent" for a normal commit and 97 "-p parent1 -p parent2 -p parent3 ..." for a merge commit. 98 99--msg-filter <command>:: 100 This is the filter for rewriting the commit messages. 101 The argument is evaluated in the shell with the original 102 commit message on standard input; its standard output is 103 used as the new commit message. 104 105--commit-filter <command>:: 106 This is the filter for performing the commit. 107 If this filter is specified, it will be called instead of the 108 gitlink:git-commit-tree[1] command, with arguments of the form 109 "<TREE_ID> [-p <PARENT_COMMIT_ID>]..." and the log message on 110 stdin. The commit id is expected on stdout. 111+ 112As a special extension, the commit filter may emit multiple 113commit ids; in that case, ancestors of the original commit will 114have all of them as parents. 115+ 116Note that the 'map' function is not available in the commit filter yet. 117This will be changed in a future version. 118 119--tag-name-filter <command>:: 120 This is the filter for rewriting tag names. When passed, 121 it will be called for every tag ref that points to a rewritten 122 object (or to a tag object which points to a rewritten object). 123 The original tag name is passed via standard input, and the new 124 tag name is expected on standard output. 125+ 126The original tags are not deleted, but can be overwritten; 127use "--tag-name-filter cat" to simply update the tags. In this 128case, be very careful and make sure you have the old tags 129backed up in case the conversion has run afoul. 130+ 131Note that there is currently no support for proper rewriting of 132tag objects; in layman terms, if the tag has a message or signature 133attached, the rewritten tag won't have it. Sorry. (It is by 134definition impossible to preserve signatures at any rate.) 135 136--subdirectory-filter <directory>:: 137 Only look at the history which touches the given subdirectory. 138 The result will contain that directory (and only that) as its 139 project root. 140 141--original <namespace>:: 142 Use this option to set the namespace where the original commits 143 will be stored. The default value is 'refs/original'. 144 145-d <directory>:: 146 Use this option to set the path to the temporary directory used for 147 rewriting. When applying a tree filter, the command needs to 148 temporary checkout the tree to some directory, which may consume 149 considerable space in case of large projects. By default it 150 does this in the '.git-rewrite/' directory but you can override 151 that choice by this parameter. 152 153-f\|--force:: 154 `git filter-branch` refuses to start with an existing temporary 155 directory or when there are already refs starting with 156 'refs/original/', unless forced. 157 158<rev-list-options>:: 159 When options are given after the new branch name, they will 160 be passed to gitlink:git-rev-list[1]. Only commits in the resulting 161 output will be filtered, although the filtered commits can still 162 reference parents which are outside of that set. 163 164 165Examples 166-------- 167 168Suppose you want to remove a file (containing confidential information 169or copyright violation) from all commits: 170 171------------------------------------------------------- 172git filter-branch --tree-filter 'rm filename' HEAD 173------------------------------------------------------- 174 175A significantly faster version: 176 177-------------------------------------------------------------------------- 178git filter-branch --index-filter 'git update-index --remove filename' HEAD 179-------------------------------------------------------------------------- 180 181Now, you will get the rewritten history saved in the branch 'newbranch' 182(your current branch is left untouched). 183 184To set a commit (which typically is at the tip of another 185history) to be the parent of the current initial commit, in 186order to paste the other history behind the current history: 187 188------------------------------------------------------------------- 189git filter-branch --parent-filter 'sed "s/^\$/-p <graft-id>/"' HEAD 190------------------------------------------------------------------- 191 192(if the parent string is empty - which happens when we are dealing with 193the initial commit - add graftcommit as a parent). Note that this assumes 194history with a single root (that is, no merge without common ancestors 195happened). If this is not the case, use: 196 197-------------------------------------------------------------------------- 198git filter-branch --parent-filter \ 199 'cat; test $GIT_COMMIT = <commit-id> && echo "-p <graft-id>"' HEAD 200-------------------------------------------------------------------------- 201 202or even simpler: 203 204----------------------------------------------- 205echo "$commit-id $graft-id" >> .git/info/grafts 206git filter-branch $graft-id..HEAD 207----------------------------------------------- 208 209To remove commits authored by "Darl McBribe" from the history: 210 211------------------------------------------------------------------------------ 212git filter-branch --commit-filter ' 213 if [ "$GIT_AUTHOR_NAME" = "Darl McBribe" ]; 214 then 215 shift; 216 while [ -n "$1" ]; 217 do 218 shift; 219 echo "$1"; 220 shift; 221 done; 222 else 223 git commit-tree "$@"; 224 fi' HEAD 225------------------------------------------------------------------------------ 226 227The shift magic first throws away the tree id and then the -p 228parameters. Note that this handles merges properly! In case Darl 229committed a merge between P1 and P2, it will be propagated properly 230and all children of the merge will become merge commits with P1,P2 231as their parents instead of the merge commit. 232 233To restrict rewriting to only part of the history, specify a revision 234range in addition to the new branch name. The new branch name will 235point to the top-most revision that a 'git rev-list' of this range 236will print. 237 238*NOTE* the changes introduced by the commits, and which are not reverted 239by subsequent commits, will still be in the rewritten branch. If you want 240to throw out _changes_ together with the commits, you should use the 241interactive mode of gitlink:git-rebase[1]. 242 243 244Consider this history: 245 246------------------ 247 D--E--F--G--H 248 / / 249A--B-----C 250------------------ 251 252To rewrite only commits D,E,F,G,H, but leave A, B and C alone, use: 253 254-------------------------------- 255git filter-branch ... C..H 256-------------------------------- 257 258To rewrite commits E,F,G,H, use one of these: 259 260---------------------------------------- 261git filter-branch ... C..H --not D 262git filter-branch ... D..H --not C 263---------------------------------------- 264 265To move the whole tree into a subdirectory, or remove it from there: 266 267--------------------------------------------------------------- 268git filter-branch --index-filter \ 269 'git ls-files -s | sed "s-\t-&newsubdir/-" | 270 GIT_INDEX_FILE=$GIT_INDEX_FILE.new \ 271 git update-index --index-info && 272 mv $GIT_INDEX_FILE.new $GIT_INDEX_FILE' HEAD 273--------------------------------------------------------------- 274 275 276Author 277------ 278Written by Petr "Pasky" Baudis <pasky@suse.cz>, 279and the git list <git@vger.kernel.org> 280 281Documentation 282-------------- 283Documentation by Petr Baudis and the git list. 284 285GIT 286--- 287Part of the gitlink:git[7] suite