gitweb.git
pack-objects: allow "thin" packs to exceed depth limitsJunio C Hamano Fri, 24 Feb 2006 07:04:52 +0000 (23:04 -0800)

pack-objects: allow "thin" packs to exceed depth limits

When creating a new pack to be used in .git/objects/pack/
directory, we carefully count the depth of deltified objects to
be reused, so that the generated pack does not to exceed the
specified depth limit for runtime efficiency. However, when we
are generating a thin pack that does not contain base objects,
such a pack can only be used during network transfer that is
expanded on the other end upon reception, so being careful and
artificially cutting the delta chain does not buy us anything
except increased bandwidth requirement. This patch disables the
delta chain depth limit check when reusing an existing delta.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'ar/win'Junio C Hamano Fri, 24 Feb 2006 06:35:55 +0000 (22:35 -0800)

Merge branch 'ar/win'

* ar/win:
PATCH: simplify calls to git programs in git-fmt-merge-msg

Merge branch 'jc/send-insane-refs'Junio C Hamano Fri, 24 Feb 2006 06:34:39 +0000 (22:34 -0800)

Merge branch 'jc/send-insane-refs'

* jc/send-insane-refs:
send-pack: do not give up when remote has insanely large number of refs.

Merge fixes early for next maint series.Junio C Hamano Fri, 24 Feb 2006 06:27:03 +0000 (22:27 -0800)

Merge fixes early for next maint series.

Merge branches 'jc/fix-co-candy', 'jc/fix-rename-leak... Junio C Hamano Fri, 24 Feb 2006 06:25:32 +0000 (22:25 -0800)

Merge branches 'jc/fix-co-candy', 'jc/fix-rename-leak' and 'ar/fix-win' into maint

* jc/fix-co-candy:
checkout - eye candy.

* jc/fix-rename-leak:
diffcore-rename: plug memory leak.

* ar/fix-win:
fix t5600-clone-fail-cleanup.sh on windows

Merge branch 'ak/gitview'Junio C Hamano Fri, 24 Feb 2006 06:20:30 +0000 (22:20 -0800)

Merge branch 'ak/gitview'

* ak/gitview:
gitview: Display the lines joining commit nodes clearly.

gitview: Display the lines joining commit nodes clearly.Aneesh Kumar K.V Thu, 23 Feb 2006 19:29:42 +0000 (00:59 +0530)

gitview: Display the lines joining commit nodes clearly.

Since i wanted to limit the graph box size i was resetting
the window after an index of 5. This result in line joining
commit nodes to pass over nodes which are not related. The
changes fixes the same

Signed-off-by: Aneesh Kumar K.V <aneesh.kumar@gmail.com>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-am: do not allow empty commits by mistake.Junio C Hamano Fri, 24 Feb 2006 06:14:47 +0000 (22:14 -0800)

git-am: do not allow empty commits by mistake.

Running "git-am --resolved" without doing anything can create an empty
commit. Prevent it.

Thanks for Eric W. Biederman for spotting this.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Tweak break/merge score to adjust to the new delta... Junio C Hamano Fri, 24 Feb 2006 01:02:56 +0000 (17:02 -0800)

Tweak break/merge score to adjust to the new delta generation code.

This lowers the default merge threshold score to 75% from
earlier 80%. The break threshold stays the same at 50% for now,
but we might want to revisit it (and the rename detection limit
as well).

* break score: this much edit (both insertion of new material
and deletion of old material) needs to be there in the file
before we consider this _might_ be a rewrite and break the
filepair.

* merge score: after a filepair is broken by the above criteria
and goes through rename detection, if their pieces did not
match with other files as rename/copy, we merge them back
into one as if nothing happened. If the filepair had at
least this much deletion of old material, however, we say
this is completely rewritten with dissimilarity index X% when
we do so.

The updated delta code by Nico is so good that what we earlier
thought to be complete rewrite now reuses a lot more from the
source material (reducing the counted "delete"), so this
adjustment is needed to keep the perceived behaviour similar to
what we had earlier.

Signed-off-by: Junio C Hamano <junkio@cox.net>

count-delta: fix counting of copied source.Junio C Hamano Thu, 23 Feb 2006 10:58:37 +0000 (02:58 -0800)

count-delta: fix counting of copied source.

The previous one wrongly coalesced a span with the next one
even though the span being added does not reach it.

Signed-off-by: Junio C Hamano <junkio@cox.net>

PATCH: simplify calls to git programs in git-fmt-merge-msgAlex Riesen Thu, 23 Feb 2006 10:26:46 +0000 (11:26 +0100)

PATCH: simplify calls to git programs in git-fmt-merge-msg

It also makes it work on ActiveState Perl.

Signed-off-by: Junio C Hamano <junkio@cox.net>

fix t5600-clone-fail-cleanup.sh on windowsAlex Riesen Thu, 23 Feb 2006 11:25:20 +0000 (12:25 +0100)

fix t5600-clone-fail-cleanup.sh on windows

In windows you cannot remove current or opened directory,
an opened file, a running program, a loaded library, etc...

[jc: signoffs? With a minor quoting fix.]

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge part of pack-thin branchJunio C Hamano Thu, 23 Feb 2006 11:00:59 +0000 (03:00 -0800)

Merge part of pack-thin branch

Merge branch 'np/delta' into nextJunio C Hamano Thu, 23 Feb 2006 10:59:24 +0000 (02:59 -0800)

Merge branch 'np/delta' into next

* np/delta:
count-delta: tweak counting of copied source material.
diff-delta: produce optimal pack data

count-delta: tweak counting of copied source material.Junio C Hamano Thu, 23 Feb 2006 10:58:37 +0000 (02:58 -0800)

count-delta: tweak counting of copied source material.

With the finer grained delta algorithm, count-delta algorithm
started overcounting copied source material, since the new delta
output tends to reuse the same source range more than once and
more aggressively. This broke an earlier assumption that the
number of bytes copied out from the source buffer is a good
approximation how much source material is actually remaining in
the result.

This uses fairly inefficient algorithm to keep track of ranges
of source material that are actually copied out to the
destination buffer. With this tweak, the obvious rename/break
detection tests in the testsuite start to work again.

Signed-off-by: Junio C Hamano <junkio@cox.net>

pack-objects: use full pathname to help hashing with... Junio C Hamano Thu, 23 Feb 2006 06:10:24 +0000 (22:10 -0800)

pack-objects: use full pathname to help hashing with "thin" pack.

This uses the same hashing algorithm to the "preferred base
tree" objects and the incoming pathnames, to group the same
files from different revs together, while spreading files with
the same basename in different directories.

Signed-off-by: Junio C Hamano <junkio@cox.net>

pack-objects: thin pack micro-optimization.Junio C Hamano Thu, 23 Feb 2006 05:45:45 +0000 (21:45 -0800)

pack-objects: thin pack micro-optimization.

Since we sort objects by type, hash, preferredness and then
size, after we have a delta against preferred base, there is no
point trying a delta with non-preferred base. This seems to
save expensive calls to diff-delta and it also seems to save the
output space as well.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branches 'maint', 'jc/fix-co-candy' and 'jc/fix... Junio C Hamano Thu, 23 Feb 2006 03:46:59 +0000 (19:46 -0800)

Merge branches 'maint', 'jc/fix-co-candy' and 'jc/fix-rename-leak' into next

* maint:
Give no terminating LF to error() function.

* jc/fix-co-candy:
checkout - eye candy.

* jc/fix-rename-leak:
diffcore-rename: plug memory leak.

diffcore-rename: plug memory leak.Junio C Hamano Thu, 23 Feb 2006 03:45:48 +0000 (19:45 -0800)

diffcore-rename: plug memory leak.

Spotted by Nicolas Pitre.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'ml/cvs'Junio C Hamano Thu, 23 Feb 2006 03:20:55 +0000 (19:20 -0800)

Merge branch 'ml/cvs'

* ml/cvs:
Introducing git-cvsserver -- a CVS emulator for git.

Merge branch 'ra/anno'Junio C Hamano Thu, 23 Feb 2006 03:20:08 +0000 (19:20 -0800)

Merge branch 'ra/anno'

* ra/anno:
Use Ryan's git-annotate instead of jsannotate
Add git-annotate, a tool for assigning blame.

Give no terminating LF to error() function.Junio C Hamano Thu, 23 Feb 2006 01:47:10 +0000 (17:47 -0800)

Give no terminating LF to error() function.

Signed-off-by: Junio C Hamano <junkio@cox.net>

checkout - eye candy.Junio C Hamano Thu, 23 Feb 2006 03:02:39 +0000 (19:02 -0800)

checkout - eye candy.

This implements "eye candy" similar to the pack-object/unpack-object
to entertain users while a large tree is being checked out after
a clone or a pull.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'cw/remove' into nextJunio C Hamano Thu, 23 Feb 2006 01:15:01 +0000 (17:15 -0800)

Merge branch 'cw/remove' into next

* cw/remove:
git-rm: Fix to properly handle files with spaces, tabs, newlines, etc.
Add new git-rm command with documentation

git-rm: Fix to properly handle files with spaces, tabs... Carl Worth Thu, 23 Feb 2006 00:37:27 +0000 (16:37 -0800)

git-rm: Fix to properly handle files with spaces, tabs, newlines, etc.

New tests are added to the git-rm test case to cover this as well.

Signed-off-by: Carl Worth <cworth@cworth.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Add new git-rm command with documentationCarl Worth Tue, 21 Feb 2006 23:04:51 +0000 (15:04 -0800)

Add new git-rm command with documentation

This adds a git-rm command which provides convenience similar to
git-add, (and a bit more since it takes care of the rm as well if
given -f).

Like git-add, git-rm expands the given path names through
git-ls-files. This means it only acts on files listed in the
index. And it does act recursively on directories by default, (no -r
needed as in the case of rm itself). When it recurses, it does not
remove empty directories that are left behind.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge master to get fixes up to GIT 1.2.3Junio C Hamano Thu, 23 Feb 2006 00:38:21 +0000 (16:38 -0800)

Merge master to get fixes up to GIT 1.2.3

Merge fixes up to GIT 1.2.3Junio C Hamano Thu, 23 Feb 2006 00:15:42 +0000 (16:15 -0800)

Merge fixes up to GIT 1.2.3

git-fetch: follow tag only when tracking remote branch. v1.2.3Junio C Hamano Wed, 22 Feb 2006 21:10:37 +0000 (13:10 -0800)

git-fetch: follow tag only when tracking remote branch.

Unless --no-tags flag was given, git-fetch tried to always
follow remote tags that point at the commits we picked up.

It is not very useful to pick up tags from remote unless storing
the fetched branch head in a local tracking branch. This is
especially true if the fetch is done to merge the remote branch
into our current branch as one-shot basis (i.e. "please pull"),
and is even harmful if the remote repository has many irrelevant
tags.

This proposed update disables the automated tag following unless
we are storing the a fetched branch head in a local tracking
branch.

Signed-off-by: Junio C Hamano <junkio@cox.net>

pack-objects eye-candy: finishing touches.Junio C Hamano Thu, 23 Feb 2006 00:02:59 +0000 (16:02 -0800)

pack-objects eye-candy: finishing touches.

This updates the progress output to match "every one second or
every percent whichever comes early" used by unpack-objects, as
discussed on the list.

Signed-off-by: Junio C Hamano <junkio@cox.net>

also adds progress when actually writing a packNicolas Pitre Wed, 22 Feb 2006 22:41:32 +0000 (17:41 -0500)

also adds progress when actually writing a pack

If that pack is big, it takes significant time to write and might
benefit from some more eye candies as well. This is however disabled
when the pack is written to stdout since in that case the output is
usually piped into unpack_objects which already does its own progress
reporting.

Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

nicer eye candies for pack-objectsNicolas Pitre Wed, 22 Feb 2006 21:00:08 +0000 (16:00 -0500)

nicer eye candies for pack-objects

This provides a stable and simpler progress reporting mechanism that
updates progress as often as possible but accurately not updating more
than once a second. The deltification phase is also made more
interesting to watch (since repacking a big repository and only seeing a
dot appear once every many seconds is rather boring and doesn't provide
much food for anticipation).

Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Keep Porcelainish from failing by broken ident after... Junio C Hamano Sun, 19 Feb 2006 04:51:26 +0000 (20:51 -0800)

Keep Porcelainish from failing by broken ident after making changes.

"empty ident not allowed" error makes commit-tree fail, so we
are already safer in that we would not end up with commit
objects that have bogus names on the author or committer fields.
However, before commit-tree is called there are already changes
made to the index file and the working tree. The operation can
be resumed after fixing the environment problem, but when this
triggers to a newcomer with unusable gecos, the first question
becomes "what did I lose and how would I recover".

This patch modifies some Porcelainish commands to verify
GIT_COMMITTER_IDENT as soon as we know we are going to make some
commits before doing much damage to prevent confusion.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Delay "empty ident" errors until they really matter.Junio C Hamano Sun, 19 Feb 2006 04:31:05 +0000 (20:31 -0800)

Delay "empty ident" errors until they really matter.

Previous one warned people upfront to encourage fixing their
environment early, but some people just use repositories and git
tools read-only without making any changes, and in such a case
there is not much point insisting on them having a usable ident.

This round attempts to move the error until either "git-var"
asks for the ident explicitly or "commit-tree" wants to use it.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Make "empty ident" error message a bit more helpful.Junio C Hamano Sat, 18 Feb 2006 09:20:06 +0000 (01:20 -0800)

Make "empty ident" error message a bit more helpful.

It appears that some people who did not care about having bogus
names in their own commit messages are bitten by the recent
change to require a sane environment [*1*].

While it was a good idea to prevent people from using bogus
names to create commits and doing sign-offs, the error message
is not very informative. This patch attempts to warn things
upfront and hint people how to fix their environments.

[Footnote]

*1* The thread is this one.

http://marc.theaimsgroup.com/?t=113868084800004

Especially this message.

http://marc.theaimsgroup.com/?m=113932830015032

Signed-off-by: Junio C Hamano <junkio@cox.net>

pack-objects: avoid delta chains that are too long.Junio C Hamano Sat, 18 Feb 2006 04:58:45 +0000 (20:58 -0800)

pack-objects: avoid delta chains that are too long.

This tries to rework the solution for the excess delta chain
problem. An earlier commit worked it around ``cheaply'', but
repeated repacking risks unbound growth of delta chains.

This version counts the length of delta chain we are reusing
from the existing pack, and makes sure a base object that has
sufficiently long delta chain does not get deltified.

Signed-off-by: Junio C Hamano <junkio@cox.net>

git-repack: allow passing a couple of flags to pack... Junio C Hamano Thu, 16 Feb 2006 19:57:18 +0000 (11:57 -0800)

git-repack: allow passing a couple of flags to pack-objects.

A new flag -q makes underlying pack-objects less chatty.
A new flag -f forces delta to be recomputed from scratch.

Signed-off-by: Junio C Hamano <junkio@cox.net>

pack-objects: finishing touches.Junio C Hamano Thu, 16 Feb 2006 19:55:51 +0000 (11:55 -0800)

pack-objects: finishing touches.

This introduces --no-reuse-delta option to disable reusing of
existing delta, which is a large part of the optimization
introduced by this series. This may become necessary if
repeated repacking makes delta chain too long. With this, the
output of the command becomes identical to that of the older
implementation. But the performance suffers greatly.

It still allows reusing non-deltified representations; there is
no point uncompressing and recompressing the whole text.

It also adds a couple more statistics output, while squelching
it under -q flag, which the last round forgot to do.

$ time old-git-pack-objects --stdout >/dev/null <RL
Generating pack...
Done counting 184141 objects.
Packing 184141 objects....................
real 12m8.530s user 11m1.450s sys 0m57.920s
$ time git-pack-objects --stdout >/dev/null <RL
Generating pack...
Done counting 184141 objects.
Packing 184141 objects.....................
Total 184141, written 184141 (delta 138297), reused 178833 (delta 134081)
real 0m59.549s user 0m56.670s sys 0m2.400s
$ time git-pack-objects --stdout --no-reuse-delta >/dev/null <RL
Generating pack...
Done counting 184141 objects.
Packing 184141 objects.....................
Total 184141, written 184141 (delta 134833), reused 47904 (delta 0)
real 11m13.830s user 9m45.240s sys 0m44.330s

There is one remaining issue when --no-reuse-delta option is not
used. It can create delta chains that are deeper than specified.

A<--B<--C<--D E F G

Suppose we have a delta chain A to D (A is stored in full either
in a pack or as a loose object. B is depth1 delta relative to A,
C is depth2 delta relative to B...) with loose objects E, F, G.
And we are going to pack all of them.

B, C and D are left as delta against A, B and C respectively.
So A, E, F, and G are examined for deltification, and let's say
we decided to keep E expanded, and store the rest as deltas like
this:

E<--F<--G<--A

Oops. We ended up making D a bit too deep, didn't we? B, C and
D form a chain on top of A!

This is because we did not know what the final depth of A would
be, when we checked objects and decided to keep the existing
delta. Unfortunately, deferring the decision until just before
the deltification is not an option. To be able to make B, C,
and D candidates for deltification with the rest, we need to
know the type and final unexpanded size of them, but the major
part of the optimization comes from the fact that we do not read
the delta data to do so -- getting the final size is quite an
expensive operation.

To prevent this from happening, we should keep A from being
deltified. But how would we tell that, cheaply?

To do this most precisely, after check_object() runs, each
object that is used as the base object of some existing delta
needs to be marked with the maximum depth of the objects we
decided to keep deltified (in this case, D is depth 3 relative
to A, so if no other delta chain that is longer than 3 based on
A exists, mark A with 3). Then when attempting to deltify A, we
would take that number into account to see if the final delta
chain that leads to D becomes too deep.

However, this is a bit cumbersome to compute, so we would cheat
and reduce the maximum depth for A arbitrarily to depth/4 in
this implementation.

Signed-off-by: Junio C Hamano <junkio@cox.net>

pack-objects: reuse data from existing packs.Junio C Hamano Thu, 16 Feb 2006 01:34:29 +0000 (17:34 -0800)

pack-objects: reuse data from existing packs.

When generating a new pack, notice if we have already needed
objects in existing packs. If an object is stored deltified,
and its base object is also what we are going to pack, then
reuse the existing deltified representation unconditionally,
bypassing all the expensive find_deltas() and try_deltas()
calls.

Also, notice if what we are going to write out exactly match
what is already in an existing pack (either deltified or just
compressed). In such a case, we can just copy it instead of
going through the usual uncompressing & recompressing cycle.

Without this patch, in linux-2.6 repository with about 1500
loose objects and a single mega pack:

$ git-rev-list --objects v2.6.16-rc3 >RL
$ wc -l RL
184141 RL
$ time git-pack-objects p <RL
Generating pack...
Done counting 184141 objects.
Packing 184141 objects....................
a1fc7b3e537fcb9b3c46b7505df859f0a11e79d2

real 12m4.323s
user 11m2.560s
sys 0m55.950s

With this patch, the same input:

$ time ../git.junio/git-pack-objects q <RL
Generating pack...
Done counting 184141 objects.
Packing 184141 objects.....................
a1fc7b3e537fcb9b3c46b7505df859f0a11e79d2
Total 184141, written 184141, reused 182441

real 1m2.608s
user 0m55.090s
sys 0m1.830s

Signed-off-by: Junio C Hamano <junkio@cox.net>

detect broken alternates.Junio C Hamano Wed, 22 Feb 2006 19:16:38 +0000 (11:16 -0800)

detect broken alternates.

The real problem triggered an earlier fix was that an alternate
entry was pointing at a removed directory. Complaining on
object/pack directory that cannot be opendir-ed produces noise
in an ancient repository that does not have object/pack
directory and has never been packed.

Detect the real user error and report it. Also if opendir
failed for other reasons (e.g. no read permissions), report that
as well.

Spotted by Andrew Vasquez <andrew.vasquez@qlogic.com>.

Signed-off-by: Junio C Hamano <junkio@cox.net>

gitview: ls-remote invocation shellquote safety.Junio C Hamano Wed, 22 Feb 2006 11:47:20 +0000 (03:47 -0800)

gitview: ls-remote invocation shellquote safety.

This will allow you to point GIT_DIR at directories with funny names.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'ml/cvs' into nextJunio C Hamano Wed, 22 Feb 2006 10:17:56 +0000 (02:17 -0800)

Merge branch 'ml/cvs' into next

* ml/cvs:
Introducing git-cvsserver -- a CVS emulator for git.

Introducing git-cvsserver -- a CVS emulator for git.Martin Langhoff Wed, 22 Feb 2006 09:50:15 +0000 (22:50 +1300)

Introducing git-cvsserver -- a CVS emulator for git.

git-cvsserver is highly functional. However, not all methods are implemented,
and for those methods that are implemented, not all switches are implemented.
All the common read operations are implemented, and add/remove/commit are
supported.

Testing has been done using both the CLI CVS client, and the Eclipse CVS
plugin. Most functionality works fine with both of these clients.

Currently git-cvsserver only works over SSH connections, see the
Documentation for more details on how to configure your client. It
does not support pserver for anonymous access but it should not be
hard to implement. Anonymous access will need tighter input validation.

In our very informal tests, it seems to be significantly faster than a real
CVS server.

This utility depends on a version of git-cvsannotate that supports -S and on
DBD::SQLite.

Licensed under GPLv2. Copyright The Open University UK.

Authors: Martyn Smith <martyn@catalyst.net.nz>
Martin Langhoff <martin@catalyst.net.nz>

Signed-off-by: Martin Langhoff <martin@catalyst.net.nz>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'ra/anno' into nextJunio C Hamano Wed, 22 Feb 2006 10:07:20 +0000 (02:07 -0800)

Merge branch 'ra/anno' into next

* ra/anno:
Use Ryan's git-annotate instead of jsannotate

Use Ryan's git-annotate instead of jsannotateJohannes Schindelin Mon, 20 Feb 2006 15:20:10 +0000 (16:20 +0100)

Use Ryan's git-annotate instead of jsannotate

Since Ryan's git-annotate is much faster, and has support for renames,
it is likely it goes into the mainstream git soon. Adapt it a little to
work with gitcvs, and actually use it.

Signed-off-by: Johannes Schindelin <Johannes.Schindelin@gmx.de>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'jc/send-insane-refs' into nextJunio C Hamano Wed, 22 Feb 2006 09:48:49 +0000 (01:48 -0800)

Merge branch 'jc/send-insane-refs' into next

* jc/send-insane-refs:
send-pack: do not give up when remote has insanely large number of refs.
rev-list.c: fix non-grammatical comments.

send-pack: do not give up when remote has insanely... Junio C Hamano Wed, 22 Feb 2006 02:59:37 +0000 (18:59 -0800)

send-pack: do not give up when remote has insanely large number of refs.

Stephen C. Tweedie noticed that we give up running rev-list when
we see too many refs on the remote side. Limit the number of
negative references we give to rev-list and continue.

Not sending any negative references to rev-list is very bad --
we may be pushing a ref that is new to the other end.

Signed-off-by: Junio C Hamano <junkio@cox.net>

rev-list.c: fix non-grammatical comments.Junio C Hamano Wed, 22 Feb 2006 09:27:02 +0000 (01:27 -0800)

rev-list.c: fix non-grammatical comments.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge part of np/deltaJunio C Hamano Wed, 22 Feb 2006 08:57:43 +0000 (00:57 -0800)

Merge part of np/delta

diff-delta: produce optimal pack dataNicolas Pitre Wed, 22 Feb 2006 01:45:36 +0000 (20:45 -0500)

diff-delta: produce optimal pack data

Indexing based on adler32 has a match precision based on the block size
(currently 16). Lowering the block size would produce smaller deltas
but the indexing memory and computing cost increases significantly.

For optimal delta result the indexing block size should be 3 with an
increment of 1 (instead of 16 and 16). With such low params the adler32
becomes a clear overhead increasing the time for git-repack by a factor
of 3. And with such small blocks the adler 32 is not very useful as the
whole of the block bits can be used directly.

This patch replaces the adler32 with an open coded index value based on
3 characters directly. This gives sufficient bits for hashing and
allows for optimal delta with reasonable CPU cycles.

The resulting packs are 6% smaller on average. The increase in CPU time
is about 25%. But this cost is now hidden by the delta reuse patch
while the saving on data transfers is always there.

Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

diff-delta: big code simplificationNicolas Pitre Wed, 22 Feb 2006 01:43:17 +0000 (20:43 -0500)

diff-delta: big code simplification

This is much smaller and hopefully clearer code now.

Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

diff-delta: fold two special tests into one plus cleanupsNicolas Pitre Wed, 22 Feb 2006 01:41:41 +0000 (20:41 -0500)

diff-delta: fold two special tests into one plus cleanups

Testing for realloc and size limit can be done with only one test per
loop. Make it so and fix a theoretical off-by-one comparison error in
the process.

The output buffer memory allocation is also bounded by max_size when
specified.

Finally make some variable unsigned to allow the handling of files up to
4GB in size instead of 2GB.

Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

relax delta selection filtering in pack-objectsNicolas Pitre Wed, 22 Feb 2006 01:39:25 +0000 (20:39 -0500)

relax delta selection filtering in pack-objects

This change provides a 8% saving on the pack size with a 4% CPU time
increase for git-repack -a on the current git archive.

Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge git://git.kernel.org/pub/scm/gitk/gitkJunio C Hamano Wed, 22 Feb 2006 08:35:18 +0000 (00:35 -0800)

Merge git://git.kernel.org/pub/scm/gitk/gitk

* git://git.kernel.org/pub/scm/gitk/gitk:
gitk: Make "find" on "Files" work again.

Merge branch 'fix'Junio C Hamano Wed, 22 Feb 2006 08:35:07 +0000 (00:35 -0800)

Merge branch 'fix'

* fix:
git-push: Update documentation to describe the no-refspec behavior.
format-patch: pretty-print timestamp correctly.
git-add: Add support for --, documentation, and test.

Merge branch 'jc/perl'Junio C Hamano Wed, 22 Feb 2006 06:51:21 +0000 (22:51 -0800)

Merge branch 'jc/perl'

* jc/perl:
cvsimport: avoid open "-|" list form for Perl 5.6
svnimport: avoid open "-|" list form for Perl 5.6
send-email: avoid open "-|" list form for Perl 5.6
rerere: avoid open "-|" list form for Perl 5.6
fmt-merge-msg: avoid open "-|" list form for Perl 5.6

Merge branch 'jc/pack-reuse'Junio C Hamano Wed, 22 Feb 2006 06:38:43 +0000 (22:38 -0800)

Merge branch 'jc/pack-reuse'

* jc/pack-reuse:
pack-objects: avoid delta chains that are too long.
git-repack: allow passing a couple of flags to pack-objects.
pack-objects: finishing touches.
pack-objects: reuse data from existing packs.

Merge branch 'jc/nostat'Junio C Hamano Wed, 22 Feb 2006 06:33:21 +0000 (22:33 -0800)

Merge branch 'jc/nostat'

* jc/nostat:
cache_name_compare() compares name and stage, nothing else.
"assume unchanged" git: documentation.
ls-files: split "show-valid-bit" into a different option.
"Assume unchanged" git: --really-refresh fix.
ls-files: debugging aid for CE_VALID changes.
"Assume unchanged" git: do not set CE_VALID with --refresh
"Assume unchanged" git

Merge branch 'js/portable'Junio C Hamano Wed, 22 Feb 2006 06:28:40 +0000 (22:28 -0800)

Merge branch 'js/portable'

* js/portable:
Fix "gmake -j"
Really honour NO_PYTHON
avoid makefile override warning
Fixes for ancient versions of GNU make

git-push: Update documentation to describe the no-refsp... Carl Worth Wed, 22 Feb 2006 04:28:50 +0000 (20:28 -0800)

git-push: Update documentation to describe the no-refspec behavior.

It turns out that the git-push documentation didn't describe what it
would do when not given a refspec, (not on the command line, nor in a
remotes file). This is fairly important for the user who is trying to
understand operations such as:

git clone git://something/some/where
# hack, hack, hack
git push origin

I tracked the mystery behavior down to git-send-pack and lifted the
relevant portion of its documentation up to git-push, (namely that all
refs existing both locally and remotely are updated).

Signed-off-by: Carl Worth <cworth@cworth.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

gitview: Use monospace font to draw the branch and... aneesh.kumar@gmail.com Tue, 21 Feb 2006 10:30:10 +0000 (16:00 +0530)

gitview: Use monospace font to draw the branch and tag name

This patch address the below:
Use monospace font to draw branch and tag name
set the font size to 13.
Make the graph column resizable. This helps to accommodate large tag names

Signed-off-by: Aneesh Kumar K.V <aneesh.kumar@gmail.com>
Signed-off-by: Junio C Hamano <junkio@cox.net>

gitview: Read tag and branch information using git... aneesh.kumar@gmail.com Tue, 21 Feb 2006 10:30:04 +0000 (16:00 +0530)

gitview: Read tag and branch information using git ls-remote

This fix the below bug

Junio C Hamano <junkio@cox.net> writes:

>
> It does not work in my repository, since you do not seem to
> handle branch and tag names with slashes in them. All of my
> topic branches live in directories with two-letter names
> (e.g. ak/gitview).

Signed-off-by: Aneesh Kumar K.V <aneesh.kumar@gmail.com>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-ls-files: Fix, document, and add test for --error... Carl Worth Tue, 21 Feb 2006 20:48:30 +0000 (12:48 -0800)

git-ls-files: Fix, document, and add test for --error-unmatch option.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Fix typo in git-rebase.sh.Jason Riedy Tue, 21 Feb 2006 20:56:14 +0000 (12:56 -0800)

Fix typo in git-rebase.sh.

s/upsteram/upstream in git-rebase.sh.

Signed-off-by: Jason Riedy <ejr@cs.berkeley.edu>
Signed-off-by: Junio C Hamano <junkio@cox.net>

New test to verify that when git-clone fails it cleans... Carl Worth Fri, 17 Feb 2006 21:33:28 +0000 (13:33 -0800)

New test to verify that when git-clone fails it cleans up the new directory.

Signed-off-by: Carl Worth <cworth@cworth.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'pj/portable'Junio C Hamano Wed, 22 Feb 2006 02:16:29 +0000 (18:16 -0800)

Merge branch 'pj/portable'

* pj/portable:
Makefile tweaks: Solaris 9+ dont need iconv / move up uname variables

format-patch: pretty-print timestamp correctly.Junio C Hamano Wed, 22 Feb 2006 02:13:32 +0000 (18:13 -0800)

format-patch: pretty-print timestamp correctly.

Perl is not C and does not truncate the division result. Arghh!

Signed-off-by: Junio C Hamano <junkio@cox.net>

git-rebase: Clarify usage statement and copy it into... Carl Worth Wed, 22 Feb 2006 01:10:12 +0000 (17:10 -0800)

git-rebase: Clarify usage statement and copy it into the actual documentation.

I found a paper thin man page for git-rebase, but was quite happy to
see something much more useful in the usage statement of the script
when I went there to find out how this thing worked. Here it is
cleaned up slightly and expanded a bit into the actual documentation.

Signed-off-by: Carl Worth <cworth@cworth.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-add: Add support for --, documentation, and test.Carl Worth Tue, 21 Feb 2006 23:33:49 +0000 (15:33 -0800)

git-add: Add support for --, documentation, and test.

This adds support to git-add to allow the common -- to separate
command-line options and file names. It adds documentation and a new
git-add test case as well.

[jc: this should apply to 1.2.X maintenance series, so I reworked
git-ls-files --error-unmatch test. ]

Signed-off-by: Junio C Hamano <junkio@cox.net>

Fix "gmake -j"Johannes Schindelin Tue, 21 Feb 2006 12:08:21 +0000 (13:08 +0100)

Fix "gmake -j"

In my attempt to port git to IRIX, I broke it. Sorry.

Signed-off-by: Johannes E. Schindelin <Johannes.Schindelin@gmx.de>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'fk/blame' into nextJunio C Hamano Tue, 21 Feb 2006 09:08:21 +0000 (01:08 -0800)

Merge branch 'fk/blame' into next

* fk/blame:
Add git-blame, a tool for assigning blame.

Merge branch 'pj/portable' into nextJunio C Hamano Tue, 21 Feb 2006 09:07:57 +0000 (01:07 -0800)

Merge branch 'pj/portable' into next

* pj/portable:
Makefile tweaks: Solaris 9+ dont need iconv / move up uname variables
Merge part of jc/portable branch
git-mktree: reverse of git-ls-tree.
Merge branch 'lt/merge-tree'
Merge branch 'jc/ident'
cherry-pick/revert: error-help message rewording.
Fix fmt-merge-msg counting.

Makefile tweaks: Solaris 9+ dont need iconv / move... Paul Jakma Mon, 20 Feb 2006 23:36:28 +0000 (23:36 +0000)

Makefile tweaks: Solaris 9+ dont need iconv / move up uname variables

- Solaris 9 and up do not need -liconv, so NEEDS_LIBICONV should be set
only for S8.
- Move the declaration of the uname variables to early in the Makefile
so they can be referenced by prefix and gitexecdir variables.
- gitexecdir defaults to being same as bindir, it might as well reference
that variable.

[jc: corrupt patch, sneakily tried to remove inclusion of GIT-VERSION-FILE
I do not know why I am applying this...]

Signed-off-by: Paul Jakma <paul@quagga.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Add git-blame, a tool for assigning blame.Fredrik Kuivinen Mon, 20 Feb 2006 23:40:54 +0000 (00:40 +0100)

Add git-blame, a tool for assigning blame.

I have also been working on a blame program. The algorithm is pretty
much the one described by Junio in his blame.perl. My variant doesn't
handle renames, but it shouldn't be too hard to add that. The output
is minimal, just the line number followed by the commit SHA1.

An interesting observation is that the output from my git-blame and
your git-annotate doesn't match on all files in the git
repository. One example where several lines differ is read-cache.c. I
haven't investigated it further to find out which one is correct.

The code should be considered as a work in progress. It certainly has
a couple of rough edges. The output looks fairly sane on the few files
I have tested it on, but it wouldn't be too surprising if it gets some
cases wrong.

[jc: adding it to pu for wider comments. I did minimum
whitespace fixups but it still needs an indent run and
-Wdeclaration-after-statement fixups.]

Signed-off-by: Fredrik Kuivinen <freku045@student.liu.se>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge part of jc/portable branchJunio C Hamano Tue, 21 Feb 2006 08:52:18 +0000 (00:52 -0800)

Merge part of jc/portable branch

git-mktree: reverse of git-ls-tree.Junio C Hamano Tue, 21 Feb 2006 05:50:01 +0000 (21:50 -0800)

git-mktree: reverse of git-ls-tree.

This reads data in the format a (non recursive) ls-tree outputs
and writes a tree object to the object database. The created
tree object name is output to the standard output.

For convenience, the input data does not need to be sorted; the
command sorts the input lines internally.

By request from Tommi Virtanen.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'lt/merge-tree'Junio C Hamano Tue, 21 Feb 2006 08:49:38 +0000 (00:49 -0800)

Merge branch 'lt/merge-tree'

* lt/merge-tree:
git-merge-tree: generalize the "traverse <n> trees in sync" functionality
Handling large files with GIT
Handling large files with GIT

Merge branch 'jc/ident'Junio C Hamano Tue, 21 Feb 2006 08:46:07 +0000 (00:46 -0800)

Merge branch 'jc/ident'

* jc/ident:
Keep Porcelainish from failing by broken ident after making changes.
Delay "empty ident" errors until they really matter.
Make "empty ident" error message a bit more helpful.

cherry-pick/revert: error-help message rewording.Junio C Hamano Tue, 21 Feb 2006 08:28:04 +0000 (00:28 -0800)

cherry-pick/revert: error-help message rewording.

It said "after fixing up, commit the result using -F .msg", but
it was not clear for new people how "fix up" should be done.
Hint "git-update-index <path>".

We could recommend "git commit -a -F .msg" instead, but I am
hesitant to give that suggestion in the blind -- you could do a
cherry-pick, revert or a merge in general in a dirty working
tree as long as local modifications do not overlap with the
merge, but using "commit -a" would include them in the result.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Fix fmt-merge-msg counting.Junio C Hamano Tue, 21 Feb 2006 03:26:21 +0000 (19:26 -0800)

Fix fmt-merge-msg counting.

Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'jc/perl' into nextJunio C Hamano Mon, 20 Feb 2006 22:25:50 +0000 (14:25 -0800)

Merge branch 'jc/perl' into next

* jc/perl:
cvsimport: avoid open "-|" list form for Perl 5.6
svnimport: avoid open "-|" list form for Perl 5.6
send-email: avoid open "-|" list form for Perl 5.6
rerere: avoid open "-|" list form for Perl 5.6
fmt-merge-msg: avoid open "-|" list form for Perl 5.6

Merge branch 'ra/anno' into nextJunio C Hamano Mon, 20 Feb 2006 22:25:46 +0000 (14:25 -0800)

Merge branch 'ra/anno' into next

* ra/anno:
Add git-annotate, a tool for assigning blame.
git-svn: 0.9.1: add --version and copyright/license (GPL v2+) information
contrib/git-svn: add Makefile, test, and associated ignores
git-svn: fix several corner-case and rare bugs with 'commit'
contrib/git-svn.txt: add a note about renamed/copied directory support
git-svn: change ; to && in addremove()
git-svn: remove any need for the XML::Simple dependency
git-svn: Allow for more argument types for commit (from..to)
git-svn: allow --find-copies-harder and -l<num> to be passed on commit
git-svn: fix a typo in defining the --no-stop-on-copy option

cvsimport: avoid open "-|" list form for Perl 5.6Junio C Hamano Mon, 20 Feb 2006 22:17:28 +0000 (14:17 -0800)

cvsimport: avoid open "-|" list form for Perl 5.6

Signed-off-by: Junio C Hamano <junkio@cox.net>

svnimport: avoid open "-|" list form for Perl 5.6Junio C Hamano Mon, 20 Feb 2006 22:14:15 +0000 (14:14 -0800)

svnimport: avoid open "-|" list form for Perl 5.6

Signed-off-by: Junio C Hamano <junkio@cox.net>

send-email: avoid open "-|" list form for Perl 5.6Junio C Hamano Mon, 20 Feb 2006 22:23:51 +0000 (14:23 -0800)

send-email: avoid open "-|" list form for Perl 5.6

Signed-off-by: Junio C Hamano <junkio@cox.net>

rerere: avoid open "-|" list form for Perl 5.6Junio C Hamano Mon, 20 Feb 2006 22:02:47 +0000 (14:02 -0800)

rerere: avoid open "-|" list form for Perl 5.6

Signed-off-by: Junio C Hamano <junkio@cox.net>

fmt-merge-msg: avoid open "-|" list form for Perl 5.6Junio C Hamano Mon, 20 Feb 2006 22:02:25 +0000 (14:02 -0800)

fmt-merge-msg: avoid open "-|" list form for Perl 5.6

Signed-off-by: Junio C Hamano <junkio@cox.net>

Add git-annotate, a tool for assigning blame.Ryan Anderson Mon, 20 Feb 2006 10:46:09 +0000 (05:46 -0500)

Add git-annotate, a tool for assigning blame.

Signed-off-by: Ryan Anderson <ryan@michonline.com>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: 0.9.1: add --version and copyright/license... Eric Wong Mon, 20 Feb 2006 18:57:29 +0000 (10:57 -0800)

git-svn: 0.9.1: add --version and copyright/license (GPL v2+) information

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

contrib/git-svn: add Makefile, test, and associated... Eric Wong Mon, 20 Feb 2006 18:57:29 +0000 (10:57 -0800)

contrib/git-svn: add Makefile, test, and associated ignores

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: fix several corner-case and rare bugs with... Eric Wong Mon, 20 Feb 2006 18:57:28 +0000 (10:57 -0800)

git-svn: fix several corner-case and rare bugs with 'commit'

None of these were really show-stoppers (or even triggered)
on most of the trees I've tracked.

* Node change prevention for identically named nodes. This is
a limitation of SVN, but we find the error and exit before
it's passed to SVN so we don't dirty our working tree when our
commit fails. git-svn will exit with an error code 1 if any
of the following conditions are found:

1. a directory is removed and a file of the same name of the
removed directory is created
1a. a file has its parent directory removed and the file is
takes the name of the removed parent directory::
baz/zzz => baz
2. a file is removed and a directory of the same name of the
removed file is created.
2a. a file is moved into a deeper directory that shares the
previous name of the file::
dir/$file => dir/file/$file

Since SVN cannot handle these cases, the user will have to
manually split the commit into several parts.

* --rmdir now handles nested/deep removals. If dir/a/b/c/d/e/file
is removed, and everything else is in the dir/ hierarchy is
otherwise empty, then dir/ will be deleted when file is deleted
from svn and --rmdir specified.

* Always assert that we have written the tree we want to write
on commits. This helped me find several bugs in the symlink
handling code (which as been fixed).

* Several symlink handling fixes. We now refuse to set
permissions on symlinks. We also always unlink a file
if we're going to overwrite it.

* Apply changes in a pre-determined order, so we always have
rename from locations handy before we delete them.

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

contrib/git-svn.txt: add a note about renamed/copied... Eric Wong Mon, 20 Feb 2006 18:57:28 +0000 (10:57 -0800)

contrib/git-svn.txt: add a note about renamed/copied directory support

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: change ; to && in addremove()Eric Wong Mon, 20 Feb 2006 18:57:28 +0000 (10:57 -0800)

git-svn: change ; to && in addremove()

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: remove any need for the XML::Simple dependencyEric Wong Mon, 20 Feb 2006 18:57:28 +0000 (10:57 -0800)

git-svn: remove any need for the XML::Simple dependency

XML::Simple was originally required back when I made svn-arch-mirror
because I needed to explictly track renames with Arch. Then I carried
it over to git-svn because I was afraid somebody could commit an svn
log message that could throw off a non-XML log parser. Then I noticed
the <n> lines column in the header. So, no more XML :)

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: Allow for more argument types for commit ... Eric Wong Mon, 20 Feb 2006 18:57:26 +0000 (10:57 -0800)

git-svn: Allow for more argument types for commit (from..to)

Allow 'from..to' notation from the command line.

More liberal sha1 parsing when reading from stdin no longer requires the
sha1 to be the first character, so a leading 'commit ' string is OK.

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: allow --find-copies-harder and -l<num> to... Eric Wong Mon, 20 Feb 2006 18:57:26 +0000 (10:57 -0800)

git-svn: allow --find-copies-harder and -l<num> to be passed on commit

Both of these options are passed directly to git-diff-tree when
committing to a SVN repository.

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

git-svn: fix a typo in defining the --no-stop-on-copy... Eric Wong Mon, 20 Feb 2006 18:57:25 +0000 (10:57 -0800)

git-svn: fix a typo in defining the --no-stop-on-copy option

Just a typo, I doubt anybody would use (and I highly recommend not
using) this option anyways. But you never know...

Signed-off-by: Eric Wong <normalperson@yhbt.net>
Signed-off-by: Junio C Hamano <junkio@cox.net>

Merge branch 'jc/pack-thin' into nextJunio C Hamano Mon, 20 Feb 2006 08:45:38 +0000 (00:45 -0800)

Merge branch 'jc/pack-thin' into next

* jc/pack-thin:
Use thin pack transfer in "git fetch".
Add git-push --thin.

Use thin pack transfer in "git fetch".Junio C Hamano Mon, 20 Feb 2006 08:38:39 +0000 (00:38 -0800)

Use thin pack transfer in "git fetch".

Signed-off-by: Junio C Hamano <junkio@cox.net>

Add git-push --thin.Junio C Hamano Mon, 20 Feb 2006 08:09:41 +0000 (00:09 -0800)

Add git-push --thin.

Maybe we would want to make this default before it graduates to
the master branch, but in the meantime to help testing things,
this allows you to say "git push --thin destination".

Signed-off-by: Junio C Hamano <junkio@cox.net>