Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login



sort by: page size:


was this when they decided, nah fk it let s just aquire the thing https://blogs.msdn.microsoft.com/bharry/2017/05/24/the-large...

I've heard many people claim Microsoft has the largest git repo and I've never heard anyone successfully dispute that claim.

they have definitely changed the source control infrastructure and are now on git.

https://blogs.msdn.microsoft.com/bharry/2017/05/24/the-large...


I belive the entire Windows codebase is in a git monorepo now, for some years: https://devblogs.microsoft.com/bharry/the-largest-git-repo-o...

> the complete editing history of the Linux kernel (probably the largest Git repo ever created)

Linux is definitely not the largest git repo ever created [0]. The big corporate monorepos are definitely larger; I know MS has moved Windows to git, and itself claims it to be the largest ever created (~300GB as of 2017, per [1]). Google and Facebook both eschew git, though.

Finding data on the largest open repos is more difficult. The largest classes of projects are those that develop in monorepos that implement critical operating system [2] functionality, browser engines, and compiler implementations. The shortlist I'd make comes out to these projects (in no particular order):

* gcc

* LLVM

* Mozilla

* Chromium

* Linux

* OpenJDK

I haven't finished downloading all of these repos (my disk is begging me to stop right now), but it looks Linux is larger than gecko-dev by a very thin margin (so a putative gecko-dev that included comm-central with its CVS history as well would easily outstrip Linux), and Chromium seems to be an order of magnitude over both.

[0] To be clear here, I'm mostly thinking in terms of primarily textual repositories. Repositories with large binary assets are clearly not relevant for your means.

[1] https://devblogs.microsoft.com/bharry/the-largest-git-repo-o..., although https://news.ycombinator.com/item?id=14411724 claims that the 300 GB measures the size of the checked-out directory on disk, not the putative size of a full .git folder.

[2] I'm including both kernel roles as well as key userspace roles. Qt and Gnome would both be on my list of putative largest repos were they monorepos, but they appear to use many small repos instead.


Keep in mind that to manage the insane scale, Microsoft internally hides git behind a custom virtual file system:

https://vfsforgit.org/

https://github.com/Microsoft/VFSForGit


Interesting tidbit:

> The developers making Microsoft Windows use Git


Google and Facebook don't use Git. Microsoft really loves Git and GitHub.

I din't know Microsoft used git. Lolz.

Microsoft does definitely use git, monorepo I am not so sure.

Or Microsoft (Windows is using Git).

Considering how comparatively rare it is, it's unlikely to ever be supported natively. Microsoft has invested a lot into git and now uses it for the Windows codebase, which is the largest repo by several magnitudes.

Most companies of this size don't use git. ;)

It's worth pointing out that they have Git support built in as well.

reminds me of a certain hub, that uses git.

You are actually right and down voters have no clue (and this is what I hate about HN, if you don't comment then don't downvoye)

To answer your question though - Microsoft has a lot of git extensions that they are slowly submitting upstream. Hence git is usable for their megarepo.

Look up GVFS on GitHub, very cool work.

Same for Twitter (they are not submitting upstream though as far as I know)

Similar for fb/mercurial


Nice how they cite their git repository in the references.

Something this article goes into in more detail with regards to Git:

https://blogs.msdn.microsoft.com/devops/2017/05/30/optimizin...

next

Legal | privacy