This is awesome and I can't believe I've never found it even though it's over ten years old. Anyone know of any open projects that use these data that are taking on contributors?
Perhaps you should let some of the open data citizen groups know about this so they can add more data. Also, if you haven't already then take a look at CKAN[1] for datasets to add.
I wish the work you did could be open sourced, including the data set. Work like that is lost to humanity because it's done for only a few entities, and once they are being replaced, the initial work disapear with it.
Having access to the data set itself could unlock a lot of new, creative ideas and applications beyond the expected ones. That's one great thing I've learned from the open source community.
It could not only be used for search, but some data analysis, and what not. I think it would be fairly beneficial for github to do it, actually. Easy to work with, up to date dataset -> interesting projects -> github brand value++.
I'm looking to play with some open source data sets, but I can't find any with anything particularly interesting. Or the interesting ones aren't big enough.
reply