Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

Would be nice if this dataset was open sourced and the community could add more candidates.


sort by: page size:

Would be nice if there was an open source version of this, where the data was published for the public to learn from

Yes, please open source this data! Making it easily searched is a great public service but technical people can do even more with the text data.

According to this page, they have already open sourced some datasets: https://crowdsource.google.com/about/open-source/

This is awesome and I can't believe I've never found it even though it's over ten years old. Anyone know of any open projects that use these data that are taking on contributors?

Perhaps you should let some of the open data citizen groups know about this so they can add more data. Also, if you haven't already then take a look at CKAN[1] for datasets to add.

[1] http://ckan.net/


Good. Hopefully this data will be open sourced.

Are you planning to open source the dataset generated?

I wish the work you did could be open sourced, including the data set. Work like that is lost to humanity because it's done for only a few entities, and once they are being replaced, the initial work disapear with it.

This is great, would love to see support for more data sources.

Do you happen to know if their dataset is available to the public?

I think this is a great idea.

Having access to the data set itself could unlock a lot of new, creative ideas and applications beyond the expected ones. That's one great thing I've learned from the open source community.

It could not only be used for search, but some data analysis, and what not. I think it would be fairly beneficial for github to do it, actually. Easy to work with, up to date dataset -> interesting projects -> github brand value++.


I'm looking to play with some open source data sets, but I can't find any with anything particularly interesting. Or the interesting ones aren't big enough.

Put some open datasets link. let's make some Open Source Deep Learning projects.

Yes exactly! It would be nice if this data were more opened up so that tools around the data could be created.

Do you intend to make the dataset available? I've attempted to do something similar, but had difficulty, would love access to the dataset.

Now open source the data. :)

Check out one of Data for Democracy's projects and see if you'd like to contribute to any of their open source work: http://datafordemocracy.org/

Sounds interesting! I work with these data, is any of that going to be Open Source or commercial only?

Is the data open sourced too?
next

Legal | privacy