Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

Nice. It will be interesting to see how the Dataset API matures over time.


sort by: page size:

Will be interesting to play around with that dataset.

This is wonderful news. Anyone here get started on projects leveraging this dataset yet?

This is great - looking forward to seeing the available datasets grow!

I am working on this for a different definition of term dataset. I started learning deep learning which led me to start building datasets.

Wanting to store versions of the datasets efficiently I started building a version control system for them. It tracks objects and annotations and can roll back to any point in time. It helps answer questions like what has changed since the last release and which user made which changes.

Still working on the core library but I'm excited for it.


Oh nice, could be helpful if this takes off with the data!

Hey thanks! If you're still reading this, I just soft-launched (I will get an email but I plan on automate it) a way to add new datasets

Great, I would be interested on how you built the dataset/database. Please share when you have the post.

Nice! I didn't realize so much had been done on dataprotocols.org yet!

That dataset looks cool. Good work either way, I'm sure it'll go somewhere

Looking forward to that, it will be a very nice data set

Great work! Love to see more datasets coming!

That's pretty awesome. Please keep in touch, let's see if we can apply the tool on the new data sets.

Love it! It'd be nice to have a couple more example datasets, and I'd promote them to the top level (rather than in 'more'). I think the first thing many users will want to do is just try to tool with some pre-provided datasets - and when it works nicely (which it does!) then maybe import their own data

Yeah it's a pretty fun dataset that you can do a lot of stuff with. Very slept on.

Neat way for Microsoft to have other developers build their labeled datasets, I guess.

This seems like something that could be useful for data scientists.

Nice! Glad it resonated. Never quite sure how a project like this will land.

Thanks for sharing those - will check them out. Interested to see what happens as the size of the dataset grows.

I have not looked deeply, but Typesense[1] seems like another interesting project. Similar to ES or Algolia, easy to self-host, & with a seemingly efficient memory & disk footprint.

[1]https://github.com/typesense/typesense


That's one of the most interesting dataset I've seen released. I know what I'll be playing with this weekend :)

Thanks! Custom dataset indeed on the todo list. The feature is half done already, I should be able to release it soon.
next

Legal | privacy