Hacker Read

bitL · 2019-11-09 02:30:21+00:00

That should be easily solvable using 3D convolutions and processing a short clip (~10 frames) instead of a single picture.

ibrahimsow1 | karma 20 | avg karma 0.65 · | 2023-12-12 07:12:05

I like to imagine with 3D gaussian splatting of the videos. Not compute possible yet.

sitver | karma 59 | avg karma 0.7 · | 2013-09-23 03:55:35

Interesting idea. The issue is that (I'm guessing) you'd also need some intense software to decipher the image and convert it into a 3D file.

Animats | karma 143047 | avg karma 6.11 · | 2022-03-25 18:20:17

What's new about this? That it's faster? People have been reconstructing 3D images from multiple photos for over a decade. The experimental work today is constructing a 3D image from a single photo, using a neural net to fill in a reasonable model of the stuff you can't see.

corysama | karma 10761 | avg karma 4.62 · | 2022-06-21 12:36:00

Yep. There has been some work to figure out frame-to-frame coherence on a sequence of images in 2D. But, I think this skips over that problem by working in 3D.

vanviegen | karma 1401 | avg karma 2.85 · | 2022-08-31 00:48:34

What's the next step? Generating 3d scenes out of these images? Would that be feasible?

dylan604 | karma 28008 | avg karma 1.81 · | 2023-10-16 22:06:28

all of these videos that zoom into the static image just get me every time I see one. anyone have any insights on how to create a somewhat accurate 3D image like this? I'm sure there's a data set available for public use since it was publicly funded. Really curious how much hardware RAM/CPU/GPU it takes, and what kind of render times are involved to make these types of videos.

rayalez | karma 4393 | avg karma 4.69 · | 2014-11-16 14:00:52+00:00

I think that 3D scanners would be a perfect solution to this. It becomes easier and easier to digitize real world objects, even based on a single-camera video.

http://youtu.be/gu5Ywwb4RaU

reply

qingcharles | karma 6485 | avg karma 2.82 · | 2023-07-13 11:03:26

The video is wild. Now we need an AI 3D infill to block in all the missing data at the edges of the view.

bemmu | karma 16056 | avg karma 6.67 · | 2008-12-08 15:31:55

Here's something I'd like to see someone try.

Take pictures of a 3D object from say 100 different angles, remembering which angles they were. Now, use this mona lisa algorithm to put 3D polygons inside a box of sufficient size, keep those that most look like the original object when looked at from all those angles.

Will you get a compressed 3D representation of an object this way?

reply

adityasankar | karma 20 | avg karma 1.67 · | 2015-05-28 19:11:45

Probably with some simple optical flow view morphing. There's enough data from all the cameras to enable creating a stereo image.

fragmede | karma 18795 | avg karma 1.82 · | 2013-08-06 13:41:34

You'd need four pictures - two of each side, to capture enough information to generate a 3d-model, instead of just the one.

Better, but still vulnerable.

reply

0x02A | karma 27 | avg karma 3.38 · | 2024-03-18 05:22:25

That's super interesting. Are you trying to do single-view image to 3D?

pontifier | karma 2263 | avg karma 2.06 · | 2020-12-19 00:07:41

Wow, I really want to see a photogrammetry system that can turn that video into to a reconstruction of the area.

rasz_pl | karma 1849 | avg karma 0.86 · | 2015-06-20 13:28:01

Next step is Video. Adding temporal dimension will emphasize extrapolating true 3d shapes of recognized objects.

andy_ppp | karma 10079 | avg karma 2.53 · | 2021-04-16 15:27:36+00:00

With enough training data I’m sure it can be done... I’ve seen CNNs that fill in 3D scenes and animate them from two images. I would guess this was a simpler problem?

CharlesW | karma 34884 | avg karma 5.2 · | 2022-08-14 17:32:54

A more effective version of this would capture a 3D depth map with the 2D image.

ge96 | karma 1132 | avg karma 0.63 · | 2023-04-27 03:03:43

Probably easiest solution is a 360 video camera since YT allows you to view them... but not sure how good of quality/distortions.

rasz_pl | karma 1849 | avg karma 0.86 · | 2016-07-26 22:49:28+00:00

This is from last years conference, running on a laptop in real time: https://www.youtube.com/watch?v=oJt3Ln8H03s

'computer tricks' are already here with full 3d reconstruction in real time

reply

ctdonath | karma 11320 | avg karma 2.51 · | 2012-11-08 15:55:07+00:00

Suggestion: Use "photosynth" techniques to fuse the video frames from the moving cameras into a 3D model of the room.