Hacker Read

20kleagues · 2015-02-02 06:42:58+00:00

I think we will need to look at streaming scenes from a more powerful machine rather than creating them on board. I don't believe we are ready in any way to process that much information on such a small system at this time (HoloLens also gives out quite a lot of heat from its on board processor). The old pi had trouble streaming higher bitrates, though then we still have the same ethernet so I am not sure how much is possible.

mike-cardwell | karma 13092 | avg karma 3.68 · | 2012-08-01 09:03:28+00:00

I'm waiting for the day when you can get a raspberry pi style machine that will do full 1080p and will transcode anything I throw at it in real time or faster. It can't be that far off in the future can it? A couple of years maybe?

zdw | karma 117487 | avg karma 18.27 · | 2013-11-05 10:29:29

The Pi's biggest strength is it's GPU. Using some of the OpenMAX API's for video or 3D encoding would probably be a better use than a CPU-bound task like web serving:

http://elinux.org/Raspberry_Pi_VideoCore_APIs

reply

mrubashkin | karma 102 | avg karma 6.0 · | 2017-02-09 19:50:37+00:00

My colleagues and myself have a blog post discussing how to do streaming video analysis on the Raspberry Pi: https://svds.com/streaming-video-analysis-python/

On the Pi3, our application processes 320X240 images at 10 FPS without any problems.

Let me know if you have any questions!

reply

azernik | karma 12752 | avg karma 3.49 · | 2013-07-15 12:55:58

The Pi is an interesting system in that it has a truly impressive GPU (according to the manufacturers "capable of BluRay quality playback"), especially compared to its bottom-shelf CPU. Which means you need everything encoded in h.264 or some other format the GPU can do, but that's achievable.

its-summertime | karma 615 | avg karma 2.36 · | 2023-10-30 08:31:43

If bandwidth ain't much of an issue, can just dump frames to a separate device for encoding, aside, Can't imagine someone using a pi 5 just for camera usage (aside from projects needing cameras)

RetroTechie | karma 1471 | avg karma 1.52 · | 2023-10-30 10:25:15

Gordon Hollingworth:

"In future we’ll have to do something, but for Pi 5 we feel the hardware encode is a mm^2 too far."

Sounds reasonable, given a fast cpu & less-than optimal hw-accelerated encoding options. As for that "something", maybe:

1) Drop hw-accelerated encoding and decoding entirely, and use the freed up silicon for much beefier cpus (like ones including -bigger- vector units, more cores etc. Cortex X?). That would be useful for any cpu heavy applications.

2) Include hw encoder for a common (1), relatively 'heavy' codec. And hw decoder for same + maybe others.

3) Only include decoder(s?), like they seem to have done for RPi5.

4) Include some kind of flexible compute fabric that can be configured to do the heavy lifting for popular video codecs.

Combined with:

5) Move to newer silicon node to obtain higher efficiency or transistor budget.

Whatever route a future RPi would go, imho hw-accelerated decoding is much more useful than encoding.

reply

problems | karma 3239 | avg karma 2.65 · | 2017-09-20 17:48:30+00:00

See my comment here:

https://news.ycombinator.com/item?id=15296354

Only catch with the Pi might be decode/encode performance if you plan to do anything with the video on that side of things, you'd have to have video software that can take advantage of hardware acceleration on the Pi.

reply

rektide | karma 9514 | avg karma 1.5 · | 2022-03-08 16:49:58

Personally I'd rather folks shoot high, aim for great, & if someone has special needs, they can transpile out their suboptimal version as they like.

Software rendering at 1080p should work just fine for everyone. Even if a pi cant realtime decode a high bit rate av1, it probably can re-encode a movie overnight, maybe in a day. That is a chore, yes, but in return we get to be transfering the best quality available, we're getting much better results.

I dont think there's any reason to wait. The good stuff should start getting seeded. We need media servers to help fill in the gap, do overnight non-real-time transcoding.

reply

Shared404 | karma 4928 | avg karma 2.18 · | 2023-03-19 09:07:54

I believe that there is hardware accelerated video now, though I haven't had hands on a Pi4 to test.

canada_dry | karma 3379 | avg karma 3.27 · | 2020-06-21 21:54:27+00:00

I'm hoping advances like YoloV5 [i] will allow a rpi4 to more ably do this without piping the video to another processor.

[i] https://github.com/ultralytics/yolov5

reply

olabyne | karma 201 | avg karma 1.79 · | 2023-04-05 09:21:32

Check what Vigibot is doing with their client. Using h264_omx, the raspberry Pi is fairly capable of streaming 720p at <50ms latency

scottlamb | karma 4110 | avg karma 3.86 · | 2020-06-21 23:18:35+00:00

> You need way more processing power than an RPi to do this at 30fps, and C/C++, not Python. (There are literally dozens of projects for the RPi and TFlow online but they all get like 0.1 fps or less by using Flask and browser reload of a PNG... great for POC but not for real video)

I think 8 streams at 15 fps (aka 120 fps total) is possible with a ($35) Raspberry Pi 4 + ($75) Coral USB Accelerator. I say "I think" because I haven't tested on this exact setup yet. My Macbook Pro and Intel NUC are a lot more pleasant to experiment on (much faster compilation times). A few notes:

* I'm currently just using the coral.ai prebuilt 300x300 MobileNet SSD v2 models. I haven't done much testing but can see it has notable false negatives and positives. It'd be wonderful to put together some shared training data [1] to use for transfer learning. I think then results could be much better. Anyone interested in starting something? I'd be happy to contribute!

* iirc, I got the Coral USB Accelerator to do about 180 fps with this model. [edit: but don't trust my memory—it could have been as low as 100 fps.] It's easy enough to run the detection at a lower frame rate than the input as well—do the H.264 decoding on every frame but only do inference at fixed pts intervals.

* You can also attach multiple Coral USB Accelerators to one system and make use of all of them.

* Decoding the 8 streams is likely possible on the Pi 4 depending on your resolution. I haven't messed with this yet, but I think it might even be possible in software, and the Pi has hardware H.264 decoding that I haven't tried to use yet.

* I use my cameras' 704x480 "sub" streams for motion detection and downsample that full image to the model's expected 300x300 input. Apparently some people do things like multiple inference against tiles of the image or running a second round of inference against a zoomed-in object detection region to improve confidence. That obviously increases the demand on both the CPU and TPU.

* The Orange Pi AI Stick Lite is crazy cheap ($20) and supposedly comparable to the Coral USB Accelerator in speed. At that price if it works buying one per camera doesn't sound too crazy. But I'm not sure if drivers/toolchain support are any good. I have a PLAI Plug (basically the same thing but sold by the manufacturer). The PyTorch-based image classification on a prebuilt model works fine. I don't have the software to build models or do object detection so it's basically useless right now. They want to charge an unknown price for the missing software, but I think Orange Pi's rebrand might include it with the device?

[1] https://groups.google.com/g/moonfire-nvr-users/c/ZD1uS7kL7tc...

reply

masterzora | karma 1038 | avg karma 2.59 · | 2012-08-15 20:31:04+00:00

I'm not at all familiar with video encoding/decoding processes or what they entail. Can you explain how hardware support for it would drastically change power requirements?

not_your_vase | karma 1398 | avg karma 1.9 · | 2023-10-24 00:54:37

Embedded machine with big enough CPU for video encoding, but not having 10-20 MB flash? Doesn't sound too plausible.

yourcousinbilly | karma 56 | avg karma 2.8 · | 2022-09-21 18:33:04

Video engineer here. Many seemingly network restricted tasks could be unlocked with faster CPUS doing advanced compression and decompression.

1. Video Calls

In video calls, encoding and decoding is actually a significant cost of video calls, not just networking. Right now the peak is Zoom's 30 video streams onscreen, but with 1000x CPUS you can have 100s of high quality streams with advanced face detection and superscaling[1]. Advanced computer vision models could analyze each face creating a face mesh of vectors, then send those vector changes across the wire instead of a video frame. The receiving computers could then reconstruct the face for each frame. This could completely turn video calling into a CPU restricted task.

2. Incredible Realistic and Vast Virtual Worlds

Imagine the most advanced movie realistic CGI being generated for each frame. Something like the new Lion King or Avatar like worlds being created before you through your VR headset. With extremely advanced eye tracking and graphics, VR would hit that next level of realism. AR and VR use cases could explode with incredibly light headsets.

To be imaginative, you could have everything from huge concerts to regular meetings take play in the real world, but be scanned and sent to VR participants in real time. The entire space including the room and whiteboard or live audience could be rendered in realtime for all VR participants.

[1] https://developer.nvidia.com/maxine-getting-started

reply

kd1221 | karma 34 | avg karma 1.79 · | 2011-10-23 18:55:46+00:00

Currently they're bandwidth bound, but I'm dissolving that. In the next year I imagine that it'll move to being more memory bound. Then that will turn into being bound by I/O, but that can be solved with memory and more clever caching schemes. CPU should never be an issue unless I move the video encoding back in house, but I don't see that happening soon, and I would separate that into a its own server farm.

mumumu | karma 83 | avg karma 1.84 · | 2023-06-20 01:09:04

The bottleneck is bandwidth and storage not computer power.

Data from sensor and to display are usually (but not always) raw and uncompressed.

To go beyond 8k 120fps we will need fiber, or better compression.

We have good codecs to move data online. And specialized ASIC to do live encoding with good quality.

The current compression defined by HDMI is very rudimentary.

reply

nfriedly | karma 6876 | avg karma 3.44 · | 2023-11-02 11:41:13

I think this overstates it a bit. I know what hardware video encoding is, but I still think I'm part of the target audience for the Pi.

I have several Pi's around the house that I use for various projects, and none of them have ever involved video encoding. The one thing I can think of where it might be beneficial is my plex server, but that's an x86 machine right now, and it'll probably stay that way.

reply

mehmetalianil | karma 1 | avg karma 0.5 · | 2020-06-22 11:14:22+00:00

There is a video of the video pipeline framework of NVidia that might give a hint of what is possible. Nano has h264 encoders and decoders that can feed 8 simultaneous 720px streams.

I presume that this is through ethernet or flash, but might not be applicable to CSI, since CSI port/lane count seems to lowish. Never tried it though.

https://youtu.be/Y43W04sMK7I

reply