OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

opencv.org - 532 poäng - 100 kommentarer - 303002 sekunder sedan

Kommentarer (21)

plasticeagle - 28146 sekunder sedan
The thing I love about OpenCV is that it remains hands down the best library for simply loading images and video. I've never even used any of its fancy computer vision features, but if I need to load a video file and look at the pixels - which I did need to do recently for an art project - OpenCV does it in about four lines of code.
pzo - 25139 sekunder sedan
Quite a good release although not sure why they invest so much time into their ONNX engine. I don't think they have enough stuff and big pockets to compete with ONNXRuntime, CoreAI, ExecuTorch, LiteRT.
I'm happy they added option for ONNXRuntime. I wish their cv.dnn was mostly that unified wrapper around many different backends (ONNXRuntime, Executorch, LiteRT, CoreAI) and maybe just some tooling around it (performance metrics tools, model downloads etc). Transformers(.js) approach looks better for me.
Wish they also invested more time into better production ready Camera I/O (for mobiles, device/format discovery, manual settings, depthmap support, etc) and better Highgui that could use different backends (skia, webgpu) and on mobiles.
ftchd - 38024 sekunder sedan
> One practical detail is worth knowing. The new engine is CPU-only at the moment, so if you select a non-CPU backend and target (for example CUDA or OpenVINO through setPreferableBackend and setPreferableTarget), you will want the classic engine.
So there's room for even better performance!
boredemployee - 7980 sekunder sedan
How can I learn the practical side of computer vision in 2026?
I'm not interested in understanding papers or the math behind it, but rather in how to put a system into production, whether it's object detection, running 20 cameras in parallel on a single computer, like sizing hardware for a specific task, and so on.
Any tips?
GreenSalem - 21759 sekunder sedan
AI written release post and it shows...
ge96 - 5156 sekunder sedan
I remember trying to do photo stitching myself (panoramas) then I failed miserably but it's built into opencv ha. I've used quite a bit of OpenCV features eg. laplace variance for an automatic zoom/focusing mechanical lens camera system (steppers) and contour/blob finding for crude color segmentation.
arcanine - 35320 sekunder sedan
They really improved the performance. I tested yolov8 medium segmentation model on intel i7 11th gen cpu.
Opencv 4.11 : ~255ms Opencv 5.0.0 : ~185ms
with the same code.
owenpalmer - 3070 sekunder sedan
> This is not just another incremental release. OpenCV 5 is a major step forward.
Am I the only one that finds this sentence very cheesey?
shelled - 30173 sekunder sedan
A few years ago I was using OpenCV is a commercial Android SDK (it might still be being used; also because iOS provided almost all of those "needs" ready-made and Android just didn't, neither did Firebase, or Jetpack suites/tools). I was the one who had added it in the SDK. There was a lot I/we could do but as an Android developer (barely any exposure to CV or even C/C++) what I felt we lacked was documentation, a community. We struggled with even shaving off parts that we did not want to ship with our SDK. Speed was such an issue. The problem was someone who just wanted to use the lib (on mobile) a lot of things felt esoteric and out of reach i.e difficult. It didn't have to be.Sadly LLM wasn't at full speed back then, barely useable, not even talked about. Something like this would have been a perfect use case of AI/LLM. A coder, not from the exact/specific field the tool was made in/from, but being able to take full advantage of its capabilities in a nuanced/selective manner.
- 4319 sekunder sedan
hbcondo714 - 260204 sekunder sedan
> LLMs and VLMs, Running Inside OpenCV…Qwen 2.5, Gemma 3, PaliGemma, and the GPT-2 / GPT-4 family
Why these specific models / versions?
maelito - 32909 sekunder sedan
Can it detect the speed of the car without any hand-made measurement ?
globalnode - 38365 sekunder sedan
does this mean im actually able to try object detection in opencv now? i mean i know basic image processing techniques, and i know "in theory" how ML works but ive never really seen a case where i can just say "heres an image now detect all the apples". theres always 1. find a model that has the knowledge, 2. hook it up to an inference engine, 3. do something useful. i always get stuck at 1.
Magnets - 27739 sekunder sedan
The announcement itself is pure AI slop
charankilari - 30299 sekunder sedan
wow its been ages
xavierforge - 11803 sekunder sedan
[flagged]
cdogukank - 13472 sekunder sedan
[dead]
imJack - 32821 sekunder sedan
[dead]
pimlottc - 21751 sekunder sedan
[dead]
leoncos - 295371 sekunder sedan
When I use Codex/Claude to complete a computer vision task, such as extracting assets from an image, OpenCV is their default solution. However, I believe that using YOLO and other methods is outdated. The best solution now is to directly use Nano Banana or other AI image models. A paper has proven that image generation models can perform most CV tasks well. I believe the new OpenCV should become a wrapper for VLM or AI image models.
oliveiracwb - 34107 sekunder sedan
Computer vision was the formative school for many autodidacts. Although I acquired substantial knowledge from articles translated via Power Translator and Babylon (whose outputs closely mirror those of any 2-million-parameter SLM), it was OpenCV that made concepts like convolutions, softmax, minmax, and others finally click for me. I have consistently viewed OpenCV as an intrinsically open, educational, and adaptable library. Any developer can dissect its codebase to extract a specific filter or algorithmic implementation and tailor it to their requirements. It is certainly not cruising at the velocity of trillion-dollar capital. But it holds its altitude. And it will always be there.