I am trying to integrate the Darknet YOLO (https://pjreddie.com/darknet/yolo/) object detection computer vision package into Unity to be used on the Hololens. I've only gotten as far as exporting the package as a DLL to Unity and making marshalled function calls to it. I want to be able to continuously put video feed through it rather than single pictures. Only problem is, for a single picture to be processed and the bounding box information to be given back to me, it takes several seconds as opposed to real-time, and I would like it to operate in real time (as in, it continuously provides me visuals of bounding boxes around the objects it can detect). Is there a reason why it takes so slow for the package to provide the detection information back to me when I call a function for it to read image information from a file, and is there a way I can optimize for this?
Also I'm not sure how (if I am able to) I can integrate Unity's VideoCapture functionalities to be used with the YOLO package, but I would also like to be able to do that so that I can have real-time video feed analyzed on the fly and projected onto the Hololens.
Any steps in the right direction would be much appreciated!
↧