A go at making the depth_packet_stream_parser better. #75

larshg · 2014-10-22T13:20:29Z

Iterates through all bytes to find footer but copy data in chunks.
Doesn't use additional working buffer.

I tried to find out why I was getting a lot of errors as shown here #72 (comment) .

And I found out that its probably not about the code, but rather a hardware/software issue of my dell Precision m6600, with a Renesas USB 3.0 controller not being able to process and acquire all the data that the Kinectv2 sends out.

Also when data is transferred using ISO-mode, the packages are not guaranteed to be received either in order or at all.

I don't have any measures if this is faster/better than the previous, but it avoids using the extra working buffer.

Update: I added a FPS counter(locally) which showed my laptop running about 20(min 16, max 24). Tried the code on my stationary pc and there are a lot fewer errors and showed about 29-30 fps, hence sometimes a frame is lost. I tried a official depth app also and it runs about 29-30 fps - maybe just a bit more stable at the 30 :)

christiankerl · 2015-02-02T09:17:43Z

examples/protonect/src/depth_packet_stream_parser.cpp


-    size_t max_length = std::min<size_t>(wb.capacity - wb.length, in_length - 8);
+  for (size_t i = 0; i < in_length; i++)


upper bound should be in_length - sizeof(DepthSubPacketFooter) otherwise we might cause a segfault!?

xlz · 2015-02-11T22:10:37Z

For your reference one of the image data too short bug before this patch was sometime it detected the footer in the middle of a 33792-byte transfer which has actually no footer. I found this by adding a in_length != 33792 condition to the footer detection code and the error disappeared.

xlz · 2015-02-12T02:03:16Z

Here is my take. This is a complete rewrite, please ignore the diff and just look at the new function. It has been tested extensively .

Several considerations for the rewrite:

Scanning magic bytes for footer is dangerous. Many image data too short errors happened because it found the magic bytes in the middle where it is not actually the footer.
The footer seems to only appear at the very end of a transfer. This is also consistent with RGB transfers. So far I have not seen anything in experiments that contradicts this.
A packet and its subpackets all have fixed sizes, therefore the work buffer can be just a pointer to the actual buffer to save many memcpy's.
Correct packets and subpackets must have continuous sequence and subsequence numbers. If the buffer overruns, subpackets don't have 512*424*11/8 bytes, or the (sub)sequence numbers are not continuous, it must drop the entire packet and wait until the next packet. In all these cases there is no way to determine what has been lost within the current packet and the buffer will not be clean to continue assembling the current packet.

Some additional observations:

ISO transfer is very sensitive to USB autosuspend. If USB autosuspend is turned on, you can barely receive any data.
Booting with USB plugged in or hot-plugging might make a difference. In my case I have the Kinect plugged in before booting and disabled autosuspend, I received a lot of short subpackets (less than 512*424*11/8 bytes but footers detected).

larshg · 2015-02-16T19:57:51Z

Hi @xlz

I have tried out your implementation, but I get really bad performance :(

I can't even manage to stop and find the average framerate printout...

But I like a lot of the ideas you bring up - so I'll try to figure out why its not working at my part.

I have added the implementation on a branch in my fork here

xlz · 2015-02-16T20:05:46Z

My version should be strictly faster than the current one because it does not use an extra buffer and does not memcpy unnecessarily. I have tested it for 70 fps without data loss.

It may be that something is different on Windows that needs change. So this depends on your investigation. You can check if you have correctly disabled USB autosuspend settings, and you can try running it without visualization (comment out cv::imshow). Thanks.

larshg · 2015-02-16T20:29:36Z

Hi @xlz

The above test was on my dell precision m6600 - and I just tried it on my stationary I7 GTX480 and here it runs a lot more smooth. (I see same behavior with the current(master/my go at it) implementation - simply more processing power :) )

But I'll try to investigate why its running slower than the current implementation on the laptop.

I think have seen various data transfers sometimes on the USB - this might be the problem (ie. looking at the end/memory scanning).

I have never done anything about USB autosuspend, but I'll try find out where to change this :)

It has nothing to do with the visualization.

xlz · 2015-02-16T20:57:02Z

@larshg It might also be the slow scrolling of cmd.exe output. How is it when you comment out processor_->process(packet); and it's only stream parsing without processing?

Iterates through all bytes to find footer. Copy data in chunks. Doesn't use additional working buffer. Adds timestamp

xlz · 2015-04-18T19:41:03Z

@larshg
I got an interesting performance characteristic. In my approach above I eliminated the work buffer and copied usb buffer directly into the main packet buffer, and immediately sent the packet buffer to the depth processor. After adding in VA-API JPEG decoder, this particular order of memcpys seems to be causing performance regression, that is:

Slow (0.5x performance)

Collect transfers directly into buffer_
Send buffer_ to depth processor upon subsequence == 9

Fast

Collect transfers into work_buffer_
Send buffer_ to depth processor upon subsequence == 0
Copy work_buffer_ into buffer_

I don't know what is exactly the problem, but I reverted my patch to follow the original order of memcpys.

xlz · 2015-05-01T18:59:59Z

After closer look at libusb ISO transfer, it seems it is not even necessary to detect the footer at all. ISO packets length will provide clue about the end of a subpacket. And zero packet length indicates splitting between packets.

An example with one transfer a line:
(max usb iso packet size 33792, 8 usb packets per usb transfer, a depth subpacket data + footer = 512*424*11/8 + 152 bytes, 10 subpackets per depth packet)

 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 33792
 33792 33792 33792 33792 33792 33792 33792 28312
 0 0 0 33792 33792 33792 33792 33792
 33792 33792 33792 28312 0 0 33792 33792
 33792 33792 33792 33792 33792 33792 28312 0
 0 33792 33792 33792 33792 33792 33792 33792
 33792 28312 0 0 33792 33792 33792 33792
 33792 33792 33792 33792 28312 0 33792 33792
 33792 33792 33792 33792 33792 33792 28312 0
 0 0 33792 33792 33792 33792 33792 33792
 33792 33792 28312 0 33792 33792 33792 33792
 33792 33792 33792 33792 28312 0 0 0
 33792 33792 33792 33792 33792 33792 33792 33792
 28312 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 33792 33792 33792 33792 33792 33792 33792
 33792 28312 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0

Also as can be seen, the 10th auxiliary subpacket lags in time and is not actually used for the depth image.

xlz · 2015-05-02T17:54:10Z

@larshg
I have updated the depth stream parser in my stream-parsers branch according to findings in the above comment.

I have removed enforcing of sequence number, and mostly minimized changes made to the original code to preserve the original execution order. The only changes now are 1) removing magic footer scanning, and assuming fixed sizes.

I have tested the patch under Linux and Windows. Please see if it works for you.

larshg force-pushed the depthpacketparse branch 4 times, most recently from 492ba42 to 0c1f09c Compare October 28, 2014 13:05

larshg mentioned this pull request Oct 28, 2014

Rgb packet handling #73

Closed

larshg force-pushed the depthpacketparse branch from 0c1f09c to 8ba584a Compare November 2, 2014 19:45

larshg force-pushed the depthpacketparse branch from 8ba584a to 54db33b Compare November 17, 2014 08:02

larshg force-pushed the depthpacketparse branch from 54db33b to e460b71 Compare January 13, 2015 14:02

christiankerl reviewed Feb 2, 2015
View reviewed changes

larshg force-pushed the depthpacketparse branch from e460b71 to 58d88e9 Compare February 5, 2015 17:05

larshg force-pushed the depthpacketparse branch from 58d88e9 to 67f14c4 Compare February 12, 2015 18:02

larshg mentioned this pull request Feb 19, 2015

New libusb version not working on AMD #164

Closed

A go at making the depth_packet_stream_parser better.

7e757b5

Iterates through all bytes to find footer. Copy data in chunks. Doesn't use additional working buffer. Adds timestamp

larshg force-pushed the depthpacketparse branch from 67f14c4 to 7e757b5 Compare April 8, 2015 11:24

xlz mentioned this pull request Apr 29, 2015

Crashes on OSX with Nvidia graphics #205

Open

xlz mentioned this pull request May 4, 2015

Improve RGB and depth stream parsers #221

Merged

larshg closed this May 5, 2015

larshg deleted the depthpacketparse branch May 5, 2015 12:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A go at making the depth_packet_stream_parser better. #75

A go at making the depth_packet_stream_parser better. #75

Uh oh!

larshg commented Oct 22, 2014

Uh oh!

christiankerl Feb 2, 2015

Uh oh!

xlz commented Feb 11, 2015

Uh oh!

xlz commented Feb 12, 2015

Uh oh!

larshg commented Feb 16, 2015

Uh oh!

xlz commented Feb 16, 2015

Uh oh!

larshg commented Feb 16, 2015

Uh oh!

xlz commented Feb 16, 2015

Uh oh!

xlz commented Apr 18, 2015

Uh oh!

xlz commented May 1, 2015

Uh oh!

xlz commented May 2, 2015

Uh oh!

Uh oh!


		size_t max_length = std::min<size_t>(wb.capacity - wb.length, in_length - 8);
		for (size_t i = 0; i < in_length; i++)

A go at making the depth_packet_stream_parser better. #75

A go at making the depth_packet_stream_parser better. #75

Uh oh!

Conversation

larshg commented Oct 22, 2014

Uh oh!

christiankerl Feb 2, 2015

Choose a reason for hiding this comment

Uh oh!

xlz commented Feb 11, 2015

Uh oh!

xlz commented Feb 12, 2015

Uh oh!

larshg commented Feb 16, 2015

Uh oh!

xlz commented Feb 16, 2015

Uh oh!

larshg commented Feb 16, 2015

Uh oh!

xlz commented Feb 16, 2015

Uh oh!

xlz commented Apr 18, 2015

Uh oh!

xlz commented May 1, 2015

Uh oh!

xlz commented May 2, 2015

Uh oh!

Uh oh!