Optoelectronic notes: How Kinect works with PrimeSense

Dec 2, 2010

How Kinect works with PrimeSense

Microsoft applys 3D motion sensing technology into XBox - Kinect, letting users interact with gaming freely without controller in hand. Camera and gesture recognition make the function of motion sensing. PrimeSense delivers the chip and structure to the 3D sensing technology, helping Microsoft to achieve controller-free entertainment device.

PrimeSense, a Israel company, comes up with a camera-based solution to 3D sensing. The module consist of :

Color image sensor
IR image sensor for depth measurement
IR light source - laser diode and diffuser

Spec

color image size: UXGA 1600x1200
depth image size: VGA 640x480
image throughput: 60 fps
operation range: 0.8m~3.5m
spatial x/y resolution: 3mm @2m distance
depth z resolution: 1cm @2m distance

3D Mapping
There are two speckle pattern could appear on the camera: one being primary speckle coming from the diffuser(projected on the object) and the other being secondary speckle formed during the imaging due to the lens aperture and object material roughness. It only concentrates in primary speckle. The primary speckle pattern, produced by the diffuser and DOE and then projected on the object, varies with z-axis. From the graph below, the object (human hand) shows different speckle pattern at different distance.

The speckle pattern is what PrimeSense names "structured light". Why the structured light has such property? ....See DOE below.

3D correlation will construct the 3D mapping of object. Because of the distance-dependent property, each position has its specific shape and spacing. Processor estimate the depth by correlating each window with the reference data.

Triangulation
For relative motion in z-axis, the speckle pattern will move along the x-axis on image plane of camera, because camera see the object at different angle. Based on the triangulation principle, correlation with x-direction can calculate the z-shift as the absolute distance is known from 3D mapping and the distance between projector and camera is defined. Each successive image is served as a reference image for the next frame calculation.
For example, object locates at 2m, spacing between projector and camera is 20cm, and given pixel size of sensor is 6um --> z-resolution is 15mm, calculated on triangulation relationship.

Calibration is carried out the the time of manufacture. A set of reference images were taken at different locations then stored in the memory.

Diffractive Optical Element (DOE)
In the illumination parts:

laser diode
diffuser - produce pseudo random pattern
DOE

Based on the extended depth of field (EDof), DOE is the embodiment of astigmatic optical element, which has different focus for different angle direction. From the picture above, the speckle of pseudo random distribution of diffuser would be focused to spot of different orientation at different focal distance.

Besides, DOE is designed to reduce the divergence angle so that light intensity would vary slower with distance.

PrimeSense's 3D sensing technology is very unique and special. There is no one has the similar patent in depth measurement. PrimeSense claims that the profile like human face can be distinct at 1m distance but loses the detail as it moves far away.

[Related articles]
How kinect works with PrimeSense-2
Microsoft: 3D Gesture Recognition

10 comments:

UnknownFebruary 18, 2011 at 6:21 AM
Hi, I would like to know where you got those diagrammes from, I can not seem to find them in the patent files I have come across.

Thank you for a very helpful and informative article.
ReplyDelete
Replies
ZHChenFebruary 18, 2011 at 2:53 PM
Only first pic is from PrimeSense website, and others are all from the patents of PrimeSense. Because patent just emphasis claims not technique, I survey almost all patents PrimeSense assigned to piece up how they make Kinect. You can find more patents in WIPO because it is Isreal company. I may list where pics from later long.
ReplyDelete
Replies
AnonymousMarch 30, 2011 at 9:48 AM
Thanks for post!
ReplyDelete
Replies
UnknownMay 23, 2013 at 11:47 AM
Thank you very much!
ReplyDelete
Replies
AnonymousJune 18, 2013 at 9:12 PM
Where do you have these information from?
ReplyDelete
Replies
Smith BrownMay 24, 2016 at 7:13 PM
I really enjoy reading your blog as the postings are so simple to read and follow. Outstanding. Please keep it up. Thanks.
Patented Gesture control technology
ReplyDelete
Replies
Isabel BentJuly 19, 2016 at 3:02 AM
Really very amazing and nice information about the technology i really books mark it and read it again.
What is Spot.IM
ReplyDelete
Replies
Andrew CarterNovember 10, 2016 at 3:58 PM
I visited your blog for the first time and just been your fan and get many informative information about the technology and I Will be back often to check up on new stuff you post well done.
IT jobs
ReplyDelete
Replies
Isabel BentNovember 12, 2016 at 4:55 PM
Very informative site and article, i must bookmark it and keep posting interesting articles.
it information technology
ReplyDelete
Replies
Isabel BentNovember 28, 2016 at 8:25 PM
Nice blog and your all presenting information are very great and it's really good well done.
Escape the Groundhog Life
ReplyDelete
Replies

Add comment