headtracker / mag lens / macro lens (linaccess at yokoy.de)

Thu Sep 6 14:44:43 EDT 2007

Hi Luke,

thanks for the suggestion, it sounds good in my ears.

If we use absolute or relative values depends on the user and his potentiality. In the first place the headtracker is an accessibility issue. Second, we all have benefit from accessibility software. I really would like to get rid of the touchpad. Using a touchpad is a bit like ZEN :-) Having both hands on the keyboard all the time accelerates the workflow enormous.
Maybe we could put a slider into the activity where the user select absolute, relativ or somewhat between that.
The user should also define a activ area. Outside this activ area there could be car traffic or smokers with lighters, doesn't matter. There is no need to process this area and there are less problems with jumping mouse pointers.

Is it possible to give an active area to the camera? That would speed up the processing and maybe the framerate.

regards,
yokoy

On Wed, 5 Sep 2007 19:52:38 -0400
"Luke Hutchison" <luke.hutch at gmail.com> wrote:

> I see, so the camera sensor area covered by moving your head in-place to
> point to the four corners of the display is approximately 90x30.
> 
> I think you are saying that the image of the reflective dot is a blob of
> neighboring pixels of size 5x3 to 20x12 somewhere on the display.  If that
> is the case, then taking the centroid is probably the best approach.
> However I may make a suggestion to improve accuracy:
> 
> -- Threshold image at some minimum brightness
> -- Find largest blob of connected pixels above threshold, and put them into
> a set B
> -- Put all pixels just inside or just outside the edge of the blob into a
> set E
> -- Find all internal pixels I = B \ E  (all pixels in B that are not in E)
> -- Linearly scale the (floating point) intensity of all pixels in E so the
> brightest is 1.0 and the darkest is 0.0.  (Beware of division by zero if all
> edge pixels are the same intensity.)
> -- Set the intensity of all pixels in I to 1.0
> -- Find the weighted centroid of (I Union E), by multiplying the pixel
> intensity by the x and y coords, and summing for x and y, then dividing x
> and y sums by the total of all pixel intensities.
> 
> What this effectively does is give all internal pixels equal weight in
> calculating centroid position, and then it uses the normalized shade of edge
> pixels to fractionally adjust the centroid position.  This effectively
> reverses the antialiasing that occurs at the edge of the image of the dot on
> the CCD.  You really should be able to get some good resolution using some
> sort of approach like this.
> 
> By the way, as someone pointed out in another post, trying to hold your head
> in position and just tilt it slightly to reach the corners of the laptop is
> likely to give you a knot in your neck!  You should probably allow the user
> to move their head further (because their effective range is extended by
> their eyes).  You might also want to consider doing some eye tracking, so
> that the user can blink twice to turn the head tracking on or off, for
> example, and maybe wink to trigger a L/R mouse button action.
> 
> Also you should look at relative positioning and not just absolute
> positioning: e.g. the cursor doesn't move unless you move your head quickly,
> and the movement is only part of the width of the screen each time -- that
> way you can move the cursor in several steps, by moving quickly forward in
> one direction, then slowly back, much the same way as you do with a touch
> pad, except that touching/raising your finger is emulated by the speed
> threshold.
> 
> Luke
> 
> 
> 
> 
> On 9/5/07, linaccess at yokoy.de <linaccess at yokoy.de> wrote:
> >
> > Hello Luke,
> >
> > I will consider the paraboloid thing, thanks for the code.
> >
> > I am not tracking only one point. I am tracking a bundle of points - say
> > 5x3 to 20x12 - and it changes all the time with different ratio. Out of
> > those pixels (and maybe some time) I have to build an AVG Pixel with a
> > defined XY value. AVG pixel != brightest pixel. For that AVG pixel I could
> > use subpixel, too. I have to use subpixel because I have to map the low
> > resolution to 1200x900 pixel if I try to use absolute coordinates.
> > Where do I get the size 90x30 from? It is not an pixel exact value but a
> > approximation. It differs.
> > I tried not to move my eyes but my head and looking to the four corners
> > and the center of the XO display. The display is very small so I moved my
> > head really not to much. One infrared filtered result merged from 5
> > snapshots (four eges and center) is here:
> >
> > http://www.olpcaustria.org/mediawiki/upload/7/79/Headtracker_area_small.jpg
> > It is downsized from 640x480 to 320x240px but the relation is the same.
> >
> > Maybe I have got a knot in my brain. I really like to get the headtracker
> > working in a good way without an additional lens.
> > Again the link to the project site:
> > http://www.olpcaustria.org/mediawiki/index.php/Headtracker
> >
> > greeting,
> > yokoy
> >
> >
> >
> >
> > On Wed, 5 Sep 2007 12:57:54 -0400
> > "Luke Hutchison" <luke.hutch at gmail.com> wrote:
> >
> > > PS there's a "cheap" way you can accomplish subpixel accuracy, as
> > follows.
> > > You basically take a bunch of 1D samples through the brightest pixel,
> > > looking at 2 neighbors in each direction, and then take a weighted sum
> > of
> > > the results.  This calls the code I pasted in the last email.  It's not
> > > going to be as good as paraboloid regression, but it should allow you to
> > > test feasibility.
> > >
> > >
> > > // Do a 2D parabolic fit to give sub-pixel accuracy for translational
> > > // registration, by performing four 1D parabolic fits through the
> > closest
> > > // integer correlation offset: horizontally, vertically, and on the
> > leading
> > > // and trailing diagonals.  Take the weighted centroid of all four to
> > get
> > > // the true correlation offset.
> > >
> > > // off_x and off_y are the one-pixel-accurate correlation offsets
> > recovered
> > > // by correlation.
> > >
> > > // x1d and y1d are x and y values for the 1D quadratic fit function
> > > double y1d[9] = {     // Get magnitudes at (off_x,off_y) and 8 neighbors
> > >   dft_mag(dft, off_x - 1, off_y - 1),
> > >   dft_mag(dft, off_x    , off_y - 1),
> > >   dft_mag(dft, off_x + 1, off_y - 1),
> > >   dft_mag(dft, off_x - 1, off_y    ),
> > >   dft_mag(dft, off_x    , off_y    ),
> > >   dft_mag(dft, off_x + 1, off_y    ),
> > >   dft_mag(dft, off_x - 1, off_y + 1),
> > >   dft_mag(dft, off_x    , off_y + 1),
> > >   dft_mag(dft, off_x + 1, off_y + 1)
> > > }
> > >
> > > // Sum contributions to centroid of each quadratic fit
> > > double x1d_tot = 0.0, y1d_tot = 0.0, x1d;
> > >
> > > // Parabolic fit in horiz direction through correlation maximum
> > > x1d = parabolic_fit(-1, y1d[3], 0, y1d[4], 1, y1d[5]);
> > > x1d_tot += x1d;
> > >
> > > // Parabolic fit in horiz direction through correlation maximum
> > > x1d = parabolic_fit(-1, y1d[1], 0, y1d[4], 1, y1d[7]);
> > > y1d_tot += x1d;   // [x1d is x in parabola space, but y in correlation
> > > space]
> > >
> > > // Weight contributions of diagonal by the inverse of their distance
> > > #define RT2_OV_2  0.7071067811865475244   // sqrt(2)/2  (= 1/sqrt(2))
> > >
> > > // Parabolic fit in leading diagonal direction through correlation
> > maximum
> > > x1d = parabolic_fit(-1, y1d[0], 0, y1d[4], 1, y1d[8]);
> > > x1d_tot += x1d * RT2_OV_2;
> > > y1d_tot += x1d * RT2_OV_2;
> > >
> > > // Parabolic fit in leading diagonal direction through correlation
> > maximum
> > > x1d = parabolic_fit(-1, y1d[2], 0, y1d[4], 1, y1d[6]);
> > > x1d_tot -= x1d * RT2_OV_2;
> > > y1d_tot += x1d * RT2_OV_2;
> > >
> > > // Take centroid of all parabolic fits, weighting diagonals by RT2_OV_2;
> > > // make relative to correlation coords by adding off_x, off_y
> > > double subpix_off_x = off_x + x1d_tot / (2.0 + 2.0 * RT2_OV_2);
> > > double subpix_off_y = off_y + y1d_tot / (2.0 + 2.0 * RT2_OV_2);
> > >
> > >
> > >
> > > On 9/5/07, Luke Hutchison <luke.hutch at gmail.com> wrote:
> > > >
> > > > Where do you get the size 90x30 from though?  Are you saying you can't
> > get
> > > > at the full-sized frame through the API currently?
> > > >
> > > > You really should consider fitting a paraboloid over the dot to get
> > > > sub-pixel resolution.  Note that if the dot is bigger (more than a few
> > > > pixels), you probably want to just use the weighted centroid, but if
> > it's
> > > > small, a paraboloid is the right approach.  You really will get at
> > least a
> > > > 10x increase in accuracy in both x and y, bringing your effective
> > resolution
> > > > to something like 900x300 for the example you gave.  You may not even
> > need a
> > > > lens.  I have used this before with success for an image processing
> > project.
> > > >
> > > >
> > > > Here's the code for the 1D version:
> > > >
> > > > // Fit a parabola to three points, and return the x coord of the
> > turning
> > > > // point (point 2 is the central point, points 1 and 3 are its
> > neighbors)
> > > > double parabolic_fit(double x1, double y1,
> > > >                      double x2, double y2,
> > > >                      double x3, double y3) {
> > > >
> > > >   double a = (y3 - y2) / ((x3 - x2) * (x3 - x1)) -
> > > >              (y1 - y2) / ((x1 - x2) * (x3 - x1));
> > > >
> > > >   double b = (y1 - y2 + a * (x2 * x2 - x1 * x1)) / (x1 - x2);
> > > >
> > > >   double xmin = x2;       // Just use central point if parabola is
> > flat
> > > >   if (fabs(a) > EPSILON)
> > > >     xmin = -b / (2 * a);  // [avoids division by zero]
> > > >
> > > >   // Use the following to calculate the y-value at the turning point
> > > >   // of the parabola:
> > > >   //
> > > >   //   double c = y1 - a * x1 * x1 - b * x1;
> > > >   //   double ymin = a * xmin * xmin + b * xmin + c;
> > > >
> > > >   return xmin;
> > > > }
> > > >
> > > > I don't have code for the 2D version unfortunately.
> > > >
> > > > The 2D version (fitting a paraboloid to a neighborhood of more than
> > four
> > > > points total) is overspecified, as is the 1D version if fitting a
> > parabola
> > > > to more than three points (e.g. using two neighbors on either side of
> > the
> > > > brightest pixel).  Thus you need to do some sort of regression to find
> > the
> > > > best fit.  I'm sure there is code out there to accomplish this.
> > > >
> > > > Luke
> > > >
> > > >
> > > > On 9/5/07, linaccess at yokoy.de <linaccess at yokoy.de> wrote:
> > > > >
> > > > > Hello Luke,
> > > > >
> > > > > On Tue, 4 Sep 2007 16:11:34 -0700
> > > > > "Luke Hutchison" < luke.hutch at gmail.com> wrote:
> > > > >
> > > > > > Is the processing time for 640x480 the reason you're only using
> > 90x30?
> > > > > >
> > > > >
> > > > > No, I am not at the point thinking about optimization the processing
> > > > > time.
> > > > > You don't want to dance in front of the camera to control the mouse
> > > > > pointer. You just want to move your head a few degrees as if you
> > would look
> > > > > to the edges of the display without moving your eyes. Then the area
> > > > > recognized from the camera is very small. It is like moving the
> > mouse only
> > > > > one or two millimeters to move the mousepointer over the whole
> > desktop. To
> > > > > get more 'delta pixel' I need a mag lens, I think.
> > > > >
> > > > > regards,
> > > > >
> > > > > yokoy
> > > > >
> > > > >
> > > > >
> > > > > > You can actually dramatically increase the precision to which you
> > can
> > > > > read
> > > > > > back the bright point's location by fitting a paraboloid to the
> > > > > intensity
> > > > > > values in the neighborhood of the brightest pixel, then reading
> > off
> > > > > the
> > > > > > location of the extremum of the paraboloid.  You will get at least
> > one
> > > > > order
> > > > > > of magnitude more accuracy that way than looking at the integer
> > coords
> > > > > of
> > > > > > the brightest pixel (perhaps as much as two orders of magnitude).
> > > > > >
> > > > > > Luke
> > > > > >
> > > > > >
> > > > > > On 9/4/07, linaccess at yokoy.de <linaccess at yokoy.de> wrote:
> > > > > > >
> > > > > > > Hello Mary Lou,
> > > > > > >
> > > > > > > On Tue, 04 Sep 2007 12:13:34 -0400
> > > > > > > Mary Lou Jepsen <mljatolpc at gmail.com> wrote:
> > > > > > >
> > > > > > > > lenses are cheap.  it depends on what exactly you are doing
> > with
> > > > > the
> > > > > > > > software.
> > > > > > >
> > > > > > > tracking a little shiny point at the head and transform it into
> > > > > > > mousepointer movements. Here is the description:
> > > > > > > http://www.olpcaustria.org/mediawiki/index.php/Headtracker
> > > > > > >
> > > > > > > With the XO camera we typicaly use only 90x30 pixel from the
> > 640x480
> > > > > > > pixel. So I want to magnify the operative area with a lens.
> > > > > > > Here is a picture of the area:
> > > > > > >
> > > > > > >
> > http://www.olpcaustria.org/mediawiki/index.php/Headtracker#magnification_lens
> > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > > American Science and Surplus is a good way to experiment:
> > > > > > > > http://sciplus.com/category.cfm?subsection=21
> > > > > > > >
> > > > > > >
> > > > > > > thank you for that link. A plastic lens is what I am searching
> > for.
> > > > > > >
> > > > > > >
> > > > > > > > then to china for mass production at very low price point.
> > > > > > > >
> > > > > > > >
> > > > > > > > - Mary Lou
> > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > > > regards,
> > > > > > >
> > > > > > > yokoy
> > > > > > > --
> > > > > > >
> > > > > > > _______________________________________________
> > > > > > > Devel mailing list
> > > > > > > Devel at lists.laptop.org
> > > > > > > http://lists.laptop.org/listinfo/devel
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > >
> > > > >
> > > >
> > >
> >
> >
> > --
> >
> > _______________________________________________
> > Devel mailing list
> > Devel at lists.laptop.org
> > http://lists.laptop.org/listinfo/devel
> >
> 

--