Computer Pointer Controller

This project - Computer Pointer Controller - moves the mouse pointer to the direction of the eye gaze. It does this by using a combination of 4 different computer vision models -face detection model, landmark detection model, head-pose estimation model, and gaze estimation. The final output, which is the x and y coordinates of the eye gaze, from the combined models is then fed to a mouse controller which moves the mouse pointer to the given coordinates.

Demo video of the App running

Project Set Up and Installation

Install OpenVINO (You can run this script to automate the installation of OpenVINO)
Clone/download this repo.
Use the requirements.txt file to install the required packages, i.e.

pip3 install requirements.txt

Use the OpenVINO model downloader to download the following models:

a. Face detection model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "face-detection-adas-binary-0001"

b. Landmark regression model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "landmarks-regression-retail-0009"

c. Head-pose estimation model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "head-pose-estimation-adas-0001"

d. Gaze estimation model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "gaze-estimation-adas-0002"

Directory Structure

Demo

From terminal, navigate to the src folder on the cloned directory, and run

python3 main.py -i ../bin/demo.mp4 \
-m_f <path to face detection model xml file> \
-m_l <path to landmark detection model xml file> \
-m_h <path to head-pose estimation model xml file> \
-m_g <path to gaze estimation model xml file>

Documentation

The required command line arguments are:

-i, which can either be the path of the input video or cam for camera
-m_f, path to face detection model
-m_l, path to landmark detection model
-m_h, path to head-pose estimation model
-m_g, path to gaze estimation model

The optional command line arguments are:

-l, path for MKLDNN (CPU)-targeted custom layers
-d, target device type e.g. CPU, FPGA
-p, path (in the cloned directory) to store performance statistics i.e. inference time, fps, and model loading time.
-vf, specify flags from m_f, m_l, m_h, m_g e.g. -vf m_f m_l m_h m_g (seperate each flag by space) for visualization of the output of intermediate models

Benchmarks

Results

Of the four models, the face detection model has the most latency across the precision types. Hence, the combined inferencing speed of the four models is mostly dependent on that of the face detection model.

It can also be seen that there is a general decrease in the processed frames per second with increase in precision. This can be attributed to the increase in floating point numbers with increase in precision, hence the calculations become more computational intensive.

FP32 precision gives better accuracy than the rest, the increased accuracy is more noticeable in the output for the gaze estimation. This could be as a result of the gaze estimation model being the last model before the final output to the mouse controller, hence, the losses of lower precisions are being built up from the first model down to the gaze estimation model.

Edge Cases

Certain situations make the inferencing to fail. If the lighting conditions are poor, the application may not be able to detect the face, and should incase it detects the face and can't pick out the left and right eyes, a message is logged that the image is too dark or eyes are covered, hence it can't pick out the features.

Also, if there are multiple people in the frame, it takes the first detected face and uses it in the inferencing flow. There will be certain situations that will break your inference flow. For instance, lighting changes or multiple people in the frame. Explain some of the edge cases you encountered in your project and how you solved them to make your project more robust.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
bin		bin
models		models
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Computer Pointer Controller

Demo video of the App running

Project Set Up and Installation

Directory Structure

Demo

Documentation

Benchmarks

Results

Edge Cases

About

Uh oh!

Releases

Packages

Languages

Tob-iee/mouseController

Folders and files

Latest commit

History

Repository files navigation

Computer Pointer Controller

Demo video of the App running

Project Set Up and Installation

Directory Structure

Demo

Documentation

Benchmarks

Results

Edge Cases

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages