save readme

eef54182 · vincent · d5d9ba7e · eef54182
Commit eef54182 authored Oct 16, 2018 by vincent
Hide whitespace changes
Inline Side-by-side

Showing with 409 additions and 195 deletions

README.md README.md +409 -195

No files found.
--- a/README.md
+++ b/README.md
@@ -4,51 +4,67 @@

 **JavaScript API for face detection and face recognition in the browser implemented on top of the tensorflow.js core API ([tensorflow/tfjs-core](https://github.com/tensorflow/tfjs-core))**

+Table of Contents:
+
+* **[Resources](#resources)**
+  * **[Live Demos](#live-demos)**
+  * **[Tutorials](#tutorials)**
+* **[Examples](#examples)**
+  * **[Running the Examples](#running-the-examples)**
+* **[Available Models](#models)**
+  * **[Face Detection Models](#models-face-detection)**
+  * **[68 Point Face Landmark Detection Models](#models-face-landmark-detection)**
+  * **[Face Recognition Model](#models-face-recognition)**
+* **[Usage](#usage)**
+  * **[Loading the Models](#usage-loading-models)**
+  * **[High Level API](#usage-high-level-api)**
+  * **[Displaying Detection Results](#usage-displaying-detection-results)**
+  * **[Face Detection Options](#usage-face-detection-options)**
+  * **[Utility Classes](#usage-utility-classes)**
+  * **[Other Useful Utility](#other-useful-utility)**
+
+<a name="resources"></a>
+
+# Resources
+
+<a name="live-demos"></a>
+
+## Live Demos
+
+**[Check out the live demos!](https://justadudewhohacks.github.io/face-api.js/)**
+
+<a name="tutorials"></a>
+
+## Tutorials
+
 Check out my face-api.js tutorials:

 * **[face-api.js — JavaScript API for Face Recognition in the Browser with tensorflow.js](https://itnext.io/face-api-js-javascript-api-for-face-recognition-in-the-browser-with-tensorflow-js-bcc2a6c4cf07)**
 * **[Realtime JavaScript Face Tracking and Face Recognition using face-api.js’ MTCNN Face Detector](https://itnext.io/realtime-javascript-face-tracking-and-face-recognition-using-face-api-js-mtcnn-face-detector-d924dd8b5740)**

-**Check out the live demos [here](https://justadudewhohacks.github.io/face-api.js/)!**
+<a name="examples"></a>

-Table of Contents:
-
-* **[Running the Examples](#running-the-examples)**
-* **[About the Package](#about-the-package)**
-  * **[Face Detection - SSD Mobilenet v1](#about-face-detection-ssd)**
-  * **[Face Detection - Tiny Yolo v2](#about-face-detection-yolo)**
-  * **[Face Detection & 5 Point Face Landmarks - MTCNN](#about-face-detection-mtcnn)**
-  * **[Face Recognition](#about-face-recognition)**
-  * **[68 Point Face Landmark Detection](#about-face-landmark-detection)**
-* **[Usage](#usage)**
-  * **[Loading the Models](#usage-load-models)**
-  * **[Face Detection - SSD Mobilenet v1](#usage-face-detection-ssd)**
-  * **[Face Detection - Tiny Yolo v2](#usage-face-detection-yolo)**
-  * **[Face Detection & 5 Point Face Landmarks - MTCNN](#usage-face-detection-mtcnn)**
-  * **[Face Recognition](#usage-face-recognition)**
-  * **[68 Point Face Landmark Detection](#usage-face-landmark-detection)**
-  * **[Shortcut Functions for Full Face Description](#shortcut-functions)**
+# Examples

-## Examples
-
-### Face Recognition
+## Face Recognition

 ![preview_face-detection-and-recognition](https://user-images.githubusercontent.com/31125521/41526995-1a90e4e6-72e6-11e8-96d4-8b2ccdee5f79.gif)

 ![preview_face-recognition_gif](https://user-images.githubusercontent.com/31125521/40313021-c3afdfec-5d14-11e8-86df-cf89a00668e2.gif)

-### Face Similarity
+## Face Similarity

 ![preview_face-similarity](https://user-images.githubusercontent.com/31125521/40316573-0a1190c0-5d1f-11e8-8797-f6deaa344523.gif)

-### Face Landmarks
+## Face Landmark Detection

 ![face_landmarks_boxes_1](https://user-images.githubusercontent.com/31125521/46063403-fff9f480-c16c-11e8-900f-e4b7a3828d1d.jpg)
+
 ![face_landmarks_boxes_2](https://user-images.githubusercontent.com/31125521/46063404-00928b00-c16d-11e8-8f29-e9c50afd2bc8.jpg)

 ![preview_face_landmarks](https://user-images.githubusercontent.com/31125521/41507950-e121b05e-723c-11e8-89f2-d8f9348a8e86.png)

-### Live Face Detection
+## Realtime Face Tracking

 **SSD Mobilenet v1**

@@ -63,115 +79,92 @@ Table of Contents:
 ## Running the Examples

 ``` bash
-cd examples
+git clone https://github.com/justadudewhohacks/face-api.js.git
+cd face-api.js/examples
 npm i
 npm start
 ```

 Browse to http://localhost:3000/.

-<a name="about-the-package"></a>
+<a name="models"></a>

-## About the Package
+# Available Models

-<a name="about-face-detection-ssd"></a>
+<a name="models-face-detection"></a>

-### Face Detection - SSD Mobilenet v1
+## Face Detection Models

-For face detection, this project implements a SSD (Single Shot Multibox Detector) based on MobileNetV1. The neural net will compute the locations of each face in an image and will return the bounding boxes together with it's probability for each face. This face detector is aiming towards obtaining high accuracy in detecting face bounding boxes instead of low inference time.
+## SSD Mobilenet V1
+
+For face detection, this project implements a SSD (Single Shot Multibox Detector) based on MobileNetV1. The neural net will compute the locations of each face in an image and will return the bounding boxes together with it's probability for each face. This face detector is aiming towards obtaining high accuracy in detecting face bounding boxes instead of low inference time. The size of the quantized model is about 5.4 MB (**ssd_mobilenetv1_model**).

 The face detection model has been trained on the [WIDERFACE dataset](http://mmlab.ie.cuhk.edu.hk/projects/WIDERFace/) and the weights are provided by [yeephycho](https://github.com/yeephycho) in [this](https://github.com/yeephycho/tensorflow-face-detection) repo.

-<a name="about-face-detection-yolo"></a>
+## Tiny Face Detector

-### Face Detection - Tiny Yolo v2
+The Tiny Face Detector is a very performant, realtime face detector, which is much faster, smaller and less resource consuming compared to the SSD Mobilenet V1 face detector, in return it performs slightly less well on detecting small faces. This model is extremely mobile and web friendly, thus it should be your GO-TO face detector on mobile devices and resource limited clients. The size of the quantized model is only 190 KB (**tiny_face_detector_model**).

-The Tiny Yolo v2 implementation is a very performant face detector, which can easily adapt to different input image sizes, thus can be used as an alternative to SSD Mobilenet v1 to trade off accuracy for performance (inference time). In general the models ability to locate smaller face bounding boxes is not as accurate as SSD Mobilenet v1.
+The face detector has been trained on a custom dataset of ~14K images labeled with bounding boxes. Furthermore the model has been trained to predict bounding boxes, which entirely cover facial feature points, thus it in general produces better results in combination with subsequent face landmark detection than SSD Mobilenet V1.

-The face detector has been trained on a custom dataset of ~10K images labeled with bounding boxes and uses depthwise separable convolutions instead of regular convolutions, which ensures very fast inference and allows to have a quantized model size of only 1.7MB making the model extremely mobile and web friendly. Thus, the Tiny Yolo v2 face detector should be your GO-TO face detector on mobile devices.
+This model is basically an even tinier version of Tiny Yolo V2, replacing the regular convolutions of Yolo with depthwise separable convolutions. Yolo is fully convoltuional, thus can easily adapt to different input image sizes to trade off accuracy for performance (inference time).

-<a name="about-face-detection-mtcnn"></a>
+## MTCNN

-### Face Detection & 5 Point Face Landmarks - MTCNN
+**Note, this model is mostly kept in this repo for experimental reasons. In general the other face detectors should perform better, but of course you are free to play around with MTCNN.**

-MTCNN (Multi-task Cascaded Convolutional Neural Networks) represents an alternative face detector to SSD Mobilenet v1 and Tiny Yolo v2, which offers much more room for configuration. By tuning the input parameters, MTCNN is able to detect a wide range of face bounding box sizes. MTCNN is a 3 stage cascaded CNN, which simultaneously returns 5 face landmark points along with the bounding boxes and scores for each face. By limiting the minimum size of faces expected in an image, MTCNN allows you to process frames from your webcam in realtime. Additionally with the model size is only 2MB.
+MTCNN (Multi-task Cascaded Convolutional Neural Networks) represents an alternative face detector to SSD Mobilenet v1 and Tiny Yolo v2, which offers much more room for configuration. By tuning the input parameters, MTCNN should be able to detect a wide range of face bounding box sizes. MTCNN is a 3 stage cascaded CNN, which simultaneously returns 5 face landmark points along with the bounding boxes and scores for each face. Additionally the model size is only 2MB.

 MTCNN has been presented in the paper [Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks](https://kpzhang93.github.io/MTCNN_face_detection_alignment/paper/spl.pdf) by Zhang et al. and the model weights are provided in the official [repo](https://github.com/kpzhang93/MTCNN_face_detection_alignment) of the MTCNN implementation.

-<a name="about-face-recognition"></a>
-
-### Face Recognition
+<a name="models-face-landmark-detection"></a>

-For face recognition, a ResNet-34 like architecture is implemented to compute a face descriptor (a feature vector with 128 values) from any given face image, which is used to describe the characteristics of a persons face. The model is **not** limited to the set of faces used for training, meaning you can use it for face recognition of any person, for example yourself. You can determine the similarity of two arbitrary faces by comparing their face descriptors, for example by computing the euclidean distance or using any other classifier of your choice.
-
-The neural net is equivalent to the **FaceRecognizerNet** used in [face-recognition.js](https://github.com/justadudewhohacks/face-recognition.js) and the net used in the [dlib](https://github.com/davisking/dlib/blob/master/examples/dnn_face_recognition_ex.cpp) face recognition example. The weights have been trained by [davisking](https://github.com/davisking) and the model achieves a prediction accuracy of 99.38% on the LFW (Labeled Faces in the Wild) benchmark for face recognition.
+## 68 Point Face Landmark Detection Models

-<a name="about-face-landmark-detection"></a>
+This package implements a very lightweight and fast, yet accurate 68 point face landmark detector. The default model has a size of only 350kb (**face_landmark_68_model**) and the tiny model is only 80kb (**face_landmark_68_tiny_model**). Both models employ the ideas of depthwise separable convolutions as well as densely connected blocks. The models have been trained on a dataset of ~35k face images labeled with 68 face landmark points.

-### 68 Point Face Landmark Detection
+<a name="models-face-recognition"></a>

-This package implements a very lightweight and fast, yet accurate 68 point face landmark detector. The default model has a size of only 350kb and the tiny model is only 80kb. Both models employ the ideas of depthwise separable convolutions as well as densely connected blocks. The models have been trained on a dataset of ~35k face images labeled with 68 face landmark points.
+## Face Recognition Model

-<a name="usage"></a>
-
-## Usage
-
-Get the latest build from dist/face-api.js or dist/face-api.min.js and include the script:
+For face recognition, a ResNet-34 like architecture is implemented to compute a face descriptor (a feature vector with 128 values) from any given face image, which is used to describe the characteristics of a persons face. The model is **not** limited to the set of faces used for training, meaning you can use it for face recognition of any person, for example yourself. You can determine the similarity of two arbitrary faces by comparing their face descriptors, for example by computing the euclidean distance or using any other classifier of your choice.

-``` html
-<script src="face-api.js"></script>
-```
+The neural net is equivalent to the **FaceRecognizerNet** used in [face-recognition.js](https://github.com/justadudewhohacks/face-recognition.js) and the net used in the [dlib](https://github.com/davisking/dlib/blob/master/examples/dnn_face_recognition_ex.cpp) face recognition example. The weights have been trained by [davisking](https://github.com/davisking) and the model achieves a prediction accuracy of 99.38% on the LFW (Labeled Faces in the Wild) benchmark for face recognition.

-Or install the package:
+The size of the quantized model is roughly 6.2 MB (**face_recognition_model**).

-``` bash
-npm i face-api.js
-```
+# Usage

-<a name="usage-load-models"></a>
+<a name="usage-loading-models"></a>

-### Loading the Models
+## Loading the Models

 To load a model, you have provide the corresponding manifest.json file as well as the model weight files (shards) as assets. Simply copy them to your public or assets folder. The manifest.json and shard files of a model have to be located in the same directory / accessible under the same route.

 Assuming the models reside in **public/models**:

 ``` javascript
-await faceapi.loadFaceDetectionModel('/models')
+await faceapi.loadSsdMobilenetv1Model('/models')
 // accordingly for the other models:
+// await faceapi.loadTinyFaceDetectorModel('/models')
+// await faceapi.loadMtcnnModel('/models')
 // await faceapi.loadFaceLandmarkModel('/models')
 // await faceapi.loadFaceLandmarkTinyModel('/models')
 // await faceapi.loadFaceRecognitionModel('/models')
-// await faceapi.loadMtcnnModel('/models')
-// await faceapi.loadTinyYolov2Model('/models')
 ```

-As an alternative, you can also create instance of the neural nets:
+Alternatively, you can also create instance of the neural nets:

 ``` javascript
-const net = new faceapi.FaceDetectionNet()
-// accordingly for the other models:
-// const net = new faceapi.FaceLandmark68Net()
-// const net = new faceapi.FaceLandmark68TinyNet()
-// const net = new faceapi.FaceRecognitionNet()
-// const net = new faceapi.Mtcnn()
-// const net = new faceapi.TinyYolov2()
-
-await net.load('/models/face_detection_model-weights_manifest.json')
-// await net.load('/models/face_landmark_68_model-weights_manifest.json')
-// await net.load('/models/face_landmark_68_tiny_model-weights_manifest.json')
-// await net.load('/models/face_recognition_model-weights_manifest.json')
-// await net.load('/models/mtcnn_model-weights_manifest.json')
-// await net.load('/models/tiny_yolov2_separable_conv_model-weights_manifest.json')
+const net = new faceapi.SsdMobilenetv1()
+await net.load('/models')
 ```

 Using instances, you can also load the weights as a Float32Array (in case you want to use the uncompressed models):

 ``` javascript
 // using fetch
-const res = await fetch('/models/face_detection_model.weights')
-const weights = new Float32Array(await res.arrayBuffer())
-net.load(weights)
+net.load(await faceapi.fetchNetWeights('/models/face_detection_model.weights'))

 // using axios
 const res = await axios.get('/models/face_detection_model.weights', { responseType: 'arraybuffer' })
@@ -179,166 +172,382 @@ const weights = new Float32Array(res.data)
 net.load(weights)
 ```

-<a name="usage-face-detection-ssd"></a>
+## High Level API

-### Face Detection - SSD Mobilenet v1
+In the following **input** can be an HTML img, video or canvas element or the id of that element.

-Detect faces and get the bounding boxes and scores:
+``` html
+<img id="myImg" src="images/example.png" />
+<video id="myVideo" src="media/example.mp4" />
+<canvas id="myCanvas" />
+```

 ``` javascript
-// optional arguments
-const minConfidence = 0.8
-const maxResults = 10
+const input = document.getElementById('myImg')
+// const input = document.getElementById('myVideo')
+// const input = document.getElementById('myCanvas')
+// or simply:
+// const input = 'myImg'
+```

-// inputs can be html canvas, img or video element or their ids ...
-const myImg = document.getElementById('myImg')
-const detections = await faceapi.ssdMobilenetv1(myImg, minConfidence, maxResults)
+### Detecting Faces
+
+Detect all faces in an image. Returns **Array<[FaceDetection](#interface-face-detection)>**:
+
+``` javascript
+const detections = await faceapi.detectAllFaces(input)
 ```

-Draw the detected faces to a canvas:
+Detect the face with the highest confidence score in an image. Returns **[FaceDetection](#interface-face-detection) | undefined**:

 ``` javascript
+const detection = await faceapi.detectSingleFace(input)
+```
+
+By default **detectAllFaces** and **detectSingleFace** utilize the SSD Mobilenet V1 Face Detector. You can specify the face detector by passing the corresponding options object:
+
+``` javascript
+const detections1 = await faceapi.detectAllFaces(input, new SsdMobilenetv1Options())
+const detections2 = await faceapi.detectAllFaces(input, new TinyFaceDetectorOptions())
+const detections3 = await faceapi.detectAllFaces(input, new MtcnnOptions())
+```
+
+You can tune the options of each face detector as shown [here](#usage-face-detection-options).
+
+### Detecting 68 Face Landmark Points
+
+**After face detection, we can furthermore predict the facial landmarks for each detected face as follows:**
+
+Detect all faces in an image + computes 68 Point Face Landmarks for each detected face. Returns **Array<[FaceDetectionWithLandmarks](#interface-face-detection-with-landmarks)>**:
+
+``` javascript
+const detectionsWithLandmarks = await faceapi.detectAllFaces(input).withFaceLandmarks()
+```
+
+Detect the face with the highest confidence score in an image + computes 68 Point Face Landmarks for that face. Returns **[FaceDetectionWithLandmarks](#interface-face-detection-with-landmarks) | undefined**:
+
+``` javascript
+const detectionWithLandmarks = await faceapi.detectSingleFace(input).withFaceLandmarks()
+```
+
+You can also specify to use the tiny model instead of the default model:
+
+``` javascript
+const useTinyModel = true
+const detectionsWithLandmarks = await faceapi.detectAllFaces(input).withFaceLandmarks(useTinyModel)
+```
+
+### Computing Face Descriptors
+
+**After face detection and facial landmark prediction the face descriptors for each face can be computed as follows:**
+
+Detect all faces in an image + computes 68 Point Face Landmarks for each detected face. Returns **Array<[FullFaceDescription](#interface-full-face-description)>**:
+
+``` javascript
+const fullFaceDescriptions = await faceapi.detectAllFaces(input).withFaceLandmarks().withFaceDescriptors()
+```
+
+Detect the face with the highest confidence score in an image + computes 68 Point Face Landmarks and face descriptor for that face. Returns **[FullFaceDescription](#interface-full-face-description) | undefined**:
+
+``` javascript
+const fullFaceDescription = await faceapi.detectSingleFace(input).withFaceLandmarks().withFaceDescriptor()
+```
+
+### Face Recognition by Matching Descriptors
+
+To perform face recognition, one can use faceapi.FaceMatcher to compare reference face descriptors to query face descriptors.
+
+First, we initialize the FaceMatcher with the reference data, for example we can simply detect faces in a **referenceImage** and match the descriptors of the detected faces to faces of subsquent images:
+
+``` javascript
+const fullFaceDescriptions = await faceapi
+  .detectAllFaces(referenceImage)
+  .withFaceLandmarks()
+  .withFaceDescriptors()
+
+if (!fullFaceDescriptions.length) {
+  return
+}
+
+// create FaceMatcher with automatically assigned labels
+// from the detection results for the reference image
+const faceMatcher = new faceapi.FaceMatcher(fullFaceDescriptions)
+```
+
+Now we can recognize a persons face shown in **queryImage1**:
+
+``` javascript
+const singleFullFaceDescription = await faceapi
+  .detectSingleFace(queryImage1)
+  .withFaceLandmarks()
+  .withFaceDescriptor()
+
+if (singleFullFaceDescription) {
+  const bestMatch = faceMatcher.findBestMatch(singleFullFaceDescription.descriptor)
+  console.log(bestMatch.toString())
+}
+```
+
+Or we can recognize all faces shown in **queryImage2**:
+
+``` javascript
+const fullFaceDescriptions = await faceapi
+  .detectAllFaces(queryImage2)
+  .withFaceLandmarks()
+  .withFaceDescriptors()
+
+fullFaceDescriptions.forEach(fd => {
+  const bestMatch = faceMatcher.findBestMatch(fd.descriptor)
+  console.log(bestMatch.toString())
+})
+```
+
+You can also create labeled reference descriptors as follows:
+
+``` javascript
+const labeledDescriptors = [
+  new faceapi.LabeledFaceDescriptors(
+    'obama'
+    [descriptorObama1, descriptorObama2]
+  ),
+  new faceapi.LabeledFaceDescriptors(
+    'trump'
+    [descriptorTrump]
+  )
+]
+
+const faceMatcher = new faceapi.FaceMatcher(labeledDescriptors)
+```
+
+<a name="usage-displaying-detection-results"></a>
+
+## Displaying Detection Results
+
+Drawing the detected faces into a canvas:
+
+``` javascript
+const detections = await faceapi.detectAllFaces(input)
+
 // resize the detected boxes in case your displayed image has a different size then the original
-const detectionsForSize = detections.map(det => det.forSize(myImg.width, myImg.height))
+const detectionsForSize = detections.map(det => det.forSize(input.width, input.height))
+// draw them into a canvas
 const canvas = document.getElementById('overlay')
-canvas.width = myImg.width
-canvas.height = myImg.height
-faceapi.drawDetection(canvas, detectionsForSize, { withScore: false })
+canvas.width = input.width
+canvas.height = input.height
+faceapi.drawDetection(canvas, detectionsForSize, { withScore: true })
 ```

-You can also obtain the tensors of the unfiltered bounding boxes and scores for each image in the batch (tensors have to be disposed manually):
+Drawing face landmarks into a canvas:

 ``` javascript
-const { boxes, scores } = await net.forward('myImg')
+const detectionsWithLandmarks = await faceapi
+  .detectAllFaces(input)
+  .withFaceLandmarks()
+
+// resize the detected boxes and landmarks in case your displayed image has a different size then the original
+const detectionsWithLandmarksForSize = detectionsWithLandmarks.map(det => det.forSize(input.width, input.height))
+// draw them into a canvas
+const canvas = document.getElementById('overlay')
+canvas.width = input.width
+canvas.height = input.height
+faceapi.drawLandmarks(canvas, detectionsWithLandmarks, { drawLines: true })
 ```

-<a name="usage-face-detection-yolo"></a>
+Finally you can also draw boxes with custom text:

-### Face Detection - Tiny Yolo v2
+``` javascript
+const boxesWithText = [
+  new faceapi.BoxWithText(new faceapi.Rect(x, y, width, height), text))
+  new faceapi.BoxWithText(new faceapi.Rect(0, 0, 50, 50), 'some text'))
+]

-Detect faces and get the bounding boxes and scores:
+const canvas = document.getElementById('overlay')
+faceapi.drawDetection(canvas, boxesWithText)
+```
+
+<a name="usage-face-detection-options"></a>
+
+## Face Detection Options
+
+### SsdMobilenetv1Options

 ``` javascript
-// defaults parameters shown:
-const forwardParams = {
-  scoreThreshold: 0.5,
-  // any number or one of the predefined sizes:
-  // 'xs' (224 x 224) | 'sm' (320 x 320) | 'md' (416 x 416) | 'lg' (608 x 608)
-  inputSize: 'md'
+export interface ISsdMobilenetv1Options {
+  // minimum confidence threshold
+  // default: 0.5
+  minConfidence?: number
+
+  // maximum number of faces to return
+  // default: 100
+  maxResults?: number
 }

-const detections = await faceapi.tinyYolov2(document.getElementById('myImg'), forwardParams)
+// example
+const options new SsdMobilenetv1Options({ minConfidence: 0.8 })
 ```

-<a name="usage-face-detection-mtcnn"></a>
+### TinyFaceDetectorOptions
+
+``` javascript
+export interface ITinyFaceDetectorOptions {
+  // size at which image is processed, the smaller the faster,
+  // but less precise in detecting smaller faces, must be divisible
+  // by 32, common sizes are 128, 160, 224, 320, 416, 512, 608,
+  // for face tracking via webcam I would recommend using smaller sizes,
+  // e.g. 128, 160, for detecting smaller faces use larger sizes, e.g. 512, 608
+  // default: 416
+  inputSize?: number
+
+  // minimum confidence threshold
+  // default: 0.5
+  scoreThreshold?: number
+}

-### Face Detection & 5 Point Face Landmarks - MTCNN
+// example
+const options new TinyFaceDetectorOptions({ inputSize: 320 })
+```

-Detect faces and get the bounding boxes and scores:
+### MtcnnOptions

 ``` javascript
-// defaults parameters shown:
-const forwardParams = {
+export interface IMtcnnOptions {
+  // minimum face size to expect, the higher the faster processing will be,
+  // but smaller faces won't be detected
+  // default: 20
+  minFaceSize?: number
+
+  // the score threshold values used to filter the bounding
+  // boxes of stage 1, 2 and 3
+  // default: [0.6, 0.7, 0.7]
+  scoreThresholds?: number[]
+
+  // scale factor used to calculate the scale steps of the image
+  // pyramid used in stage 1
+  // default: 0.709
+  scaleFactor?: number
+
  // number of scaled versions of the input image passed through the CNN
  // of the first stage, lower numbers will result in lower inference time,
  // but will also be less accurate
-  maxNumScales: 10,
-  // scale factor used to calculate the scale steps of the image
-  // pyramid used in stage 1
-  scaleFactor: 0.709,
-  // the score threshold values used to filter the bounding
-  // boxes of stage 1, 2 and 3
-  scoreThresholds: [0.6, 0.7, 0.7],
-  // minimum face size to expect, the higher the faster processing will be,
-  // but smaller faces won't be detected
-  minFaceSize: 20
+  // default: 10
+  maxNumScales?: number
+
+  // instead of specifying scaleFactor and maxNumScales you can also
+  // set the scaleSteps manually
+  scaleSteps?: number[]
 }

-const results = await faceapi.mtcnn(document.getElementById('myImg'), forwardParams)
+// example
+const options new MtcnnOptions({ minFaceSize: 100, scaleFactor: 0.8 })
 ```

-Alternatively you can also specify the scale steps manually:
+<a name="usage-utility-classes"></a>
+
+## Utility Classes
+
+### IBox

 ``` javascript
-const forwardParams = {
-  scaleSteps: [0.4, 0.2, 0.1, 0.05]
+export interface IBox {
+  x: number
+  y: number
+  width: number
+  height: number
 }
-
-const results = await faceapi.mtcnn(document.getElementById('myImg'), forwardParams)
 ```

-Finally you can draw the returned bounding boxes and 5 Point Face Landmarks into a canvas:
+<a name="interface-face-detection"></a>
+
+### IFaceDetection

 ``` javascript
-const minConfidence = 0.9
+export interface IFaceDetection {
+  score: number
+  box: Box
+}
+```

-if (results) {
-  results.forEach(({ faceDetection, faceLandmarks }) => {
+<a name="interface-face-landmarks"></a>

-    // ignore results with low confidence score
-    if (faceDetection.score < minConfidence) {
-      return
-    }
+### IFaceLandmarks

-    faceapi.drawDetection('overlay', faceDetection)
-    faceapi.drawLandmarks('overlay', faceLandmarks)
-  })
+``` javascript
+export interface IFaceLandmarks {
+  positions: Point[]
+  shift: Point
 }
 ```

-<a name="usage-face-recognition"></a>
-
-### Face Recognition
+<a name="interface-face-detection-with-landmarks"></a>

-Compute and compare the descriptors of two face images:
+### IFaceDetectionWithLandmarks

 ``` javascript
-// inputs can be html canvas, img or video element or their ids ...
-const descriptor1 = await faceapi.computeFaceDescriptor('myImg')
-const descriptor2 = await faceapi.computeFaceDescriptor(document.getElementById('myCanvas'))
-const distance = faceapi.euclideanDistance(descriptor1, descriptor2)
+export interface IFaceDetectionWithLandmarks {
+  detection: FaceDetection
+  faceLandmarks: FaceLandmarks
+}
+```
+
+<a name="interface-full-face-description"></a>

-if (distance < 0.6)
-  console.log('match')
-else
-  console.log('no match')
+### IFullFaceDescription
+
+``` javascript
+export interface IFullFaceDescription extends IFaceDetectionWithLandmarks {
+  faceDescriptor: Float32Array
+}
 ```

-Or simply obtain the tensor (tensor has to be disposed manually):
+<a name="other-useful-utility"></a>
+
+## Other Useful Utility
+
+### Using the Low Level API
+
+Instead of using the high level API, you can directly use the forward methods of each neural network:

 ``` javascript
-const t = await net.forward('myImg')
+const detections1 = await faceapi.ssdMobilenetv1(input, options)
+const detections2 = await faceapi.tinyFaceDetector(input, options)
+const detections3 = await faceapi.mtcnn(input, options)
+const landmarks1 = await faceapi.detectFaceLandmarks(input, options)
+const landmarks2 = await faceapi.detectFaceLandmarksTiny(input, options)
+const descriptor = await faceapi.computeDescriptor(input, options)
 ```

-<a name="usage-face-landmark-detection"></a>
+All global neural network instances are exported via faceapi.nets:

-### Face Landmark Detection
+``` javascript
+console.log(faceapi.nets)
+```

-Detect face landmarks:
+### Extracting a Canvas for an Image Region

 ``` javascript
-// inputs can be html canvas, img or video element or their ids ...
-const myImg = document.getElementById('myImg')
-const landmarks = await faceapi.detectLandmarks(myImg)
+const regionsToExtract = [
+  new faceapi.Rect(0, 0, 100, 100)
+]
+// actually extractFaces is meant to extract face regions from bounding boxes
+// but you can also use it to extract any other region
+const canvases = await faceapi.extractFaces(input, regionsToExtract)
 ```

-Draw the detected face landmarks to a canvas:
+### Euclidean Distance

 ``` javascript
-// adjust the landmark positions in case your displayed image has a different size then the original
-const landmarksForSize = landmarks.forSize(myImg.width, myImg.height)
-const canvas = document.getElementById('overlay')
-canvas.width = myImg.width
-canvas.height = myImg.height
-faceapi.drawLandmarks(canvas, landmarksForSize, { drawLines: true })
+// ment to be used for computing the euclidean distance between two face descriptors
+const dist = faceapi.euclideanDistance([0, 0], [0, 10])
+console.log(dist) // 10
 ```

-Retrieve the face landmark positions:
+### Retrieve the Face Landmark Points and Contours

 ``` javascript
 const landmarkPositions = landmarks.getPositions()

-// or get the positions of individual contours
+// or get the positions of individual contours,
+// only available for 68 point face ladnamrks (FaceLandmarks68)
 const jawOutline = landmarks.getJawOutline()
 const nose = landmarks.getNose()
 const mouth = landmarks.getMouth()
@@ -348,48 +557,52 @@ const leftEyeBbrow = landmarks.getLeftEyeBrow()
 const rightEyeBrow = landmarks.getRightEyeBrow()
 ```

-Compute the Face Landmarks for Detected Faces:
+### Fetch and Display Images from an URL

-``` javascript
-const detections = await faceapi.ssdMobilenetv1(input)
-
-// get the face tensors from the image (have to be disposed manually)
-const faceTensors = await faceapi.extractFaceTensors(input, detections)
-const landmarksByFace = await Promise.all(faceTensors.map(t => faceapi.detectLandmarks(t)))
-
-// free memory for face image tensors after we computed their descriptors
-faceTensors.forEach(t => t.dispose())
+``` html
+<img id="myImg" src="">
 ```

-<a name="shortcut-functions"></a>
+``` javascript
+const image = await faceapi.fetchImage('/images/example.png')

-### Shortcut Functions for Full Face Description
+console.log(image instanceof HTMLImageElement) // true

-After face detection has been performed, I would recommend to align the bounding boxes of the detected faces before passing them to the face recognition net, which will make the computed face descriptor much more accurate. Fortunately, the api can do this for you under the hood by providing convenient shortcut functions. You can obtain the full face descriptions (location, landmarks and descriptor) of each face in an input image as follows.
+// displaying the fetched image content
+const myImg = document.getElementById('myImg')
+myImg.src = image.src
+```

-Using the SSD Mobilenet v1 face detector + 68 point face landmark detector:
+### Fetching JSON

 ``` javascript
-const fullFaceDescriptions = await faceapi.allFacesSsdMobilenetv1(input, minConfidence)
+const json = await faceapi.fetchJson('/files/example.json')
 ```

-Using the Tiny Yolo v2 face detector + 68 point face landmark detector:
+### Creating an Image Picker

-``` javascript
-const fullFaceDescriptions = await faceapi.allFacesTinyYolov2(input, { inputSize: 'md' })
+``` html
+<img id="myImg" src="">
+<input id="myFileUpload" type="file" onchange="uploadImage()" accept=".jpg, .jpeg, .png">
 ```

-Or with MTCNN face detection + 5 point face landmarks:
-
 ``` javascript
-const fullFaceDescriptions = await faceapi.allFacesMtcnn(input, { minFaceSize: 20 })
+async function uploadImage() {
+  const imgFile = document.getElementById('myFileUpload').files[0]
+  // create an HTMLImageElement from a Blob
+  const img = await faceapi.bufferToImage(imgFile)
+  document.getElementById('myImg').src = img.src
+}
 ```

-The shortcut functions return an array of FullFaceDescriptions:
+### Creating a Canvas Element from an Image or Video Element

-``` javascript
-const fullFaceDescription0 = fullFaceDescriptions[0]
-console.log(fullFaceDescription0.detection) // bounding box & score
-console.log(fullFaceDescription0.landmarks) // face landmarks
-console.log(fullFaceDescription0.descriptor) // face descriptor
+``` html
+<img id="myImg" src="images/example.png" />
+<video id="myVideo" src="media/example.mp4" />
 ```
+
+``` javascript
+const canvas1 = faceapi.createCanvasFromMedia(document.getElementById('myImg'))
+const canvas2 = faceapi.createCanvasFromMedia(document.getElementById('myVideo'))
+```
\ No newline at end of file