Update docs

ospillinger · ospillinger · commit 6e624083cf8b · 2019-11-04T15:02:56.000-08:00
diff --git a/README.md b/README.md
@@ -15,7 +15,7 @@ Cortex is an open source platform that takes machine learning models—trained w
 
 - **Autoscaling:** Cortex automatically scales APIs to handle production workloads.
 
-- **Multi framework:** Cortex supports TensorFlow, Keras, PyTorch, Scikit-learn, XGBoost, and more.
+- **Multi framework:** Cortex supports TensorFlow, PyTorch, scikit-learn, XGBoost, and more.
 
 - **CPU / GPU support:** Cortex can run inference on CPU or GPU infrastructure.
 
diff --git a/docs/README.md b/docs/README.md
@@ -1 +1 @@
-Please refer to [www.cortex.dev](https://www.cortex.dev) for documentation on the latest stable version.
+Please refer to [cortex.dev](https://cortex.dev) for documentation on the latest stable version.
diff --git a/docs/deployments/apis.md b/docs/deployments/apis.md
diff --git a/docs/deployments/predictor.md b/docs/deployments/predictor.md
@@ -1,3 +1,54 @@
+# Predictor APIs
+
+You can deploy models from any Python framework by implementing Cortex's Predictor interface. The interface consists of an `init()` function and a `predict()` function. The `init()` function is responsible for preparing the model for serving, downloading vocabulary files, etc. The `predict()` function is called on every request and is responsible for responding with a prediction.
+
+In addition to supporting Python models via the Predictor interface, Cortex can serve the following exported model formats:
+
+- [TensorFlow](tensorflow.md)
+- [ONNX](onnx.md)
+
+## Configuration
+
+```yaml
+- kind: api
+  name: <string>  # API name (required)
+  endpoint: <string>  # the endpoint for the API (default: /<deployment_name>/<api_name>)
+  predictor:
+    path: <string>  # path to the predictor Python file, relative to the Cortex root (required)
+    model: <string>  # S3 path to a file or directory (e.g. s3://my-bucket/exported_model) (optional)
+    python_path: <string>  # path to the root of your Python folder that will be appended to PYTHONPATH (default: folder containing cortex.yaml)
+    metadata: <string: value>  # dictionary that can be used to configure custom values (optional)
+  tracker:
+    key: <string>  # the JSON key in the response to track (required if the response payload is a JSON object)
+    model_type: <string>  # model type, must be "classification" or "regression" (required)
+  compute:
+    min_replicas: <int>  # minimum number of replicas (default: 1)
+    max_replicas: <int>  # maximum number of replicas (default: 100)
+    init_replicas: <int>  # initial number of replicas (default: <min_replicas>)
+    target_cpu_utilization: <int>  # CPU utilization threshold (as a percentage) to trigger scaling (default: 80)
+    cpu: <string | int | float>  # CPU request per replica (default: 200m)
+    gpu: <int>  # GPU request per replica (default: 0)
+    mem: <string>  # memory request per replica (default: Null)
+```
+
+### Example
+
+```yaml
+- kind: api
+  name: my-api
+  predictor:
+    path: predictor.py
+  compute:
+    gpu: 1
+```
+
+## Debugging
+
+You can log information about each request by adding a `?debug=true` parameter to your requests. This will print:
+
+1. The raw sample
+2. The value after running the `predict` function
+
 # Predictor
 
 A Predictor is a Python file that describes how to initialize a model and use it to make a prediction.
diff --git a/docs/packaging/onnx.md b/docs/packaging/onnx.md
@@ -18,11 +18,6 @@ with open("sklearn.onnx", "wb") as f:
     f.write(onnx_model.SerializeToString())
 ```
 
-<!-- CORTEX_VERSION_MINOR x4 -->
-Here are complete examples of converting models from some of the common ML frameworks to ONNX:
-
-* [XGBoost](https://colab.research.google.com/github/cortexlabs/cortex/blob/master/examples/xgboost/iris-classifier/xgboost.ipynb)
-
 Upload your exported model to Amazon S3 using the AWS web console or CLI:
 
 ```bash
diff --git a/docs/summary.md b/docs/summary.md
@@ -11,28 +11,27 @@
 ## Deployments
 
 * [Deployments](deployments/deployments.md)
-* [APIs](deployments/apis.md)
-* [Predictor](deployments/predictor.md)
+* [Predictor APIs](deployments/predictor.md)
 * [TensorFlow APIs](deployments/tensorflow.md)
 * [ONNX APIs](deployments/onnx.md)
 * [Request handlers](deployments/request-handlers.md)
 * [Autoscaling](deployments/autoscaling.md)
-* [Prediction Monitoring](deployments/prediction-monitoring.md)
+* [Prediction monitoring](deployments/prediction-monitoring.md)
 * [Compute](deployments/compute.md)
 * [CLI commands](cluster/cli.md)
-* [Python client](deployments/python-client.md)
 * [API statuses](deployments/statuses.md)
-
-## Dependency management
-
-* [Python packages](dependencies/python-packages.md)
-* [System packages](dependencies/system-packages.md)
+* [Python client](deployments/python-client.md)
 
 ## Packaging models
 
 * [TensorFlow](packaging/tensorflow.md)
 * [ONNX](packaging/onnx.md)
 
+## Dependency management
+
+* [Python packages](dependencies/python-packages.md)
+* [System packages](dependencies/system-packages.md)
+
 ## Cluster management
 
 * [Cluster configuration](cluster/config.md)

Original file line number	Diff line number	Diff line change
`@@ -1 +1 @@`
`1`		`-Please refer to [www.cortex.dev](https://www.cortex.dev) for documentation on the latest stable version.`
	`1`	`+Please refer to [cortex.dev](https://cortex.dev) for documentation on the latest stable version.`