Labelbox
diff --git a/‎examples/basics/basics.ipynb‎
Lines changed: 36 additions & 22 deletions b/‎examples/basics/basics.ipynb‎
Lines changed: 36 additions & 22 deletions
diff --git a/‎examples/basics/data_rows.ipynb‎
Lines changed: 59 additions & 28 deletions b/‎examples/basics/data_rows.ipynb‎
Lines changed: 59 additions & 28 deletions
@@ -13,15 +13,15 @@
    "id": "smaller-syndication",
    "metadata": {},
    "source": [
-    "#### Quick install instructions\n",
+    "### Quick install instructions\n",
     "The quick version is basically just\n",
     "1. `!pip install labelbox`\n",
     "2. `export LABELBOX_API_KEY=\"<your_api_key>\"`\n",
     "* Get this from the UI under (Account -> API -> Create API Key)\n",
-    "\n",
+    "* You can also set the api_key below in the notebook.\n",
     "\n",
     "This only works for cloud deployments.\n",
-    "* For more details : https://docs.labelbox.com/python-sdk/en/index-en#labelbox-python-sdk"
+    "* For more details : https://docs.labelbox.com/python-sdk/en/index-en#labelbox-python-sdk\n"
    ]
   },
   {
@@ -34,6 +34,16 @@
     "    *    https://docs.labelbox.com/python-sdk/en/index-en#fundamental-concepts"
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "indie-bracket",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install labelbox"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 5,
@@ -42,7 +52,8 @@
    "outputs": [],
    "source": [
     "from labelbox import Client\n",
-    "from labelbox import Project, Dataset"
+    "from labelbox import Project, Dataset\n",
+    "import os"
    ]
   },
   {
@@ -74,15 +85,19 @@
     "PROJECT_ID = \"ckk4q1viuc0w20704eh69u28h\"\n",
     "DATASET_ID = \"ckk4q1vjznyhu087203wlghfr\"\n",
     "PROJECT_NAME = \"Sample Project\"\n",
-    "DATASET_NAME = \"Example Jellyfish Dataset\""
+    "DATASET_NAME = \"Example Jellyfish Dataset\"\n",
+    "# Set this if running in colab. Otherwise it should work if you have the LABELBOX_API_KEY set.\n",
+    "API_KEY = os.environ[\"LABELBOX_API_KEY\"]\n",
+    "# Only update this if you have an on-prem deployment\n",
+    "ENDPOINT = \"https://api.labelbox.com/graphql\" "
    ]
   },
   {
    "cell_type": "markdown",
    "id": "chinese-playing",
    "metadata": {},
    "source": [
-    "#### Client\n",
+    "### Client\n",
     "* Starting point for all db interactions"
    ]
   },
@@ -93,9 +108,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Client is used for all DB interactions.\n",
-    "#This is usually the starting point for all usage.\n",
-    "client = Client()"
+    "client = Client(api_key = API_KEY, endpoint = ENDPOINT)"
    ]
   },
   {
@@ -136,7 +149,7 @@
    "id": "popular-nylon",
    "metadata": {},
    "source": [
-    "#### Fields\n",
+    "### Fields\n",
     "* All db objects have fields (look at the source code to see them https://github.com/Labelbox/labelbox-python/blob/develop/labelbox/schema/project.py)\n",
     "* These fields are attributes of the object"
    ]
@@ -187,7 +200,7 @@
    "id": "viral-power",
    "metadata": {},
    "source": [
-    "#### Pagination\n",
+    "### Pagination\n",
     "* Queries that return a list of database objects return them as a PaginatedCollection\n",
     "* The goal here is to limit the data being returned to only the necessary data."
    ]
@@ -232,17 +245,18 @@
     }
    ],
    "source": [
-    "#Iterate over them to get the items out.\n",
+    "# Note that if you selected a project_id without any labels this will raise StopIteration\n",
+    "# Iterate over them to get the items out.\n",
     "next(labels_paginated_collection)\n",
-    "#Be careful not to call list(paginated_collection) on a large collection"
+    "# Be careful not to call list(paginated_collection) on a large collection"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "widespread-startup",
    "metadata": {},
    "source": [
-    "#### Query parameters\n",
+    "### Query parameters\n",
     "* Query with the following conventions:\n",
     "    * `DbObject.Field`"
    ]
@@ -273,21 +287,21 @@
     "    (Project.description == \"new description field\")\n",
     "))\n",
     "                            \n",
-    "#The above two queries return PaginatedCollections because the filter parameters aren't guarenteed to be unique.\n",
-    "#So even if there is one element returned it is in a paginatedCollection.\n",
+    "# The above two queries return PaginatedCollections because the filter parameters aren't guarenteed to be unique.\n",
+    "# So even if there is one element returned it is in a paginatedCollection.\n",
     "print(projects)\n",
     "print(next(projects, None))\n",
     "print(next(projects, None))\n",
     "print(next(projects, None))\n",
-    "#We can see there is only one."
+    "# We can see there is only one."
    ]
   },
   {
    "cell_type": "markdown",
    "id": "french-toner",
    "metadata": {},
    "source": [
-    "#### Querying Limitations\n",
+    "### Querying Limitations\n",
     "* The DbObject used for the query must be the same as the DbObject returned by the querying function.  \n",
     "* eg. is not valid since get_project returns a Project but we are filtering on a Dataset\n",
     ">  `>>> projects = client.get_projects(where = Dataset.name == \"dataset_name\")`\n"
@@ -298,7 +312,7 @@
    "id": "defensive-bidder",
    "metadata": {},
    "source": [
-    "#### Relationship\n",
+    "### Relationship\n",
     "* This solves the above problem of querying by a relationship\n",
     "* You can find all realtionships of a DB object in the source code\n",
     "    * eg. for a Project ( https://github.com/Labelbox/labelbox-python/blob/develop/labelbox/schema/project.py))"
@@ -322,9 +336,9 @@
     }
    ],
    "source": [
-    "#Dataset has a Relationship to a Project so we can use the following\n",
+    "# Dataset has a Relationship to a Project so we can use the following\n",
     "list(dataset.projects())\n",
-    "#This will return all projects that are attached to this dataset"
+    "# This will return all projects that are attached to this dataset"
    ]
   },
   {
@@ -354,7 +368,7 @@
    "id": "metric-speaker",
    "metadata": {},
    "source": [
-    "#### Delete\n",
+    "### Delete\n",
     "* Most DBObjects support deletion"
    ]
   },
 
@@ -20,30 +20,62 @@
     "* A datarow is a member of a dataset \n",
     "* A datarow cannot exist without belonging to a dataset.\n",
     "* Datarows are staged to be labeled by attaching the dataset that they are members of to a project\n",
-    "    * See dataset notebook on information about datasets\n",
-    "---\n",
-    "To run this notebook on your own data, set the variables in the next cell"
+    "    * See dataset notebook on information about datasets"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "rural-fellow",
+   "id": "posted-nation",
    "metadata": {},
    "outputs": [],
    "source": [
-    "PROJECT_ID = \"ckk4q1viuc0w20704eh69u28h\""
+    "!pip install labelbox"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
-   "id": "filled-disaster",
+   "id": "beautiful-ready",
    "metadata": {},
    "outputs": [],
    "source": [
     "from labelbox import Project, Dataset, DataRow, Client\n",
-    "import uuid"
+    "import uuid\n",
+    "import os"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "legendary-harvard",
+   "metadata": {},
+   "source": [
+    "* Set the following cell with your data to run this notebook"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "rural-fellow",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Pick a project that has a dataset attached\n",
+    "PROJECT_ID = \"ckk4q1viuc0w20704eh69u28h\"\n",
+    "# Set this if running in colab. Otherwise it should work if you have the LABELBOX_API_KEY set.\n",
+    "API_KEY = os.environ[\"LABELBOX_API_KEY\"]\n",
+    "# Only update this if you have an on-prem deployment\n",
+    "ENDPOINT = \"https://api.labelbox.com/graphql\" "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "proof-detective",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "client = Client(api_key = API_KEY, endpoint = ENDPOINT)"
    ]
   },
   {
@@ -53,11 +85,10 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "client = Client()\n",
     "project = client.get_project(PROJECT_ID)\n",
     "dataset = next(project.datasets())\n",
-    "#This is the same as\n",
-    "#-> dataset = client.get_dataset(dataset_id)"
+    "# This is the same as\n",
+    "# -> dataset = client.get_dataset(dataset_id)"
    ]
   },
   {
@@ -86,7 +117,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Url\n",
+    "# Url\n",
     "print(\"Associated dataset\", datarow.dataset())\n",
     "print(\"Associated label(s)\",  next(datarow.labels()))\n",
     "print(\"External id\", datarow.external_id)"
@@ -99,7 +130,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#External ids can be a reference to your internal datasets\n",
+    "# External ids can be a reference to your internal datasets\n",
     "datarow = dataset.data_row_for_external_id(datarow.external_id)\n",
     "print(datarow)"
    ]
@@ -123,8 +154,8 @@
     "dataset = client.create_dataset(name = \"testing-dataset\")\n",
     "dataset.create_data_row(row_data = \"https://picsum.photos/200/300\")\n",
     "\n",
-    "#It is reccomended that you use external ids but optional.\n",
-    "#These are useful for users to maintain references to a data_row.\n",
+    "# It is reccomended that you use external ids but optional.\n",
+    "# These are useful for users to maintain references to a data_row.\n",
     "dataset.create_data_row(row_data = \"https://picsum.photos/200/300\", external_id = str(uuid.uuid4()))\n"
    ]
   },
@@ -135,7 +166,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Bulk create datarows\n",
+    "# Bulk create datarows\n",
     "task1 = dataset.create_data_rows([{DataRow.row_data : \"https://picsum.photos/200/300\"}\n",
     "                          , {DataRow.row_data : \"https://picsum.photos/200/300\"}])"
    ]
@@ -147,7 +178,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Local paths\n",
+    "# Local paths\n",
     "local_data_path = '/tmp/test_data_row.txt'\n",
     "with open(local_data_path, 'w') as file:\n",
     "    file.write(\"sample data\")\n",
@@ -162,7 +193,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#You can mix local files with urls\n",
+    "# You can mix local files with urls\n",
     "task3 = dataset.create_data_rows([{DataRow.row_data : \"https://picsum.photos/200/300\"}, local_data_path])"
    ]
   },
@@ -173,8 +204,8 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Note that you cannot set external_ids at this time when uploading from local files.\n",
-    "#To do this you have to first\n",
+    "# Note that you cannot set external_ids at this time when uploading from local files.\n",
+    "# To do this you have to first\n",
     "item_url = client.upload_file(local_data_path)\n",
     "task4 = dataset.create_data_rows([{DataRow.row_data : item_url, DataRow.external_id : str(uuid.uuid4())}])"
    ]
@@ -186,7 +217,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Blocking wait until complete\n",
+    "# Blocking wait until complete\n",
     "task1.wait_till_done()\n",
     "task2.wait_till_done()\n",
     "task3.wait_till_done()\n",
@@ -210,7 +241,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Useful for resigning urls\n",
+    "# Useful for resigning urls\n",
     "new_id = str(uuid.uuid4())\n",
     "datarow.update(external_id = new_id)\n",
     "print(datarow.external_id, new_id)\n"
@@ -223,12 +254,12 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#We can also attach metadata (See metadata tutorial for more)\n",
-    "#Metadata is visible for all projects with this datarow attached\n",
+    "# We can also attach metadata (See metadata tutorial for more)\n",
+    "# Metadata is visible for all projects with this datarow attached\n",
     "datarow.create_metadata(meta_type = \"TEXT\", meta_value = \"LABELERS WILL SEE THIS \")\n",
-    "#See more information here:\n",
-    "#https://docs.labelbox.com/en/import-data/attachments\n",
-    "#Note that meta_value must always be a string (url to a video/image or a text value to display)"
+    "# See more information here:\n",
+    "# https://docs.labelbox.com/en/import-data/attachments\n",
+    "# Note that meta_value must always be a string (url to a video/image or a text value to display)"
    ]
   },
   {
@@ -247,7 +278,7 @@
    "outputs": [],
    "source": [
     "datarow.delete()\n",
-    "#Will remove from the dataset too"
+    "# Will remove from the dataset too"
    ]
   },
   {
@@ -257,7 +288,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "#Bulk delete a list of datarows (in this case all of them we just uploaded)\n",
+    "# Bulk delete a list of datarows (in this case all of them we just uploaded)\n",
     "DataRow.bulk_delete(list(dataset.data_rows()))"
    ]
   }