From f983267530bb334ba5848fa23762b405e5c7e978 Mon Sep 17 00:00:00 2001 From: Emin Orhan Date: Wed, 9 Jun 2021 10:04:19 -0400 Subject: [PATCH] adding some shared public datasets in LakeGroup --- cassio/DATASETS.md | 24 +++++++++++++++++++++--- 1 file changed, 21 insertions(+), 3 deletions(-) diff --git a/cassio/DATASETS.md b/cassio/DATASETS.md index b642cd8..66686eb 100644 --- a/cassio/DATASETS.md +++ b/cassio/DATASETS.md @@ -1,12 +1,12 @@ -**Last update: 01/18/2020** +**Last update: 06/09/2021** Here we store up-to-date information about the datasets stored in CILVR/CDS filesystems. Please send a PR with edited file if you want to contribute here! -# NLP data +# NLP datasets -`/misc/vlgscratch4/BowmanGroup/datasets` : +`/misc/vlgscratch4/BowmanGroup/datasets`: * bert_trees @@ -47,3 +47,21 @@ Please send a PR with edited file if you want to contribute here! * wikipedia_corpus_tiny * WikiText103 + +`/misc/vlgscratch5/LakeGroup/shared_data`: + +* [pile](https://pile.eleuther.ai/) + +# Computer vision datasets + +`/misc/vlgscratch5/LakeGroup/shared_data`: + +* [ava_v2](http://research.google.com/ava/download.html) + +* [ecoset](https://www.kietzmannlab.org/ecoset/) + +* [openimages](https://storage.googleapis.com/openimages/web/index.html) + +* [places365](http://places2.csail.mit.edu/download.html) + +* [imagenet_winter21_whole](https://image-net.org/download-images.php)