Skip to content
This repository was archived by the owner on Nov 23, 2023. It is now read-only.

Commit 96ae111

Browse files
Amendment 9:41
1 parent 5a635ee commit 96ae111

File tree

1 file changed

+24
-0
lines changed

1 file changed

+24
-0
lines changed

Wade Jacobs - Base Model/Team_GM5_BaseModel.ipynb

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11314,6 +11314,30 @@
1131411314
"---"
1131511315
]
1131611316
},
11317+
{
11318+
"cell_type": "markdown",
11319+
"id": "741da55e",
11320+
"metadata": {},
11321+
"source": [
11322+
"---\n",
11323+
"\n",
11324+
"Initial observations:\n",
11325+
"\n",
11326+
"1. The feature, more specifically, \"Valencia Pressure\" is missing various observations. \n",
11327+
"\n",
11328+
"2. There is a feature called \"Unnamed: 0\", which does not add any substantive information to the overall dataset. \n",
11329+
"\n",
11330+
"3. As we will be more fully shown below, the datatype of the features, more specifically, \"time\", \"Valencia_wind_deg\" and \"Seville_pressure\", are incorrectly classed as an object. \n",
11331+
"\n",
11332+
"4. There quite a few features that are largely empty, and does not contain any valuable informatation that contributes to the overall dataset, such as the \"Seville_clouds_all\" and \"Bilbao_snow_3h\" columns.\n",
11333+
"\n",
11334+
"5. As will be more fully shown below, there are quite a few outliers present, which evidently disturbs the natural orientation of our dataset. \n",
11335+
"\n",
11336+
"6. Multicollinearity appears to be extremely prevalent between the various features. \n",
11337+
"\n",
11338+
"---"
11339+
]
11340+
},
1131711341
{
1131811342
"cell_type": "markdown",
1131911343
"id": "50f14031",

0 commit comments

Comments
 (0)