You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+10-10Lines changed: 10 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,32 +29,32 @@ Below examples include the intense usage of industry-hot frameworks (i.e. Pytorc
29
29
30
30
### 2020 Edition
31
31
32
-
#### [End-to-End Machine Learning Model using PySpark and MLlib](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/End-to-End%20Machine%20Learning%20Model%20using%20PySpark%20and%20MLlib.ipynb):
32
+
#### [Multi-class Text Classification Problem with PySpark and MLlib](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/Multi-class%20Text%20Classification%20Problem%20with%20PySpark%20and%20MLlib.ipynb):
33
33
<p>
34
-
We build a complete ML model (Binary Classification with Imbalanced Classes problem) leveraging Spark's computation. Full cycle of ML (EDA, feature engineering, model building) is covered. In-Memory computation and Parallel-Processing are some of the major reasons that Apache Spark has become very popular in the big data industry to deal with data products at large scale and perform faster analysis
34
+
Apache Spark is quickly gaining steam both in the headlines and real-world adoption, mainly because of its ability to process streaming data. With so much data being processed on a daily basis, it has become essential for us to be able to stream and analyze it in real time. We use Spark Machine Learning Library (Spark MLlib) to solve this multi-class text classification problem.
35
35
</p>
36
-
July 3, 2020
36
+
Jul 6, 2020
37
37
38
38
39
-
#### [End-to-End Machine Learning Model using PySpark and MLlib](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/End-to-End%20Machine%20Learning%20Model%20using%20PySpark%20and%20MLlib.ipynb):
39
+
#### [Retail Price Recommendation model with Gradient Boosting Tree](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/Retail%20Price%20Recommendation%20model%20with%20Gradient%20Boosting%20Tree.ipynb):
40
40
<p>
41
-
We build a complete ML model (Binary Classification with Imbalanced Classes problem) leveraging Spark's computation. Full cycle of ML (EDA, feature engineering, model building) is covered. In-Memory computation and Parallel-Processing are some of the major reasons that Apache Spark has become very popular in the big data industry to deal with data products at large scale and perform faster analysis
41
+
Mercari (Japan’s biggest shopping app) would like to offer pricing suggestions to sellers, but this is not easy because their sellers are enabled to put just about anything, or any bundle of things, on Mercari’s marketplace. In this machine learning project, we will build a model that automatically suggests the right product prices. Here we build a complete price recommendation model leveraging LightGBM.
42
42
</p>
43
-
July 3, 2020
43
+
Jul 4, 2020
44
44
45
45
46
-
#### [Full Pytorch Implementation of Recommender System (Collaborative Filtering)](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/End-to-End%20Machine%20Learning%20Model%20using%20PySpark%20and%20MLlib.ipynb):
46
+
#### [Full Pytorch Implementation of Recommender System (Collaborative Filtering)](https://github.com/hyunjoonbok/Python-Projects/blob/master/Pytorch/Full%20Pytorch%20Implementation%20of%20Recommender%20System%20(Collaborative%20Filtering).ipynb):
47
47
<p>
48
-
We build a complete ML model (Binary Classification with Imbalanced Classes problem) leveraging Spark's computation. Full cycle of ML (EDA, feature engineering, model building) is covered. In-Memory computation and Parallel-Processing are some of the major reasons that Apache Spark has become very popular in the big data industry to deal with data products at large scale and perform faster analysis
48
+
We utilize Pytoch's embeddings layers to build a simple recommendation system. Our model will predict user ratings for specific movies.
49
49
</p>
50
-
July 3, 2020
50
+
Jul 2, 2020
51
51
52
52
53
53
#### [End-to-End Machine Learning Model using PySpark and MLlib](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/End-to-End%20Machine%20Learning%20Model%20using%20PySpark%20and%20MLlib.ipynb):
54
54
<p>
55
55
We build a complete ML model (Binary Classification with Imbalanced Classes problem) leveraging Spark's computation. Full cycle of ML (EDA, feature engineering, model building) is covered. In-Memory computation and Parallel-Processing are some of the major reasons that Apache Spark has become very popular in the big data industry to deal with data products at large scale and perform faster analysis
56
56
</p>
57
-
July 3, 2020
57
+
Jun 28, 2020
58
58
59
59
60
60
#### [ML Model for predicting a Crew Size](https://github.com/hyunjoonbok/Python-Projects/blob/master/vanilla/Ship_Crew_Size_ML_Model.ipynb):
0 commit comments