What you would like to be added?
Hello,
I was wondering if there are any plans for integrating with Kubeflow Spark Applications? It would be very helpful to have a unified solution for Python, so that we can run Spark tasks from the pipeline with minimal interaction with the Kubernetes API, customize the start of jobs, and monitor their progress.
Why is this needed?
Spark is an important tool for working with data in the machine learning pipeline. The Kubeflow ecosystem includes a Spark solution, but there is currently no tool available to easily integrate and use it within the Kubeflow pipeline.
Love this feature?
Give it a 👍 We prioritize the features with most 👍