Formation OpenClassrooms - Parcours data scientist - Projet n°8 - Déployez un modèle dans le cloud - 70 h
-
Updated
Oct 13, 2022 - HTML
Formation OpenClassrooms - Parcours data scientist - Projet n°8 - Déployez un modèle dans le cloud - 70 h
Pyspark, machine learning, python
This repository contains Databricks projects utilizing RDDs, DataFrames, and SQL to process and analyze various real-world datasets. Data cleaning and analysis have been performed using PySpark functions to handle challenges such as inconsistent formats, missing values, and complex data structures. The project ensures efficient data transformation
Add a description, image, and links to the pyspark-python topic page so that developers can more easily learn about it.
To associate your repository with the pyspark-python topic, visit your repo's landing page and select "manage topics."