A VS Code Extension to make it easier to manage and develop Spark jobs on EMR
-
Updated
Feb 17, 2025 - TypeScript
A VS Code Extension to make it easier to manage and develop Spark jobs on EMR
A data pipeline to generate stats from logs.
Explore a smarter way to shop online with this full-stack project built on the infrastructure of Google Cloud Platform (GCP) for RAG based e-commerce with LLM.
An end-to-end example of a serverless machine learning pipeline for multiclass classification on AWS with SageMaker Pipelines, Data Wrangler, Athena and XGBoost.
An end-to-end content-based TMDB movie recommendation engine developed using PySpark, Flask, and Angular.
StanQuant is a secure, scalable platform for testing algorithmic trading strategies using real historical stock data. Built with Django, Next.js, Spark, Kubernetes, and more, it allows users to upload, run, and evaluate trading algorithms at scale in isolated cloud containers.
Add a description, image, and links to the pyspark topic page so that developers can more easily learn about it.
To associate your repository with the pyspark topic, visit your repo's landing page and select "manage topics."