This repository is an educational project focused on big data processing using Python. It includes scripts and datasets for tasks such as word count analysis and data manipulation.
-
Updated
Mar 11, 2025 - Python
This repository is an educational project focused on big data processing using Python. It includes scripts and datasets for tasks such as word count analysis and data manipulation.
This repository presents a 2-round coreset-based MapReduce algorithm designed to address the k-center problem with z outliers.
This is a small project for Big Data Computing course, applying Dimensionality Reduction, Sampling and Clustering for topic detection in text documents.
Homeworks from the Big Data Computing course, UniPD, 2021/22
Add a description, image, and links to the big-data-computing topic page so that developers can more easily learn about it.
To associate your repository with the big-data-computing topic, visit your repo's landing page and select "manage topics."