spark-dataframes

Here are 7 public repositories matching this topic...

jkoth / Data-Lake-with-Spark-and-AWS-S3

Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster

apache-spark aws-s3 aws-emr pyspark data-engineering data-lake json-format udacity-nanodegree spark-dataframes dimensional-model star-schema etl-pipeline

Updated Oct 10, 2019
Python

LucasDLee / CMPT-353-Final-Project

Star

This is our final project for SFU's CMPT 353 taught by Greg Baker during Summer 2023

python data-science statistics university-project spark-dataframes

Updated Aug 23, 2023
Python

chinmayms / propinvestment

Star

Predict Current Property Investment opportunities using Data Analysis (Big Data Spark ML)

django spark apache pandas spark-dataframes spark-ml

Updated Jun 18, 2017
Python

the-timoye / spark-examples

Star

python spark data-wrangling spark-sql spark-dataframes data-engin

Updated Sep 3, 2023
Python

milesgranger / pontem

Star

Treat Spark like pandas.

pandas pyspark dataframes dataframe-api spark-dataframes distributed-dataframe

Updated Sep 3, 2017
Python

airztz / Python4fun

Star

Some batch processing demos with various data warehouses like local, S3 and HDFS in AWS

aws-s3 batch-processing pandas-dataframes spark-dataframes hadoop-hdfs

Updated Feb 27, 2018
Python

on2e / ntua-atdb

Star

Advanced Topics in Databases course project - NTUA ECE - 2022-23

apache-spark pyspark spark-dataframes advanced-database apache-hadoop ntua-ece spark-rdd

Updated Mar 30, 2023
Python

Improve this page

Add a description, image, and links to the spark-dataframes topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-dataframes topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spark-dataframes

Here are 7 public repositories matching this topic...

jkoth / Data-Lake-with-Spark-and-AWS-S3

LucasDLee / CMPT-353-Final-Project

chinmayms / propinvestment

the-timoye / spark-examples

milesgranger / pontem

airztz / Python4fun

on2e / ntua-atdb

Improve this page

Add this topic to your repo