Here are 874 public repositories matching this topic "dataframe"
Repository
Created on May 7, 2017, 3:43 am
cuDF - GPU DataFrame Library
Last updated on December 4, 2023, 6:08 am
Repository
Created on February 23, 2023, 5:16 pm
data-science
python
dag
data-engineering
dataframe
etl
etl-framework
etl-pipeline
feature-engineering
featurization
Your single tool to express data, ML, and LLM pipelines with simple python functions. Runs anywhere that python runs, E.G. spark, airflow, jupyter, fastapi, etc. Incrementally adoptable. Use Hamilton to build testable, reusable, and self-documenting dataflows with lineage and metadata out of the box.
Last updated on December 4, 2023, 5:13 am
Repository
Created on May 13, 2020, 7:45 pm
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Last updated on December 4, 2023, 6:43 am
Repository
Created on April 17, 2021, 3:40 pm
Apache Arrow DataFusion SQL Query Engine
Last updated on December 4, 2023, 5:30 am
Repository
Created on October 7, 2023, 4:12 pm
Web scraper to get updates of my master degrees schedule and send and email to my classmates when a change occurs
Last updated on November 27, 2023, 10:12 am
Repository
Created on December 3, 2023, 11:13 am
data-science
dataframe
excel
jupyter-notebook
pandas
python
regular-expression
scraper
scraping
selenium
Data analysis of trending FIFA players. Data was collected using web scraping by selenium and visualized by Tableau. Our Tableau link is attached below:
Last updated on December 3, 2023, 12:48 pm
Repository
Created on November 25, 2023, 2:49 pm
data-analysis
data-visualization
dataframe
dataset
ggplot2
kaggle
kaggle-dataset
r
rstudio
tidyverse
This repository serves as a comprehensive resource for individuals interested in exploring the intricacies of cleaning and analyzing used cars datasets. By delving into the provided scripts and datasets, users can gain valuable insights into the world of used cars and understand the methodologies employed to ensure data accuracy and reliability.
Last updated on November 30, 2023, 1:59 pm
Repository
Created on April 25, 2022, 10:02 pm
image-processing
machine-learning
python
data-engineering
data-science
dataframe
deep-learning
distributed-computing
rust
The Python DataFrame for Complex Data
Last updated on December 3, 2023, 1:41 pm
Repository
Created on June 21, 2018, 9:35 pm
Modin: Scale your Pandas workflows by changing a single line of code
Last updated on December 3, 2023, 12:35 pm
Repository
Created on October 28, 2017, 5:25 pm
numerical-analysis
dataframe
data-analysis
multidimensional-data
cpp
large-data
heterogeneous-data
statistical-analysis
financial-data-analysis
financial-engineering
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
Last updated on December 3, 2023, 6:20 pm
Repository
Created on September 9, 2016, 9:41 pm
Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
Last updated on December 2, 2023, 7:40 pm
Repository
Created on April 8, 2023, 5:59 pm
automation
diagram
file-management
flask
flowchart
pdf
python
robotic-process-automation
software-design
txt-files
File Management, School Automation, Text Automation, Web Crawler, Web Automation, Data Preprocessing, Dataframe Editor
Last updated on September 24, 2023, 4:03 am
Repository
Created on February 19, 2019, 4:41 pm
python3
pandas
pandas-extension
technical-analysis
technical-analysis-indicators
technical-analysis-library
finance
fundamental-analysis
trading
trading-algorithms
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Last updated on December 4, 2023, 6:16 am
Repository
Created on April 19, 2021, 6:24 pm
Snowflake Snowpark Python API
Last updated on December 1, 2023, 2:36 pm
Repository
Created on March 18, 2021, 4:10 pm
Type safety for spark columns
Last updated on December 3, 2023, 9:57 pm
Repository
Created on April 27, 2020, 8:46 am
Structured data processing in Kotlin
Last updated on December 3, 2023, 8:13 pm
Repository
Created on September 26, 2022, 9:05 am
pldf is a framework for working with JavaScript objects as if they were DataFrames
Last updated on December 29, 2022, 3:43 pm
Repository
Created on November 16, 2023, 6:35 pm
Analysis of inventory data.
Last updated on November 18, 2023, 6:02 pm
Repository
Created on February 16, 2023, 5:17 am
data-analysis
pandas
tableau
tableau-alternative
visualization
data-exploration
dataframe
matplotlib
plotly
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Last updated on December 4, 2023, 6:44 am
Repository
Created on November 29, 2023, 6:56 am
Access file as a database with interactive SQL query experience. Built on Streamlit and DuckDB.
Last updated on November 30, 2023, 11:29 am
Repository
Created on December 1, 2023, 1:33 am
data-science
dataframe
ganeshkavhar
ganeshkavhargithub
ganeshkavharpythontutorials
pandas
pyspark-notebook
python
learn Python Pandas Basics
Last updated on December 1, 2023, 1:35 am
Repository
Created on May 19, 2022, 2:32 pm
Apache Arrow Ballista Distributed Query Engine
Last updated on December 2, 2023, 6:39 pm
Repository
Created on June 28, 2017, 1:57 am
datascience
data-analysis
data-analytics
regression
regression-models
principal-component-analysis
finance
quantitative-finance
dataframe
dataframe-library
The foundational library of the Morpheus data science framework
Last updated on December 2, 2023, 4:08 pm
Repository
Created on July 11, 2023, 5:29 pm
Repository of practical activities of the Data Analyst professional course.
Last updated on September 20, 2023, 10:43 pm
Repository
Created on January 13, 2021, 10:21 pm
Fastest library to load data from DB to DataFrames in Rust and Python
Last updated on December 4, 2023, 6:06 am
Repository
Created on June 11, 2019, 7:24 am
python
elasticsearch
pandas
data-analysis
machine-learning
time-series-forecasting
etl
big-data
dataframe
eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Last updated on November 28, 2023, 11:21 pm
Repository
Created on February 3, 2023, 4:25 am
A very much Pandas-like JavaScript library for data science
Last updated on November 26, 2023, 4:25 pm
Repository
Created on November 28, 2023, 8:42 pm
Data Frame, Data Set and Parquet
Last updated on November 28, 2023, 8:44 pm
Repository
Created on April 18, 2020, 10:46 am
clojure
clojure-library
spark
data-science
data-engineering
high-performance-computing
machine-learning
dataframe
distributed-computing
clojure-repl
A Clojure dataframe library that runs on Spark
Last updated on November 24, 2023, 8:01 pm
Repository
Created on September 6, 2018, 7:01 am
Multi-dimensional data arrays with labeled dimensions
Last updated on November 16, 2023, 6:31 am