Ponder(@ponderdata) 's Twitter Profileg
Ponder

@ponderdata

Run pandas at scale directly in your data warehouse—be it Snowflake, BigQuery or DuckDB—all from the comfort of your favorite IDE or notebook.

ID:1409999723784671234

linkhttp://ponder.io/product calendar_today29-06-2021 22:19:53

684 Tweets

2,4K Followers

167 Following

Ponder(@ponderdata) 's Twitter Profile Photo

We’re excited to announce Snowflake's intent to acquire Ponder to bring Ponder's Python data science innovations to all its customers.

Thank you to all of the Ponder and Modin supporters who brought us to this point -- We're excited for what's to come!

ponder.io/snowflake-to-a…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

Great coverage in the Real Python podcast of Mahesh Vashishtha's article: 'Why Are There So Many Python Dataframes?'

Here's the link to jump straight to that 7-minute portion:
realpython.com/podcasts/rpp/1…

Link to article: ponder.io/why-are-there-…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

You can run NumPy 🔢 at scale in your database with Ponder 💥

Through Modin Project, Ponder supports:
- Element-wise matrix ops (addition, subtraction, division, power)
- Axis-collapsing or reducing ops (min, max, sum, product, mean)
& more: ponder.io/numpy-on-snowf…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

If you're backed into a dark alley, facing a pack of ravenous NaN dogs, you'll want a sidekick like pandas dropna.

Drop rows with n missing values using thresh=n.

Drop entirely empty rows with how='all'.

Learn more from Matt Harrison: ponder.io/professional-p…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

A 2-hour pandas workflow (150 million rows) can take 2 minutes with Ponder on Snowflake.

Read more: ponder.io/ponder-in-publ…

Or watch the demo here: youtube.com/watch?v=N7D2Wn…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

How do you use database connections & cursors in Python?

It can be hard to figure out because database concepts & Python objects don't map 1:1.

To learn more (with an emphasis on the Snowflake-Python connector) check out: ponder.io/top-5-things-t…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

If you want to automatically parallelize your pandas code, you should check out open-source Modin: github.com/modin-project/…

Modin Project uses the pandas API, so all you need to do is change 'import pandas as pd' to 'import modin.pandas as pd'

account_circle
Python Hub(@PythonHub) 's Twitter Profile Photo

How the Python Dataframe Interchange Protocol Makes Life Better

In this article, we answer three questions about the Python Dataframe Interchange Protocol: What it is + what problems it solves; how it works; and how extensively it's been adopted.

ponder.io/how-the-python…

account_circle
pandas(@pandas_dev) 's Twitter Profile Photo

We're happy to announce the release of 2.1.1. You can install it with `pip install pandas` or `mamba install -c conda-forge pandas`. You can find what's new in this version in the release notes. Thanks to all contributors and sponsors who made this release possible!

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

Ever had hourly data, but wished it were monthly? Daily, but wished it were weekly? 😢

pandas resample will dry your tears.

It makes upsampling + downsampling really slick, & now it's possible to run it directly in your database: ponder.io/how-to-use-pan…

Ever had hourly data, but wished it were monthly? Daily, but wished it were weekly? 😢 pandas resample will dry your tears. It makes upsampling + downsampling really slick, & now it's possible to run it directly in your database: ponder.io/how-to-use-pan… #pandas #database
account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

Congratulations to @MotherDuck 🦆 on your Series B 📈

We're excited to have Ponder be part of the Modern Duck Stack!

motherduck.com/blog/motherduc…

Congratulations to @MotherDuck 🦆 on your Series B 📈 We're excited to have Ponder be part of the Modern Duck Stack! motherduck.com/blog/motherduc… #pandas #Python #duckdb
account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

Pulling data from Snowflake to work w/ scikit-learn, LightGBM, XGBoost?

💥Snowpark ML lets you stay in Snowflake but use important ML classes: LGBMClassifier, Lasso, PCA, XGBClassifier, etc.

💥Ponder lets you do your pandas preprocessing in Snowflake
ponder.io/using-snowpark…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

Ponder typically runs pandas code in your database by translating it to SQL. But pandas apply is different.

For apply, we create a temporary Python user-defined table function (UDTF).

Read more about apply + how Ponder handles it: ponder.io/pandas-apply-i…

Ponder typically runs pandas code in your database by translating it to SQL. But pandas apply is different. For apply, we create a temporary Python user-defined table function (UDTF). Read more about apply + how Ponder handles it: ponder.io/pandas-apply-i… #Python #pandas #SQL
account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

.Software Dev Tools saw a workload speedup of 4X with Modin Project (the scalable drop-in replacement for pandas) compared to pandas!

Read more from Intel here: intel.com/content/www/us…

account_circle
Ponder(@ponderdata) 's Twitter Profile Photo

The Python Dataframe Interchange Protocol is a quiet hero 🦸‍♀️.

It makes it easy for libraries to accept many dataframes (Modin, Polars, Ibis, cuDF, Dask).

Read our encomium: ponder.io/how-the-python…

Thanks to Marco Gorelli & Ralf Gommers for your work!

account_circle