Quentin Lhoest(@qlhoest) 's Twitter Profile Photo

Finally !! You can query ANY dataset on πŸ€—Hugging Face with DuckDB πŸ’›πŸ’›πŸ’›

It's now EASY to access datasets in SQL, e.g.

```
SELECT * FROM 'hf://datasets/<repo-id>/<file>'
```

Here I queried the dataset used for Cohere For AI Aya 101 or the new ✨Aya 23 ✨ multilingual LLMs:

Finally !! You can query ANY dataset on πŸ€—@huggingface with @duckdb πŸ’›πŸ’›πŸ’›

It's now EASY to access datasets in SQL, e.g.

```
SELECT * FROM 'hf://datasets/<repo-id>/<file>'
```

Here I queried the dataset used for @CohereForAI Aya 101 or the new ✨Aya 23 ✨ multilingual LLMs:
account_circle
WrenAI(@getwrenai) 's Twitter Profile Photo

blog.getwren.ai/how-to-load-hu…

Yesterday, @DuckDB and Hugging Face announced support for the hf:// path to 150,000+ datasets on Hugging Face. Today, we also added support for the latest DuckDB version on WrenAI, allowing you to load datasets directly from Hugging Face and use natural

account_circle
Sylvain Lesage(@severo_dev) 's Twitter Profile Photo

πŸ¦† βœ– πŸ€—
DuckDB 0.10.3 natively supports Hugging Face datasets!

Why does it matter?

Because it unlocks new use cases, possibly the one you need!

⬇️

πŸ¦† βœ– πŸ€—
@duckdb 0.10.3 natively supports @huggingface datasets!

Why does it matter?

Because it unlocks new use cases, possibly the one you need!

⬇️
account_circle
The Moisrex(@the_moisrex) 's Twitter Profile Photo

DuckDB is a very interesting project. Its C++11 implementation could use some modernizing and a bit less allocations, but the idea very interesting.
shell.duckdb.org

DuckDB is a very interesting project. Its C++11 implementation could use some modernizing and a bit less allocations, but the idea very interesting.
shell.duckdb.org
account_circle
just-js(@justjs14) 's Twitter Profile Photo

'With DuckDB as a browser for the data cloud, relational datasets are always just a hyperlink away' nikolasgoebel.com/2024/05/28/duc…

'With DuckDB as a browser for the data cloud, relational datasets are always just a hyperlink away' nikolasgoebel.com/2024/05/28/duc…
account_circle
T.W. Buley(@taylorbuley) 's Twitter Profile Photo

duckdb as a 'browser for the data cloud' is a concept I can get behind nikolasgoebel.com/2024/05/28/duc…

account_circle
DuckDB(@duckdb) 's Twitter Profile Photo

New blog post: Access 150k+ Datasets from Hugging Face with DuckDB

This blog post, co-authored by the Hugging Face and DuckDB teams, describes how you can use the hf:// prefix in DuckDB to access datasets in Hugging Face repositories.

Read more at duckdb.org/2024/05/29/acc…

New blog post: Access 150k+ Datasets from Hugging Face with DuckDB

This blog post, co-authored by the @huggingface and DuckDB teams, describes how you can use the hf:// prefix in DuckDB to access datasets in Hugging Face repositories.

Read more at duckdb.org/2024/05/29/acc…
account_circle
Equan P.(@junwatu) 's Twitter Profile Photo

Pake multi-shot prompting untuk generate SQL.

Supaya hasil generasi gak keluar dari pakem SQL di database (DuckDB) maka cara paling aman adalah dengan memberikan contoh perintah SQL (contoh: INSERT)

Pake multi-shot prompting untuk generate SQL. 

Supaya hasil generasi gak keluar dari pakem SQL di database (DuckDB) maka cara paling aman adalah dengan memberikan contoh perintah SQL (contoh: INSERT)
account_circle
Tidier.jl(@Tidierjl) 's Twitter Profile Photo

TidierDB.jl ❀️ DuckDB

TidierDB lets you run queries lazily in DuckDB without instantiating the results in memory (like dbplyr + duckdb). You can even reuse and build on top of the queries by using the `from_query()` function.

TidierDB.jl ❀️ DuckDB

TidierDB lets you run queries lazily in DuckDB without instantiating the results in memory (like dbplyr + duckdb). You can even reuse and build on top of the queries by using the `from_query()` function.
account_circle
Khuyen Tran(@KhuyenTran16) 's Twitter Profile Photo

Traditional database systems require a predefined table schema and a subsequent data import process when working with CSV data.

In contrast, allows for direct reading of CSV files from disk, eliminating the need for explicit table creation and data loading.

Traditional database systems require a predefined table schema and a subsequent data import process when working with CSV data.

In contrast, #DuckDB allows for direct reading of CSV files from disk, eliminating the need for explicit table creation and data loading.
account_circle
Trent Hauck(@trent_hauck) 's Twitter Profile Photo

Wrote up a one-pager on using DuckDB (or really any OLAP, local or not) to query traces for statistical inference. E.g. measuring the mean difference between two random variables. I'll try to show a multi-level example at some point.

Wrote up a one-pager on using DuckDB (or really any OLAP, local or not) to query traces for statistical inference. E.g. measuring the mean difference between two random variables. I'll try to show a multi-level example at some point.
account_circle