Did you know Databricks integrates with DeltaStream? Now you can process streaming data and write results directly to #DeltaLake . Keep your Delta Tables always up-to-date!
deltastream.io/integrating-de…
#DataEngineering #StreamingData #DeltaLake
Z Order in #DeltaLake organizes data in storage to improve query performance.
In this example, the query has to scan through 8 separate files to find rows where id = 5. However, with Z Order optimization, the query only needs to scan one file to locate the desired rows.
Have you tried the #OneTable quickstart? Within minutes you can have a pipeline simultaneously using Hudi, Delta, and Iceberg. Check out the docs here: onetable.dev/docs/how-to/
#ApacheHudi #ApacheIceberg #DeltaLake #DataLakehouse
If you use Apache Doris as a data lakehouse, this is what the data stacks look like:
Read more: doris.apache.org/docs/lakehouse…
#Hive #Iceberg #Hudi #DeltaLake #ApachePaimon
Support for liquid clustering is now generally available using Databricks Runtime +15.2
Getting started with Delta Lake Liquid clustering youtube.com/watch?v=6g685a…
#DeltaLake #Databricks
You can now enable liquid clustering on existing tables without the need to rewrite the underlying data.
It requires DBR 14.3 LTS+
#DeltaLake #Databricks
Want to learn how to time travel?? 🚀
This is the second part in our YouTube Short series on the things you need to know about #DeltaLake - and Chris is talking time travel!
Catch the first part here: hubs.la/Q01ZLlky0
Things you need to know about Delta Lake: Part 3 - Optimisation 🎯
What can you do to make your #DeltaLake faster, more reliable, and more scalable? Chris has the answer you're looking for!
#DataEngineering #DataOptimisation
Saddle up for #DataCouncil 🤠. Let's corral a fireside chat on #lakehouse table formats #ApacheHudi , #DeltaLake , and #ApacheIceberg . You don't want to miss this 🌶 discussion 3/27 11:30am. I will also intro the brand new Apache XTable (Incubating) (prev known as OneTable)
#apachextable
Ah, so that's the explanation why I didn't see TRUNCATE TABLE in #DeltaLake OSS (and on #Databricks ).
It is simply not supported in OSS 🤷♂️
Jupyter notebooks are a great tool for data analysis in #Python . They are easy to use and give you an intuitive, interactive interface to process and visualize your data.
Learn how you can use Delta Lake from a #Jupyter Notebook ➡ delta.io/blog/delta-lak…
#deltalake #opensource
The Apache Druid community just added a Delta Lake connector via Delta Kernel Java.
Delta Kernel is an ambitious project to abstract all the core Delta logic into Java/Rust codebases, so each connector doesn't need to write all Delta processing logic from scratch.
#deltalake
🚨 Register to join me live 12/14 10am PST and I will answer all your Qs about @OnetableOSS. You no longer have to pick between #ApacheHudi , #ApacheIceberg , and #DeltaLake . Register to watch live, or get the recording: 👉 linkedin.com/events/onetabl…
#datalakehouse #apachepaimon
deu certo saporra! É para glorificar de pé! hahahaha
Milhares de linhas duplicadas que não iriam permitir inserirmos os dados usando upsert no Delta com Streaming.
#databricks #healthtech #datasus #deltaLake #s3 #streaming #TeoMeWhy