Huanchen Zhang(@huanchenzhang) 's Twitter Profileg
Huanchen Zhang

@huanchenzhang

Assistant Professor @Tsinghua_Uni. Formerly @CarnegieMellon

ID:987064905693061122

linkhttp://people.iiis.tsinghua.edu.cn/~huanchen/ calendar_today19-04-2018 20:26:27

121 Tweets

1,0K Followers

233 Following

Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

Columnar file formats like Parquet/ORC are ubiquitous. Our VLDB paper with Xinyu Zeng + Huanchen Zhang + Wes McKinney studies their internals.

TLDR: They're not optimized for modern hardware. Something new is needed.

Paper: vldb.org/pvldb/vol17/p1…
Code: github.com/XinyuZeng/Eval…

Columnar file formats like Parquet/ORC are ubiquitous. Our VLDB paper with @XinyuZeng218 + @huanchenzhang + @wesmckinn studies their internals. TLDR: They're not optimized for modern hardware. Something new is needed. Paper: vldb.org/pvldb/vol17/p1… Code: github.com/XinyuZeng/Eval…
account_circle
Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

Somebody tipped me off that a 2022 paper out of Saudia Arabia blatantly stole our entire 2019 ICDE Bulletin survey paper on using ML automatically optimize databases.

+ 2022 Plagiarism: eajournals.org/ejcsit/vol10-i…
+ 2019 Original: db.cs.cmu.edu/papers/2019/pa…

Somebody tipped me off that a 2022 paper out of Saudia Arabia blatantly stole our entire 2019 ICDE Bulletin survey paper on using ML automatically optimize databases. + 2022 Plagiarism: eajournals.org/ejcsit/vol10-i… + 2019 Original: db.cs.cmu.edu/papers/2019/pa…
account_circle
Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

My #1 PhD student Matt Butrovich successfully completed his PhD defense. Thanks to the committee (@pateljm billions of packets Sam Madden). Matt's thesis is on accelerating databases with eBPF (telemetry, proxies, OLTP stores).

You have 60 days to hire him. Expect fierce competition.

My #1 PhD student @butro successfully completed his PhD defense. Thanks to the committee (@pateljm @justinesherry @samrmadden). Matt's thesis is on accelerating databases with eBPF (telemetry, proxies, OLTP stores). You have 60 days to hire him. Expect fierce competition.
account_circle
SIGMOD/PODS 2024(@SIGMODConf) 's Twitter Profile Photo

The SIGMOD Jim Gray Dissertation Award has started receiving nominations! Deadline: March 15th, 2024. For more information please visit sigmod.org/sigmod-awards/…

account_circle
Nesime Tatbul(@tatbul) 's Twitter Profile Photo

CfP: Data Management on New Hardware Workshop, co-located with SIGMOD/PODS 2024 in Santiago, Chile. Papers due: March 15, 2024. Carsten Binnig and I are looking forward to your submissions to this *** 20th special edition of DaMoN ***! damon-db.org

account_circle
Programming Wisdom(@CodeWisdom) 's Twitter Profile Photo

'I had this crazy idea that I’m going to build a database engine that does not have a server, that talks directly to disk, and ignores the data types, and if you asked any of the experts of the day, they would say, “That’s impossible. That will never work. That’s a stupid idea.”

account_circle
Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

I'm back again with my annual retrospective of the last year in the world of databases. Major highlights include vector databases, @MariaDB problems, SQL:2023, the FAA database crash, and the most expensive password change ever: ottertune.com/blog/2023-data…

account_circle
Peter Boncz(@peterabcz) 's Twitter Profile Photo

Last month the 'DB Research Meeting' was held @mitcsail, hosted by Sam Madden and Natassa Ailamaki (🙏). An encounter of the who's who in data systems research.

I warned for the declining impact of DB research & pitched better incentives for system work: bit.ly/dbmeeting-boncz

account_circle
Andrew Akbashev(@Andrew_Akbashev) 's Twitter Profile Photo

Overpublishing puts enormous stress on students and PIs.

And brings tons of money to publishers in STEM.

A new study shows that the number of papers is increasing FASTER than the number of graduates.

It’s an amazing work with very useful statistics. Huge kudos to the

Overpublishing puts enormous stress on students and PIs. And brings tons of money to publishers in STEM. A new study shows that the number of papers is increasing FASTER than the number of #PhD graduates. It’s an amazing work with very useful statistics. Huge kudos to the
account_circle
Wes McKinney(@wesmckinn) 's Twitter Profile Photo

Important stuff here 👇 It's great to see this transformation of the data stack gaining steam (and raising more capital) but much work still remains

account_circle
Wes McKinney(@wesmckinn) 's Twitter Profile Photo

Some long form thoughts on composable data management systems (and ApacheArrow, pandas, and more) on the heels of VLDB 2023 (also migrated my blog to Quarto!):

wesmckinney.com/blog/looking-b…

account_circle
Marc Brooker(@MarcJBrooker) 's Twitter Profile Photo

'My Favorite Bits of OSDI/ATC'23', new blog post looking at some trends, some papers I enjoyed, and a few random themes from this week's conference: brooker.co.za/blog/2023/07/1…

account_circle
Nesime Tatbul(@tatbul) 's Twitter Profile Photo

Join us on 6/19 for the 19th International Workshop on Data Management on New Hardware (DaMoN) SIGMOD/PODS 2024 !

- Invited talks by Frank Hady (Intel) & Huanchen Zhang (Tsinghua Univ)

- 17 exciting papers

See you in Seattle! Norman May

damon-db.org

account_circle
Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

Thanks for this hot take dude who doesn't know the 60 year history of databases. H/T Prem Viswanathan

Mike and I have a WIP paper that analyzes all the (failed) attempts to replace SQL + relational model. This tweet has motivated me to finish it and submit it.

Thanks for this hot take dude who doesn't know the 60 year history of databases. H/T @prempv Mike and I have a WIP paper that analyzes all the (failed) attempts to replace SQL + relational model. This tweet has motivated me to finish it and submit it.
account_circle
Paul Dix(@pauldix) 's Twitter Profile Photo

Great read, this quote is particularly insightful: “To put it another way, new technologies that require throwing away old technologies are harder to scale than new technologies that somehow bring along old technologies.”

account_circle
Matthias Boehm(@matthiasboehm7) 's Twitter Profile Photo

Why is PACMMOD using single-column ACM style, and requiring SIGMOD/PODS 2024 camera-ready papers to reformat last minute? This is wrong at so many levels.

account_circle