Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profileg
Andreas Mueller (also at mastodon)

@amuellerml

Machine learner, Python geek and scikit-learn developer.
Principal Research SDE @AzureData @Microsoft

ID:471550563

linkhttp://amueller.github.io calendar_today23-01-2012 00:40:44

9,5K Tweets

49,5K Followers

1,0K Following

Follow People
Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

Columnar file formats like Parquet/ORC are ubiquitous. Our VLDB paper with Xinyu Zeng + Huanchen Zhang + Wes McKinney studies their internals.

TLDR: They're not optimized for modern hardware. Something new is needed.

Paper: vldb.org/pvldb/vol17/p1ā€¦
Code: github.com/XinyuZeng/Evalā€¦

Columnar file formats like Parquet/ORC are ubiquitous. Our VLDB paper with @XinyuZeng218 + @huanchenzhang + @wesmckinn studies their internals. TLDR: They're not optimized for modern hardware. Something new is needed. Paper: vldb.org/pvldb/vol17/p1ā€¦ Code: github.com/XinyuZeng/Evalā€¦
account_circle
William Fedus(@LiamFedus) 's Twitter Profile Photo

Not only is this the best model in the world, but it's available for free in ChatGPT, which has never before been the case for a frontier model.

account_circle
William Fedus(@LiamFedus) 's Twitter Profile Photo

GPT-4o is our new state-of-the-art frontier model. Weā€™ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot šŸ™‚. Hereā€™s how itā€™s been doing.

GPT-4o is our new state-of-the-art frontier model. Weā€™ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot šŸ™‚. Hereā€™s how itā€™s been doing.
account_circle
Andrej Karpathy(@karpathy) 's Twitter Profile Photo

Congrats to AI at Meta on Llama 3 release!! šŸŽ‰
ai.meta.com/blog/meta-llamā€¦
Notes:

Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ lmsys.org :))
400B is still training, but already encroachingā€¦

account_circle
Yann LeCun(@ylecun) 's Twitter Profile Photo

šŸ„ Llama3 is out šŸ„
8B and 70B models available today.
8k context length.
Trained with 15 trillion tokens on a custom-built 24k GPU cluster.
Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases.
More versions are coming over the nextā€¦

šŸ„ Llama3 is out šŸ„ 8B and 70B models available today. 8k context length. Trained with 15 trillion tokens on a custom-built 24k GPU cluster. Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases. More versions are coming over the nextā€¦
account_circle
Ibis Project(@IbisData) 's Twitter Profile Photo

We often get questions around why Voltron Data supports the Ibis project -- we've answered them here!

TL;DR: open standards are critical for the composable data ecosystem and tightly coupling Python dataframes to execution engines is bad for everyone

ibis-project.org/posts/why-vodaā€¦

account_circle
Julien Le Dem(@J_) 's Twitter Profile Photo

The rumors are true! I started a(nother) blog. sympathetic.ink

The first post is an adaption of my talk, recalling the pas 10+ years of building open source standards and the lessons learned along the way. sympathetic.ink/2024/01/24/Tenā€¦

account_circle
Nick Erickson(@innixma) 's Twitter Profile Photo

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use šŸš€@AutoGluon AutoMLšŸš€ to achieve their strong performance!

When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it.

Competition: kaggle.com/competitions/pā€¦

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use šŸš€@AutoGluon AutoMLšŸš€ to achieve their strong performance! When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it. Competition: kaggle.com/competitions/pā€¦
account_circle
Andrew Lamb(@andrewlamb1111) 's Twitter Profile Photo

I am going to speak about ApacheArrow , Apache Parquet and Apache Arrow DataFusion at the Data Council this March. Should be a good conference datacouncil.ai/talks24/buildiā€¦

account_circle
OtterTune(@OtterTuneAI) 's Twitter Profile Photo

From the rise of vector databases to SQL:2023 to MariaDB troubles and the FAA outage, 2023 was an exciting year in database history. OtterTune CEO Andy Pavlo (@[email protected]) covers all that, plus database VC funding. ottertune.com/blog/databasesā€¦

account_circle
hazyresearch(@HazyResearch) 's Twitter Profile Photo

Thank you so much for the fun keynote, NeurIPS Conference

As in every year, our lab had a blast! We've enjoyed connecting with so many smart, enthusiastic people--and learning about your work. What an exciting time in AI!

Some asked for slides: cs.stanford.edu/~chrismre/papeā€¦ and videoā€¦

Thank you so much for the fun keynote, @NeurIPSConf As in every year, our lab had a blast! We've enjoyed connecting with so many smart, enthusiastic people--and learning about your work. What an exciting time in AI! Some asked for slides: cs.stanford.edu/~chrismre/papeā€¦ and videoā€¦
account_circle
Daniel Mas Montserrat(@_danielmas) 's Twitter Profile Photo

What if you could train an MLP with milliseconds instead of hours and still obtain state-of-the-art accuracy?

We introduce HyperFast: a hypernetwork for instant classification of tabular data that matches the accuracy of XGBoost while being much faster!

openreview.net/pdf?id=VRBhaU8ā€¦

What if you could train an MLP with milliseconds instead of hours and still obtain state-of-the-art accuracy? We introduce HyperFast: a hypernetwork for instant classification of tabular data that matches the accuracy of XGBoost while being much faster! openreview.net/pdf?id=VRBhaU8ā€¦
account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

I'm excited to share our results on MotherNet, a new hyper-network architecture based on TabPFN that can learn an MLP in-context using a single forward pass. This substantially improves prediction times over predicting with TabPFN directly: arxiv.org/abs/2312.08598

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

Really amazing and inspiring talk by Chris Re! Hope the recording will be available soon, I'll have to re-watch it a couple of times, I think. Also you should follow Dillon Niederhut PhD for amazing NeurIPS coverage.

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

In other news, I think I'll mostly be leaving this platform for LinkedIn, which seems to have higher quality engagement these days. Though this is obviously the premier platform for trolling and subtweeting (sub-X-ing?)

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

Christopher Re at his keynote 'I think Ilya deserves the Touring award. Maybe not Employee of the month, but definitely the touring award'. [Most of the keynote was deeply technical and quite inspiring.]

account_circle
Shao-Hua Sun(@shaohua0116) 's Twitter Profile Photo

Since everyone on my Twitter timeline is attending . I thought it would be helpful to share this Inigo Montoya conference networking tip for initiating conversations, which works great for me.

Since everyone on my Twitter timeline is attending #NeurIPS2023. I thought it would be helpful to share this Inigo Montoya conference networking tip for initiating conversations, which works great for me.
account_circle
Dillon Niederhut PhD(@dillonniederhut) 's Twitter Profile Photo

I'm worried that we're heading into an LLM replicability crisis. How many of the results that we've seen are due to very careful prompts? How many would be robust to a small change?

- Meredith Ringel Morris at

account_circle
Miro Dudik(@MiroDudik) 's Twitter Profile Photo

šŸšØDeadline for ML/AI postdocs at Microsoft Research NYC is extended to December 15. If you are at and want to chat about these positions, please DM me.

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

Amazing tutorial by Nihar Shah on the many issues in peer review, definitely check out slides and material! cs.cmu.edu/~nihars/tutoriā€¦ my take away is that there's really no hope for the current model of peer review. Which was kind of my intuition before...

account_circle