Shalini De Mello(@shalinidemello) 's Twitter Profileg
Shalini De Mello

@shalinidemello

Director of Research & Distinguished Research Scientist @NVIDIA, doing AI and graphics research. PhD from UT Austin, ECE. Views are my own.

ID:1037319684708655104

linkhttps://research.nvidia.com/person/shalini-de-mello/ calendar_today05-09-2018 12:41:00

657 Tweets

2,0K Followers

406 Following

Prof. Anima Anandkumar(@AnimaAnandkumar) 's Twitter Profile Photo

For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory. This represents more than 82.5% reduction in memory for storing optimizer states during training.

Training LLMs from scratch currently requires huge

account_circle
Leshem Choshen 🤖🤗(@LChoshen) 's Twitter Profile Photo

DoRA explores the magnitude and direction and
surpasses LoRA quite significantly

This is done with an empirical finding that I can't wrap my head around
NVIDIA AI
arxiv.org/abs/2402.09353
Shih-Yang Sean Liu Chien-Yi Wang Hongxu (Danny) Yin Pavlo Molchanov Min-Hung (Steve) Chen

DoRA explores the magnitude and direction and surpasses LoRA quite significantly This is done with an empirical finding that I can't wrap my head around @NVIDIAAI arxiv.org/abs/2402.09353 @nbasyl_tw @chienyi_wang @yin_hongxu @PavloMolchanov @CMHungSteven
account_circle
Wonmin Byeon(@wonmin_byeon) 's Twitter Profile Photo

ConvSSM: State Space Models for long videos
🎉 We finally released the code and the pretrained models.
Code: github.com/NVlabs/ConvSSM
Paper: arxiv.org/abs/2310.19694
NVIDIA AI Jimmy Smith

account_circle
NVIDIA AI(@NVIDIAAI) 's Twitter Profile Photo

Join @NVIDIA and Dell Technologies today at 2:00 p.m. ET to hear from Target and Canadian Tire as they discuss the top retail use cases that are driving revenue, improving customer experiences, and boosting employee productivity. nvda.ws/48rGHa3

account_circle
Yann LeCun(@ylecun) 's Twitter Profile Photo

Meta spends twice more on R&D than Amazon, Alphabet, and Microsoft, and 4 times more than Apple, when normalized by revenue.
The only other big tech that comes close is Nvidia.
That tells you a lot about who is building the technology of tomorrow.

account_circle
Soumith Chintala(@soumithchintala) 's Twitter Profile Photo

gpt-fast now supports mixtral-8x7B, in addition to gpt/llama.
1000 lines of simple pytorch code blazing it out!
github.com/pytorch-labs/g…

gpt-fast now supports mixtral-8x7B, in addition to gpt/llama. 1000 lines of simple pytorch code blazing it out! github.com/pytorch-labs/g…
account_circle
Lijie Fan(@lijie_fan) 's Twitter Profile Photo

🚀 Is the future of vision models Synthetic? Introducing SynCLR: our new pipeline leveraging LLMs & Text-to-image models to train vision models with only synthetic data!
🔥 Outperforming SOTAs like DinoV2 & CLIP on real images! SynCLR excels in fine-grained classification &

🚀 Is the future of vision models Synthetic? Introducing SynCLR: our new pipeline leveraging LLMs & Text-to-image models to train vision models with only synthetic data! 🔥 Outperforming SOTAs like DinoV2 & CLIP on real images! SynCLR excels in fine-grained classification &
account_circle
Bryan Catanzaro(@ctnzr) 's Twitter Profile Photo

I worked at Intel on Larrabee applications in 2007. Then I went to NVIDIA to work on ML in 2008. So I was there at both places at that time and I can say:

NVIDIA's dominance didn't come from luck. It came from vision and execution. Which Intel lacked.

account_circle
Anurag Ranjan(@anuragranj) 's Twitter Profile Photo

Introducing FastViT, fast, super-small, general purposes vision transformers running at ~ 1ms on mobile. Code and pre-trained models are live.
github.com/apple/ml-fastv…

account_circle
Hao Li(@HaoLi81) 's Twitter Profile Photo

Introducing “VOODOO 3D: VOlumetric pOrtrait Disentanglement fOr One-shot 3D head reenactment”.
We present a real-time 3D aware one-shot head reenactment method that can generate consistent views from any angle MBZUAI ETH Zürich VinAI Pinscreen URL shorturl.at/gnr17

account_circle
Umar Iqbal(@UmarIqb) 's Twitter Profile Photo

ian Splatting based ian Avatar of Carl Friedrich

Did I say Gaussians a bit too many times? There is more. Happy to present our latest work by
Ye Yuan and Xueting Li

Project page: nvlabs.github.io/GAvatar/

account_circle
Xueting Li(@XueT_Li) 's Twitter Profile Photo

Excited to share our text to animatable avatar project. TL;DR: primitive-based Gaussian representation + Implicit Gaussian attribute fields + normal/mesh extraction from Gaussians.
w/ Ye Yuan Yangyi Huang Shalini De Mello Koki Nagano Jan Kautz @umariqb

account_circle
AK(@_akhaliq) 's Twitter Profile Photo

COLMAP-Free 3D Gaussian Splatting

paper page: huggingface.co/papers/2312.07…

While neural rendering has led to impressive advances in scene reconstruction and novel view synthesis, it relies heavily on accurately pre-computed camera poses. To relax this constraint, multiple efforts

account_circle
Kripasindhu Sarkar(@ksarkar89) 's Twitter Profile Photo

Introducing “LitNeRF”

syntec-research.github.io/LitNeRF/

With a sparse, mobile, and easy-to-build capture setup, LitNeRF performs novel view synthesis and relighting by decomposing scene radiance into interpretable components that are motivated by physically-based rendering.
Google AR & VR

account_circle
Xi(@xiwang1212) 's Twitter Profile Photo

We are organizing the first Rhobin workshop on Reconstruction of Human-Object Interactions at #CVPR2024 with an amazing lineup of speakers. Join us and submit a paper with an extended deadline on March 20.
rhobin-challenge.github.io

We are organizing the first Rhobin workshop on Reconstruction of Human-Object Interactions at @CVPR with an amazing lineup of speakers. Join us and submit a paper with an extended deadline on March 20. rhobin-challenge.github.io
account_circle