NEWS
Find a Professional: Explore Experts Across 197 Disciplines in 220 Countries!
Find a Professional
Print Your Certificate
The 2025 AD Scientific Index is here—explore updated university and researcher rankings!
New! Young University / Institution Rankings 2025
New! Art & Humanities Rankings 2025
New! Social Sciences and Humanities Rankings 2025
"Exciting Update! The 2025 Edition of the AD Scientific Index is now live!
AD
Scientific Index 2025
Scientist Rankings
University Rankings
Subject Rankings
Country Rankings
login
Login
person_add
Register
insights
H-Index Rankings
insights
i10 Productivity Rankings
format_list_numbered
Citation Rankings
subject
University Subject Rankings
school
Young Universities
format_list_numbered
Top 100 Scientists
format_quote
Top 100 Institutions
format_quote
Compare & Choose
local_fire_department
Country Reports
person
Find a Professional
Mostofa Patwary
nVidia Corporation - / United States
Engineering & Technology / Computer Science
AD Scientific Index ID: 4414240
Registration, Add Profile,
Premium Membership
Print Your Certificate
Ranking &
Analysis
Job
Experiences (0)
Education
Information (0)
Published Books (0)
Book Chapters (0)
Articles (0)
Presentations (0)
Lessons (0)
Projects (0)
Congresses (0)
Editorship, Referee &
Scientific Board (0 )
Patents /
Designs (0)
Academic Grants
& Awards (0)
Artistic
Activities (0)
Certificate / Course
/ Trainings (0)
Association &
Society Memberships (0)
Contact, Office
& Social Media
person_outline
Mostofa Patwary's MOST POPULAR ARTICLES
1-)
Megatron-LM: Training Multi-Billion Parameter Language Models Using GPU Model ParallelismM Shoeybi, M Patwary, R Puri, P LeGresley, J Casper, B CatanzaroarXiv preprint arXiv:1909.08053, 201912602019
2-)
Scalable Bayesian Optimization Using Deep Neural NetworksJ Snoek, O Rippel, K Swersky, R Kiros, N Satish, N Sundaram, M Patwary, ...arXiv preprint arXiv:1502.05700, 201512222015
3-)
Deep learning scaling is predictable, empiricallyJ Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ...arXiv preprint arXiv:1712.00409, 20176572017
4-)
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelS Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ...arXiv preprint arXiv:2201.11990, 2022738*2022
5-)
Efficient large-scale language model training on GPU clusters using megatron-LMD Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ...Proceedings of the International Conference for High Performance Computing …, 20215932021
ARTICLES
Add your articles
We use cookies to personalize our website and offer you a better experience. If you accept cookies, we can offer you special services.
Cookie Policy
Accept