NEWS
Institutional Subscription: Comprehensive Analyses to Enhance Your Global and Local Impact
New Feature: Compare Your Institution with the Previous Year
Find a Professional: Explore Experts Across 197 Disciplines in 221 Countries!
Find a Professional
Print Your Certificate
New! Young University / Institution Rankings 2025
New! Art & Humanities Rankings 2025
New! Social Sciences and Humanities Rankings 2025
Highly Cited Researchers 2025
AD
Scientific Index 2025
Scientist Rankings
University Rankings
Subject Rankings
Country Rankings
Login
Register & Pricing
insights
H-Index Rankings
insights
i10 Productivity Rankings
format_list_numbered
Citation Rankings
subject
University Subject Rankings
school
Young Universities
format_list_numbered
Top 100 Scientists
format_quote
Top 100 Institutions
format_quote
Compare & Choose
local_fire_department
Country Reports
person
Find a Professional
Shuming Ma
Microsoft Research - - / United States
Engineering & Technology / Computer Science
AD Scientific Index ID: 4382351
Registration, Add Profile,
Premium Membership
Print Your Certificate
Ranking &
Analysis
Job
Experiences (0)
Education
Information (0)
Published Books (0)
Book Chapters (0)
Articles (0)
Presentations (0)
Lessons (0)
Projects (0)
Co-Authors
Subject Leaders
Editorship, Referee &
Scientific Board (0 )
Patents /
Designs (0)
Academic Grants
& Awards (0)
Artistic
Activities (0)
Certificate / Course
/ Trainings (0)
Association &
Society Memberships (0)
Contact, Office
& Social Media
person_outline
Shuming Ma's MOST POPULAR ARTICLES
1-)
Kosmos-2: Grounding multimodal large language models to the worldZ Peng, W Wang, L Dong, Y Hao, S Huang, S Ma, F WeiarXiv preprint arXiv:2306.14824, 20235722023
2-)
SGM: sequence generation model for multi-label classificationP Yang, X Sun, W Li, S Ma, W Wu, H WangarXiv preprint arXiv:1806.04822, 20184952018
3-)
Language is not all you need: Aligning perception with language modelsS Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...Advances in Neural Information Processing Systems 36, 20243312024
4-)
Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizersD Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F WeiarXiv preprint arXiv:2212.10559, 20223772022
5-)
Retentive network: A successor to transformer for large language modelsY Sun, L Dong, S Huang, S Ma, Y Xia, J Xue, J Wang, F WeiarXiv preprint arXiv:2307.08621, 20233202023
ARTICLES
Add your articles
We use cookies to personalize our website and offer you a better experience. If you accept cookies, we can offer you special services.
Cookie Policy
Accept