Thank you everyone for trying the Galactica model demo. We appreciate the feedback we have received so far from the community, and have paused the demo for now. Our models are available for researchers who want to learn more about the work and reproduce results in the paper.
Galactica is basically GPT-3 for science. It can write whitepapers, reviews, wikipedia pages and code. It knows how to cite and how to write equations. It's kind of big deal 1/ 🧵
🪐 Introducing Galactica. A large language model for science.
Can summarize academic literature, solve math problems, generate Wiki articles, write scientific code, annotate molecules and proteins, and more.
Explore and get weights: galactica.org
Today a 120B model called “Galactica” is open-sourced by @paperswithcode. It’s capable of writing math notations, citations, code, chemical formula, DNA, etc. Here’s why I think Galactica is a huge milestone in open foundation models, scientific automation, and responsible AI: 🧵
The new language model for science galactica.org. Upon few quick tries, it seems to generate professional text in the areas I am familiar with. And 7 years ago we were *joking* about ML writing papers!
🪐 Introducing Galactica. A large language model for science.
Can summarize academic literature, solve math problems, generate Wiki articles, write scientific code, annotate molecules and proteins, and more.
Explore and get weights: galactica.org
This is just the first step on our mission to organize science. And there is a lot more work to be done. We look forward to seeing what the open ML community builds with the model.
Despite not being trained on a general corpus, Galactica outperforms BLOOM and OPT-175B on BIG-bench. Galactica is also significantly less toxic than other language models based on evaluations.
🪐 Introducing Galactica. A large language model for science.
Can summarize academic literature, solve math problems, generate Wiki articles, write scientific code, annotate molecules and proteins, and more.
Explore and get weights: galactica.org
We have explored some of the latest progress, architectural improvements, and emerging new techniques for long-range modeling. We'll continue to keep track of the progress on long-range modeling and LRA. More threads like this coming soon! Follow @paperswithcode for more.
10/10
Besides transformers, other types of models have been tested on LRA. Some of the top performing models are attained by S4 variants which are based on state space models. A recent, improved S4 variant (Liquid-S4) attained competitive results with Mega (current SoTA).
9/10
How well do machine learning models perform on long sequences?
This is a question of high interest in ML research so let’s take a look at what we know so far?
1/10
1.2M Followers 788 FollowingProfessor at NYU & Executive Chairman at AMI Labs.
Ex-Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
1.6M Followers 1K FollowingCo-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
804K Followers 323 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
305K Followers 1K FollowingBuilding new things @thinkymachines. Also dabble in robotics at NYU. Cofounded @PyTorch. AI is delicious when it is accessible and open-source.
86K Followers 10K FollowingOn X we surface the AI research that matters and explain the ideas behind it. In the newsletter, we connect the dots between AI’s past, present, and future ⬇️
30K Followers 614 FollowingLLMs and retrieval by day and other genres of AI when I get the chance
🧪 Senior AI Eng @NVIDIAAI
🏫 @fastdotai trained DL Eng
📝 https://t.co/By87iXx5Pu
4 Followers 47 FollowingToying with AI - and their humans - for fun!
If your AI is misbehaving and you want to know why, give it the full inquisition.
LLM INQUISITOR • link in bio.
2 Followers 279 FollowingData & AI Leader helping businesses turn data into competitive advantage. Trusted data. AI at scale. Business impact. #DataAI #EnterpriseAI #TrustedData
1 Followers 57 FollowingCS undergrad @SRM chennai
Building AI from first principles. No libraries until I understand the math. Targeting top AI PhD 2028. Documenting the real grind
305K Followers 1K FollowingBuilding new things @thinkymachines. Also dabble in robotics at NYU. Cofounded @PyTorch. AI is delicious when it is accessible and open-source.
11K Followers 1K FollowingCo-founder and CEO @GenReasoning. Previously lots of other things like: reasoning lead Meta AI, Llama 3/2, Galactica, Papers with Code.