NLP Newsletter

Share this post

🥇Top ML Papers of the Week

nlp.elvissaravia.com

Discover more from NLP Newsletter

The NLP newsletter covers the latest trending natural language processing (NLP) and machine learning (ML) news, projects, resources, and research papers.
Over 10,000 subscribers
Continue reading
Sign in

🥇Top ML Papers of the Week

The top ML Papers of the Week (April 3 - April 9)

elvis
Apr 9, 2023
12
Share this post

🥇Top ML Papers of the Week

nlp.elvissaravia.com
Share

1). Segment Anything Model - presents a set of resources to establish foundational models for image segmentation; releases the largest segmentation dataset with over 1 billion masks on 11M licensed images; the model’s zero-shot performance is competitive with or even superior to fully supervised results. (paper)

Twitter avatar for @MetaAI
Meta AI @MetaAI
Today we're releasing the Segment Anything Model (SAM) — a step toward the first foundation model for image segmentation. SAM is capable of one-click segmentation of any object from any photo or video + zero-shot transfer to other segmentation tasks ➡️ bit.ly/433YuBI
1:01 PM ∙ Apr 5, 2023
6,758Likes1,792Retweets

2). Instruction Tuning with GPT-4 - presents GPT-4-LLM, a "first attempt" to use GPT-4 to generate instruction-following data for LLM fine-tuning; the dataset is released and includes 52K unique English and Chinese instruction-following data; the dataset is used to instruction-tune LLaMA models which leads to superior zero-shot performance on new tasks. (paper)

Twitter avatar for @omarsar0
elvis @omarsar0
Okay, this is awesome! Instruction Tuning with GPT-4! This paper presents GPT-4-LLM, a "first attempt" to use GPT-4 to generate instruction-following data for LLM fine-tuning. The dataset is released and includes 52K unique English and Chinese instruction-following data.… https://t.co/OnFyePSrqR
Image
1:01 AM ∙ Apr 7, 2023
1,037Likes218Retweets

3). 8 Things to Know about LLMs - discusses important considerations regarding the capabilities and limitations of LLMs. (paper)

Twitter avatar for @sleepinyourhat
Sam Bowman @sleepinyourhat
I’m sharing a draft of a slightly-opinionated survey paper I’ve been working on for the last couple of months. It's meant for a broad audience—not just LLM researchers. (🧵)
A paper header for "Eight things to know about large language models" by Sam Bowman.
7:47 PM ∙ Apr 2, 2023
1,342Likes259Retweets

4). A Survey of LLMs - a new 50 pages survey on large language models. (paper)

Twitter avatar for @omarsar0
elvis @omarsar0
A Survey of LLMs A new 50 pages survey on large language models just dropped on arXiv. arxiv.org/abs/2303.18223
Image
1:19 AM ∙ Apr 3, 2023
1,579Likes389Retweets

5). Baize - an open-source chat model fine-tuned with LoRA. Leverages 100K dialogs generated from ChatGPT chatting with itself; it releases the dialogs along with 7B, 13B, and 30B parameter models. (paper)

Twitter avatar for @XuCanwen
Canwen Xu @XuCanwen
We are announcing Baize, an open-source chat model trained with ChatGPT self-chat data. We are releasing 150k high-quality dialogs with 7B, 13B and 30B models. ArXiv: arxiv.org/abs/2304.01196 Demo: huggingface.co/spaces/project… Github: github.com/project-baize/…
Image
2:12 AM ∙ Apr 4, 2023
733Likes134Retweets

6). MACHIAVELLI - a new benchmark of 134 text-based Choose-Your-Own-Adventure games to evaluate the capabilities and unethical behaviors of LLMs. (paper)

Twitter avatar for @DanHendrycks
Dan Hendrycks @DanHendrycks
Do models like GPT-4 behave safely when given the ability to act? We develop the Machiavelli benchmark to measure deception, power-seeking tendencies, and other unethical behaviors in complex interactive environments that simulate the real world. Paper: arxiv.org/abs/2304.03279
Image
4:08 PM ∙ Apr 7, 2023
754Likes170Retweets

7). Better Language Models of Code through Self-Improvement - generates pseudo data from knowledge gained through pre-training and fine-tuning; adds the data to the training dataset for the next step; results show that different frameworks can be improved in performance using code-related generation tasks. (paper)

Twitter avatar for @johnjnay
John Nay @johnjnay
Simple Self-Improvement of Code LLMs 1) Pre-train & Fine-tune code LLM, gaining knowledge 2) LLM then generates pseudo outputs 3) Add that to original data & train for next epoch Significantly improves code summarization & code generation performance arxiv.org/abs/2304.01228
Image
1:27 AM ∙ Apr 6, 2023
880Likes166Retweets

8). Summary of ChatGPT/GPT-4 Research - an overview of applications of ChatGPT and GPT-4; the analysis is done on 194 relevant papers and discusses capabilities, limitations, concerns, and more. (paper)

Twitter avatar for @omarsar0
elvis @omarsar0
Good overview of applications of ChatGPT and GPT-4. The analysis is done on 194 relevant papers and discusses capabilities, limitations, concerns, and more. A great read for AI developers and researchers. arxiv.org/abs/2304.01852
Image
2:12 PM ∙ Apr 5, 2023
1,348Likes311Retweets

9). Pythia - a suite for analyzing LLMs across training and scaling; includes 16 LLMs trained on public data and ranging in size from 70M to 12B parameters. (paper)

Twitter avatar for @BlancheMinerva
Stella Rose Biderman @BlancheMinerva
Have you ever wanted to do an experiment on LLMs and found that none of the existing model suites met your needs? At @AiEleuther we got tired of this happening and so designed a model suite that centers enabling scientific research as its primary goal arxiv.org/abs/2304.01373
12:34 AM ∙ Apr 5, 2023
809Likes168Retweets

10). SegGPT - unifies segmentation tasks into a generalist model through an in-context framework that supports different kinds of data. (paper)

Twitter avatar for @_akhaliq
AK @_akhaliq
SegGPT: Segmenting Everything In Context abs: arxiv.org/abs/2304.03284 github: github.com/baaivision/Pai… @Gradio demo: dev.ssi.plus:43533
1:19 AM ∙ Apr 7, 2023
498Likes126Retweets
12
Share this post

🥇Top ML Papers of the Week

nlp.elvissaravia.com
Share
Comments
Top
New
Community

No posts

Ready for more?

© 2023 elvis
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing