1). BloombergGPT - a new 50B parameter large language model for finance. Claims the largest domain-specific dataset yet with 363 billion tokens... further augmented with 345 billion tokens from general-purpose datasets; outperforms existing models on financial tasks while not sacrificing performance on general LLM benchmarks. (paper)
![Twitter avatar for @omarsar0](https://substackcdn.com/image/twitter_name/w_96/omarsar0.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsjKUWgXgAA-7WW.jpg)
2). ALOHA - a low-cost system that performs end-to-end imitation learning from real demonstrations; also presents an algorithm called Action Chunking with Transformers to learn a generative model that allows a robot to learn difficult tasks in the real world. (paper | code)
![Twitter avatar for @tonyzzhao](https://substackcdn.com/image/twitter_name/w_96/tonyzzhao.jpg)
3). HuggingGPT - a system that leverages LLMs like ChatGPT to conduct task planning, select models and act as a controller to execute subtasks and summarize responses according to execution results. (paper)
![Twitter avatar for @johnjnay](https://substackcdn.com/image/twitter_name/w_96/johnjnay.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsgpjBNWIAcpnAR.jpg)
4). ChatDoctor - a medical chat model fine-tuned on LLaMA using medical domain knowledge. Collects data on around 700 diseases and generated 5K doctor-patient conversations to finetune the LLM. (paper | code)
![Twitter avatar for @omarsar0](https://substackcdn.com/image/twitter_name/w_96/omarsar0.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsRQ9snWYAQ5kea.png)
5). LLaMA-Adapter - a lightweight adaption method to efficiently fine-tune LLaMA into an instruction-following model; generates responses comparable to Alpaca with fully fine-tuned 7B parameter; it’s also extended for multi-modal input support. (paper | code)
![Twitter avatar for @rasbt](https://substackcdn.com/image/twitter_name/w_96/rasbt.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsefD4NaYAA4kFv.jpg)
6). ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks - demonstrates that ChatGPT can outperform crowd-workers for several annotation tasks such as relevance, topics, and frames detection; besides better zero-shot accuracy, the per-annotation cost of ChatGPT is less 20 times cheaper than MTurk. (paper)
![Twitter avatar for @AlphaSignalAI](https://substackcdn.com/image/twitter_name/w_96/AlphaSignalAI.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsfEdRQWYAIJPAR.png)
7). LLMs for Computer Tasks - shows that a pre-trained LLM agent can execute computer tasks using a simple prompting scheme where the agent recursively criticizes and improves its outputs. (paper)
![Twitter avatar for @arankomatsuzaki](https://substackcdn.com/image/twitter_name/w_96/arankomatsuzaki.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsgq-d2aUAE7lhP.png)
8). Dialog-Enabled Resolving Agents (DERA) - a paradigm to enhance large language model completions by allowing models to communicate feedback and iteratively improve output; DERA outperforms base GPT-4 on clinically-focused tasks. (paper)
![Twitter avatar for @johnjnay](https://substackcdn.com/image/twitter_name/w_96/johnjnay.jpg)
![Image](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsolbUQWAAEWUuH.jpg)
9). Natural Selection Favors AIs over Humans - discusses why AI systems will become more fit than humans and the potential dangers and risks involved, including ways to mitigate them. (paper)
![Twitter avatar for @DanHendrycks](https://substackcdn.com/image/twitter_name/w_96/DanHendrycks.jpg)
![Forces that fuel selfishness and erode safety.](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsZdwmJaYAYgRBp.jpg)
![Darwinism and evolution apply to more than just biological organisms and generalized across different domains.](https://substackcdn.com/image/fetch/w_600,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fpbs.substack.com%2Fmedia%2FFsZd27CaAAELp5_.jpg)
10). ML for Partial Differential Equations - a review examining avenues of partial differential equations research advanced by machine learning. (paper)
![Twitter avatar for @DynamicsSIAM](https://substackcdn.com/image/twitter_name/w_96/DynamicsSIAM.jpg)