|
7 | 7 | Some ideas:
|
8 | 8 |
|
9 | 9 | - [The History of Open-Source LLMs: Better Base Models (part 2)](https://cameronrwolfe.substack.com/p/the-history-of-open-source-llms-better) (LLaMA, MPT, Falcon, LLaMA-2)
|
10 |
| -- [Papers I've read this week, Mixture of Experts edition](https://finbarrtimbers.substack.com/p/papers-ive-read-this-week-mixture) (conditional routing models) |
| 10 | +- [Papers I've read this week, Mixture of Experts edition](https://www.artfintel.com/p/papers-ive-read-this-week-mixture) (conditional routing models) |
11 | 11 | - [AI and Memory Wall](https://medium.com/riselab/ai-and-memory-wall-2cb4265cb0b8)
|
12 | 12 | - https://github.com/imaurer/awesome-decentralized-llm
|
13 | 13 | - https://github.com/huggingface/transformers/blob/main/awesome-transformers.md
|
@@ -45,7 +45,7 @@ ChatGPT is sensitive to tweaks to the input phrasing or attempting the same prom
|
45 | 45 |
|
46 | 46 | #### GPT-4
|
47 | 47 |
|
48 |
| -[GPT-4 is a language model developed by OpenAI](https://openai.com/research/gpt-4). It is the successor to GPT-3 and has been made publicly available via the paid chatbot product ChatGPT Plus and via OpenAI's API. It is a large multimodal model that can accept image and text inputs and emit text outputs, [though multimodal capabilities aren't released to the public yet](https://analyticsindiamag.com/what-happened-to-multimodal-gpt-4). It exhibits human-level performance on various professional and academic benchmarks and can follow complex instructions in natural language and solve difficult problems with accuracy. It can handle input prompts of up to 32k tokens, which is a significant increase from GPT-3.5's 4k tokens. It can solve complex mathematical and scientific problems beyond the capabilities of GPT-3.5, such as advanced calculus problems or simulating chemical reactions [more effectively than its predecessor](https://www.searchenginejournal.com/gpt-4-vs-gpt-3-5/482463). It is more reliable, creative, and able to handle much more nuanced instructions than GPT-3.5. |
| 48 | +[GPT-4 is a language model developed by OpenAI](https://openai.com/research/gpt-4). It is the successor to GPT-3 and has been made publicly available via the paid chatbot product ChatGPT Plus and via OpenAI's API. It is a large multimodal model that can accept image and text inputs and emit text outputs, [though multimodal capabilities aren't released to the public yet](http://analyticsindiamag.com/what-happened-to-multimodal-gpt-4/). It exhibits human-level performance on various professional and academic benchmarks and can follow complex instructions in natural language and solve difficult problems with accuracy. It can handle input prompts of up to 32k tokens, which is a significant increase from GPT-3.5's 4k tokens. It can solve complex mathematical and scientific problems beyond the capabilities of GPT-3.5, such as advanced calculus problems or simulating chemical reactions [more effectively than its predecessor](https://www.searchenginejournal.com/gpt-4-vs-gpt-3-5/482463). It is more reliable, creative, and able to handle much more nuanced instructions than GPT-3.5. |
49 | 49 |
|
50 | 50 | Despite its capabilities, [GPT-4 still sometimes "hallucinates"](https://www.reddit.com/r/ChatGPT/comments/12fmrcd/examples_of_gpt4_hallucination) facts and makes reasoning errors.
|
51 | 51 |
|
@@ -111,7 +111,7 @@ Before [](#chatgpt)'s (GPT-3.5) public release we had [GPT-3](https://en.wikiped
|
111 | 111 | There has been a few visible marks across modalities of AI models, highly catalysing growth of open source:
|
112 | 112 |
|
113 | 113 | - [Meta AI launches LLaMA](https://ai.meta.com/blog/large-language-model-llama-meta-ai), open sourcing the code but not the weights.
|
114 |
| -- [StabilityAI released Stable Diffusion](https://stability.ai/blog/stable-diffusion-announcement). |
| 114 | +- [StabilityAI released Stable Diffusion](https://stability.ai/news/stable-diffusion-announcement). |
115 | 115 |
|
116 | 116 | #### [Stable Diffusion](https://registry.premai.io/detail.html?service=stable-diffusion-1-5)
|
117 | 117 |
|
@@ -371,15 +371,15 @@ Till now we've mostly been looking at LLMs in general and not other models, let'
|
371 | 371 |
|
372 | 372 | #### [Stable Diffusion XL](https://registry.premai.io/detail.html?service=stable-diffusion-xl-with-refiner)
|
373 | 373 |
|
374 |
| -[StabilityAI released Stable Diffusion XL 1.0 (SDXL)](https://stability.ai/blog/stable-diffusion-sdxl-1-announcement) models on 26th July, being current State of the Art for text-to-image and image-to-image generation open sourced models. They released a [base model](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0) and a [refinement model](https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0) which is used to improve the visual fidelity of samples generated by SDXL. |
| 374 | +[StabilityAI released Stable Diffusion XL 1.0 (SDXL)](https://stability.ai/news/stable-diffusion-sdxl-1-announcement) models on 26th July, being current State of the Art for text-to-image and image-to-image generation open sourced models. They released a [base model](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0) and a [refinement model](https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0) which is used to improve the visual fidelity of samples generated by SDXL. |
375 | 375 |
|
376 | 376 | Few months back they released Stable-diffusion-xl {cite}`podell2023sdxl` [base](https://huggingface.co/stabilityai/stable-diffusion-xl-base-0.9) and [refinement](https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-0.9) models versioned as 0.9, where license permitting only research purpose usages.
|
377 | 377 |
|
378 | 378 | SDXL consistently surpasses all previous versions of Stable Diffusion models by a significant margin:
|
379 | 379 |
|
380 | 380 | ```{figure} https://static.premai.io/book/models_sdxl-winrate.png
|
381 | 381 | :width: 60%
|
382 |
| -[SDXL Winrate](https://stability.ai/blog/stable-diffusion-sdxl-1-announcement) |
| 382 | +[SDXL Winrate](https://stability.ai/news/stable-diffusion-sdxl-1-announcement) |
383 | 383 | ```
|
384 | 384 |
|
385 | 385 | ##### Uniqueness
|
|
0 commit comments