Update blog post

This commit is contained in:
Jake R 2024-08-17 00:14:12 -07:00
parent 9acf42fb63
commit 690b6a3d7d
2 changed files with 12 additions and 2 deletions

View File

@ -35,9 +35,19 @@ The most impressive leap forward has been in text generation, but that being sai
In the past we had to do with specially trained models that typically generated text with a lot of errors, usually smashing together characters. In the past we had to do with specially trained models that typically generated text with a lot of errors, usually smashing together characters.
The current state of text generation is far better, the text that is specified in the prompt is treated specially and is correct more than it is incorrect. The current state of text generation is far better, the text that is specified in the prompt is treated specially and is correct more than it is incorrect.
Despite this, background text (anything that not generated as a result of explicitly asking for it in the prompt) frequently has some of the original issues we faced. Despite this, background text (anything that not generated as a result of explicitly asking for it in the prompt) frequently has some of the original issues we faced, but that can be worked around.
![img alt](./tobeeandmaymay.png) ![img alt](./badtext.png)
Other things that are better in this model include hands and feet - previously generating anything with exposed hands or feet would often end up looking more like a horror illustration, but now they look more humanlike than not.
There are still issues with multiple hands showing up where there should only be one, but there is a large improvement in getting the correct number of digits, and making them curl in the correct ways.
Overall, a great step forward!
![img alt](./apple.png)
One thing that hasn't seen much improvement is generating photos of eggs. For some reason whole peeled eggs haven't come out very well in my experience.
![img alt](./eggs.png)
Overall, the steps are positive, and the release of new tools means we have new limits to push the boundaries of. Overall, the steps are positive, and the release of new tools means we have new limits to push the boundaries of.

Binary file not shown.

Before

Width:  |  Height:  |  Size: 1.1 MiB