- LLMs excel at converting unstructured => structured data
- Will become less expensive over time
- When GPT-4 image support launches publicly, would be a cool integration / fallback for cases where the code-based extraction fails to produce desired results
- In theory works on any website regardless of format / tech
What I think is super compelling is other AI techniques excel at reasoning about structured data and making complex inferences. Using a feedback cycle ensemble model between LLMs and other techniques I think is how the true power of LLMs will be unlocked. For instance many techniques can reason about stuff expressed in RDF, and gpt4 does a pretty good job changing text blobs like web pages into decent and well formed RDF. The output of those techniques are often in RDF, which gpt4 does a good job of ingesting and converting into human consumable format.
I would love for multimodal models to learn generative art process. e.g. processing or houdini, etc. Being able to map programs in those languages to how they look visually would be a great multiplier for generative artists. Then exploring the latent space through text.
- LLMs excel at converting unstructured => structured data
- Will become less expensive over time
- When GPT-4 image support launches publicly, would be a cool integration / fallback for cases where the code-based extraction fails to produce desired results
- In theory works on any website regardless of format / tech