Leveraging technology that generates text is coming to the main theaters and Forbes is the most recent one: “The Biggest Opportunity In Generative AI Is Language, Not Images”
Different names are in use: generative AI, as in the article; synthetic text, following the popular term “synthetic data”; NLG (Natural Language Generation) is the most traditional term maybe not so trendy just for that reason.
Synthetic Text, as we will call it, started to follow the path of synthetic image recently. Synthetic image and video have been a huge success in sectors like self-driven cars.
For text, the initial successes have come from tabular data. In structured or tabular text, what’s generated is names (James O’Reilly, Bethesda Pharmaceuticals Inc.) or phrases (Junior Accountant, out of order) properly combined in tables or relational structures.
The next step in synthetic text seems to be unstructured data, where actual full sentences are produced, rather than phrases or names in tables.
Report generation, based on numeric tables, is an intermediate step between generating tabular data and actually generating full sentences from scratch. It’s very popular for sectors like e-commerce, finance or pharma.
At Bitext, we are focused on generating unstructured text for customer service applications and solving problems like:
You can take a look at a sample data in our GitHub Repository
A robust discussion persists within the technical and academic communities about the suitability of LLMs…
Chinese, Southeast Asian, and Arabic names require transliteration, often resulting in inconsistent spellings in Roman…
Customizing Large Language Models in 2 steps via fine-tuning is a very efficient way to…
Discover the advantages of using symbolic approaches over traditional data generation techniques in GenAI. Learn…
In the blog "General Purpose Models vs. Verticalized Enterprise GenAI," the focus is on the…
Bitext introduced the Copilot, a natural language interface that replaces static forms with a conversational,…