Resources
A collection of free resources for you to download, including guides, white papers, e-books, benchmarks, and case studies.
Quick Downloads
Arabic Sentiment Text Similarity
Pre Built Training Data
Arabic Embeddings
Lexical Data Resources
List of Services
Synonym Data Resources
Guides
Morphological Analyzer
Twitter Sentiment Analysis
Understanding Your Results
POS Approach
Create Your Coding Plan
Booleans and POS Tagging
Last Articles
Transforming the Business Landscape with ChatGPT Enterprise: A Detailed Look
ChatGPT Enterprise: OpenAI's Game-Changer for Businesses In a groundbreaking move,...
Harnessing Large Language Models (LLMs) and Artificial Intelligence
Empower Your Business with Question and Answer Datasets Revolutionizing Business...
Bitext’s Free Customer Support Dataset
We have shown in previous posts why Synthetic Training Data is the best way to boost the...
Synthetic Text: The Moment for Enterprise Applications Is Now
Leveraging technology that generates text is coming to the main theaters and Forbes is...
Unstructured Synthetic Text: Beyond Tabular Data
The case for evaluation of NLU platforms Synthetic image and video have proven to be a...
Multilingual Synthetic Training Data For Intent Detection
What Is Synthetic training data? Synthetic Training data is the data that is used to...
How to Change Your Order in 10,000 Different Ways
How Synthetic Text can solve your training and evaluation problems for your virtual...
Why Linguistics for Text Analysis?
In previous posts, we have outlined the crucial role of Machine Learning for Analytics...
How to Make Machine Learning More Effective Using Linguistic Analysis
Text analysis is becoming a pervasive task in many business areas. Machine Learning is...
How to Automate the Generation of Training Data for Conversational Bots
Everything looks promising in the world of bots: big players are pushing platforms to...
Benchmarks
Benchmark on Amazon Lex
Check out how we improved Amazon Lex accuracy by 50% using our training data
Benchmark on Entity Extration
This report compares Bitext’s entity extraction software to 3 other engines (CRFSuite, Stanford and SENNA)
Benchmark on Microsoft LUIS
Increase accuracy on the LUIS platform up to 40% using Bitext training data.
Benchmark on Lemmatization
A brief comparison of stemmers and lemmatizers
Benchmark on Dialogflow
A benchmark based on Dialogflow shows accuracy increases of up to 40%
Case Studies
Chatbot Multilingual Synthetic Data
Deploying a bot capable to engage in successful conversations for retail.
TechCrunch
Download the Case Study about our work with TechCrunch.
Consumer Insights in Minutes
Learn how Movistar saved 75% using Bitext services.
Automotive Industry
Market research leader saves 65% of its time when looking for insights.
Automating Manual Coding
Learn how a market research leader achieved 65% savings using Bitext.
Philz Coffee
Discover how one of our customers began saving time and money.
E-books and Cheat Sheets
How Linguistics Can Improve Chatbots
Solving chatbot issues using linguistics.
Lemmatization vs Stemming
Download practical examples of the two methods in different languages.
Lemmatization and POS Tagging for Deep Learning
How both impact Deep Learning.
Anonymization for GDPR Compliance
Are you ready for the GDPR?
White Papers
Lemmatization for Topic Modeling
Discover how lemmatization impacts Topic Modeling.
How to Solve Chatbot Problems
How to solve 3 common chatbot issues.