Is The Visual Bot the New Frontier in the History of Chatbots?

Contents

The Amazing Chatbot Discovery
The History of Chatbots
Chatbots Today
Why is a Visual Bot Key to Successful CS?
Humans have eyes and bots don’t
Early Stage Visual Bot Has Arrived
The advantages of bots with eyes
Chatbot Development Methodology: The Path to Evolution
Next Step: Teaching the Visual Bot
The Challenge of Developing a Data-Bank
A Visual Chatbot = The Smart Investment for Business Owners

The world may be enamored with bots at the moment, but they’ve actually been around for quite some time. In the following article we will explore the topics of how chatbots are evolving and what is the future of chatbots, with a focus on the visual bot.

The Amazing Chatbot Discovery

Chatbot discovery began in 1966 with ELIZA – the first chatbot created. She answered some very simple decision tree questions. It operated by recognizing keywords or phrases and responding from a bank of pre-programmed responses, based on these keywords.

1972 saw the development of PARRY, a bot that tried to model the behavior of a paranoid schizophrenic. This was followed by RACTER in 1983, a storytelling bot, designed to entertain and amuse users. In 2005, JABBERWACKY was created which tried to mimic human conversations.

However, only recently the chatbot discovery has gained momentum in the world of technology.

The History of Chatbots

The first chatbots created for the finance industry were already used more than a decade ago, automatically buying and selling equities based on key market indicators. It was a novel concept at the time, but the technology is now ubiquitous in the industry, with the financial robo-advice market projected to grow to $7 trillion by 2025, according to CNBC.

Today’s bots have evolved to become much more capable than their ancestors. Conversational AI platforms, known as chatbots, automate and scale one-on-one conversations. They have a large number of use cases that extend well beyond the finance industry, into the sales, marketing and customer support domains.

What’s more, they’re continuing to evolve from their predecessors; just a few years ago, the notion that a bot could answer a text message or suggest a product for purchase was revolutionary. This is now commonplace, with chatbots a near-standard Help feature on websites and other online platforms.

The next evolutionary stage in bot technology should have entrepreneurs salivating.

What if chatbots had eyes?

Chatbots Today

Businesses utilizing chatbot technology today have likely done so for two main reasons: to enhance the customer experience and save money. Juniper Research projects bots will cut business expenses by as much as $8 billion by 2022. Without a doubt, this technology can make a huge impact for both SMBs and enterprises.

Yet chatbots today still come with a multitude of problems for entrepreneurs, especially as it pertains to customer experience. Sometimes chatbots fail to deliver user experiences that are as seamless, efficient, and pleasant as hoped. And often the reason why is simple: chatbots cannot see.

Why is a Visual Bot Key to Successful CS?

When a customer interacts with a chatbot, the success of the communication is highly dependent on the customer’s ability to accurately describe – and type – the issue at hand. The chatbot’s ability to interpret the customer’s phrases, nuances and complex reality is limited as well. This carries over into the chatbot’s ability to help the customer solve the problem. The bot’s responses are even further limited by a specific pool of words and texts.

Humans have eyes and bots don’t

According to a PointSource survey, 59% of customers say bots aren’t getting the job done, because customers are more than text. They are emotional, visual creatures who communicate with body language and subtle cues. Humans use their eyes and brains to see and visually sense the world around them. That’s why we’ve seen a huge spike in visual search engines, video tutorials, and more visual customer assistance.

For business owners, the difference between visually walking a customer through resolution steps and typing words about mechanical actions is immense. Visual engagement reduces frustration and empowers the customer rather than escalates dissatisfaction.

Early Stage Visual Bot Has Arrived

Computer vision AI is already being utilized in a wide range of applications, for example:

It recognizes faces and smiles in cameras.
It helps self-driving cars read traffic signs and avoid pedestrians.
It allows factory robots to monitor problems on the production line.

In customer engagement, it will help the visual bot see the problem, as a virtual assistant. The implications for business owners who incorporate AI in chatbot development methodology, are immense.

‘These jeans look great on you!’ advises the visual bot stylist

The e-commerce industry, and the fashion industry in particular, has been among the early adopters of visual bots. Levi’s AI-powered virtual stylist can advise the shopper about products or styles most suited to them.

The advantages of bots with eyes

If brands can use visual bots to “see” and understand their customers on an individual level, then they can truly up their efforts at personalized sales, marketing, and service.

These are exciting developments, but there are many more use cases along the customer journey that still remain untapped.

Chatbot Development Methodology: The Path to Evolution

For mass adoption of visual bots, vendors and enterprises are required to adopt the core technologies that support its development: computer vision AI and Augmented Reality (AR).

This chatbot development methodology evolution will encompass a number of phases.

Phase One: Text to Image

At the early stage, the visual chatbot receives text-based inputs from the customer, interprets the input and retrieves a relevant visual from a knowledge base or a search engine. This can be a reply for a specific request, such as, “Please show me the room in the hotel I’ve reserved.” It can also be a reply to a general request, for instance, “How do I program my coffee machine?”

Phase Two: Image to Image/ Text

At this more advanced phase in the methodology of chatbots, the bots apply computer vision AI to process the input received. The visual bots then reply either with words or visuals.

An example of this stage in chatbot development methodology would be when museum-goers snap a photo of an item of interest and a museum chatbot recognizes the item. The museum’s visual bot then shares more details about the artist and the item’s background.

Phase Three: Image to Smart Image

At this stage, the bot applies computer vision upon processing the input as well as when processing the reply. For example, the customer contacts his insurance company following a car accident. The visual bot then does the following:

asks the customer to upload images of the vehicle
identifies the damaged areas
detects the extent of the damage, and
estimates the potential cost of repairs.

The visual bot has acquired information that speeds up the claim cycle and saves money for the business.

Insurance companies have been focused on developing these capabilities as part of their chatbot development methodology. Computer vision has added value to the user experience and has resulted in the maturing of ‘virtual adjuster’ bots.

Phase Four: Interactive Visual Conversation

The most advanced stage in the chatbot discovery evolution is when the chatbot can switch to real-time video mode, enabling the customer to show the issue and receive interactive AR guidance. This advanced visual bot can perform complicated tasks while guiding customers and can also provide feedback and correct them in an interactive manner.

For example, when unboxing a new router, a ‘virtual technician’ recognizes the cables and inputs, and guides the customer using AR through the installation process.

Next Step: Teaching the Visual Bot

Advanced visual bots harness deep learning technologies to recognize and analyze visual images to the highest degree of accuracy. Deep learning requires the creation of a massive data set in order to effectively train the model. In order for the visual bot to correctly identify vehicular damage, as in the above example, the bot must have had the opportunity to process tens of thousands of images of each damage type.

To identify a coffee machine’s specific model, the visual bot needs to have processed a massive amount of images of each specific model; in various lighting, angles and positions.

The Challenge of Developing a Data-Bank

Building these massive datasets is extremely time consuming and labor-intensive, and simply out of scope for many enterprises and vendors. It will be time intensive. It will be costly. But when confronted with the question of what is the future of chatbots, we can confidently answer that it absolutely will be done. Just as with all technology, the price of developing visual chatbots will drop over time and become affordable for all sorts of businesses.

A Visual Chatbot = The Smart Investment for Business Owners

Chatbots today are quickly becoming an integral part of the user experience, and as long as humans are involved it is clear what is the future of chatbots: the visual bot. The transformation to bots with eyes will be an evolutionary process, where gradually the bots move from traditional text-based understanding to image processing, and eventually to full visual interactions.

For entrepreneurs and business owners, the potential upside is far-reaching, including for example:

improved customer experience
loyalty and retention
lower costs, and
more generated revenue thanks to personalized sales and service.

To read more about visual bots today , check out this article about how visual bots enable self-service solutions.

Ronen Rozenberg, AI & Image Processing Manager

With 14 years dedicated to developing algorithms and research, Ronen Rozenberg leads teams in the fields of computer vision AI to design and implement Computer Vision, Machine Learning and Deep Learning.