Cognitive Services gets better at handling rich information

Blog|by Mary Branscombe|14 May 2019

The Azure Cognitive Services APIs and SDKs are a straightforward way to use pre-trained but often customisable machine learning models for everything from image recognition to speech synthesis. Developers can add them to a website or an app or call them as part of a distributed machine learning pipeline in Spark ML, making them very flexible components whatever your level of data science expertise. At the Build 2019 Conference, Microsoft announced that over 1.3 million developers are already using Cognitive Services.

Four existing services came out of preview at Build 2019: Neural Text-To-Speech for generating voices lifelike enough to use for large amounts of text like reading an audiobook; Computer Vision Read for recognising text and handwriting in images; Named Entity Recognition in Text Analytics for finding the names of people, places, products and organisations mentioned in text; and Cognitive Search in Azure Search, for exploring and finding insights in large and complex sets of unstructured and semi-structured content like images and transcripts.

The QnA Maker tool for turning FAQs into an interactive chatbot can now handle multi-turn dialogs and the Language Understanding Service, LUIS, can extract multiple intents from a long sentence, so if a user types, say, “send a large pepperoni pizza over to my home after 9pm this evening, but make sure it is a deep dish with the new spicy pepperoni,” LUIS can more easily translate what they want into the right pizza order.

Microsoft also added a new category of services, Decision (in addition to Vision, Speech, Language and Search). Decision includes the existing Content Moderator tool for reviewing text and images that users contribute that might not be the kind of content you want to include (whether it’s offensive, age-inappropriate or about your competitors) and the recently released Anomaly Detector service.

You can run anomaly detection to monitor time series data looking for values that aren’t what they should be, in the cloud if you’re monitoring a website or online business process for fraud or worrying trends, or locally in a Docker container. That’s ideal if you want to check whether a really unusual figure is an actual disaster like fire or flood, or if the sensor on an IoT device might be failing.

There’s also a brand new service in preview, Personalizer – for picking the best thing to show to users based on their behaviour in real time, whether that’s ads, shopping recommends, news headlines, filters to apply to a photo, the next best action to complete a business process, or the best answer for a chatbot to give. You give the Rank API a list of the possible content your app could show and information about your users; the API can rank the actions with its current model or explore new choices that might change the model. Rank returns a single action that your app shows to users. Based on whether they click the link, buy the product or choose a completely different photo filter, you return a score with the Reward API that’s used to update the model.

Microsoft Build 2019: Cognitive Services gets better at handling rich information — *Source: Microsoft*
**How the Rank and Reward APIs work in the new Cognitive Services Personalizer**

Personalizer uses reinforcement learning, a relatively new and advanced machine learning technique that technology companies like Google have been using internally. Microsoft may be the first to offer it as a commercial service for developers to take advantage of (the similar Custom Decision Service has been available as an experimental service through Cognitive Services Labs).

Another of the Cognitive Services Labs experiments has also graduated to a full service. Ink Recognizer converts handwriting into text and drawn shapes into smooth polygons; developers can use it in an app to recognise handwriting, tidy up drawings so they’re easier to understand, make handwritten lists line up neatly on the page, make ink searchable without converting it to text, or making it easy to fill out forms with fields – even down to only recognising digits when someone is writing in a file that’s supposed to be for phone numbers. Because the digital ink is stored as JSON, ink recognition works on mobile and in web apps as well as in Windows apps (this is the same ink recognition that’s been in PowerPoint since 2018), and it’s available for 63 languages and locales initially.

If you’re dealing with forms that have already been filled out on paper and then scanned (something that many organisations have in their archives), the new Form Recognizer API can extract text from fields and tables in documents (stored as PDFs, PNG or JPEG) and store them as key value pairs to turn them into structured data. Because most organisations have their own specific form layouts, you can use sample documents (as few as five) to customise the image recognition model. Because it’s a REST API you can then pipe those recognised forms into existing search indexes and business workflows. (For less structured document layouts like a menu or the order of service for a wedding, try the Computer Vision Read API instead.)

And if you’re dealing with forms that have sensitive or regulated data that you might have issues taking to the cloud, you can run that trained model locally in a container. Initially it’s only available for English, and in the West US and West Europe Azure regions, but more languages and regions will be supported soon.

Organisations using Teams and Azure Streams can already get transcripts of meetings and videos. That same transcription of meetings and conversations is now available as a real-time Conversation Transcription feature in Speech Services. Initially it needs a circular seven microphone array like the one Microsoft used to demo meeting transcription at Build in 2018 (which is available as a development kit as part of the Microsoft Speech Services SDK) and is suitable for small groups of people (especially as you need to train it with sample recordings for each person speaking and create user profiles for them). You can give the service multiple custom vocabularies, so it can more accurately recognise words from your industry and your company – and teaching it about, say, health terms won’t compromise its accuracy recognising terms from the transport industry if what your business does is transport pharmaceuticals.

Microsoft is planning to expand conversation transcription to larger groups, and to take advantage of existing microphones in a video conferencing-equipped room. And a demo at Build 2019 showed it working with the microphones on the laptops and phones of meeting attendees and turning those into an array microphone, making it easier to work out who is speaking and putting the right name on their portion of the conversation. This is one of the hardest problems in speech recognition, so those developments may take a little time, but the state of the art here is moving very rapidly.

The members of the Grey Matter Managed Services team are experts at understanding and managing Azure-related projects. If you need their technical advice or wish to discuss Azure and Visual Studio options and costs, call them on 01364 654100 or complete the form below.

14 May 2019 | Blog

Contact Grey Matter

If you have any questions or want some extra information, complete the form below and one of the team will be in touch ASAP. If you have a specific use case, please let us know and we'll help you find the right solution faster.

By submitting this form you are agreeing to our Privacy Policy and Website Terms of Use.

Mary Branscombe

Mary Branscombe is a freelance tech journalist. Mary has been a technology writer for nearly two decades, covering everything from early versions of Windows and Office to the first smartphones, the arrival of the web and most things in between.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Analytics" category.
cookielawinfo-checkbox-functional	1 year	The GDPR Cookie Consent plugin sets the cookie to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Necessary" category.
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie stores user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie stores the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
csrftoken	1 year	This cookie is associated with Django web development platform for python. Used to help protect the website against Cross-Site Request Forgery attacks
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
SRCHD	1 year 24 days	Bing sets this cookie to display map content using Bing Maps.
SRCHUID	1 year 24 days	Bing sets this cookie to display map content using Bing Maps.
SRCHUSR	1 year 24 days	Bing sets this cookie to display map content using Bing Maps.
viewed_cookie_policy	1 year	The GDPR Cookie Consent plugin sets the cookie to store whether or not the user has consented to use cookies. It does not store any personal data.

Cookie	Duration	Description
_an_uid	7 days	No description available.
_cfuvid	session	Description is currently not available.
6suuid	1 year 1 month 4 days	No description available.
AN	1 month	No description available.
AS	session	No description available.
debug	never	No description available.
ebEventToTrack	1 month	No description available.
eblang	1 year	No description available.
gm_country_code	7 days	Description is currently not available.
guest	1 month	No description available.
JOTFORM_SESSION	1 month	No description available.
loglevel	never	No description available.
receive-cookie-deprecation	1 year 1 month 4 days	Description is currently not available.
SP	session	Description is currently not available.
SRCHHPGUSR	1 year 24 days	No description available.
SS	session	Description is currently not available.
stableId	1 year	Description is currently not available.
TESTCOOKIESENABLED	1 minute	Description is currently not available.
userReferer	1 month	No description available.
VISITOR_PRIVACY_METADATA	6 months	Description is currently not available.
zoom	never	No description available.

Cookie	Duration	Description
_SS	session	Bing sets this cookie to collect information on how visitors behave on multiple websites and to understand how they access the website, to provide relevant ads.
ANONCHK	10 minutes	The ANONCHK cookie, set by Bing, is used to store a user's session ID and verify ads' clicks on the Bing search engine. The cookie helps in reporting and personalization as well.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements by tracking user behaviour across the web, on sites with Facebook pixel or Facebook social plugin.
guest_id	1 year 1 month	Twitter sets this cookie to identify and track the website visitor. It registers if a user is signed in to the Twitter platform and collects information about ad preferences.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
mgref	1 year	This cookie is set by Eventbrite to deliver content tailored to the end user's interests and improve content creation. It is also used for event-booking purposes.
muc_ads	1 year 1 month 4 days	Twitter sets this cookie to collect user behaviour and interaction data to optimize the website.
MUID	1 year 24 days	Bing sets this cookie to recognise unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
personalization_id	1 year 1 month 4 days	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
SUID	12 hours	Google Analytics sets this cookie to collect data on user preferences and/or interaction with web campaign content (Microsoft).
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
VISITOR_INFO1_LIVE	5 months 27 days	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_clck	1 year	Microsoft Clarity sets this cookie to retain the browser's Clarity User ID and settings exclusive to that website. This guarantees that actions taken during subsequent visits to the same website will be linked to the same user ID.
_clsk	1 day	Microsoft Clarity sets this cookie to store and consolidate a user's pageviews into a single session recording.
_fbp	3 months	Facebook sets this cookie to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising after visiting the website.
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gd_session	4 hours	This cookie is used for collecting information on users visit to the website. It collects data such as total number of visits, average time spent on the website and the pages loaded.
_gd_svisitor	1 year 1 month 4 days	This cookie is set by the Google Analytics. This cookie is used for tracking the signup commissions via affiliate program.
_gd_visitor	1 year 1 month 4 days	This cookie is used for collecting information on the users visit such as number of visits, average time spent on the website and the pages loaded for displaying targeted ads.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
_s	1 year	This cookie is associated with Shopify's analytics suite.
ajs_anonymous_id	never	This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before.
ajs_group_id	never	This cookie is set by Segment to track visitor usage and events within the website.
ajs_user_id	never	This cookie is set by Segment to help track visitor usage, events, target marketing, and also measure application performance and stability.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
CLID	1 year	Microsoft Clarity set this cookie to store information about how visitors interact with the website. The cookie helps to provide an analysis report. The data collection includes the number of visitors, where they visit the website, and the pages visited.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
ln_or	1 day	Linkedin sets this cookie to registers statistical data on users' behaviour on the website for internal analytics.
MR	7 days	This cookie, set by Bing, is used to collect user information for analytics purposes.
MUIDB	1 year 24 days	Bing sets this cookie to determine how the user uses the website and any advertising that the end user may have seen before visiting the said website.
SM	session	Microsoft Clarity cookie set this cookie for synchronizing the MUID across Microsoft domains.
vuid	1 year 1 month 4 days	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos on the website.
wow.anonymousId	1 year 1 month 4 days	This is a analytic cookie used to store anonymous visitor ID. It tracks the visitor uniquely between visits.
wow.session	20 minutes	This cookie is set by the provider Communigator.This cookie is used to track the Internet Information Services(IIS) session state.
wow.utmvalues	20 minutes	This cookie is from Communigator. This cookie is used to store UTM values for the session.UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on

Cookie	Duration	Description
__cf_bm	30 minutes	Cloudflare set the cookie to support Cloudflare Bot Management.
_EDGE_S	session	Bing sets this cookie to display map content using Bing Maps.
_EDGE_V	1 year 24 days	Bing sets this cookie to display map content using Bing Maps.
li_gc	5 months 27 days	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
TawkConnectionTime	session	Tawk.to, a live chat functionality, sets this cookie. For improved service, this cookie helps remember users so that previous chats can be linked together.
twk_idm_key	session	Tawk set this cookie to allow the website to recognise the visitor in order to optimize the chat-box functionality.

Cognitive Services gets better at handling rich information

Contact Grey Matter

Mary Branscombe

Intel oneAPI 2024.1 A Milestone Release

ISV Partner Day Shortlisted for CRN Sales & Marketing Award

Microsoft 365 and Azure Security Tools: Microsoft Intune

Women in Tech: A New Era | Roundtable

About

Solutions

Vendors

Certifications

Select Your Region

Cognitive Services gets better at handling rich information

Contact Grey Matter

Mary Branscombe

Related News

Intel oneAPI 2024.1 A Milestone Release

ISV Partner Day Shortlisted for CRN Sales & Marketing Award

Microsoft 365 and Azure Security Tools: Microsoft Intune

Women in Tech: A New Era | Roundtable

Select Your Region