HMS ML Kit- Text to Speech
With HMS ML Kit Huawei ensures that developers have a simple way to implement text-to-speech features into their app. Text to speech has the ability to turn text content into a natural spoken voice. This service uses the deep neural network (DNN) synthesis mode and can be quickly integrated through the on-device SDK to generate audio data in real-time. It supports the download of offline models. In the current version, two standard male voices and six standard female voices are available. This service is available globally, to developers across the world.
Using the ML Kit demo APK, you can see this technology in action.
How Developers are Implementing Text to Speech
Some of the more common areas in which TTS is used can be found in broadcasting, news, voice navigation, and audio reading. Developers will be able to use this feature to allow their users to convert large amounts of text, into speech output. HMS TTS also works seamlessly with navigation data, which enables developers to create powerful navigation apps. ML Kit also understands how to synthesizes the voice segment into the navigation voice, so that navigation is more personalized.
{
"lightbox_close": "Close",
"lightbox_next": "Next",
"lightbox_previous": "Previous",
"lightbox_error": "The requested content cannot be loaded. Please try again later.",
"lightbox_start_slideshow": "Start slideshow",
"lightbox_stop_slideshow": "Stop slideshow",
"lightbox_full_screen": "Full screen",
"lightbox_thumbnails": "Thumbnails",
"lightbox_download": "Download",
"lightbox_share": "Share",
"lightbox_zoom": "Zoom",
"lightbox_new_window": "New window",
"lightbox_toggle_sidebar": "Toggle sidebar"
}
TTS is currently only available for users who have a Huawei device. Some of the limitations of this service include a cap on the amount of text that can be read, at 500 characters. Currently, TTS in French, Spanish, German, and Italian is deployed only in Asia, Africa, Latin America, and Europe. TTS depends on on-cloud APIs. During commissioning and usage, ensure that the device can access the Internet. Default specifications of the real-time output audio data are as follows: PCM mono, 16-bit depth, and 16 kHz audio sampling rate.
Developer Resources
Huawei provides plenty of documentation and guides to help you get started with ML Kit's Real-Time Transcription. You can get started with HMS ML Kit- Text to Speech with this guide here.
For all of the functions of ML Kit, refer to their service portal here.
For an overview of their APIs, browse the comprehensive resource library here.
You can also look at different ways that ML Kit can be implemented, by seeing a collection of sample codes here.
Related
More information like this, you can visit HUAWEI Developer Forum
{
"lightbox_close": "Close",
"lightbox_next": "Next",
"lightbox_previous": "Previous",
"lightbox_error": "The requested content cannot be loaded. Please try again later.",
"lightbox_start_slideshow": "Start slideshow",
"lightbox_stop_slideshow": "Stop slideshow",
"lightbox_full_screen": "Full screen",
"lightbox_thumbnails": "Thumbnails",
"lightbox_download": "Download",
"lightbox_share": "Share",
"lightbox_zoom": "Zoom",
"lightbox_new_window": "New window",
"lightbox_toggle_sidebar": "Toggle sidebar"
}
ML Kit
Features:
Added the liveness detection service, which supports silent liveness detection and captures faces in real time. It can determine whether a face is of a real user without requiring the user to follow specific instructions.
Added the image super-resolution service, which removes the compression noise of images to obtain clearer images, with the resolution unchanged.
Added the document skew correction service, which automatically identifies the location of a document in an image and adjusts the shooting angle to the angle facing the document, even if the document is tilted.
Added the hand keypoint setection service, improved the speed and accuracy of bank card recognition, enhanced the translation service, optimized the text to speech service, and other features.
Link: https://developer.huawei.com/consumer/en/doc/development/HMSCore-Guides/version-changehistory-0000001050040023
Scene Kit
Features:
Added APIs for you to create apps which blend the virtual and the real, with features such as dynamic face stickers, 3D Qmojis, and virtual object placement.
Link: https://developer.huawei.com/consumer/en/hms/huawei-scenekit/
Push Kit
Features:
Added the messaging by user time zone, scenario, and geofence, improving user experience.*
*The functions involving user data must be implemented with users' authorization.
Link: https://developer.huawei.com/consumer/en/doc/development/HMSCore-Guides/automated-notifications-0000001051072150#EN-US_TOPIC_0000001051072150__section17240145319447
Analytics Kit
Features:
Supported integration of web apps with Analytics Kit to implement data collection and unified analysis.
Supported automatic collection from mobile phones and tablets as well as configuration of the app installation source.
Link: https://developer.huawei.com/consumer/en/hms/huawei-analyticskit
Updates of all HMS Core versions
Learn more: https://developer.huawei.com/consumer/en/doc/development/HMSCore-Guides-V5/hmssdk-kit-0000001050042513-V5
Scene Kit
Sample Code:
Added the ARView and FaceView development procedures.
Github: https://github.com/HMS-Core/hms-scene-demo
ML Kit
Sample Code:
Added a demo for the messaging by scenario.
Github: https://github.com/HMS-Core/hms-ml-demo
Push Kit
Sample Code:
Added a demo to illustrate how to integrate web apps with Analytics Kit.
Github: https://github.com/HMS-Core/hms-push-clientdemo-android
Analytics Kit
Sample Code:
Added a demo to illustrate how to integrate web apps with Analytics Kit.
Github: https://github.com/HMS-Core/hms-analytics-demo-javascript
[July 15, 2021] Huawei launched HMS Core 6.0 today to global app developers, bringing multiple new open capabilities and updating some existing services and features. As of now, HMS Core (APK) on all user devices has been updated to the 6.0 version. You can access HUAWEI Developers anytime to get the new services and features.
{
"lightbox_close": "Close",
"lightbox_next": "Next",
"lightbox_previous": "Previous",
"lightbox_error": "The requested content cannot be loaded. Please try again later.",
"lightbox_start_slideshow": "Start slideshow",
"lightbox_stop_slideshow": "Stop slideshow",
"lightbox_full_screen": "Full screen",
"lightbox_thumbnails": "Thumbnails",
"lightbox_download": "Download",
"lightbox_share": "Share",
"lightbox_zoom": "Zoom",
"lightbox_new_window": "New window",
"lightbox_toggle_sidebar": "Toggle sidebar"
}
HMS Core 6.0 further opens up Huawei's advanced technologies in media, graphics, telecommunications, and other fields. For media apps, Huawei provides AV Pipeline Kit with preset pipelines for video super-resolution, sound event detection, and other media services to address audio/video app developers' pain points such as difficulties with capability building and high power consumption. In the graphics field, 3D Modeling Kit offers AI-based capabilities such as 3D object reconstruction and material generation for e-commerce and animation apps, making it possible to generate 3D models even with mobile phones as long as there is an RGB camera installed, bringing more fun to user interactions. This service is currently available to all Android device models.
In addition, Computer Graphics Kit debuts the "volumetric fog" feature powered by the volume rendering technique for dynamic lighting and shadowing fog effects on mobile devices, making indoor scenes more realistic for next-generation mobile games, especially for Cyberpunk-style mobile games. This feature is available to all Android devices supporting Vulkan.
In the telecommunications field where Huawei has gained a leading position, Network Kit has updated its AI-powered network environment prediction function so that network parameters can be optimized based on service network access rules and differentiated network requirements, thereby reducing apps' network latency by more than 20%.
HMS Core 6.0 also provides other services such as Membership Kit, Video Editor Kit, and Audio Editor Kit. You can visit the HUAWEI Developers website to find out more.
As a collection of Huawei's software and hardware open capabilities, HMS Core has always been developer-centric and is continuously opening up Huawei's core technologies and capabilities to more developers, providing a high-quality consistent experience for cross-platform and cross-device apps. HMS Core empowers you to develop Web apps, Quick Apps, and apps in Android and HarmonyOS running on phones, tablets, smart displays, wearables, and head units, using cross-platform development frameworks such as React Native and Cordova.
HMS Core 6.0, which was released globally on July 15, offers SmartCoder as part of HMS Toolkit, an intelligent programming assistant that's designed to make HMS Core integration seamlessly efficient, for developing innovative apps easier than ever.
{
"lightbox_close": "Close",
"lightbox_next": "Next",
"lightbox_previous": "Previous",
"lightbox_error": "The requested content cannot be loaded. Please try again later.",
"lightbox_start_slideshow": "Start slideshow",
"lightbox_stop_slideshow": "Stop slideshow",
"lightbox_full_screen": "Full screen",
"lightbox_thumbnails": "Thumbnails",
"lightbox_download": "Download",
"lightbox_share": "Share",
"lightbox_zoom": "Zoom",
"lightbox_new_window": "New window",
"lightbox_toggle_sidebar": "Toggle sidebar"
}
With SmartCoder, you'll be able to find official HMS Core resources (currently includes Account Kit, Ads Kit, Analytics Kit, IAP, and Push Kit) in Android Studio with incredible ease, without ever needing to switch between Android Studio and other interfaces – a document-free programming experience. This alone can boost development efficiency by more than 30%.
SmartCoder comes equipped with two functions: code auto-completion and smart search. Activating code auto-completion only requires you to enter hms in the editing area. The function enables you to complete coding quickly, thanks to the following advantages: completes single API or API call chains; provides floating windows to describe APIs in rich text, and quick documentation in pop-ups, listing code details as well as a short descriptions; allows you to edit auto-completed code snippets. Smart search provides instant access to a wealth of resources, including Kit (for kit-related documentation), Template (for integration steps), Snippet (for sample code snippets), and API (for official API references).
HMS Toolkit is an IDE plug-in that provides all the tools required for developing and releasing HMS Core-integrated apps in Android Studio. You can use it to create, code, convert, test, and release your apps, for a more productive and cost-effective development process.
Notebloc is a Spanish app that scans, saves, and shares notes, documents, receipts, drawings, photos, and images of any type. It allows users to crop documents or images, and automatically corrects misaligned pages. Notebloc, which is now available worldwide and supports over 30 languages, has already been downloaded by more than 7 million users around the world.
{
"lightbox_close": "Close",
"lightbox_next": "Next",
"lightbox_previous": "Previous",
"lightbox_error": "The requested content cannot be loaded. Please try again later.",
"lightbox_start_slideshow": "Start slideshow",
"lightbox_stop_slideshow": "Stop slideshow",
"lightbox_full_screen": "Full screen",
"lightbox_thumbnails": "Thumbnails",
"lightbox_download": "Download",
"lightbox_share": "Share",
"lightbox_zoom": "Zoom",
"lightbox_new_window": "New window",
"lightbox_toggle_sidebar": "Toggle sidebar"
}
Notebloc's core functions are centered around documents. The integration of the HMS Core Pencil Engine into the Notebloc app offers specialized features such as brush effects, handwriting editing, stroke estimate, smart shape, and double-tapping. These advanced tools provide a superior handwriting experience. Now, users can effortlessly edit documents, by using the marker to annotate, mark up, and add notes to a file, and they can also unleash their creativity by adding sketches or diagrams. This is how Huawei's Pencil Engine allows Notebloc to bring users' best ideas to life.
Notebloc also integrates the HMS Core ML Kit text recognition service, which enables the app to accurately identify and extract text from images of receipts, business cards, and documents, and provide precise, and structured transcription of important information in text, greatly improving user satisfaction.
Teamwork Timeline:2013:Notebloc was founded in Barcelona, Spain
2021:In September, team meetings were held regarding co-development between Notebloc and Huawei. In November, the project began. Huawei's HMS Core DTSE team helped Notebloc's developers overcome difficulties, such as a lack of test devices and insufficient sample documents.
2022:In January, HMS Core was successfully integrated into the Notebloc app, and became available on the HUAWEI AppGallery.
Customer feedback:
STEM Alliance reported that Notebloc, one of the first apps in Europe to integrate Pencil Engine, was able to provide users with substantially improved note-taking services by working with HMS Core. More users can now access and use a myriad of editing tools, and can easily, and securely, scan and share documents.
The Notebloc team confirmed its intention to integrate other HMS Core capabilities in order to attract app users and increase monetization in the future.
To learn more, please visit:
>> Reddit to join developer discussions
>> GitHub to download the sample code
>> Stack Overflow to solve integration problems
Follow our official account for the latest HMS Core-related news and updates.
{
"lightbox_close": "Close",
"lightbox_next": "Next",
"lightbox_previous": "Previous",
"lightbox_error": "The requested content cannot be loaded. Please try again later.",
"lightbox_start_slideshow": "Start slideshow",
"lightbox_stop_slideshow": "Stop slideshow",
"lightbox_full_screen": "Full screen",
"lightbox_thumbnails": "Thumbnails",
"lightbox_download": "Download",
"lightbox_share": "Share",
"lightbox_zoom": "Zoom",
"lightbox_new_window": "New window",
"lightbox_toggle_sidebar": "Toggle sidebar"
}
Our lives are now packed with advanced devices, such as mobile gadgets, wearables, smart home appliances, telematics devices, and more.
Of all the features that make them advanced, the major one is the ability to understand user speech. Speaking into a device and telling it to do something are naturally easier and more satisfying than using input devices (like a keyboard and mouse) for the same purpose.
To help devices understand human speech, HMS Core ML Kit introduced the automatic speech recognition (ASR) service, to create a smoother human-machine interaction experience.
Service IntroductionASR can recognize and simultaneously convert speech (no longer than 60s) into text, by using industry-leading deep learning technologies. Boasting regularly updated algorithms and data, currently the service delivers a recognition accuracy of 95%+. The supported languages now are: Mandarin Chinese (including Chinese-English bilingual speech), English, French, German, Spanish, Italian, Arabic, Russian, Thai, Malay, Filipino, and Turkish.
Demo
Use CasesASR covers many fields spanning life and work, and enhances recognition capabilities of searching for products, movies, TV series, and music, as well as the capabilities for navigation services. When a user searches for a product in a shopping app through speech, this service recognizes the product name or feature in speech as text for search.
Similarly, when a user uses a music app, this service recognizes the song name or singer input by voice as text to search for the song.
On top of these, ASR can even contribute to driving safety: During driving — when users are not supposed to use their phone to, for example, search for a place — ASR allows them to speak out where they want to go and converts the speech into text for the navigation app which can then offer the search results to users.
Features
Real-time result output
Available options: with and without speech pickup UI
Endpoint detection: Start and end points of speech can be accurately located.
Silence detection: No voice packet is sent for silent parts.
Intelligent conversion of number formats: For example, when the speech is "year two thousand twenty-two", the text output by ASR will be "2022".
How to Integrate ML Kit?
For guidance about ML Kit integration, please refer to its official document. Also welcome to the HUAWEI Developers website, where you can find other resources for reference.