Yandex SpeechKit Mobile SDK

The SpeechKit SDK allows you to embed Yandex speech recognition and synthesis, as well as voice activation, into your mobile app.

License

License

Categories

Categories

Dex General Purpose Libraries Utility
GroupId

GroupId

com.yandex.android
ArtifactId

ArtifactId

speechkit
Last Version

Last Version

3.12.2
Release Date

Release Date

Type

Type

aar
Description

Description

Yandex SpeechKit Mobile SDK
The SpeechKit SDK allows you to embed Yandex speech recognition and synthesis, as well as voice activation, into your mobile app.
Project URL

Project URL

https://tech.yandex.com/speechkit/mobilesdk/
Project Organization

Project Organization

Yandex LLC
Source Code Management

Source Code Management

https://github.com/yandexmobile/yandex-speechkit-android.git

Download speechkit

How to add to project

<!-- https://jarcasting.com/artifacts/com.yandex.android/speechkit/ -->
<dependency>
    <groupId>com.yandex.android</groupId>
    <artifactId>speechkit</artifactId>
    <version>3.12.2</version>
    <type>aar</type>
</dependency>
// https://jarcasting.com/artifacts/com.yandex.android/speechkit/
implementation 'com.yandex.android:speechkit:3.12.2'
// https://jarcasting.com/artifacts/com.yandex.android/speechkit/
implementation ("com.yandex.android:speechkit:3.12.2")
'com.yandex.android:speechkit:aar:3.12.2'
<dependency org="com.yandex.android" name="speechkit" rev="3.12.2">
  <artifact name="speechkit" type="aar" />
</dependency>
@Grapes(
@Grab(group='com.yandex.android', module='speechkit', version='3.12.2')
)
libraryDependencies += "com.yandex.android" % "speechkit" % "3.12.2"
[com.yandex.android/speechkit "3.12.2"]

Dependencies

compile (2)

Group / Artifact Type Version
com.android.support » support-v4 jar 25.3.1
com.getkeepsafe.relinker : relinker aar 1.2.3

Project Modules

There are no modules declared in this project.

Yandex SpeechKit

Website | Documentation | EULA

Use the SpeechKit library to integrate speech recognition, text-to-speech, music identification, and Yandex voice activation into your Android mobile app. Voice control makes your app more user-friendly, especially for people who use it on the go or whose hands are busy. Without touching the screen, users can activate the desired feature with a single phrase.

SpeechKit supports the following languages for speech recognition and text-to-speech:

  • Russian
  • English
  • Ukrainian
  • Turkish

If the language you need isn't supported, SpeechKit allows you to switch to the Google speech recognition system.

If the number of voice commands accessing your application does not exceed 10,000 per day, you can use the basic version of the SpeechKit Mobile SDK for free. If you have more requests, you can either pay for the amount over the limit, or switch to the commercial pricing plan. The commercial plan removes the restrictions and adds extra functionality. For instance, you can create unique voices and speech models for specific contexts.

Where to start

  • Find out what SpeechKit is, what functionality is available in the library, and how it can improve your application.
  • Download SpeechKit samples and try to run them. This is a good way to quickly learn how to use the main components of the library.
  • Read the Quick Start, which describes the configuration required for the library to work, along with recommended steps for fast and successful integration of SpeechKit in your application.
  • Review the Documentation for the library's API: the main classes and their functions, features, and restrictions.
  • If you have previously used SpeechKit, we recommend that you also read the Guide for migrating to SpeechKit 3.12.2, which explains what has changed since the latest available version.

Before you get started

Before you start working with APIs of Yandex services, you must create an API key:

  • Go to the Developer Dashboard.
  • Click Get key.
  • Enter a name for the key and select SpeechKit Mobile SDK in the list.
  • Enter your information and the type of project.
  • Click Submit.

After the API key is generated, you will be redirected to the Developer Dashboard. Under Your API keys, you can see all the keys that you have created.

Installation

You can use Maven to add SpeechKit to a project. To do this, open the build.gradle file in the project. Add the Maven Central repository to the list of available repositories. In the repositories section, type:

repositories {
    ...
    mavenCentral()
}

Open the build.gradle file for the application (module). In the dependencies section, add the dependency:

dependencies {
    ...
    compile 'com.yandex.android:speechkit:3.12.2'
}

In the build.gradle file for the application (module), go to the defaultConfig section and enter the minimum Android API version required by your application (14 or later):

defaultConfig {
        ...
        minSdkVersion 14
}

Requirements

Minimum system requirements:

  • SDK version 14
  • Android Studio 2.3.3 +

Architecture

Global settings

  • SpeechKit
    • LocationProvider
    • EventLogger
    • LogLevel

Speech recognition

  • OnlineRecognizer
    • OnlineRecognizer.Builder
      • Language
      • OnlineModel
      • SoundFormat
    • Recognizer
    • RecognizerListener
    • Track
      • Artist
    • Recognition
      • RecognitionHypothesis
        • RecognitionWord
      • Biometry
        • BiometryGroup
        • BiometryEmotion
        • LanguageScore
  • RecognizerActivity
    • AudioProcessingMode
    • RecognizerUIEarcons
      • RecognizerUIEarcons.Builder
      • DefaultEarconsBundle

Speech synthesis

  • OnlineVocalizer
    • OnlineVocalizer.Builder
      • Language
      • Voice
      • Emotion
      • SoundFormat
      • SoundQuality
    • Vocalizer
      • TextSynthesizingMode
    • VocalizerListener
    • Synthesis
      • SoundBuffer

Voice activation

  • PhraseSpotter
    • PhraseSpotter.Builder
    • PhraseSpotterListener

Additional functionality

  • AudioSource
    • AutoStartStopAudioSource
    • ManualStartStopAudioSource
    • AudioSourceListener
  • UniProxySession

Usage

SpeechKit

Singleton class for configuring and controlling the library. You don't need to explicitly create or destroy instances of the SpeechKit class. To access an object, use the getInstance method, which creates an instance of the SpeechKit class when accessed the first time. This instance is destroyed when the application closes.

Before using any of the SpeechKit functionality, you must configure SpeechKit using the API key (for more information, see Before you get started).

SpeechKit.getInstance().init(getApplicationContext(), "developer_api_key")

OnlineRecognizer

OnlineRecognizer recognizer = new OnlineRecognizer.Builder(Language.ENGLISH, OnlineModel.QUERIES, this)
                 .setDisableAntimat(false)
                 .setEnablePunctuation(true)
                 .build();
recognizer.prepare();
recognizer.startRecording();
public void onPartialResults(@NonNull Recognizer recognizer, @NonNull Recognition recognition, boolean endOfUtterance) {
    System.out.println("Partial result: " + results);
    if (endOfUtterance) {
        System.out.println("Partial result: " + recognition.getRequestId());
    }
}

OnlineVocalizer

OnlineVocalizer.Builder vocalizer = new OnlineVocalizer.Builder(Language.ENGLISH, this)
                .setEmotion(Emotion.GOOD)
                .setVoice(Voice.ERMIL)
                .build();
vocalizer.prepare();
vocalizer.synthesize("What's up kid?", Vocalizer.TextSynthesizingMode.APPEND);

Credits

The SpeechKit Mobile SDK is created and developed by Yandex.

License

The licensing terms for using SpeechKit are described in the Terms of Use. If you do not agree to any of the terms described in the license agreement, you cannot use SpeechKit.

Contacts

  • If you have any general questions about the library, look for answers in the FAQ.
  • If you found a bug in the library, and you can provide the exact steps for reproducing it or logs containing the necessary data, write to us via the feedback form.
  • If you need help, contact us at [email protected].
  • If you need to increase the maximum number of requests for speech recognition and/or speech synthesis, send an inquiry to [email protected].
com.yandex.android

Versions

Version
3.12.2
2.5.0