If set to "true" allows SAPI4 Recognizers to "guess" the current SpeakerProfile, so that
subsequent calls to SpeakerManager.getCurrentSpeaker will return the profile that the engine is
currently using.
Basic audio format converter, which is guaranteed to
convert between any two javax.sound.sampled.AudioFormats - this is unlike the
javax.sound.sampled.AudioSystem conversion,
which sometimes can't provide the needed conversions.
Basic audio format converter, which is guaranteed to
convert between any two AudioFormats provided both are
16-bit - this is unlike the javax.sound.sampled.AudioSystem conversion,
which sometimes can't provide the needed conversions.
AudioSink wrapper for a javax.media playback device, enabling audio data to be
played to speakers etc (avoiding the bugs encountered by the
javax.sound.sampled playback devices when a TargetDataLine is running at
the same time).
AudioSource wrapper for a javax.media capture device, enabling audio data to be
captured from a microphone etc (avoiding the bugs encountered by the
javax.sound.sampled capture devices when a SourceDataLine is running at
the same time).
Writes audio data to a URL using the JMF packages, with content type defined by
one of the FileTypeDescriptor String fields, eg MPEG_AUDIO, QUICKTIME etc
A class which has an internal buffer, which it uses to store data read from
it's source (set using the setSource method) before being sent to it's
sink (set using the setSink method).
A class designed to allow a single AudioSource to supply the same audio
data to multiple AudioSinks (useful, say, to broadcast speech to multiple
remote clients).
Returns false if the AudioFormat cannot be set - eg some Recognizers
may not be able to change their input format, so the CGAudioManager
(which is an AudioSink) for that Recognizer will return false here.
This class opens two RTP steams - one receives audio data from the url "rtp://:12346/audio"
and plays it to the local output device, and the other sends audio data (captured from the
local audio capture device) to the url "rtp://:12344/audio".
This class provides the standard methods of the EngineCentral interface,
but also allows synchronization with the AWT EventQueue to be turned on
or off.
This class provides the standard methods of the EngineProperties interface,
but also allows UI components specific to the Microsoft speech API to be
displayed.
Implements both the FinalRuleResult and FinalDictationResult interfaces,
as described in the JSAPI documentation, but adds two methods allowing
the user to obtain the recognizer's confidence in it's estimate of the current result.
Returns false unless you have selected a Profile and/or Voice, depending
on the mode of this panel - ie, if the mode is SHOW_SYNTHESIZERS | SHOW_RECOGNIZERS
both a SpeakerProfile and Voice will need to be selected for this method to return true.
Used by a server to receive notices that a client has attached or removed itself to or from
an AudioSocketSource, and also to be informed of the bytes transferred to or from a
specific client.
This package provides classes originally intended for redirecting output
from a Synthesizer to a File, a SourceDataLine, a remote client or a custom
AudioSink, and providing audio data to a Recognizer from a File, TargetDataLine,
remote client or a custom AudioSource.
The com.cloudgarden.speech package provides public access
to a few classes which implement
interfaces in the javax.speech packages but add a few additional functions.
The com.cloudgarden.speech.userinterface package provides classes for
drawing a mouth shape as defined by a CGSpeakableEvent
displaying speech engines in an extension of a JTree (a SpeechEngineTree),
with the SpeechEngineChooser class providing various dialogs for selecting speech
engines, SpeakerProfiles and Voices.
Reads audio data from a file, and uses an AudioSplitter to send the
data to a recognizer as well as another file, saving the audio data
in the same format as the recognizer, which may be different from the
input file.
Tests out basic dictation from the default audio device (usually the microphone) -
also demonstrates getting the list of speaker profiles and setting the current speaker.
Tests whether a duplex sound card is being used - keep saying one of the
five commands ("Nice day", "Hello", "How are you" etc) while the computer
is replying - the computer should hear what you said while it was still talking
and reply when it has finished it's current reply.
Demonstrates network transmission of audio data in compressed (GSM) and
uncompressed (RAW) formats - for ease of demonstration incorporates both
server and client since both run on the localhost.
Returns a modal SpeechEngineChooser initialized to display all available Recognizers
that match the reqRec parameter and all Synthesizers that match the reqSyn parameter..
Returns a modal SpeechEngineChooser initialized to display all available Recognizers
that match the reqRec parameter and all Synthesizers that match the reqSyn parameter..
Returns a confidence value from 0 to 100, indicating the degree of
confidence the recognizer has in a certain alternate
result (0 for the set of best tokens).
Returns a confidence value in a range determined
by the engine , indicating the degree of
confidence the recognizer has in a certain alternate
result (0 for the set of best tokens).
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the openess of the jaw.
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the horizontal tension of the lips.
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the total height of the mouth (from min height to max height)
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the total upturn of the mouth (from max downturn to max upturn)
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the total width of the mouth (from min width to max width)
If from a SAPI4 engine, returns the IPA phoneme corresponding to this event,
otherwise, if from a SAPI5 engine, returns the Microsoft PhoneID for this
sound - you will need to translate from the PhoneID to a unicode value.
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the visibility of the lower teeth.
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the visibility of the upper teeth.
If this is a MOUTH_SHAPE event (or a VISEME event after convertToMouthShapeEvent has been called),
returns a number from 0 to 255 specifying the position of the tongue (0 being lowest, 255 being highest).
getTopLip() -
Method in class com.cloudgarden.speech.userinterface.Mouth
Returns the polygon of points in this Component outlining the upper lip.
Tests activating/deactivating rules and grammars in response to spoken commands - one grammar
contains the words "alpha","bravo","charlie","delta", but only one rule is active at a time
and the other grammar contains "one","two","three","four" - again, only one rule
active at a time: also, only one of the grammars is active at a time - switch between
grammars with the command "switch" and activate the next rule in each grammar
with the command "next".