1518:
1736:. The software was licensed from third-party developers Joseph Katz and Mark Barton (later, SoftVoice, Inc.) and was featured during the 1984 introduction of the Macintosh computer. This January demo required 512 kilobytes of RAM memory. As a result, it could not run in the 128 kilobytes of RAM the first Mac actually shipped with. So, the demo was accomplished with a prototype 512k Mac, although those in attendance were not told of this and the synthesis demo created considerable excitement for the Macintosh. In the early 1990s Apple expanded its capabilities offering system wide text-to-speech support. With the introduction of faster PowerPC-based computers they included higher quality voice sampling. Apple also introduced
408:
1650:. The program was available for non-Macintosh Apple computers (including the Apple II, and the Lisa), various Atari models and the Commodore 64. The Apple version preferred additional hardware that contained DACs, although it could instead use the computer's one-bit audio output (with the addition of much distortion) if the card was not present. The Atari made use of the embedded POKEY audio chip. Speech playback on the Atari normally disabled interrupt requests and shut down the ANTIC chip during vocal output. The audible output is extremely distorted speech when the screen is on. The Commodore 64 made use of the 64's embedded SID audio chip.
980:; however, many concatenative systems also have rules-based components. Many systems based on formant synthesis technology generate artificial, robotic-sounding speech that would never be mistaken for human speech. However, maximum naturalness is not always the goal of a speech synthesis system, and formant synthesis systems have advantages over concatenative systems. Formant-synthesized speech can be reliably intelligible, even at very high speeds, avoiding the acoustic glitches that commonly plague concatenative systems. High-speed synthesized speech is used by the visually impaired to quickly navigate computers using a
567:
1894:, a text-to-speech utility for people who have visual impairment. Third-party programs such as JAWS for Windows, Window-Eyes, Non-visual Desktop Access, Supernova and System Access can perform various text-to-speech tasks such as reading text aloud from a specified website, email account, text document, the Windows clipboard, the user's keyboard typing, etc. Not all programs can use speech synthesis directly. Some programs can use plug-ins, extensions or add-ons to read text aloud. Third-party programs are available that can read text from the system clipboard.
574:
68:
1208:. The company states its software is built to adjust the intonation and pacing of delivery based on the context of language input used. It uses advanced algorithms to analyze the contextual aspects of text, aiming to detect emotions like anger, sadness, happiness, or alarm, which enables the system to understand the user's sentiment, resulting in a more realistic and human-like inflection. Other features include multilingual speech generation and long-form content creation with contextually-aware voices.
523:(LSP) method for high-compression speech coding, while at NTT. From 1975 to 1981, Itakura studied problems in speech analysis and synthesis based on the LSP method. In 1980, his team developed an LSP-based speech synthesizer chip. LSP is an important technology for speech synthesis and coding, and in the 1990s was adopted by almost all international speech coding standards as an essential component, contributing to the enhancement of digital speech communication over mobile channels and the internet.
832:(DSP) to the recorded speech. DSP often makes recorded speech sound less natural, although some systems use a small amount of signal processing at the point of concatenation to smooth the waveform. The output from the best unit-selection systems is often indistinguishable from real human voices, especially in contexts for which the TTS system has been tuned. However, maximum naturalness typically require unit-selection speech databases to be very large, in some systems ranging into the
1324:
also be read as "one three two five", "thirteen twenty-five" or "thirteen hundred and twenty five". A TTS system can often infer how to expand a number based on surrounding words, numbers, and punctuation, and sometimes the system provides a way to specify the context if it is ambiguous. Roman numerals can also be read differently depending on context. For example, "Henry VIII" reads as "Henry the Eighth", while "Chapter VIII" reads as "Chapter Eight".
2367:
877:. Diphone synthesis suffers from the sonic glitches of concatenative synthesis and the robotic-sounding nature of formant synthesis, and has few of the advantages of either approach other than small size. As such, its use in commercial applications is declining, although it continues to be used in research because there are a number of freely available software implementations. An early example of Diphone synthesis is a teaching robot,
1346:
193:
1814:
1316:" to aid in disambiguating homographs. This technique is quite successful for many cases such as whether "read" should be pronounced as "red" implying past tense, or as "reed" implying present tense. Typical error rates when using HMMs in this fashion are usually below five percent. These techniques also work well for most European languages, although access to required training
1597:). The synthesizer uses a variant of linear predictive coding and has a small in-built vocabulary. The original intent was to release small cartridges that plugged directly into the synthesizer unit, which would increase the device's built-in vocabulary. However, the success of software text-to-speech in the Terminal Emulator II cartridge canceled that plan.
1439:
approach works on any input, but the complexity of the rules grows substantially as the system takes into account irregular spellings or pronunciations. (Consider that the word "of" is very common in
English, yet is the only word in which the letter "f" is pronounced .) As a result, nearly all speech synthesis systems use a combination of these approaches.
1844:", which allowed command-line users to redirect text output to speech. Speech synthesis was occasionally used in third-party programs, particularly word processors and educational software. The synthesis software remained largely unchanged from the first AmigaOS release and Commodore eventually removed speech synthesis support from AmigaOS 2.1 onward.
1489:, reported that listeners to voice recordings could determine, at better than chance levels, whether or not the speaker was smiling. It was suggested that identification of the vocal features that signal emotional content may be used to help make synthesized speech sound more natural. One of the related issues is modification of the
1447:
loanwords, whose pronunciations are not obvious from their spellings. On the other hand, speech synthesis systems for languages like
English, which have extremely irregular spelling systems, are more likely to rely on dictionaries, and to use rule-based methods only for unusual words, or words that are not in their dictionaries.
1584:
In the early 1980s, TI was known as a pioneer in speech synthesis, and a highly popular plug-in speech synthesizer module was available for the TI-99/4 and 4A. Speech synthesizers were offered free with the purchase of a number of cartridges and were used by many TI-written video games (games offered
1455:
The consistent evaluation of speech synthesis systems may be difficult because of a lack of universally agreed objective evaluation criteria. Different organizations often use different speech data. The quality of speech synthesis systems also depends on the quality of the production technique (which
1243:
used to create convincing speech sentences that sound like specific people saying things they did not say. This technology was initially developed for various applications to improve human life. For example, it can be used to produce audiobooks, and also to help people who have lost their voices (due
756:
Concatenative synthesis is based on the concatenation (stringing together) of segments of recorded speech. Generally, concatenative synthesis produces the most natural-sounding synthesized speech. However, differences between natural variations in speech and the nature of the automated techniques for
2447:
Content creators have used voice cloning tools to recreate their voices for podcasts, narration, and comedy shows. Publishers and authors have also used such software to narrate audiobooks and newsletters. Another area of application is AI video creation with talking heads. Webapps and video editors
1943:
Text-to-speech (TTS) refers to the ability of computers to read text aloud. A TTS engine converts written text to a phonemic representation, then converts the phonemic representation to waveforms that can be output as sound. TTS engines with different languages, dialects and specialized vocabularies
1323:
Deciding how to convert numbers is another problem that TTS systems have to address. It is a simple programming challenge to convert a number into words (at least in
English), like "1325" becoming "one thousand three hundred twenty-five". However, numbers occur in many different contexts; "1325" may
897:
Domain-specific synthesis concatenates prerecorded words and phrases to create complete utterances. It is used in applications where the variety of texts the system will output is limited to a particular domain, like transit schedule announcements or weather reports. The technology is very simple to
836:
of recorded data, representing dozens of hours of speech. Also, unit selection algorithms have been known to select segments from a place that results in less than ideal synthesis (e.g. minor words become unclear) even when a better choice exists in the database. Recently, researchers have proposed
2491:
In the 2010s, Singing synthesis technology has taken advantage of the recent advances in artificial intelligence—deep listening and machine learning to better represent the nuances of the human voice. New high fidelity sample libraries combined with digital audio workstations facilitate editing in
1438:
Each approach has advantages and drawbacks. The dictionary-based approach is quick and accurate, but completely fails if it is given a word which is not in its dictionary. As dictionary size grows, so too does the memory space requirements of the synthesis system. On the other hand, the rule-based
901:
Because these systems are limited by the words and phrases in their databases, they are not general-purpose and can only synthesize the combinations of words and phrases with which they have been preprogrammed. The blending of words within naturally spoken language however can still cause problems
1620:
speech synthesizer chip on a removable cartridge. The
Narrator had 2kB of Read-Only Memory (ROM), and this was utilized to store a database of generic words that could be combined to make phrases in Intellivision games. Since the Orator chip could also accept speech data from external memory, any
1430:
is stored by the program. Determining the correct pronunciation of each word is a matter of looking up each word in the dictionary and replacing the spelling with the pronunciation specified in the dictionary. The other approach is rule-based, in which pronunciation rules are applied to words to
1327:
Similarly, abbreviations can be ambiguous. For example, the abbreviation "in" for "inches" must be differentiated from the word "in", and the address "12 St John St." uses the same abbreviation for both "Saint" and "Street". TTS systems with intelligent front ends can make educated guesses about
1446:
have a very regular writing system, and the prediction of the pronunciation of words based on their spellings is quite successful. Speech synthesis systems for such languages often use the rule-based method extensively, resorting to dictionaries only for those few words, like foreign names and
898:
implement, and has been in commercial use for a long time, in devices like talking clocks and calculators. The level of naturalness of these systems can be very high because the variety of sentence types is limited, and they closely match the prosody and intonation of the original recordings.
2492:
fine detail, such as shifting of formats, adjustment of vibrato, and adjustments to vowels and consonants. Sample libraries for various languages and various accents are available. With today's advancements in vocal synthesis, artists sometimes use sample libraries in lieu of backing singers.
2377:
Speech synthesis techniques are also used in entertainment productions such as games and animations. In 2007, Animo
Limited announced the development of a software application package based on its speech synthesis software FineSpeech, explicitly geared towards customers in the entertainment
1078:
More recent synthesizers, developed by Jorge C. Lucero and colleagues, incorporate models of vocal fold biomechanics, glottal aerodynamics and acoustic wave propagation in the bronchi, trachea, nasal and oral cavities, and thus constitute full systems of physics-based speech simulation.
583:
586:
582:
727:
Naturalness describes how closely the output sounds like human speech, while intelligibility is the ease with which the output is understood. The ideal speech synthesizer is both natural and intelligible. Speech synthesis systems usually try to maximize both characteristics.
1576:
1573:
2443:
Text-to-speech is also used in second language acquisition. Voki, for instance, is an educational tool created by
Oddcast that allows users to create their own talking avatar, using different accents. They can be emailed, embedded on websites or shared on social media.
2330:
Speech synthesis has long been a vital assistive technology tool and its application in this area is significant and widespread. It allows environmental barriers to be removed for people with a wide range of disabilities. The longest application has been in the use of
1847:
Despite the
American English phoneme limitation, an unofficial version with multilingual speech synthesis was developed. This made use of an enhanced version of the translator library which could translate a number of languages, given a set of rules for each language.
687:
Early electronic speech-synthesizers sounded robotic and were often barely intelligible. The quality of synthesized speech has steadily improved, but as of 2016 output from contemporary speech synthesis systems remains clearly distinguishable from actual human speech.
1634:
390:
in the late 1940s and completed it in 1950. There were several different versions of this hardware device; only one currently survives. The machine converts pictures of the acoustic patterns of speech in the form of a spectrogram back into sound. Using this device,
5034:
1927:
1663:
1572:
1289:
that all require expansion into a phonetic representation. There are many spellings in
English which are pronounced differently based on context. For example, "My latest project is to learn how to better project my voice" contains two pronunciations of "project".
1631:
1152:
584:
1924:
1805:
1660:
1840:. The synthesis system was divided into a translator library which converted unrestricted English text into a standard set of phonetic codes and a narrator device which implemented a formant model of speech generation.. AmigaOS also featured a high-level "
1630:
1149:
1923:
1716:
1703:
1659:
1211:
The DNN-based speech synthesizers are approaching the naturalness of the human voice. Examples of disadvantages of the method are low robustness when the data are not sufficient, lack of controllability and low performance in auto-regressive models.
88:
87:
1164:(DNN) to produce artificial speech from text (text-to-speech) or spectrum (vocoder). The deep neural networks are trained using a large amount of recorded speech and, in the case of a text-to-speech system, the associated labels and/or input text.
1574:
543:
1174:—hundreds of voices are trained concurrently rather than sequentially, decreasing the required training time and enabling the model to learn and generalize shared emotional context, even for voices with no exposure to such emotional context. The
1148:
530:
was released, and was one of the first Speech
Synthesis systems. It consisted of a stand-alone computer hardware and a specialized software that enabled it to read Italian. A second version, released in 1978, was also able to sing Italian in an
1621:
additional words or phrases needed could be stored inside the cartridge itself. The data consisted of strings of analog-filter coefficients to modify the behavior of the chip's synthetic vocal-tract model, rather than simple digitized samples.
1802:
1764:), the user can choose out of a wide range list of multiple voices. VoiceOver voices feature the taking of realistic-sounding breaths between sentences, as well as improved clarity at high read rates over PlainTalk. Mac OS X also includes
1632:
1801:
1456:
may involve analogue or digital recording) and on the facilities used to replay the speech. Evaluating speech synthesis systems has therefore often been compromised by differences between production techniques and replay facilities.
1925:
1661:
1713:
1700:
1075:. The system, first marketed in 1994, provides full articulatory-based text-to-speech conversion using a waveguide or transmission-line analog of the human oral and nasal tracts controlled by Carré's "distinctive region model".
1150:
5042:
1699:
1712:
540:
169:
provides the largest output range, but may lack clarity. For specific usage domains, the storage of entire words or sentences allows for high-quality output. Alternatively, a synthesizer can incorporate a model of the
1244:
to throat disease or other medical problems) to get them back. Commercially, it has opened the door to several opportunities. This technology can also create more personalized digital assistants and natural-sounding
1803:
539:
683:
In 1976, Computalker
Consultants released their CT-1 Speech Synthesizer. Designed by D. Lloyd Rice and Jim Cooper, it was an analog synthesizer built to work with microcomputers using the S-100 bus standard.
459:
was visiting his friend and colleague John Pierce at the Bell Labs Murray Hill facility. Clarke was so impressed by the demonstration that he used it in the climactic scene of his screenplay for his novel
1714:
1701:
85:
4578:
853:
of the language: for example, Spanish has about 800 diphones, and German about 2500. In diphone synthesis, only one example of each diphone is contained in the speech database. At runtime, the target
5511:"Hot AI startup ElevenLabs, founded by ex-Google and Palantir staff, is set to raise $ 18 million at a $ 100 million valuation. Check out the 14-slide pitch deck it used for its $ 2 million pre-seed"
3139:
1992:. On the other hand, on-line RSS-readers are available on almost any personal computer connected to the Internet. Users can download generated audio files to portable devices, e.g. with a help of
541:
885:. Leachim contained information regarding class curricular and certain biographical information about the students whom it was programmed to teach. It was tested in a fourth grade classroom in
821:, the desired target utterance is created by determining the best chain of candidate units from the database (unit selection). This process is typically achieved using a specially weighted
1935:
From 1971 to 1996, Votrax produced a number of commercial speech synthesizer components. A Votrax synthesizer was included in the first generation Kurzweil Reading Machine for the Blind.
2158:
which included a Formant synthesis capability. Sequences of up to 512 individual vowel and consonant formants could be stored and replayed, allowing short vocal phrases to be synthesized.
2211:
since the early 2000s has improved beyond the point of human's inability to tell a real human imaged with a real camera from a simulation of a human imaged with a simulation of a camera.
1260:
reporter Joseph Cox published findings that he had recorded five minutes of himself talking and then used a tool developed by ElevenLabs to create voice deepfakes that defeated a bank's
208:. The front-end has two major tasks. First, it converts raw text containing symbols like numbers and abbreviations into the equivalent of written-out words. This process is often called
177:
The quality of a speech synthesizer is judged by its similarity to the human voice and by its ability to be understood clearly. An intelligible text-to-speech program allows people with
2428:
Text-to-speech for disability and impaired communication aids have become widely available. Text-to-speech is also finding new applications; for example, speech synthesis combined with
1832:
text-to-speech system. It featured a complete system of voice emulation for American English, with both male and female voices and "stress" indicator markers, made possible through the
1501:
residual). Such pitch synchronous pitch modification techniques need a priori pitch marking of the synthesis speech database using techniques such as epoch extraction using dynamic
5650:
2112:) were capable of text-to-phoneme synthesis or reciting complete words and phrases (text-to-dictionary), using a very popular Speech Synthesizer peripheral. TI used a proprietary
2063:
Following the commercial failure of the hardware-based Intellivoice, gaming developers sparingly used software synthesis in later games. Earlier systems from Atari, such as the
84:
1182:: each time that speech is generated from the same string of text, the intonation of the speech will be slightly different. The application also supports manually altering the
765:
Unit selection synthesis uses large databases of recorded speech. During database creation, each recorded utterance is segmented into some or all of the following: individual
4961:
2351:. Work to personalize a synthetic voice to better match a person's personality or historical voice is becoming available. A noted application, of speech synthesis, was the
2007:. It can deliver TTS functionality to anyone (for reasons of accessibility, convenience, entertainment or information) with access to a web browser. The non-profit project
1426:). The simplest approach to text-to-phoneme conversion is the dictionary-based approach, where a large dictionary containing all the words of a language and their correct
585:
5452:
1776:
Standard Additions includes a say verb that allows a script to use any of the installed voices and to control the pitch, speaking rate and modulation of the spoken text.
4465:
4540:
Prathosh, A. P.; Ramakrishnan, A. G.; Ananthapadmanabha, T. V. (December 2013). "Epoch extraction based on integrated linear prediction residual using plosion index".
86:
5368:
3118:
1575:
984:. Formant synthesizers are usually smaller programs than concatenative systems because they do not have a database of speech samples. They can therefore be used in
4595:
2485:
1633:
3528:
6458:
4575:
3309:. (2003). CMU ARCTIC databases for speech synthesis. CMU-LTI-03-177. Language Technologies Institute, School of Computer Science, Carnegie Mellon University.
1740:
into its systems which provided a fluid command set. More recently, Apple has added sample-based voices. Starting as a curiosity, the speech system of Apple
1071:
in the late 1980s and merged with Apple Computer in 1997), the Trillium software was published under the GNU General Public License, with work continuing as
4336:
1926:
1662:
6618:
5343:
4253:
Chadha, Anupama; Kumar, Vaibhav; Kashyap, Sonu; Gupta, Mayank (2021), Singh, Pradeep Kumar; Wierzchoń, Sławomir T.; Tanwar, Sudeep; Ganzha, Maria (eds.),
3706:
Englert, Marina; Madazio, Glaucya; Gielow, Ingrid; Lucero, Jorge; Behlau, Mara (2016). "Perceptual error identification of human and synthesized voices".
2168:
1297:
representations of their input texts, as processes for doing so are unreliable, poorly understood, and computationally ineffective. As a result, various
1151:
2249:
that generates high-quality voices from an assortment of fictional characters from a variety of media sources was released. Initial characters included
7232:
5218:
4239:
3585:
1867:
1493:
of the sentence, depending upon whether it is an affirmative, interrogative or exclamatory sentence. One of the techniques for pitch modification uses
996:
power are especially limited. Because formant-based systems have complete control of all aspects of the output speech, a wide variety of prosodies and
4428:
345:, Hungary, described in a 1791 paper. This machine added models of the tongue and lips, enabling it to produce consonants as well as vowels. In 1837,
4901:
4867:
3090:
1988:. On one hand, online RSS-narrators simplify information delivery by allowing users to listen to their favourite news sources and to convert them to
1871:
3201:
2795:
1804:
7212:
4730:
3581:
2008:
1765:
2378:
industries, able to generate narration and lines of dialogue according to user specifications. The application reached maturity in 2008, when NEC
1190:(a term coined by this project), a sentence or phrase that conveys the emotion of the take that serves as a guide for the model during inference.
5701:
2866:
1026:. Creating proper intonation for these projects was painstaking, and the results have yet to be matched by real-time text-to-speech interfaces.
5534:
1328:
ambiguous abbreviations, while others provide the same result in all cases, resulting in nonsensical (and sometimes comical) outputs, such as "
2467:
and includes models of vocal frequency jitter and tremor, airflow noise and laryngeal asymmetries. The synthesizer has been used to mimic the
1715:
1702:
1059:
Until recently, articulatory synthesis models have not been incorporated into commercial speech synthesis systems. A notable exception is the
1044:
and the articulation processes occurring there. The first articulatory synthesizer regularly used for laboratory experiments was developed at
6596:
3995:
2322:, for example, includes tags related to speech recognition, dialogue management and touchtone dialing, in addition to text-to-speech markup.
742:. Each technology has strengths and weaknesses, and the intended uses of a synthesis system will typically determine which approach is used.
3617:
259:
information together make up the symbolic linguistic representation that is output by the front-end. The back-end—often referred to as the
5292:
4840:
3860:
542:
3573:
334:
4865:
Jia, Ye; Zhang, Yu; Weiss, Ron J. (2018-06-12), "Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis",
757:
segmenting the waveforms sometimes result in audible glitches in the output. There are three main sub-types of concatenative synthesis.
6159:
5911:
4801:, PDA's and MP3-Players, Proceedings of the 18th International Conference on Database and Expert Systems Applications, Pages: 575–579
2813:
2401:
5262:
1003:
Examples of non-real-time but highly accurate intonation control in formant synthesis include the work done in the late 1970s for the
7007:
6451:
3830:
Valle, Rafael (2020). "Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens".
3767:
706:
caused speech synthesizers to become cheaper and more accessible, more people would benefit from the use of text-to-speech programs.
559:
at MIT, and the Bell Labs system; the latter was one of the first multilingual language-independent systems, making extensive use of
474:
puts it to sleep. Despite the success of purely electronic speech synthesis, research into mechanical speech-synthesizers continues.
5510:
7176:
4704:
4496:
2945:
2388:
263:—then converts the symbolic linguistic representation into sound. In certain systems, this part includes the computation of the
4513:
Muralishankar, R.; Ramakrishnan, A. G.; Prathibha, P. (February 2004). "Modification of pitch using DCT in the source domain".
3345:
2839:
527:
5186:
5141:
4687:
4274:
4161:
4094:
3639:
3506:
3241:
3054:
2712:
2641:
2577:
6917:
6608:
6444:
5322:
3890:
3471:
3115:
2543:
2183:
to achieve text-to-speech synthesis, that can be made to sound almost like anybody from a speech sample of only 5 seconds.
1067:, where much of the original research was conducted. Following the demise of the various incarnations of NeXT (started by
7171:
5971:
5827:
5694:
4777:
2269:
17:
2884:"A History of Realtime Digital Speech on Packet Networks: Part II of Linear Predictive Coding and the Internet Protocol"
7227:
6778:
6229:
4602:
3166:
1566:
1546:
1509:
regions of speech. In general, prosody remains a challenge for speech synthesizers, and is an active research topic.
1459:
Since 2005, however, some researchers have started to evaluate speech synthesis systems using a common speech dataset.
1023:
818:
509:
139:
6932:
6763:
4806:
4116:
3274:
2508:
1393:
1367:
4303:
3370:
2916:
1517:
1375:
6703:
6306:
6002:
5967:
3523:
3387:
Muralishankar, R; Ramakrishnan, A.G.; Prathibha, P (2004). "Modification of Pitch using DCT in the Source Domain".
2300:
2040:
5477:
1676:
42:
7120:
6773:
6224:
6068:
2452:
allow users to create video content involving AI avatars, who are made to speak using text-to-speech technology.
2175:
presented the work 'Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis', which
1040:
Articulatory synthesis consists of computational techniques for synthesizing speech based on models of the human
929:, many final consonants become no longer silent if followed by a word that begins with a vowel, an effect called
797:
set to a "forced alignment" mode with some manual correction afterward, using visual representations such as the
720:
326:
310:
220:
3446:
2683:
952:
synthesis does not use human speech samples at runtime. Instead, the synthesized speech output is created using
364:, which automatically analyzed speech into its fundamental tones and resonances. From his work on the vocoder,
6768:
6513:
5904:
5722:
5687:
4798:
2348:
1371:
1142:
1052:, Tom Baer, and Paul Mermelstein. This synthesizer, known as ASY, was based on vocal tract models developed at
809:
of the units in the speech database is then created based on the segmentation and acoustic parameters like the
660:" programming technique to produce a synthesized speech waveform. Another early example, the arcade version of
493:
5065:"Effect of Text-to-Speech and Human Reader on Listening Comprehension for Students with Learning Disabilities"
7217:
7037:
6758:
1531:
903:
462:
373:
5237:
2382:
announced a web service that allows users to create phrases from the voices of characters from the Japanese
937:
cannot be reproduced by a simple word-concatenation system, which would require additional complexity to be
6730:
5799:
5766:
4896:
4568:
2233:. It was driven only by a voice track as source data for the animation after the training phase to acquire
1468:
1200:, AI-assisted text-to-speech software, Speech Synthesis, which can produce lifelike speech by synthesizing
957:
471:
407:
2237:
and wider facial information from training material consisting of 2D videos with audio had been completed.
1000:
can be output, conveying not just questions and statements, but a variety of emotions and tones of voice.
922:
914:
7075:
7060:
7032:
6897:
6892:
6467:
5962:
5868:
5812:
5727:
4261:, Lecture Notes in Networks and Systems, vol. 203, Singapore: Springer Singapore, pp. 557–566,
3741:
3293:
2518:
2433:
2229:
2017 an audio driven digital look-alike of upper torso of Barack Obama was presented by researchers from
605:
portable calculator for the blind in 1976. Other devices had primarily educational purposes, such as the
560:
424:
314:
4734:
3371:
The MBROLA Project: Towards a set of high quality speech synthesizers of use for non commercial purposes
2455:
Speech synthesis is a valuable computational aid for the analysis and assessment of speech disorders. A
2215:
1406:
Speech synthesis systems use two basic approaches to determine the pronunciation of a word based on its
7222:
6812:
6783:
6561:
6219:
5817:
5807:
5732:
4432:
1953:
1179:
349:
produced a "speaking machine" based on von Kempelen's design, and in 1846, Joseph Faber exhibited the "
282:, some people tried to build machines to emulate human speech. Some early legends of the existence of "
5394:
3937:
3414:
3086:
2863:
1063:-based system originally developed and marketed by Trillium Sound Research, a spin-off company of the
7207:
6655:
6508:
6234:
5897:
5064:
3319:
2133:
1617:
1494:
938:
934:
874:
858:
829:
696:
38:
4020:
2593:
Rubin, P.; Baer, T.; Mermelstein, P. (1981). "An articulatory synthesizer for perceptual research".
2460:
7181:
7105:
6837:
6793:
6678:
6576:
6204:
6048:
5656:
5014:
3577:
3018:; Nebbia, Luciano (1 November 1995). "Interactive voice technology at work: The CSELT experience".
2533:
2503:
2335:
for people with visual impairment, but text-to-speech systems are now commonly used by people with
2230:
2194:
system with similar aims at the 2018 NeurIPS conference, though the result is rather unconvincing.
2050:
1900:
is a server-based package for voice synthesis and recognition. It is designed for network use with
1643:
1482:
1431:
determine their pronunciations based on their spellings. This is similar to the "sounding out", or
1356:
1219:
required and sometimes the output of speech synthesizer may result in the mistakes of tone sandhi.
1205:
997:
862:
837:
various automated methods to detect unnatural segments in unit-selection speech synthesis systems.
477:
3266:
1646:
was the first commercial all-software voice synthesis program. It was later used as the basis for
613:
in 1978. Fidelity released a speaking version of its electronic chess computer in 1979. The first
7085:
7055:
6722:
6395:
6280:
6149:
6134:
5789:
5344:"Now hear this: Voice cloning AI startup ElevenLabs nabs $ 19M from a16z and other heavy hitters"
4446:
4337:"Val Kilmer Gets His Voice Back After Throat Cancer Battle Using AI Technology: Hear the Results"
3565:
3553:
2528:
1897:
1825:
1360:
1278:
1240:
1008:
751:
154:
6556:
5210:
4986:
3182:
2343:
as well as by pre-literate children. They are also frequently employed to aid those with severe
77:
6942:
6635:
6613:
6603:
6571:
6546:
6390:
6344:
5762:
5284:
5211:"Evolution of Reading Machines for the Blind: Haskins Laboratories" Research as a Case History"
5126:
2022 Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)
4259:
Proceedings of Second International Conference on Computing, Communications, and Cyber-Security
4193:"Anticipating and addressing the ethical implications of deepfakes in the context of elections"
3852:
3593:
3487:
2698:("Mechanism of the human speech with description of its speaking machine", J. B. Degen, Wien).
2513:
1769:
1536:
1313:
1035:
790:
703:
667:
639:
606:
513:
241:
225:
143:
6802:
6359:
6098:
5981:
5742:
5419:
4962:"An artificial-intelligence first: Voice-mimicking software reportedly used in a major theft"
3911:
3613:
3605:
3569:
2657:
Van Santen, J. (April 1994). "Assignment of segmental duration in text-to-speech synthesis".
2421:
2208:
1789:
1761:
1472:
1104:
1100:
1064:
961:
854:
810:
662:
597:
electronics featuring speech synthesis began emerging in the 1970s. One of the first was the
419:
The first computer-based speech-synthesis systems originated in the late 1950s. Noriko Umeda
338:
256:
229:
4626:
3258:
2817:
2315:. Although each of these was proposed as a standard, none of them have been widely adopted.
849:(sound-to-sound transitions) occurring in a language. The number of diphones depends on the
828:
Unit selection provides the greatest naturalness, because it applies only a small amount of
7155:
6831:
6807:
6660:
6354:
6063:
6021:
5878:
5752:
4192:
3782:
2760:
2602:
2356:
2201:
researchers know of 3 cases where digital sound-alikes technology has been used for crime.
2027:
2000:
1965:
1680:
1587:
1443:
1045:
501:
428:
383:
3806:
2043:
which uses diphone-based synthesis, as well as more modern and better-sounding techniques.
1215:
For tonal languages, such as Chinese or Taiwanese language, there are different levels of
508:
during the 1970s. LPC was later the basis for early speech synthesizer chips, such as the
8:
7135:
7065:
7022:
6978:
6750:
6740:
6735:
6623:
5860:
5850:
5774:
4254:
3601:
3259:
2449:
2180:
2141:
1969:
1593:
1261:
1161:
1088:
989:
917:
is usually only pronounced when the following word has a vowel as its first letter (e.g.
598:
520:
205:
5008:
4844:
3786:
2942:
Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP'98)
2764:
2606:
1733:
7145:
7017:
6882:
6645:
6628:
6486:
6374:
6364:
6275:
5837:
5779:
5747:
5453:"Arrested Succession Parody On YouTube Features 'Narration' By AI-Generated Ron Howard"
5369:"Sztuczna inteligencja czyta głosem Jarosława Kuźniara. Rewolucja w radiu i podcastach"
5192:
5147:
5102:
4910:
4876:
4557:
4488:
4280:
4220:
4167:
4139:
4110:
4071:
4053:
3831:
3678:
3333:
Automatic Detection of Unnatural Word-Level Segments in Unit-Selection Speech Synthesis
3015:
2934:
2696:
Mechanismus der menschlichen Sprache nebst der Beschreibung seiner sprechenden Maschine
2630:
2628:
van Santen, Jan P. H.; Sproat, Richard W.; Olive, Joseph P.; Hirschberg, Julia (1997).
2429:
2340:
2304:
2219:
2204:
This increases the stress on the disinformation situation coupled with the facts that
2151:
2091:
1891:
1875:
1737:
1305:, like examining neighboring words and using statistics about frequency of occurrence.
1249:
1124:
1112:
1092:
953:
882:
794:
379:
346:
210:
182:
174:
and other human voice characteristics to create a completely "synthetic" voice output.
147:
4304:"AI gave Val Kilmer his voice back. But critics worry the technology could be misused"
3962:
2965:
2307:
in 2004. Older speech synthesis markup languages include Java Speech Markup Language (
2018:
through the W3C Audio Incubator Group with the involvement of The BBC and Google Inc.
566:
7150:
6862:
6670:
6581:
6339:
5873:
5822:
5737:
5485:
5427:
5315:"『Portal』のGLaDOSや『UNDERTALE』のサンズがテキストを読み上げてくれる。文章に込められた感情まで再現することを目指すサービス「15.ai」が話題に"
5196:
5182:
5151:
5137:
5133:
5106:
5094:
4802:
4799:
The Pediaphon – Speech Interface to the free Knowledge Encyclopedia for Mobile Phones
4708:
4683:
4369:
4311:
4284:
4270:
4224:
4212:
4171:
4157:
4098:
4075:
4045:
3970:
3912:"Generative AI comes for cinema dubbing: Audio AI startup ElevenLabs raises pre-seed"
3883:"『Portal』のGLaDOSや『UNDERTALE』のサンズがテキストを読み上げてくれる。文章に込められた感情まで再現することを目指すサービス「15.ai」が話題に"
3798:
3723:
3659:
3635:
3422:
3270:
3237:
3162:
3050:
3031:
2908:
2776:
2637:
2573:
2566:
2538:
2437:
2344:
2176:
1863:
1506:
1498:
1432:
1183:
1053:
1004:
766:
610:
489:
350:
279:
178:
162:
127:
4561:
4492:
4153:
4067:
3682:
3349:
2843:
2318:
Speech synthesis markup languages are distinguished from dialogue markup languages.
6912:
6887:
6688:
6591:
6214:
6164:
5784:
5616:
5174:
5129:
5084:
5076:
4549:
4522:
4480:
4361:
4262:
4204:
4149:
4063:
3790:
3715:
3670:
3396:
3225:
3027:
2898:
2768:
2666:
2610:
2523:
2393:
2260:
2011:
was created in 2006 to provide a similar web-based TTS interface to the Knowledge.
1901:
1757:
1725:
1672:
1329:
930:
806:
644:
485:
456:
439:
computer to synthesize speech, an event among the most prominent in the history of
387:
186:
5535:"AI-Generated Voice Firm Clamps Down After 4chan Makes Celebrity Voices for Abuse"
5178:
5171:
2021 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI)
5165:
Zhao, Yunxin; Song, Minguang; Yue, Yanghao; Kuruvilla-Dugdale, Mili (2021-07-27).
5080:
3674:
3332:
2727:
1841:
1820:
The second operating system to feature advanced speech synthesis capabilities was
873:. or more recent techniques such as pitch modification in the source domain using
573:
7139:
7100:
7095:
6963:
6693:
6566:
6541:
6523:
6058:
5845:
5645:
4582:
4526:
4484:
4266:
4046:"Probing the phonetic and phonological knowledge of tones in Mandarin TTS models"
3719:
3532:
3400:
3122:
3104:
2993:
2870:
2415:
2409:
2370:
2352:
2292:
2255:
1857:
1294:
985:
926:
878:
671:
618:
556:
412:
291:
5314:
4393:
3882:
1760:) there was only one standard voice shipping with Mac OS X. Starting with 10.6 (
267:(pitch contour, phoneme durations), which is then imposed on the output speech.
6847:
6827:
6551:
6405:
5166:
5121:
4929:
4679:
4131:
2198:
2187:
2095:
2079:
1981:
1753:
1729:
1245:
1236:
1229:
993:
497:
432:
392:
6436:
5673:
5665:
4553:
2197:
By 2019 the digital sound-alikes found their way to the hands of criminals as
161:. Systems differ in the size of the stored speech units; a system that stores
7201:
7110:
6922:
6902:
6683:
6425:
6415:
6334:
6078:
5928:
5641:
5489:
5431:
5098:
4373:
4315:
4216:
4208:
4136:
2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
4102:
3974:
3545:
3426:
3306:
3289:
3233:
2912:
2456:
2332:
2191:
2083:
1973:
1609:
1540:
1490:
1427:
1277:
The process of normalizing text is rarely straightforward. Texts are full of
1201:
1197:
1175:
981:
822:
814:
676:
481:
5621:
4895:
Arık, Sercan Ö.; Chen, Jitong; Peng, Kainan; Ping, Wei; Zhou, Yanqi (2018),
4756:"How to configure and use Text-to-Speech in Windows XP and in Windows Vista"
4021:"ElevenLabs' Powerful New AI Tool Lets You Make a Full Audiobook in Minutes"
3996:"Voice-generating platform ElevenLabs raises $ 19M, launches detection tool"
3794:
2935:"The Distance Measure for Line Spectrum Pairs Applied to Speech Recognition"
2366:
7090:
6708:
6420:
6369:
6209:
6038:
5120:
Triandafilidi, Ioanis I.; Tatarnikova, T. M.; Poponin, A. S. (2022-05-30).
5007:
Suwajanakorn, Supasorn; Seitz, Steven; Kemelmacher-Shlizerman, Ira (2017),
4576:
TI will exit dedicated speech-synthesis chips, transfer products to Sensory
3727:
3609:
3320:
Language Generation and Speech Synthesis in Dialogues for Language Learning
2670:
2274:
2127:
2108:
Some models of Texas Instruments home computers produced in 1979 and 1981 (
1905:
1887:
1785:
1613:
1286:
1129:
Sinewave synthesis is a technique for synthesizing speech by replacing the
1049:
1015:
850:
793:. Typically, the division into segments is done using a specially modified
365:
287:
201:
5584:
4819:
3802:
2883:
2780:
2373:
was one of the most famous people to use a speech computer to communicate.
2116:
to embed complete spoken phrases into applications, primarily video games.
1996:
receiver, and listen to them while walking, jogging or commuting to work.
1809:
Example of speech synthesis with the included Say utility in Workbench 1.3
423:
developed the first general English text-to-speech system in 1968, at the
7047:
6927:
6640:
6533:
6481:
6349:
6265:
6073:
5952:
5710:
5610:
4755:
3669:. Lyon, France: International Speech Communication Association: 587–591.
3597:
2155:
2004:
1977:
1773:
1419:
1317:
1216:
1096:
1056:
in the 1960s and 1970s by Paul Mermelstein, Cecil Coker, and colleagues.
1041:
1019:
802:
621:
452:
318:
295:
283:
276:
171:
138:) system converts normal language text into speech; other systems render
6184:
5679:
5089:
577:
Fidelity Voice Chess Challenger (1979), the first talking chess computer
6650:
6290:
6255:
6154:
6144:
6083:
5006:
3549:
3502:
3126:
2903:
2486:
Music technology (electronic and digital) § Vocal synthesis after 2010s
2279:
2264:
2068:
2064:
1883:
1879:
1257:
1193:
1068:
731:
The two primary technologies generating synthetic speech waveforms are
692:
614:
532:
448:
103:
2475:
speakers with controlled levels of roughness, breathiness and strain.
547:
DECtalk demo recording using the Perfect Paul and Uppity Ursula voices
244:. The process of assigning phonetic transcriptions to words is called
6518:
6285:
6129:
6108:
6103:
5957:
4671:
4539:
2772:
2614:
2472:
2464:
2123:
2102:
2046:
1829:
1749:
1745:
1741:
1647:
1302:
1298:
1108:
1072:
965:
886:
626:
505:
440:
369:
357:
342:
3511:
Proceedings ESCA-NATO Workshop and Applications of Speech Technology
2796:"Louis Gerstman, 61, a Specialist In Speech Disorders and Processes"
2751:
Klatt, D (1987). "Review of text-to-speech conversion for English".
1557:
Popular systems offering speech synthesis as a built-in capability.
1345:
1091:, also called Statistical Parametric Synthesis. In this system, the
845:
Diphone synthesis uses a minimal speech database containing all the
6993:
6973:
6958:
6937:
6907:
6852:
6817:
6698:
6316:
6260:
6179:
6174:
6093:
6053:
5947:
5889:
5285:"ゲームキャラ音声読み上げソフト「15.ai」公開中。『Undertale』や『Portal』のキャラに好きなセリフを言ってもらえる"
4915:
4881:
4144:
4058:
3853:"ゲームキャラ音声読み上げソフト「15.ai」公開中。『Undertale』や『Portal』のキャラに好きなセリフを言ってもらえる"
3836:
2336:
2319:
2242:
2234:
2226:
2109:
1687:
1526:
1423:
1411:
1407:
1130:
973:
902:
unless the many variations are taken into account. For example, in
833:
798:
778:
774:
653:
594:
496:(NTT) in 1966. Further developments in LPC technology were made by
467:
396:
250:
158:
123:
5615:(Master of Music Music thesis). Florida International University.
4512:
3386:
3369:
T. Dutoit, V. Pagel, N. Pierret, F. Bataille, O. van der Vrecken.
3202:"Ann Syrdal, Who Helped Give Computers a Female Voice, Dies at 74"
2686:, Helsinki University of Technology, Retrieved on November 4, 2006
1505:
index applied on the integrated linear prediction residual of the
817:), duration, position in the syllable, and neighboring phones. At
200:
A text-to-speech system (or "engine") is composed of two parts: a
7130:
6988:
6968:
6842:
6586:
6501:
6043:
5976:
4987:"Face2Face: Real-time Face Capture and Reenactment of RGB Videos"
4934:
2379:
2355:
which incorporated text-to-phonetics software based on work from
2105:
incorporated the Texas Instruments TMS5220 speech synthesis chip.
2087:
2072:
1993:
1989:
1837:
1821:
1502:
1415:
949:
857:
of a sentence is superimposed on these minimal units by means of
846:
770:
737:
635:
552:
444:
436:
361:
330:
306:
302:
166:
4841:"Smithsonian Speech Synthesis History Project (SSSHP) 1986–2002"
4647:
3589:
2295:
have been established for the rendition of text as speech in an
1239:(also known as voice cloning or deepfake audio) is a product of
6496:
6491:
6400:
6270:
5997:
5943:
5660:
5119:
4858:
4596:"1400XL/1450XL Speech Handler External Reference Specification"
2627:
2468:
2397:
2360:
2250:
2172:
2145:
2137:
2034:
1917:
1606:
1282:
870:
786:
656:, for which the game's developer, Hiroshi Suzuki, developed a "
237:
233:
115:
4407:
4240:"Deepfake Audio Boom Exploits One Billion-Dollar Startup's AI"
3745:
3013:
2840:"Where "HAL" First Spoke (Bell Labs Speech Synthesis website)"
2214:
2D video forgery techniques were presented in 2016 that allow
2075:
and Open Sesame), also had games utilizing software synthesis.
714:
The most important qualities of a speech synthesis system are
395:
and colleagues discovered acoustic cues for the perception of
313:
won the first prize in a competition announced by the Russian
7186:
6822:
6410:
6311:
6239:
6139:
6113:
6088:
6007:
3766:
Remez, R.; Rubin, P.; Pisoni, D.; Carrell, T. (22 May 1981).
3047:
Multilingual Text-to-Speech Synthesis: The Bell Labs Approach
2564:
Allen, Jonathan; Hunnicutt, M. Sharon; Klatt, Dennis (1987).
2383:
2312:
2246:
2113:
1833:
1167:
969:
866:
322:
192:
185:
to listen to written words on a home computer. Many computer
5164:
4989:. Proc. Computer Vision and Pattern Recognition (CVPR), IEEE
3629:
1813:
1301:
techniques are used to guess the proper way to disambiguate
6169:
4431:. University of Portsmouth. January 9, 2008. Archived from
2969:
2308:
1720:
MacinTalk 2 demo featuring the Mr. Hughes and Marvin voices
1683:
to enable World English Spelling text-to-speech synthesis.
1060:
1012:
782:
691:
Synthesized voices typically sounded male until 1990, when
5128:. St. Petersburg, Russian Federation: IEEE. pp. 1–5.
3705:
2148:
and others use speech synthesis for automobile navigation.
1824:, introduced in 1985. The voice synthesis was licensed by
1156:
Speech synthesis example using the HiFi-GAN neural vocoder
6983:
5669:
4939:
3963:"This Podcast Is Not Hosted by AI Voice Clones. We Swear"
2296:
2119:
2015:
1985:
189:
have included speech synthesizers since the early 1990s.
94:
A synthetic voice announcing an arriving train in Sweden.
5263:"Code Geass Speech Synthesizer Service Offered in Japan"
4731:"Accessibility Tutorials for Windows XP: Using Narrator"
3294:
Perfect synthesis for all of the people all of the time.
2711:
Mattingly, Ignatius G. (1974). Sebeok, Thomas A. (ed.).
2459:
synthesizer, developed by Jorge C. Lucero et al. at the
1756:(10.4). During 10.4 (Tiger) and first releases of 10.5 (
1690:
computers were sold with "stspeech.tos" on floppy disk.
4252:
4132:"Deepfake Detection: Current Challenges and Next Steps"
3765:
2713:"Speech synthesis for phonetic and phonological models"
1486:
1308:
Recently TTS systems have begun to use HMMs (discussed
1107:) of speech are modeled simultaneously by HMMs. Speech
368:
developed a keyboard-operated voice-synthesizer called
5420:"Generative AI Podcasts Are Here. Prepare to Be Bored"
5122:"Speech Synthesis System for People with Disabilities"
4447:"Smile – And The World Can Hear You, Even If You Hide"
4091:
Weaponised deep fakes: National security and democracy
3657:
1828:
from SoftVoice, Inc., who also developed the original
1772:
application that converts text to audible speech. The
976:
of artificial speech. This method is sometimes called
118:. A computer system used for this purpose is called a
5167:"Personalizing TTS Voices for Progressive Dysarthria"
4466:"The vocal communication of different kinds of smile"
4191:
Diakopoulos, Nicholas; Johnson, Deborah (June 2020).
3492:. World Future Society. 1978. pp. 359, 360, 361.
3346:"Pitch-Synchronous Overlap and Add (PSOLA) Synthesis"
3265:. Cambridge, UK: Cambridge University Press. p.
3146:. Vol. 11, no. 2. pp. 134-175 (160-3).
1671:
Arguably, the first speech system integrated into an
1410:, a process which is often called text-to-phoneme or
5666:
Simulated singing with the singing robot Pavarobotti
4362:"AI-Generated Voice Deepfakes Aren't Scary Good—Yet"
3447:"1960 - Rudy the Robot - Michael Freeman (American)"
2592:
2286:
1972:
and gadgets that can read messages directly from an
5559:
5063:Brunow, David A.; Cullen, Theresa A. (2021-07-03).
4089:Smith, Hannah; Mansted, Katherine (April 1, 2020).
3768:"Speech perception without traditional speech cues"
2563:
2169:
Conference on Neural Information Processing Systems
1999:A growing field in Internet based TTS is web-based
1878:. SAPI 4.0 was available as an optional add-on for
1087:HMM-based synthesis is a synthesis method based on
353:". In 1923, Paget resurrected Wheatstone's design.
5306:
5219:Journal of Rehabilitation Research and Development
4705:"Translator Library (Multilingual-speech version)"
4296:
4246:
4190:
3658:Lucero, J. C.; Schoentgen, J.; Behlau, M. (2013).
3505:, J.L. Gauvain, B. Prouts, C. Bouhier, R. Boesch.
3331:William Yang Wang and Kallirroi Georgila. (2011).
3161:. Vol. 1. SMG Szczepaniak. pp. 544–615.
2864:Anthropomorphic Talking Robot Waseda-Talker Series
2629:
2565:
1580:TI-99/4A speech demo using the built-in vocabulary
1462:
1293:Most text-to-speech (TTS) systems do not generate
590:Speech output from Fidelity Voice Chess Challenger
228:to each word, and divides and marks the text into
5395:"AI Can Clone Your Favorite Podcast Host's Voice"
4902:Advances in Neural Information Processing Systems
4868:Advances in Neural Information Processing Systems
4775:
4184:
3938:"AI Can Clone Your Favorite Podcast Host's Voice"
2988:
2986:
2440:using 15.ai and external voice control software.
1679:computers. These used the Votrax SC01 chip and a
551:Dominant systems in the 1980s and 1990s were the
7199:
6669:
5560:"Usage of text-to-speech in AI video generation"
5010:Synthesizing Obama: Learning Lip Sync from Audio
4894:
3874:
3140:"The Replay Years: Reflections from Eddie Adlum"
1616:Voice Synthesis module in 1982. It included the
1552:
1272:
1136:
1133:(main bands of energy) with pure tone whistles.
1111:are generated from HMMs themselves based on the
411:Computer and speech synthesizer housing used by
372:(Voice Demonstrator), which he exhibited at the
6466:
5238:"Speech Synthesis Software for Anime Announced"
2432:allows for interaction with mobile devices via
2014:Other work is being done in the context of the
157:pieces of recorded speech that are stored in a
4778:"An introduction to Text-To-Speech in Android"
3660:"Physics-based synthesis of disordered voices"
3507:Generation and Synthesis of Broadcast Messages
3159:The Untold History of Japanese Game Developers
2983:
2726:. Mouton, The Hague: 2451–2487. Archived from
1944:are available through third-party publishers.
1521:A speech synthesis kit produced by Bell System
6452:
5905:
5695:
4864:
4748:
2932:
1931:Votrax Type 'N Talk speech synthesizer (1980)
1335:
5062:
4930:"Fake voices 'help cyber-crooks steal cash'"
4542:IEEE Trans. Audio Speech Language Processing
4088:
3618:Escape from the Planet of the Robot Monsters
3193:
3187:Smithsonian Speech Synthesis History Project
2933:Zheng, F.; Song, Z.; Li, L.; Yu, W. (1998).
2753:Journal of the Acoustical Society of America
2595:Journal of the Acoustical Society of America
2436:interfaces. Some users have also created AI
1744:has evolved into a fully supported program,
1320:is frequently difficult in these languages.
892:
565:
5026:
3156:
3095:: "Talking electronic game", April 27, 1982
2684:History and Development of Speech Synthesis
2049:which uses articulatory synthesis from the
2003:, e.g. 'Browsealoud' from a UK company and
1874:components to support speech synthesis and
1752:was for the first time featured in 2005 in
1724:The first speech system integrated into an
1585:with speech during this promotion included
1374:. Unsourced material may be challenged and
1332:" being rendered as "Ulysses South Grant".
760:
709:
329:notation: , , , and ). There followed the
23:
6459:
6445:
5912:
5898:
5702:
5688:
5478:"Can A.I. Be Funny? This Troupe Thinks So"
5276:
4670:
4627:"It Sure Is Great To Get Out Of That Bag!"
3823:
2996:. IEEE Global History Network. 20 May 2009
2656:
2154:produced a music synthesizer in 1999, the
2037:which supports a broad range of languages.
1956:added support for speech synthesis (TTS).
745:
470:computer sings the same song as astronaut
24:
5709:
5620:
5312:
5088:
4914:
4897:"Neural Voice Cloning with a Few Samples"
4888:
4880:
4143:
4057:
3880:
3844:
3835:
3554:Star Trek: Strategic Operations Simulator
3322:, masters thesis, Section 5.6 on page 54.
2902:
2710:
1938:
1394:Learn how and when to remove this message
1029:
617:to feature speech synthesis was the 1980
5392:
5282:
4953:
4723:
3935:
3850:
3224:
3199:
2365:
1984:. Some specialized software can narrate
1921:
1799:
1710:
1697:
1657:
1628:
1570:
1516:
1450:
1309:
1146:
972:levels are varied over time to create a
580:
572:
537:
519:In 1975, Fumitada Itakura developed the
406:
255:conversion. Phonetic transcriptions and
191:
32:This is an accepted version of this page
7213:Applications of artificial intelligence
5000:
4978:
4922:
4733:. Microsoft. 2011-01-29. Archived from
4463:
4334:
4203:(7) (published 2020-06-05): 2072–2098.
3993:
3960:
3742:"The HMM-based Speech Synthesis System"
2960:
2958:
2793:
2677:
2389:Code Geass: Lelouch of the Rebellion R2
2162:
702:Kurzweil predicted in 2005 that as the
14:
7200:
5475:
5173:. Athens, Greece: IEEE. pp. 1–4.
4702:
4237:
3256:
3044:
2568:From Text to Speech: The MITalk system
2353:Kurzweil Reading Machine for the Blind
2299:-compliant format. The most recent is
2110:Texas Instruments TI-99/4 and TI-99/4A
1082:
443:. Kelly's voice recorder synthesizer (
114:is the artificial production of human
7233:History of human–computer interaction
6440:
5893:
5683:
5674:how the robot synthesized the singing
5608:
5336:
4984:
4095:Australian Strategic Policy Institute
3829:
3701:
3699:
3653:
3651:
3630:John Holmes and Wendy Holmes (2001).
3137:
2750:
2392:. 15.ai has been frequently used for
2359:and a black-box synthesizer built by
1512:
1481:by Amy Drahota and colleagues at the
1118:
555:system, based largely on the work of
484:, began development with the work of
402:
315:Imperial Academy of Sciences and Arts
153:Synthesized speech can be created by
59:Artificial production of human speech
6918:Simple Knowledge Organization System
5919:
5527:
5508:
5386:
4959:
4619:
4335:Etienne, Vanessa (August 19, 2021).
3744:. Hts.sp.nitech.ac.j. Archived from
3586:Indiana Jones and the Temple of Doom
2955:
2881:
2544:Text to speech in digital television
2478:
1851:
1675:was the circa 1983 unreleased Atari
1560:
1422:to describe distinctive sounds in a
1372:adding citations to reliable sources
1339:
1160:Deep learning speech synthesis uses
944:
840:
146:into speech. The reverse process is
5502:
5313:Yoshiyuki, Furushima (2021-01-18).
4429:"Smile -and the world can hear you"
4129:
4043:
3987:
3929:
3881:Yoshiyuki, Furushima (2021-01-18).
3014:Billi, Roberto; Canavesio, Franco;
2403:My Little Pony: Friendship Is Magic
2270:My Little Pony: Friendship Is Magic
2126:included VoiceType, a precursor to
1748:, for people with vision problems.
451:", with musical accompaniment from
140:symbolic linguistic representations
56:
6230:Texas Instruments LPC Speech Chips
5612:Vocal Synthesis and Deep Listening
5417:
5393:Ashworth, Boone (April 12, 2023).
5265:. Animenewsnetwork.com. 2008-09-09
5032:
4703:Devitt, Francesco (30 June 1995).
4359:
4238:Murphy, Margi (20 February 2024).
3936:Ashworth, Boone (April 12, 2023).
3696:
3648:
3476:. New York Media, LLC. 1979-07-30.
3116:Gaming's most important evolutions
1812:
1567:Texas Instruments LPC Speech Chips
1547:Texas Instruments LPC Speech Chips
1222:
510:Texas Instruments LPC Speech Chips
399:segments (consonants and vowels).
335:acoustic-mechanical speech machine
66:
57:
7244:
6933:Thesaurus (information retrieval)
5633:
5450:
4780:. Android-developers.blogspot.com
2794:Lambert, Bruce (March 21, 1992).
2509:Comparison of speech synthesizers
2287:Speech synthesis markup languages
1964:Currently, there are a number of
1178:model used by the application is
321:that could produce the five long
317:for models he built of the human
6307:Speech Synthesis Markup Language
5968:Festival Speech Synthesis System
5602:
5577:
5552:
5469:
5444:
5411:
5361:
5134:10.1109/WECONF55058.2022.9803600
4843:. Mindspring.com. Archived from
4776:Jean-Michel Trivi (2009-09-23).
4018:
3632:Speech Synthesis and Recognition
3415:"Education: Marvel of The Bronx"
3183:"A Short History of Computalker"
2951:from the original on 2022-10-09.
2922:from the original on 2022-10-09.
2484:This section is an excerpt from
2301:Speech Synthesis Markup Language
2041:Festival Speech Synthesis System
2030:systems are available, such as:
1435:, approach to learning reading.
1344:
1228:This section is an excerpt from
670:produced the first multi-player
196:Overview of a typical TTS system
102:Problems playing this file? See
82:
6069:Microsoft text-to-speech voices
5609:Bruno, Chelsea A (2014-03-25).
5325:from the original on 2021-01-18
5295:from the original on 2021-01-19
5255:
5230:
5203:
5158:
5113:
5056:
4833:
4812:
4791:
4769:
4696:
4676:Amiga Hardware Reference Manual
4664:
4640:
4588:
4533:
4506:
4457:
4439:
4421:
4400:
4386:
4353:
4328:
4231:
4154:10.1109/icmew46912.2020.9105991
4123:
4082:
4068:10.21437/speechprosody.2020-190
4037:
4012:
3954:
3904:
3893:from the original on 2021-01-18
3863:from the original on 2021-01-19
3759:
3734:
3623:
3558:
3538:
3516:
3496:
3480:
3464:
3439:
3407:
3380:
3363:
3338:
3325:
3312:
3299:
3283:
3250:
3218:
3175:
3150:
3131:
3109:
3098:
3078:
3069:
3063:
3038:
3007:
2994:"Fumitada Itakura Oral History"
2926:
2875:
2857:
2832:
2806:
2325:
1463:Prosodics and emotional content
327:International Phonetic Alphabet
311:Christian Gottlieb Kratzenstein
6514:Natural language understanding
5585:"AI Text to speech for videos"
5035:"Voice Cloning for the Masses"
4396:. World Wide Web Organization.
3138:Adlum, Eddie (November 1985).
2787:
2744:
2704:
2689:
2659:Computer Speech & Language
2650:
2621:
2586:
2572:. Cambridge University Press.
2557:
2349:voice output communication aid
2021:
1667:Atari ST speech synthesis demo
1143:Deep learning speech synthesis
1103:(voice source), and duration (
494:Nippon Telegraph and Telephone
13:
1:
7038:Optical character recognition
5283:Kurosawa, Yuki (2021-01-19).
5179:10.1109/BHI50953.2021.9508522
5081:10.1080/07380569.2021.1953362
4115:: CS1 maint: date and year (
3851:Kurosawa, Yuki (2021-01-19).
3675:10.21437/Interspeech.2013-161
2720:Current Trends in Linguistics
2550:
2136:Navigation units produced by
1728:that shipped in quantity was
1553:Hardware and software systems
1532:General Instrument SP0256-AL2
1273:Text normalization challenges
1267:
1137:Deep learning-based synthesis
427:in Japan. In 1961, physicist
275:Long before the invention of
224:. The front-end then assigns
6731:Multi-document summarization
4960:Drew, Harwell (2019-09-04).
4527:10.1016/j.specom.2003.05.001
4485:10.1016/j.specom.2007.10.001
4267:10.1007/978-981-16-0733-2_39
3994:Wiggers, Kyle (2023-06-20).
3720:10.1016/j.jvoice.2015.07.017
3401:10.1016/j.specom.2003.05.001
3032:10.1016/0167-6393(95)00030-R
2891:Found. Trends Signal Process
2814:"Arthur C. Clarke Biography"
2632:Progress in Speech Synthesis
2347:usually through a dedicated
1469:Emotional speech recognition
958:physical modelling synthesis
666:, also dates from 1980. The
652:), released in 1980 for the
122:, and can be implemented in
7:
7061:Latent Dirichlet allocation
7033:Natural language generation
6898:Machine-readable dictionary
6893:Linguistic Linked Open Data
6468:Natural language processing
5476:Fadulu, Lola (2023-07-06).
5041:. The Batch. Archived from
3045:Sproat, Richard W. (1997).
2842:. Bell Labs. Archived from
2519:Orca (assistive technology)
2496:
2463:, simulates the physics of
2434:natural language processing
2171:(NeurIPS) researchers from
1959:
1539:DT1050 Digitalker (Mozer –
1196:is primarily known for its
561:natural language processing
425:Electrotechnical Laboratory
10:
7249:
6813:Explicit semantic analysis
6562:Deep linguistic processing
5668:or a description from the
5375:(in Polish). April 9, 2023
3157:Szczepaniak, John (2014).
2483:
1947:
1915:
1855:
1795:
1564:
1466:
1336:Text-to-phoneme challenges
1227:
1186:of a generated line using
1140:
1122:
1033:
749:
699:, created a female voice.
697:AT&T Bell Laboratories
642:with speech synthesis was
374:1939 New York World's Fair
270:
7228:Computational linguistics
7164:
7119:
7074:
7046:
7006:
6951:
6873:
6861:
6792:
6749:
6721:
6656:Word-sense disambiguation
6532:
6509:Computational linguistics
6474:
6383:
6325:
6299:
6248:
6235:General Instrument SP0256
6197:
6122:
6031:
6020:
5990:
5936:
5927:
5859:
5836:
5798:
5761:
5718:
5033:Ng, Andrew (2020-04-01).
4682:Publishing Company, Inc.
4554:10.1109/TASL.2013.2273717
2966:"List of IEEE Milestones"
2057:
1911:
1779:
1612:game console offered the
1600:
1495:discrete cosine transform
1188:emotional contextualizers
1011:, and in the early 1980s
893:Domain-specific synthesis
875:discrete cosine transform
859:digital signal processing
830:digital signal processing
7182:Natural Language Toolkit
7106:Pronunciation assessment
7008:Automatic identification
6838:Latent semantic analysis
6794:Distributional semantics
6679:Compound-term processing
6577:Named-entity recognition
6049:Software Automatic Mouth
5069:Computers in the Schools
5015:University of Washington
4652:Amazon Web Services, Inc
4209:10.1177/1461444820925811
4044:Zhu, Jian (2020-05-25).
3261:Text-to-speech synthesis
2882:Gray, Robert M. (2010).
2534:Speech-generating device
2504:Chinese speech synthesis
2231:University of Washington
2051:Free Software Foundation
1866:desktop systems can use
1693:
1653:
1644:Software Automatic Mouth
1638:A demo of SAM on the C64
1483:University of Portsmouth
1414:-to-phoneme conversion (
906:dialects of English the
863:linear predictive coding
761:Unit selection synthesis
710:Synthesizer technologies
599:Telesensory Systems Inc.
478:Linear predictive coding
39:latest accepted revision
7086:Automated essay scoring
7056:Document classification
6723:Automatic summarization
6396:Concatenative synthesis
6281:Microsoft Speech Server
6150:NIAONiao Virtual Singer
5651:considered for deletion
5622:10.25148/etd.fi14040802
4758:. Microsoft. 2007-05-07
4255:"Deepfake: An Overview"
4197:New Media & Society
4052:. ISCA: ISCA: 930–934.
3795:10.1126/science.7233191
3582:The Empire Strikes Back
3296:IEEE TTS Workshop 2002.
3230:The Singularity is Near
3200:CadeMetz (2020-08-20).
2529:Silent speech interface
2303:(SSML), which became a
2245:web application called
1898:Microsoft Speech Server
1826:Commodore International
1642:Also released in 1982,
1477:A study in the journal
1241:artificial intelligence
1022:arcade games using the
956:and an acoustic model (
881:, that was invented by
752:Concatenative synthesis
746:Concatenation synthesis
733:concatenative synthesis
674:using voice synthesis,
226:phonetic transcriptions
144:phonetic transcriptions
6943:Universal Dependencies
6636:Terminology extraction
6619:Semantic decomposition
6614:Semantic role labeling
6604:Part-of-speech tagging
6572:Information extraction
6557:Coreference resolution
6547:Collocation extraction
6391:Articulatory synthesis
6345:Franklin Seaney Cooper
4985:Thies, Justus (2016).
4674:; et al. (1991).
3714:(5): 639.e17–639.e23.
3105:Voice Chess Challenger
2671:10.1006/csla.1994.1005
2514:List of screen readers
2461:University of Brasília
2374:
2186:Also researchers from
1939:Text-to-speech systems
1932:
1817:
1810:
1721:
1708:
1668:
1639:
1624:
1581:
1537:National Semiconductor
1522:
1497:in the source domain (
1157:
1036:Articulatory synthesis
1030:Articulatory synthesis
960:). Parameters such as
704:cost-performance ratio
668:Milton Bradley Company
640:personal computer game
591:
578:
570:
548:
447:) recreated the song "
416:
382:and his colleagues at
380:Dr. Franklin S. Cooper
197:
78:Automatic announcement
71:
6704:Sentence segmentation
6360:Wolfgang von Kempelen
6140:CeVIO Creative Studio
6099:CeVIO Creative Studio
5982:Automatik Text Reader
5828:Karplus–Strong string
3634:(2nd ed.). CRC.
3257:Taylor, Paul (2009).
3075:Gevaryahu, Jonathan,
2422:SpongeBob SquarePants
2369:
2222:in existing 2D video.
2209:Human image synthesis
2098:, and the Bebook Neo.
1976:and web pages from a
1930:
1816:
1808:
1790:Software as a Service
1719:
1706:
1666:
1637:
1579:
1520:
1473:Prosody (linguistics)
1451:Evaluation challenges
1155:
1101:fundamental frequency
1065:University of Calgary
1018:machines and in many
978:rules-based synthesis
962:fundamental frequency
811:fundamental frequency
607:Speak & Spell toy
589:
576:
569:
546:
463:2001: A Space Odyssey
410:
339:Wolfgang von Kempelen
195:
70:
7218:Assistive technology
7156:Voice user interface
6867:datasets and corpora
6808:Document-term matrix
6661:Word-sense induction
6355:Haskins Laboratories
6064:Microsoft Speech API
5879:Software synthesizer
5723:Frequency modulation
4515:Speech Communication
4473:Speech Communication
4464:Drahota, A. (2008).
4408:"Blizzard Challenge"
3389:Speech Communication
3352:on February 22, 2007
3020:Speech Communication
2820:on December 11, 1997
2357:Haskins Laboratories
2341:reading disabilities
2181:speaker verification
2163:Digital sound-alikes
2028:open-source software
2001:assistive technology
1792:in AWS (from 2017).
1681:finite state machine
1479:Speech Communication
1444:phonemic orthography
1418:is the term used by
1368:improve this section
1262:voice-authentication
1162:deep neural networks
1089:hidden Markov models
1048:in the mid-1970s by
1046:Haskins Laboratories
680:, in the same year.
502:Manfred R. Schroeder
429:John Larry Kelly, Jr
384:Haskins Laboratories
183:reading disabilities
7136:Interactive fiction
7066:Pachinko allocation
7023:Speech segmentation
6979:Google Ngram Viewer
6751:Machine translation
6741:Text simplification
6736:Sentence extraction
6624:Semantic similarity
5861:Digital synthesizer
4711:on 26 February 2012
4130:Lyu, Siwei (2020).
4050:Speech Prosody 2020
3787:1981Sci...212..947R
3614:Vindicators Part II
3525:Music and Computers
3522:Dartmouth College:
3016:Ciaramella, Alberto
2765:1987ASAJ...82..737K
2607:1981ASAJ...70..321R
2067:(Baseball) and the
1172:multi-speaker model
1083:HMM-based synthesis
887:the Bronx, New York
861:techniques such as
630:(known in Japan as
521:line spectral pairs
492:and Shuzo Saito of
29:Page version status
18:Singing synthesizer
7146:Question answering
7018:Speech recognition
6883:Corpus linguistics
6863:Language resources
6646:Textual entailment
6629:Sentiment analysis
6365:Ignatius Mattingly
5838:Analog synthesizer
5800:Physical modelling
5541:. January 30, 2023
5509:Kanetkar, Riddhi.
5482:The New York Times
5242:Anime News Network
4797:Andreas Bischoff,
4581:2012-05-28 at the
4394:"Speech synthesis"
4360:Newman, Lily Hay.
4097:. pp. 11–13.
3918:. January 23, 2023
3574:Return of the Jedi
3531:2011-06-08 at the
3206:The New York Times
3121:2011-06-15 at the
2904:10.1561/2000000036
2869:2016-03-04 at the
2800:The New York Times
2438:virtual assistants
2430:speech recognition
2375:
2305:W3C recommendation
2220:facial expressions
2218:counterfeiting of
2177:transfers learning
2092:PocketBook eReader
1933:
1876:speech recognition
1818:
1811:
1770:command-line based
1738:speech recognition
1722:
1709:
1669:
1640:
1582:
1523:
1513:Dedicated hardware
1250:speech translation
1158:
1125:Sinewave synthesis
1119:Sinewave synthesis
1113:maximum likelihood
1093:frequency spectrum
954:additive synthesis
883:Michael J. Freeman
632:Speak & Rescue
592:
579:
571:
549:
455:. Coincidentally,
431:and his colleague
417:
403:Electronic devices
347:Charles Wheatstone
211:text normalization
198:
179:visual impairments
148:speech recognition
120:speech synthesizer
72:
35:
7223:Auditory displays
7195:
7194:
7151:Virtual assistant
7076:Computer-assisted
7002:
7001:
6759:Computer-assisted
6717:
6716:
6709:Word segmentation
6671:Text segmentation
6609:Semantic analysis
6597:Syntactic parsing
6582:Ontology learning
6434:
6433:
6340:Catherine Browman
6193:
6192:
6016:
6015:
6003:Lyricos / Flinger
5887:
5886:
5874:Scanned synthesis
5813:Digital waveguide
5728:Linear arithmetic
5188:978-1-6654-0358-0
5143:978-1-6654-7083-4
4689:978-0-201-56776-2
4585:." June 14, 2001.
4548:(12): 2471–2480.
4276:978-981-16-0732-5
4163:978-1-7281-1485-9
3781:(4497): 947–949.
3641:978-0-7484-0856-6
3564:Examples include
3544:Examples include
3513:, September 1993.
3473:New York Magazine
3451:cyberneticzoo.com
3375:ICSLP Proceedings
3335:, IEEE ASRU 2011.
3305:John Kominek and
3243:978-0-14-303788-0
3226:Kurzweil, Raymond
3056:978-0-7923-8027-6
2643:978-0-387-94701-3
2579:978-0-521-30641-6
2539:Speech processing
2479:Singing synthesis
2345:speech impairment
2241:In March 2020, a
1928:
1852:Microsoft Windows
1806:
1717:
1704:
1664:
1635:
1577:
1561:Texas Instruments
1499:linear prediction
1442:Languages with a
1433:synthetic phonics
1404:
1403:
1396:
1153:
1054:Bell Laboratories
1024:TMS5220 LPC Chips
1009:Speak & Spell
1005:Texas Instruments
945:Formant synthesis
939:context-sensitive
841:Diphone synthesis
795:speech recognizer
611:Texas Instruments
587:
544:
514:Speak & Spell
490:Nagoya University
480:(LPC), a form of
294:(1198–1280), and
280:signal processing
187:operating systems
89:
47:22 September 2024
26:
16:(Redirected from
7240:
7208:Speech synthesis
7172:Formal semantics
7121:Natural language
7028:Speech synthesis
7010:and data capture
6913:Semantic network
6888:Lexical resource
6871:
6870:
6689:Lexical analysis
6667:
6666:
6592:Semantic parsing
6461:
6454:
6447:
6438:
6437:
6276:Windows Narrator
6215:Pattern playback
6165:Symphonic Choirs
6029:
6028:
5934:
5933:
5921:Speech synthesis
5914:
5907:
5900:
5891:
5890:
5808:Banded waveguide
5733:Phase distortion
5704:
5697:
5690:
5681:
5680:
5657:Speech synthesis
5654:
5627:
5626:
5624:
5606:
5600:
5599:
5597:
5595:
5581:
5575:
5574:
5572:
5570:
5556:
5550:
5549:
5547:
5546:
5531:
5525:
5524:
5522:
5521:
5515:Business Insider
5506:
5500:
5499:
5497:
5496:
5473:
5467:
5466:
5464:
5463:
5448:
5442:
5441:
5439:
5438:
5415:
5409:
5408:
5406:
5405:
5390:
5384:
5383:
5381:
5380:
5365:
5359:
5358:
5356:
5355:
5340:
5334:
5333:
5331:
5330:
5319:Denfaminicogamer
5310:
5304:
5303:
5301:
5300:
5280:
5274:
5273:
5271:
5270:
5259:
5253:
5252:
5250:
5249:
5234:
5228:
5227:
5215:
5207:
5201:
5200:
5162:
5156:
5155:
5117:
5111:
5110:
5092:
5060:
5054:
5053:
5051:
5050:
5030:
5024:
5023:
5022:
5021:
5004:
4998:
4997:
4995:
4994:
4982:
4976:
4975:
4973:
4972:
4957:
4951:
4950:
4948:
4947:
4926:
4920:
4919:
4918:
4892:
4886:
4885:
4884:
4862:
4856:
4855:
4853:
4852:
4837:
4831:
4830:
4828:
4827:
4816:
4810:
4795:
4789:
4788:
4786:
4785:
4773:
4767:
4766:
4764:
4763:
4752:
4746:
4745:
4743:
4742:
4737:on June 21, 2003
4727:
4721:
4720:
4718:
4716:
4707:. Archived from
4700:
4694:
4693:
4678:(3rd ed.).
4668:
4662:
4661:
4659:
4658:
4644:
4638:
4637:
4635:
4634:
4623:
4617:
4616:
4614:
4613:
4607:
4601:. Archived from
4600:
4592:
4586:
4572:
4566:
4565:
4537:
4531:
4530:
4510:
4504:
4503:
4501:
4495:. Archived from
4470:
4461:
4455:
4454:
4443:
4437:
4436:
4435:on May 17, 2008.
4425:
4419:
4418:
4416:
4415:
4404:
4398:
4397:
4390:
4384:
4383:
4381:
4380:
4357:
4351:
4350:
4348:
4347:
4332:
4326:
4325:
4323:
4322:
4300:
4294:
4293:
4292:
4291:
4250:
4244:
4243:
4235:
4229:
4228:
4188:
4182:
4181:
4179:
4178:
4147:
4138:. pp. 1–6.
4127:
4121:
4120:
4114:
4106:
4093:. Vol. 28.
4086:
4080:
4079:
4061:
4041:
4035:
4034:
4032:
4031:
4019:Bonk, Lawrence.
4016:
4010:
4009:
4007:
4006:
3991:
3985:
3984:
3982:
3981:
3958:
3952:
3951:
3949:
3948:
3933:
3927:
3926:
3924:
3923:
3908:
3902:
3901:
3899:
3898:
3887:Denfaminicogamer
3878:
3872:
3871:
3869:
3868:
3848:
3842:
3841:
3839:
3827:
3821:
3820:
3818:
3817:
3811:
3805:. Archived from
3772:
3763:
3757:
3756:
3754:
3753:
3738:
3732:
3731:
3708:Journal of Voice
3703:
3694:
3693:
3691:
3689:
3667:Interspeech 2013
3664:
3655:
3646:
3645:
3627:
3621:
3562:
3556:
3542:
3536:
3520:
3514:
3500:
3494:
3493:
3484:
3478:
3477:
3468:
3462:
3461:
3459:
3458:
3443:
3437:
3436:
3434:
3433:
3411:
3405:
3404:
3384:
3378:
3367:
3361:
3360:
3358:
3357:
3348:. Archived from
3342:
3336:
3329:
3323:
3316:
3310:
3303:
3297:
3287:
3281:
3280:
3264:
3254:
3248:
3247:
3222:
3216:
3215:
3213:
3212:
3197:
3191:
3190:
3179:
3173:
3172:
3154:
3148:
3147:
3135:
3129:
3113:
3107:
3102:
3096:
3094:
3093:
3089:
3084:Breslow, et al.
3082:
3076:
3073:
3067:
3061:
3060:
3042:
3036:
3035:
3011:
3005:
3004:
3002:
3001:
2990:
2981:
2980:
2978:
2976:
2962:
2953:
2952:
2950:
2939:
2930:
2924:
2923:
2921:
2906:
2888:
2879:
2873:
2861:
2855:
2854:
2852:
2851:
2836:
2830:
2829:
2827:
2825:
2816:. Archived from
2810:
2804:
2803:
2791:
2785:
2784:
2773:10.1121/1.395275
2748:
2742:
2741:
2739:
2738:
2732:
2717:
2708:
2702:
2701:
2693:
2687:
2681:
2675:
2674:
2654:
2648:
2647:
2635:
2625:
2619:
2618:
2615:10.1121/1.386780
2590:
2584:
2583:
2571:
2561:
2524:Paperless office
2448:like Elai.io or
2419:fandom, and the
2400:, including the
2394:content creation
2293:markup languages
2261:Twilight Sparkle
1929:
1902:web applications
1807:
1726:operating system
1718:
1707:MacinTalk 1 demo
1705:
1673:operating system
1665:
1636:
1578:
1399:
1392:
1388:
1385:
1379:
1348:
1340:
1330:Ulysses S. Grant
1180:nondeterministic
1154:
986:embedded systems
924:
916:
650:Shoplifting Girl
588:
545:
516:toys from 1978.
486:Fumitada Itakura
457:Arthur C. Clarke
388:Pattern playback
286:" involved Pope
112:Speech synthesis
91:
90:
69:
21:
7248:
7247:
7243:
7242:
7241:
7239:
7238:
7237:
7198:
7197:
7196:
7191:
7160:
7140:Syntax guessing
7122:
7115:
7101:Predictive text
7096:Grammar checker
7077:
7070:
7042:
7009:
6998:
6964:Bank of English
6947:
6875:
6866:
6857:
6788:
6745:
6713:
6665:
6567:Distant reading
6542:Argument mining
6528:
6524:Text processing
6470:
6465:
6435:
6430:
6379:
6327:
6321:
6295:
6244:
6189:
6118:
6059:Microsoft Agent
6023:
6012:
5986:
5923:
5918:
5888:
5883:
5869:Analog modeling
5855:
5846:Graphical sound
5832:
5794:
5757:
5714:
5711:Sound synthesis
5708:
5639:
5636:
5631:
5630:
5607:
5603:
5593:
5591:
5583:
5582:
5578:
5568:
5566:
5558:
5557:
5553:
5544:
5542:
5533:
5532:
5528:
5519:
5517:
5507:
5503:
5494:
5492:
5474:
5470:
5461:
5459:
5449:
5445:
5436:
5434:
5416:
5412:
5403:
5401:
5391:
5387:
5378:
5376:
5367:
5366:
5362:
5353:
5351:
5342:
5341:
5337:
5328:
5326:
5311:
5307:
5298:
5296:
5281:
5277:
5268:
5266:
5261:
5260:
5256:
5247:
5245:
5236:
5235:
5231:
5213:
5209:
5208:
5204:
5189:
5163:
5159:
5144:
5118:
5114:
5061:
5057:
5048:
5046:
5039:deeplearning.ai
5031:
5027:
5019:
5017:
5005:
5001:
4992:
4990:
4983:
4979:
4970:
4968:
4966:Washington Post
4958:
4954:
4945:
4943:
4928:
4927:
4923:
4893:
4889:
4863:
4859:
4850:
4848:
4839:
4838:
4834:
4825:
4823:
4818:
4817:
4813:
4796:
4792:
4783:
4781:
4774:
4770:
4761:
4759:
4754:
4753:
4749:
4740:
4738:
4729:
4728:
4724:
4714:
4712:
4701:
4697:
4690:
4669:
4665:
4656:
4654:
4646:
4645:
4641:
4632:
4630:
4625:
4624:
4620:
4611:
4609:
4605:
4598:
4594:
4593:
4589:
4583:Wayback Machine
4573:
4569:
4538:
4534:
4511:
4507:
4499:
4468:
4462:
4458:
4453:. January 2008.
4445:
4444:
4440:
4427:
4426:
4422:
4413:
4411:
4406:
4405:
4401:
4392:
4391:
4387:
4378:
4376:
4358:
4354:
4345:
4343:
4333:
4329:
4320:
4318:
4308:Washington Post
4302:
4301:
4297:
4289:
4287:
4277:
4251:
4247:
4236:
4232:
4189:
4185:
4176:
4174:
4164:
4128:
4124:
4108:
4107:
4087:
4083:
4042:
4038:
4029:
4027:
4017:
4013:
4004:
4002:
3992:
3988:
3979:
3977:
3959:
3955:
3946:
3944:
3934:
3930:
3921:
3919:
3910:
3909:
3905:
3896:
3894:
3879:
3875:
3866:
3864:
3849:
3845:
3828:
3824:
3815:
3813:
3809:
3770:
3764:
3760:
3751:
3749:
3740:
3739:
3735:
3704:
3697:
3687:
3685:
3662:
3656:
3649:
3642:
3628:
3624:
3563:
3559:
3543:
3539:
3533:Wayback Machine
3521:
3517:
3501:
3497:
3486:
3485:
3481:
3470:
3469:
3465:
3456:
3454:
3445:
3444:
3440:
3431:
3429:
3413:
3412:
3408:
3385:
3381:
3368:
3364:
3355:
3353:
3344:
3343:
3339:
3330:
3326:
3317:
3313:
3304:
3300:
3288:
3284:
3277:
3255:
3251:
3244:
3223:
3219:
3210:
3208:
3198:
3194:
3181:
3180:
3176:
3169:
3155:
3151:
3136:
3132:
3123:Wayback Machine
3114:
3110:
3103:
3099:
3091:
3085:
3083:
3079:
3074:
3070:
3064:
3057:
3043:
3039:
3012:
3008:
2999:
2997:
2992:
2991:
2984:
2974:
2972:
2964:
2963:
2956:
2948:
2937:
2931:
2927:
2919:
2886:
2880:
2876:
2871:Wayback Machine
2862:
2858:
2849:
2847:
2838:
2837:
2833:
2823:
2821:
2812:
2811:
2807:
2792:
2788:
2749:
2745:
2736:
2734:
2730:
2715:
2709:
2705:
2699:
2694:
2690:
2682:
2678:
2655:
2651:
2644:
2626:
2622:
2591:
2587:
2580:
2562:
2558:
2553:
2548:
2499:
2494:
2493:
2489:
2481:
2410:Team Fortress 2
2371:Stephen Hawking
2328:
2289:
2165:
2060:
2024:
1962:
1952:Version 1.6 of
1950:
1941:
1922:
1920:
1914:
1860:
1858:Microsoft Agent
1854:
1800:
1798:
1782:
1711:
1698:
1696:
1658:
1656:
1629:
1627:
1618:SP0256 Narrator
1603:
1571:
1569:
1563:
1555:
1515:
1475:
1465:
1453:
1400:
1389:
1383:
1380:
1365:
1349:
1338:
1314:parts of speech
1312:) to generate "
1275:
1270:
1254:
1253:
1233:
1225:
1223:Audio deepfakes
1147:
1145:
1139:
1127:
1121:
1085:
1038:
1032:
947:
925:). Likewise in
921:is realized as
895:
843:
773:, half-phones,
763:
754:
748:
721:intelligibility
712:
672:electronic game
636:Sun Electronics
581:
538:
413:Stephen Hawking
405:
292:Albertus Magnus
273:
246:text-to-phoneme
109:
108:
100:
98:
97:
96:
95:
92:
83:
80:
73:
67:
60:
55:
54:
53:
52:
51:
50:
34:
22:
15:
12:
11:
5:
7246:
7236:
7235:
7230:
7225:
7220:
7215:
7210:
7193:
7192:
7190:
7189:
7184:
7179:
7174:
7168:
7166:
7162:
7161:
7159:
7158:
7153:
7148:
7143:
7133:
7127:
7125:
7123:user interface
7117:
7116:
7114:
7113:
7108:
7103:
7098:
7093:
7088:
7082:
7080:
7072:
7071:
7069:
7068:
7063:
7058:
7052:
7050:
7044:
7043:
7041:
7040:
7035:
7030:
7025:
7020:
7014:
7012:
7004:
7003:
7000:
6999:
6997:
6996:
6991:
6986:
6981:
6976:
6971:
6966:
6961:
6955:
6953:
6949:
6948:
6946:
6945:
6940:
6935:
6930:
6925:
6920:
6915:
6910:
6905:
6900:
6895:
6890:
6885:
6879:
6877:
6868:
6859:
6858:
6856:
6855:
6850:
6848:Word embedding
6845:
6840:
6835:
6828:Language model
6825:
6820:
6815:
6810:
6805:
6799:
6797:
6790:
6789:
6787:
6786:
6781:
6779:Transfer-based
6776:
6771:
6766:
6761:
6755:
6753:
6747:
6746:
6744:
6743:
6738:
6733:
6727:
6725:
6719:
6718:
6715:
6714:
6712:
6711:
6706:
6701:
6696:
6691:
6686:
6681:
6675:
6673:
6664:
6663:
6658:
6653:
6648:
6643:
6638:
6632:
6631:
6626:
6621:
6616:
6611:
6606:
6601:
6600:
6599:
6594:
6584:
6579:
6574:
6569:
6564:
6559:
6554:
6552:Concept mining
6549:
6544:
6538:
6536:
6530:
6529:
6527:
6526:
6521:
6516:
6511:
6506:
6505:
6504:
6499:
6489:
6484:
6478:
6476:
6472:
6471:
6464:
6463:
6456:
6449:
6441:
6432:
6431:
6429:
6428:
6423:
6418:
6413:
6408:
6406:Inverse filter
6403:
6398:
6393:
6387:
6385:
6381:
6380:
6378:
6377:
6372:
6367:
6362:
6357:
6352:
6347:
6342:
6337:
6331:
6329:
6323:
6322:
6320:
6319:
6314:
6309:
6303:
6301:
6297:
6296:
6294:
6293:
6288:
6283:
6278:
6273:
6268:
6263:
6258:
6252:
6250:
6246:
6245:
6243:
6242:
6237:
6232:
6227:
6222:
6217:
6212:
6207:
6201:
6199:
6195:
6194:
6191:
6190:
6188:
6187:
6182:
6177:
6172:
6167:
6162:
6157:
6152:
6147:
6142:
6137:
6132:
6126:
6124:
6120:
6119:
6117:
6116:
6111:
6106:
6101:
6096:
6091:
6086:
6081:
6076:
6071:
6066:
6061:
6056:
6051:
6046:
6041:
6035:
6033:
6026:
6018:
6017:
6014:
6013:
6011:
6010:
6005:
6000:
5994:
5992:
5988:
5987:
5985:
5984:
5979:
5974:
5965:
5960:
5955:
5950:
5940:
5938:
5931:
5925:
5924:
5917:
5916:
5909:
5902:
5894:
5885:
5884:
5882:
5881:
5876:
5871:
5865:
5863:
5857:
5856:
5854:
5853:
5848:
5842:
5840:
5834:
5833:
5831:
5830:
5825:
5820:
5818:Direct digital
5815:
5810:
5804:
5802:
5796:
5795:
5793:
5792:
5787:
5782:
5777:
5771:
5769:
5759:
5758:
5756:
5755:
5750:
5745:
5740:
5735:
5730:
5725:
5719:
5716:
5715:
5707:
5706:
5699:
5692:
5684:
5678:
5677:
5663:
5635:
5634:External links
5632:
5629:
5628:
5601:
5576:
5551:
5526:
5501:
5468:
5451:Suciu, Peter.
5443:
5418:Knibbs, Kate.
5410:
5385:
5360:
5335:
5305:
5275:
5254:
5229:
5202:
5187:
5157:
5142:
5112:
5075:(3): 214–231.
5055:
5025:
4999:
4977:
4952:
4921:
4887:
4857:
4832:
4811:
4790:
4768:
4747:
4722:
4695:
4688:
4680:Addison-Wesley
4663:
4648:"Amazon Polly"
4639:
4629:. folklore.org
4618:
4587:
4567:
4532:
4521:(2): 143–154.
4505:
4502:on 2013-07-03.
4479:(4): 278–287.
4456:
4438:
4420:
4399:
4385:
4352:
4327:
4295:
4275:
4245:
4230:
4183:
4162:
4122:
4081:
4036:
4011:
3986:
3953:
3928:
3903:
3873:
3843:
3822:
3758:
3733:
3695:
3647:
3640:
3622:
3557:
3537:
3515:
3495:
3479:
3463:
3438:
3421:. 1974-04-01.
3406:
3395:(2): 143–154.
3379:
3362:
3337:
3324:
3311:
3298:
3282:
3275:
3249:
3242:
3217:
3192:
3174:
3168:978-0992926007
3167:
3149:
3130:
3108:
3097:
3077:
3068:
3062:
3055:
3037:
3026:(3): 263–271.
3006:
2982:
2954:
2925:
2897:(4): 203–303.
2874:
2856:
2831:
2805:
2786:
2743:
2703:
2688:
2676:
2649:
2642:
2620:
2601:(2): 321–328.
2585:
2578:
2555:
2554:
2552:
2549:
2547:
2546:
2541:
2536:
2531:
2526:
2521:
2516:
2511:
2506:
2500:
2498:
2495:
2490:
2482:
2480:
2477:
2333:screen readers
2327:
2324:
2288:
2285:
2267:from the show
2239:
2238:
2223:
2216:near real-time
2212:
2188:Baidu Research
2164:
2161:
2160:
2159:
2149:
2131:
2117:
2106:
2099:
2096:enTourage eDGe
2082:, such as the
2080:e-book readers
2076:
2059:
2056:
2055:
2054:
2044:
2038:
2023:
2020:
1982:Google Toolbar
1961:
1958:
1949:
1946:
1940:
1937:
1916:Main article:
1913:
1910:
1853:
1850:
1797:
1794:
1781:
1778:
1754:Mac OS X Tiger
1730:Apple Computer
1695:
1692:
1655:
1652:
1626:
1623:
1602:
1599:
1565:Main article:
1562:
1559:
1554:
1551:
1550:
1549:
1544:
1534:
1529:
1514:
1511:
1464:
1461:
1452:
1449:
1428:pronunciations
1402:
1401:
1352:
1350:
1343:
1337:
1334:
1274:
1271:
1269:
1266:
1246:text-to-speech
1237:audio deepfake
1234:
1230:Audio deepfake
1226:
1224:
1221:
1141:Main article:
1138:
1135:
1123:Main article:
1120:
1117:
1084:
1081:
1034:Main article:
1031:
1028:
994:microprocessor
946:
943:
910:in words like
894:
891:
842:
839:
762:
759:
750:Main article:
747:
744:
711:
708:
645:Manbiki Shoujo
498:Bishnu S. Atal
433:Louis Gerstman
404:
401:
393:Alvin Liberman
360:developed the
356:In the 1930s,
290:(d. 1003 AD),
272:
269:
265:target prosody
230:prosodic units
216:pre-processing
132:text-to-speech
99:
93:
81:
76:
75:
74:
65:
64:
63:
58:
36:
30:
27:
25:
9:
6:
4:
3:
2:
7245:
7234:
7231:
7229:
7226:
7224:
7221:
7219:
7216:
7214:
7211:
7209:
7206:
7205:
7203:
7188:
7185:
7183:
7180:
7178:
7177:Hallucination
7175:
7173:
7170:
7169:
7167:
7163:
7157:
7154:
7152:
7149:
7147:
7144:
7141:
7137:
7134:
7132:
7129:
7128:
7126:
7124:
7118:
7112:
7111:Spell checker
7109:
7107:
7104:
7102:
7099:
7097:
7094:
7092:
7089:
7087:
7084:
7083:
7081:
7079:
7073:
7067:
7064:
7062:
7059:
7057:
7054:
7053:
7051:
7049:
7045:
7039:
7036:
7034:
7031:
7029:
7026:
7024:
7021:
7019:
7016:
7015:
7013:
7011:
7005:
6995:
6992:
6990:
6987:
6985:
6982:
6980:
6977:
6975:
6972:
6970:
6967:
6965:
6962:
6960:
6957:
6956:
6954:
6950:
6944:
6941:
6939:
6936:
6934:
6931:
6929:
6926:
6924:
6923:Speech corpus
6921:
6919:
6916:
6914:
6911:
6909:
6906:
6904:
6903:Parallel text
6901:
6899:
6896:
6894:
6891:
6889:
6886:
6884:
6881:
6880:
6878:
6872:
6869:
6864:
6860:
6854:
6851:
6849:
6846:
6844:
6841:
6839:
6836:
6833:
6829:
6826:
6824:
6821:
6819:
6816:
6814:
6811:
6809:
6806:
6804:
6801:
6800:
6798:
6795:
6791:
6785:
6782:
6780:
6777:
6775:
6772:
6770:
6767:
6765:
6764:Example-based
6762:
6760:
6757:
6756:
6754:
6752:
6748:
6742:
6739:
6737:
6734:
6732:
6729:
6728:
6726:
6724:
6720:
6710:
6707:
6705:
6702:
6700:
6697:
6695:
6694:Text chunking
6692:
6690:
6687:
6685:
6684:Lemmatisation
6682:
6680:
6677:
6676:
6674:
6672:
6668:
6662:
6659:
6657:
6654:
6652:
6649:
6647:
6644:
6642:
6639:
6637:
6634:
6633:
6630:
6627:
6625:
6622:
6620:
6617:
6615:
6612:
6610:
6607:
6605:
6602:
6598:
6595:
6593:
6590:
6589:
6588:
6585:
6583:
6580:
6578:
6575:
6573:
6570:
6568:
6565:
6563:
6560:
6558:
6555:
6553:
6550:
6548:
6545:
6543:
6540:
6539:
6537:
6535:
6534:Text analysis
6531:
6525:
6522:
6520:
6517:
6515:
6512:
6510:
6507:
6503:
6500:
6498:
6495:
6494:
6493:
6490:
6488:
6485:
6483:
6480:
6479:
6477:
6475:General terms
6473:
6469:
6462:
6457:
6455:
6450:
6448:
6443:
6442:
6439:
6427:
6426:Voice cloning
6424:
6422:
6419:
6417:
6416:Phase vocoder
6414:
6412:
6409:
6407:
6404:
6402:
6399:
6397:
6394:
6392:
6389:
6388:
6386:
6382:
6376:
6373:
6371:
6368:
6366:
6363:
6361:
6358:
6356:
6353:
6351:
6348:
6346:
6343:
6341:
6338:
6336:
6335:Alan W. Black
6333:
6332:
6330:
6324:
6318:
6315:
6313:
6310:
6308:
6305:
6304:
6302:
6298:
6292:
6289:
6287:
6284:
6282:
6279:
6277:
6274:
6272:
6269:
6267:
6264:
6262:
6259:
6257:
6254:
6253:
6251:
6247:
6241:
6238:
6236:
6233:
6231:
6228:
6226:
6223:
6221:
6218:
6216:
6213:
6211:
6208:
6206:
6203:
6202:
6200:
6196:
6186:
6183:
6181:
6178:
6176:
6173:
6171:
6168:
6166:
6163:
6161:
6158:
6156:
6153:
6151:
6148:
6146:
6143:
6141:
6138:
6136:
6133:
6131:
6128:
6127:
6125:
6121:
6115:
6112:
6110:
6107:
6105:
6102:
6100:
6097:
6095:
6092:
6090:
6087:
6085:
6082:
6080:
6079:Voice browser
6077:
6075:
6072:
6070:
6067:
6065:
6062:
6060:
6057:
6055:
6052:
6050:
6047:
6045:
6042:
6040:
6037:
6036:
6034:
6030:
6027:
6025:
6019:
6009:
6006:
6004:
6001:
5999:
5996:
5995:
5993:
5989:
5983:
5980:
5978:
5975:
5973:
5969:
5966:
5964:
5961:
5959:
5956:
5954:
5951:
5949:
5945:
5942:
5941:
5939:
5935:
5932:
5930:
5929:Free software
5926:
5922:
5915:
5910:
5908:
5903:
5901:
5896:
5895:
5892:
5880:
5877:
5875:
5872:
5870:
5867:
5866:
5864:
5862:
5858:
5852:
5849:
5847:
5844:
5843:
5841:
5839:
5835:
5829:
5826:
5824:
5821:
5819:
5816:
5814:
5811:
5809:
5806:
5805:
5803:
5801:
5797:
5791:
5790:Concatenative
5788:
5786:
5783:
5781:
5778:
5776:
5773:
5772:
5770:
5768:
5764:
5760:
5754:
5751:
5749:
5746:
5744:
5741:
5739:
5736:
5734:
5731:
5729:
5726:
5724:
5721:
5720:
5717:
5712:
5705:
5700:
5698:
5693:
5691:
5686:
5685:
5682:
5675:
5671:
5667:
5664:
5662:
5658:
5652:
5648:
5647:
5643:
5638:
5637:
5623:
5618:
5614:
5613:
5605:
5590:
5586:
5580:
5565:
5561:
5555:
5540:
5536:
5530:
5516:
5512:
5505:
5491:
5487:
5483:
5479:
5472:
5458:
5454:
5447:
5433:
5429:
5425:
5421:
5414:
5400:
5396:
5389:
5374:
5370:
5364:
5349:
5345:
5339:
5324:
5320:
5316:
5309:
5294:
5290:
5286:
5279:
5264:
5258:
5243:
5239:
5233:
5225:
5221:
5220:
5212:
5206:
5198:
5194:
5190:
5184:
5180:
5176:
5172:
5168:
5161:
5153:
5149:
5145:
5139:
5135:
5131:
5127:
5123:
5116:
5108:
5104:
5100:
5096:
5091:
5086:
5082:
5078:
5074:
5070:
5066:
5059:
5045:on 2020-08-07
5044:
5040:
5036:
5029:
5016:
5012:
5011:
5003:
4988:
4981:
4967:
4963:
4956:
4941:
4937:
4936:
4931:
4925:
4917:
4912:
4908:
4904:
4903:
4898:
4891:
4883:
4878:
4875:: 4485–4495,
4874:
4870:
4869:
4861:
4847:on 2013-10-03
4846:
4842:
4836:
4821:
4815:
4808:
4807:0-7695-2932-1
4804:
4800:
4794:
4779:
4772:
4757:
4751:
4736:
4732:
4726:
4710:
4706:
4699:
4691:
4685:
4681:
4677:
4673:
4667:
4653:
4649:
4643:
4628:
4622:
4608:on 2012-03-24
4604:
4597:
4591:
4584:
4580:
4577:
4571:
4563:
4559:
4555:
4551:
4547:
4543:
4536:
4528:
4524:
4520:
4516:
4509:
4498:
4494:
4490:
4486:
4482:
4478:
4474:
4467:
4460:
4452:
4451:Science Daily
4448:
4442:
4434:
4430:
4424:
4410:. Festvox.org
4409:
4403:
4395:
4389:
4375:
4371:
4367:
4363:
4356:
4342:
4338:
4331:
4317:
4313:
4309:
4305:
4299:
4286:
4282:
4278:
4272:
4268:
4264:
4260:
4256:
4249:
4241:
4234:
4226:
4222:
4218:
4214:
4210:
4206:
4202:
4198:
4194:
4187:
4173:
4169:
4165:
4159:
4155:
4151:
4146:
4141:
4137:
4133:
4126:
4118:
4112:
4104:
4100:
4096:
4092:
4085:
4077:
4073:
4069:
4065:
4060:
4055:
4051:
4047:
4040:
4026:
4022:
4015:
4001:
3997:
3990:
3976:
3972:
3968:
3964:
3961:WIRED Staff.
3957:
3943:
3939:
3932:
3917:
3913:
3907:
3892:
3888:
3884:
3877:
3862:
3858:
3854:
3847:
3838:
3833:
3826:
3812:on 2011-12-16
3808:
3804:
3800:
3796:
3792:
3788:
3784:
3780:
3776:
3769:
3762:
3748:on 2012-02-13
3747:
3743:
3737:
3729:
3725:
3721:
3717:
3713:
3709:
3702:
3700:
3684:
3680:
3676:
3672:
3668:
3661:
3654:
3652:
3643:
3637:
3633:
3626:
3619:
3615:
3611:
3607:
3603:
3599:
3595:
3591:
3587:
3583:
3579:
3575:
3571:
3567:
3561:
3555:
3551:
3547:
3546:Astro Blaster
3541:
3534:
3530:
3527:
3526:
3519:
3512:
3508:
3504:
3499:
3491:
3490:
3483:
3475:
3474:
3467:
3452:
3448:
3442:
3428:
3424:
3420:
3416:
3410:
3402:
3398:
3394:
3390:
3383:
3376:
3372:
3366:
3351:
3347:
3341:
3334:
3328:
3321:
3318:Julia Zhang.
3315:
3308:
3307:Alan W. Black
3302:
3295:
3291:
3290:Alan W. Black
3286:
3278:
3276:9780521899277
3272:
3268:
3263:
3262:
3253:
3245:
3239:
3235:
3234:Penguin Books
3231:
3227:
3221:
3207:
3203:
3196:
3188:
3184:
3178:
3170:
3164:
3160:
3153:
3145:
3141:
3134:
3128:
3124:
3120:
3117:
3112:
3106:
3101:
3088:
3081:
3072:
3066:
3058:
3052:
3048:
3041:
3033:
3029:
3025:
3021:
3017:
3010:
2995:
2989:
2987:
2971:
2967:
2961:
2959:
2947:
2944:(3): 1123–6.
2943:
2936:
2929:
2918:
2914:
2910:
2905:
2900:
2896:
2892:
2885:
2878:
2872:
2868:
2865:
2860:
2846:on 2000-04-07
2845:
2841:
2835:
2819:
2815:
2809:
2801:
2797:
2790:
2782:
2778:
2774:
2770:
2766:
2762:
2759:(3): 737–93.
2758:
2754:
2747:
2733:on 2013-05-12
2729:
2725:
2721:
2714:
2707:
2697:
2692:
2685:
2680:
2672:
2668:
2665:(2): 95–128.
2664:
2660:
2653:
2645:
2639:
2634:
2633:
2624:
2616:
2612:
2608:
2604:
2600:
2596:
2589:
2581:
2575:
2570:
2569:
2560:
2556:
2545:
2542:
2540:
2537:
2535:
2532:
2530:
2527:
2525:
2522:
2520:
2517:
2515:
2512:
2510:
2507:
2505:
2502:
2501:
2487:
2476:
2474:
2470:
2466:
2462:
2458:
2457:voice quality
2453:
2451:
2445:
2441:
2439:
2435:
2431:
2426:
2424:
2423:
2418:
2417:
2412:
2411:
2406:
2404:
2399:
2395:
2391:
2390:
2385:
2381:
2372:
2368:
2364:
2362:
2358:
2354:
2350:
2346:
2342:
2338:
2334:
2323:
2321:
2316:
2314:
2310:
2306:
2302:
2298:
2294:
2284:
2282:
2281:
2276:
2272:
2271:
2266:
2262:
2258:
2257:
2252:
2248:
2244:
2236:
2232:
2228:
2224:
2221:
2217:
2213:
2210:
2207:
2206:
2205:
2202:
2200:
2195:
2193:
2192:voice cloning
2189:
2184:
2182:
2178:
2174:
2170:
2157:
2153:
2150:
2147:
2143:
2139:
2135:
2132:
2129:
2125:
2121:
2118:
2115:
2111:
2107:
2104:
2100:
2097:
2093:
2089:
2085:
2084:Amazon Kindle
2081:
2077:
2074:
2070:
2066:
2062:
2061:
2052:
2048:
2045:
2042:
2039:
2036:
2033:
2032:
2031:
2029:
2019:
2017:
2012:
2010:
2006:
2002:
1997:
1995:
1991:
1987:
1983:
1979:
1975:
1974:e-mail client
1971:
1967:
1957:
1955:
1945:
1936:
1919:
1909:
1907:
1903:
1899:
1895:
1893:
1889:
1885:
1881:
1877:
1873:
1869:
1865:
1859:
1849:
1845:
1843:
1842:Speak Handler
1839:
1835:
1831:
1827:
1823:
1815:
1793:
1791:
1787:
1777:
1775:
1771:
1767:
1763:
1759:
1755:
1751:
1747:
1743:
1739:
1735:
1731:
1727:
1691:
1689:
1684:
1682:
1678:
1677:1400XL/1450XL
1674:
1651:
1649:
1645:
1622:
1619:
1615:
1611:
1610:Intellivision
1608:
1598:
1596:
1595:
1590:
1589:
1568:
1558:
1548:
1545:
1542:
1541:Forrest Mozer
1538:
1535:
1533:
1530:
1528:
1525:
1524:
1519:
1510:
1508:
1504:
1500:
1496:
1492:
1491:pitch contour
1488:
1484:
1480:
1474:
1470:
1460:
1457:
1448:
1445:
1440:
1436:
1434:
1429:
1425:
1421:
1417:
1413:
1409:
1398:
1395:
1387:
1377:
1373:
1369:
1363:
1362:
1358:
1353:This section
1351:
1347:
1342:
1341:
1333:
1331:
1325:
1321:
1319:
1315:
1311:
1306:
1304:
1300:
1296:
1291:
1288:
1287:abbreviations
1284:
1280:
1265:
1263:
1259:
1251:
1247:
1242:
1238:
1231:
1220:
1218:
1213:
1209:
1207:
1203:
1202:vocal emotion
1199:
1198:browser-based
1195:
1191:
1189:
1185:
1181:
1177:
1176:deep learning
1173:
1169:
1165:
1163:
1144:
1134:
1132:
1126:
1116:
1114:
1110:
1106:
1102:
1098:
1094:
1090:
1080:
1076:
1074:
1070:
1066:
1062:
1057:
1055:
1051:
1047:
1043:
1037:
1027:
1025:
1021:
1017:
1014:
1010:
1006:
1001:
999:
995:
991:
987:
983:
982:screen reader
979:
975:
971:
967:
963:
959:
955:
951:
942:
940:
936:
932:
928:
920:
913:
909:
905:
899:
890:
888:
884:
880:
876:
872:
868:
864:
860:
856:
852:
848:
838:
835:
831:
826:
824:
823:decision tree
820:
816:
812:
808:
804:
800:
796:
792:
788:
784:
780:
776:
772:
768:
758:
753:
743:
741:
739:
734:
729:
726:
723:
722:
717:
707:
705:
700:
698:
694:
689:
685:
681:
679:
678:
673:
669:
665:
664:
659:
655:
651:
647:
646:
641:
637:
633:
629:
628:
623:
620:
616:
612:
608:
604:
600:
596:
575:
568:
564:
562:
558:
554:
536:
534:
529:
524:
522:
517:
515:
511:
507:
503:
499:
495:
491:
487:
483:
482:speech coding
479:
475:
473:
469:
465:
464:
458:
454:
450:
446:
442:
438:
434:
430:
426:
422:
414:
409:
400:
398:
394:
389:
385:
381:
377:
375:
371:
367:
363:
359:
354:
352:
348:
344:
340:
336:
332:
328:
324:
320:
316:
312:
308:
304:
301:In 1779, the
299:
298:(1214–1294).
297:
293:
289:
285:
281:
278:
268:
266:
262:
258:
254:
252:
247:
243:
239:
235:
231:
227:
223:
222:
217:
213:
212:
207:
203:
194:
190:
188:
184:
180:
175:
173:
168:
164:
160:
156:
155:concatenating
151:
149:
145:
141:
137:
133:
129:
125:
121:
117:
113:
107:
105:
79:
62:
48:
44:
40:
33:
28:
19:
7091:Concordancer
7027:
6487:Bag-of-words
6421:Self-voicing
6370:Philip Rubin
6249:Applications
6210:Mockingboard
6039:Amazon Polly
6022:Proprietary
5920:
5763:Sample-based
5644:
5611:
5604:
5592:. Retrieved
5589:synthesia.io
5588:
5579:
5567:. Retrieved
5563:
5554:
5543:. Retrieved
5539:www.vice.com
5538:
5529:
5518:. Retrieved
5514:
5504:
5493:. Retrieved
5481:
5471:
5460:. Retrieved
5456:
5446:
5435:. Retrieved
5423:
5413:
5402:. Retrieved
5398:
5388:
5377:. Retrieved
5372:
5363:
5352:. Retrieved
5350:. 2023-06-20
5347:
5338:
5327:. Retrieved
5318:
5308:
5297:. Retrieved
5288:
5278:
5267:. Retrieved
5257:
5246:. Retrieved
5244:. 2007-05-02
5241:
5232:
5223:
5217:
5205:
5170:
5160:
5125:
5115:
5090:11244/316759
5072:
5068:
5058:
5047:. Retrieved
5043:the original
5038:
5028:
5018:, retrieved
5009:
5002:
4991:. Retrieved
4980:
4969:. Retrieved
4965:
4955:
4944:. Retrieved
4942:. 2019-07-08
4933:
4924:
4906:
4900:
4890:
4872:
4866:
4860:
4849:. Retrieved
4845:the original
4835:
4824:. Retrieved
4814:
4793:
4782:. Retrieved
4771:
4760:. Retrieved
4750:
4739:. Retrieved
4735:the original
4725:
4713:. Retrieved
4709:the original
4698:
4675:
4666:
4655:. Retrieved
4651:
4642:
4631:. Retrieved
4621:
4610:. Retrieved
4603:the original
4590:
4570:
4545:
4541:
4535:
4518:
4514:
4508:
4497:the original
4476:
4472:
4459:
4450:
4441:
4433:the original
4423:
4412:. Retrieved
4402:
4388:
4377:. Retrieved
4365:
4355:
4344:. Retrieved
4340:
4330:
4319:. Retrieved
4307:
4298:
4288:, retrieved
4258:
4248:
4242:. Bloomberg.
4233:
4200:
4196:
4186:
4175:. Retrieved
4135:
4125:
4090:
4084:
4049:
4039:
4028:. Retrieved
4024:
4014:
4003:. Retrieved
3999:
3989:
3978:. Retrieved
3966:
3956:
3945:. Retrieved
3941:
3931:
3920:. Retrieved
3915:
3906:
3895:. Retrieved
3886:
3876:
3865:. Retrieved
3856:
3846:
3825:
3814:. Retrieved
3807:the original
3778:
3774:
3761:
3750:. Retrieved
3746:the original
3736:
3711:
3707:
3686:. Retrieved
3666:
3631:
3625:
3610:RoadBlasters
3560:
3540:
3524:
3518:
3510:
3498:
3489:The Futurist
3488:
3482:
3472:
3466:
3455:. Retrieved
3453:. 2010-09-13
3450:
3441:
3430:. Retrieved
3418:
3409:
3392:
3388:
3382:
3374:
3365:
3354:. Retrieved
3350:the original
3340:
3327:
3314:
3301:
3285:
3260:
3252:
3229:
3220:
3209:. Retrieved
3205:
3195:
3186:
3177:
3158:
3152:
3143:
3133:
3111:
3100:
3080:
3071:
3065:
3049:. Springer.
3046:
3040:
3023:
3019:
3009:
2998:. Retrieved
2973:. Retrieved
2941:
2928:
2894:
2890:
2877:
2859:
2848:. Retrieved
2844:the original
2834:
2822:. Retrieved
2818:the original
2808:
2799:
2789:
2756:
2752:
2746:
2735:. Retrieved
2728:the original
2723:
2719:
2706:
2695:
2691:
2679:
2662:
2658:
2652:
2636:. Springer.
2631:
2623:
2598:
2594:
2588:
2567:
2559:
2454:
2446:
2442:
2427:
2420:
2414:
2413:fandom, the
2408:
2402:
2387:
2376:
2329:
2326:Applications
2317:
2291:A number of
2290:
2278:
2275:Tenth Doctor
2268:
2254:
2240:
2203:
2196:
2190:presented a
2185:
2167:At the 2018
2166:
2128:IBM ViaVoice
2025:
2013:
1998:
1966:applications
1963:
1951:
1942:
1934:
1906:call centers
1896:
1888:Windows 2000
1861:
1846:
1819:
1783:
1762:Snow Leopard
1723:
1685:
1670:
1641:
1614:Intellivoice
1604:
1592:
1586:
1583:
1556:
1478:
1476:
1458:
1454:
1441:
1437:
1405:
1390:
1381:
1366:Please help
1354:
1326:
1322:
1307:
1292:
1276:
1255:
1214:
1210:
1192:
1187:
1171:
1166:
1159:
1128:
1086:
1077:
1058:
1050:Philip Rubin
1039:
1002:
977:
948:
923:/ˌklɪəɹˈʌʊt/
918:
911:
907:
900:
896:
851:phonotactics
844:
827:
764:
755:
736:
732:
730:
725:
719:
715:
713:
701:
690:
686:
682:
675:
661:
657:
649:
643:
638:. The first
631:
625:
619:shoot 'em up
609:produced by
602:
593:
557:Dennis Klatt
550:
525:
518:
512:used in the
476:
466:, where the
461:
420:
418:
378:
366:Homer Dudley
355:
300:
288:Silvester II
284:Brazen Heads
274:
264:
260:
249:
245:
221:tokenization
219:
215:
209:
199:
176:
152:
135:
131:
130:products. A
119:
111:
110:
101:
61:
46:
37:This is the
31:
7048:Topic model
6928:Text corpus
6774:Statistical
6641:Text mining
6482:AI-complete
6350:Gunnar Fant
6328:Researchers
6326:Developers/
6266:Dr. Sbaitso
6074:Readspeaker
5953:Gnopernicus
5743:Subtractive
5348:VentureBeat
4820:"gnuspeech"
4574:EE Times. "
3598:Gauntlet II
3578:Road Runner
2700:(in German)
2396:in various
2156:Yamaha FS1R
2124:OS/2 Warp 4
2022:Open source
2005:Readspeaker
1978:web browser
1774:AppleScript
1248:as well as
1217:tone sandhi
1115:criterion.
1097:vocal tract
1042:vocal tract
1020:Atari, Inc.
998:intonations
935:alternation
919:"clear out"
803:spectrogram
716:naturalness
622:arcade game
472:Dave Bowman
453:Max Mathews
333:-operated "
325:sounds (in
319:vocal tract
296:Roger Bacon
261:synthesizer
253:-to-phoneme
172:vocal tract
7202:Categories
6769:Rule-based
6651:Truecasing
6519:Stop words
6291:Voice font
6256:AOLbyPhone
6155:PPG Phonem
6145:Chipspeech
6084:CoolSpeech
5753:Distortion
5594:12 October
5545:2023-02-03
5520:2023-07-25
5495:2023-07-25
5462:2023-07-25
5437:2023-07-25
5404:2023-04-25
5379:2023-04-25
5354:2023-07-25
5329:2021-01-18
5299:2021-01-19
5269:2010-02-17
5248:2010-02-17
5226:(1). 1984.
5049:2020-04-02
5020:2018-03-02
4993:2016-06-18
4971:2019-09-08
4946:2019-09-11
4916:1802.06006
4882:1806.04558
4851:2010-02-17
4826:2010-02-17
4784:2010-02-17
4762:2010-02-17
4741:2011-01-29
4672:Miner, Jay
4657:2020-04-28
4633:2013-03-24
4612:2012-02-22
4414:2012-02-22
4379:2023-07-25
4346:2022-07-01
4341:PEOPLE.com
4321:2022-06-29
4290:2022-06-29
4177:2022-06-29
4145:2003.09234
4059:1912.10915
4030:2023-07-25
4005:2023-07-25
4000:TechCrunch
3980:2023-07-25
3947:2023-04-25
3922:2023-02-03
3897:2021-01-18
3867:2021-01-19
3837:1910.11997
3816:2011-12-14
3752:2012-02-22
3550:Space Fury
3503:L.F. Lamel
3457:2019-05-23
3432:2019-05-28
3356:2008-05-28
3211:2020-08-23
3127:GamesRadar
3087:US 4326710
3000:2009-07-21
2850:2010-02-17
2824:5 December
2737:2011-12-13
2551:References
2339:and other
2280:Doctor Who
2273:, and the
2265:Fluttershy
2069:Atari 2600
2065:Atari 5200
1884:Windows 98
1880:Windows 95
1856:See also:
1467:See also:
1384:April 2023
1303:homographs
1279:heteronyms
1268:Challenges
1206:intonation
1194:ElevenLabs
1069:Steve Jobs
904:non-rhotic
693:Ann Syrdal
658:zero cross
615:video game
533:a cappella
449:Daisy Bell
386:built the
309:scientist
277:electronic
104:media help
7078:reviewing
6876:standards
6874:Types and
6300:Protocols
6286:PlainTalk
6130:Alter/Ego
6109:LaLaVoice
6104:Voiceroid
5998:eCantorix
5958:Gnuspeech
5775:Wavetable
5649:is being
5569:10 August
5490:0362-4331
5432:1059-1028
5289:AUTOMATON
5197:236982893
5152:250118756
5107:243101945
5099:0738-0569
4822:. Gnu.org
4374:1059-1028
4316:0190-8286
4285:236666289
4225:226196422
4217:1461-4448
4172:214605906
4111:cite book
4103:2209-9689
4076:209444942
3975:1059-1028
3857:AUTOMATON
3566:Star Wars
3427:0040-781X
2913:1932-8346
2473:dysphonic
2465:phonation
2450:Synthesia
2103:BBC Micro
2047:gnuspeech
2009:Pediaphon
1986:RSS-feeds
1836:'s audio
1830:MacinTalk
1750:VoiceOver
1746:PlainTalk
1742:Macintosh
1734:MacInTalk
1648:Macintalk
1420:linguists
1355:does not
1299:heuristic
1256:In 2023,
1252:services.
1109:waveforms
1073:gnuspeech
834:gigabytes
791:sentences
779:morphemes
775:syllables
740:synthesis
627:Stratovox
563:methods.
535:" style.
526:In 1975,
506:Bell Labs
441:Bell Labs
370:The Voder
358:Bell Labs
343:Pressburg
242:sentences
202:front-end
6994:Wikidata
6974:FrameNet
6959:BabelNet
6938:Treebank
6908:PropBank
6853:Word2vec
6818:fastText
6699:Stemming
6317:VoiceXML
6261:DialogOS
6180:Vocaloid
6175:Vocalina
6160:Realivox
6094:CereProc
6054:Talk It!
6032:Speaking
6024:software
5948:eSpeakNG
5937:Speaking
5780:Granular
5748:Additive
5642:template
5373:Press.pl
5323:Archived
5293:Archived
4579:Archived
4562:10491251
4493:46693018
4025:Lifewire
3891:Archived
3861:Archived
3728:26337775
3683:17451802
3606:Paperboy
3594:Gauntlet
3529:Archived
3228:(2005).
3119:Archived
2946:Archived
2917:Archived
2867:Archived
2497:See also
2425:fandom.
2337:dyslexia
2320:VoiceXML
2243:freeware
2235:lip sync
2227:SIGGRAPH
2199:Symantec
2142:Magellan
1990:podcasts
1960:Internet
1892:Narrator
1784:Used in
1688:Atari ST
1527:Icophone
1424:language
1412:grapheme
1408:spelling
1295:semantic
1264:system.
1131:formants
988:, where
974:waveform
847:diphones
819:run time
799:waveform
771:diphones
654:PET 2001
634:), from
595:Handheld
468:HAL 9000
435:used an
397:phonetic
351:Euphonia
251:grapheme
206:back-end
167:diphones
159:database
128:hardware
124:software
43:reviewed
7165:Related
7131:Chatbot
6989:WordNet
6969:DBpedia
6843:Seq2seq
6587:Parsing
6502:Trigram
6384:Process
6205:Echo II
6198:Machine
6185:Xiaoice
6123:Singing
6044:DECtalk
5991:Singing
5977:FreeTTS
5851:Modular
5823:Formant
5767:Sampler
5738:Scanned
5564:elai.io
4935:bbc.com
4715:9 April
3803:7233191
3783:Bibcode
3775:Science
3688:Aug 27,
3570:Firefox
3535:, 1993.
3377:, 1996.
2975:15 July
2781:2958525
2761:Bibcode
2603:Bibcode
2398:fandoms
2386:series
2380:Biglobe
2088:Samsung
2073:Quadrun
1994:podcast
1970:plugins
1954:Android
1948:Android
1864:Windows
1862:Modern
1838:chipset
1822:AmigaOS
1796:AmigaOS
1788:and as
1758:Leopard
1588:Alpiner
1503:plosion
1416:phoneme
1376:removed
1361:sources
1318:corpora
1283:numbers
1184:emotion
1170:uses a
1105:prosody
966:voicing
950:Formant
933:. This
931:liaison
915:/ˈklɪə/
912:"clear"
879:Leachim
855:prosody
787:phrases
738:formant
663:Berzerk
603:Speech+
553:DECtalk
445:vocoder
437:IBM 704
415:in 1999
362:vocoder
331:bellows
271:History
257:prosody
238:clauses
234:phrases
232:, like
7138:(c.f.
6796:models
6784:Neural
6497:Bigram
6492:n-gram
6401:Currah
6375:Yamaha
6271:MBROLA
6220:Phasor
6135:Cantor
5944:eSpeak
5785:Vector
5661:Curlie
5655:
5646:Curlie
5488:
5457:Forbes
5430:
5195:
5185:
5150:
5140:
5105:
5097:
4809:, 2007
4805:
4686:
4560:
4491:
4372:
4314:
4283:
4273:
4223:
4215:
4170:
4160:
4101:
4074:
3973:
3916:Sifted
3801:
3726:
3681:
3638:
3602:A.P.B.
3552:, and
3425:
3273:
3240:
3165:
3144:RePlay
3092:
3053:
2911:
2779:
2640:
2576:
2469:timbre
2416:Portal
2407:, the
2405:fandom
2361:Votrax
2311:) and
2256:Portal
2251:GLaDOS
2173:Google
2152:Yamaha
2146:TomTom
2138:Garmin
2058:Others
2035:eSpeak
1918:Votrax
1912:Votrax
1890:added
1872:SAPI 5
1868:SAPI 4
1780:Amazon
1607:Mattel
1601:Mattel
1594:Parsec
1507:voiced
1285:, and
1016:arcade
990:memory
968:, and
927:French
871:MBROLA
789:, and
767:phones
677:Milton
601:(TSI)
421:et al.
307:Danish
303:German
240:, and
204:and a
163:phones
116:speech
7187:spaCy
6832:large
6823:GloVe
6411:PSOLA
6312:SABLE
6240:TuVox
6114:15.ai
6089:IVONA
6008:Sinsy
5972:Flite
5713:types
5640:‹The
5424:Wired
5399:Wired
5214:(PDF)
5193:S2CID
5148:S2CID
5103:S2CID
4911:arXiv
4877:arXiv
4606:(PDF)
4599:(PDF)
4558:S2CID
4500:(PDF)
4489:S2CID
4469:(PDF)
4366:Wired
4281:S2CID
4221:S2CID
4168:S2CID
4140:arXiv
4072:S2CID
4054:arXiv
3967:Wired
3942:Wired
3832:arXiv
3810:(PDF)
3771:(PDF)
3679:S2CID
3663:(PDF)
2949:(PDF)
2938:(PDF)
2920:(PDF)
2887:(PDF)
2731:(PDF)
2716:(PDF)
2384:anime
2313:SABLE
2277:from
2253:from
2247:15.ai
2179:from
2114:codec
2094:Pro,
2078:Some
2026:Some
1834:Amiga
1786:Alexa
1694:Apple
1654:Atari
1310:above
1168:15.ai
970:noise
867:PSOLA
815:pitch
807:index
805:. An
783:words
695:, at
337:" of
323:vowel
218:, or
142:like
6952:Data
6803:BERT
6225:RIAS
6170:UTAU
5963:Orca
5596:2023
5571:2022
5486:ISSN
5428:ISSN
5183:ISBN
5138:ISBN
5095:ISSN
4803:ISBN
4717:2013
4684:ISBN
4370:ISSN
4312:ISSN
4271:ISBN
4213:ISSN
4158:ISBN
4117:link
4099:ISSN
3971:ISSN
3799:PMID
3724:PMID
3690:2015
3636:ISBN
3590:720°
3423:ISSN
3419:Time
3271:ISBN
3238:ISBN
3163:ISBN
3051:ISBN
2977:2019
2970:IEEE
2909:ISSN
2826:2017
2777:PMID
2638:ISBN
2574:ISBN
2309:JSML
2263:and
2101:The
2090:E6,
1904:and
1882:and
1870:and
1768:, a
1686:The
1605:The
1591:and
1471:and
1359:any
1357:cite
1258:VICE
1204:and
1061:NeXT
1013:Sega
1007:toy
992:and
801:and
735:and
718:and
528:MUSA
500:and
6984:UBY
5765:or
5672:on
5670:BBC
5659:at
5617:doi
5175:doi
5130:doi
5085:hdl
5077:doi
4940:BBC
4550:doi
4523:doi
4481:doi
4263:doi
4205:doi
4150:doi
4064:doi
3791:doi
3779:212
3716:doi
3671:doi
3612:,
3397:doi
3028:doi
2899:doi
2769:doi
2667:doi
2611:doi
2471:of
2297:XML
2225:In
2134:GPS
2122:'s
2120:IBM
2016:W3C
1980:or
1766:say
1732:'s
1625:SAM
1370:by
1235:An
1099:),
908:"r"
869:or
504:at
488:of
341:of
248:or
181:or
165:or
136:TTS
126:or
45:on
7204::
5653:.›
5587:.
5562:.
5537:.
5513:.
5484:.
5480:.
5455:.
5426:.
5422:.
5397:.
5371:.
5346:.
5321:.
5317:.
5291:.
5287:.
5240:.
5224:21
5222:.
5216:.
5191:.
5181:.
5169:.
5146:.
5136:.
5124:.
5101:.
5093:.
5083:.
5073:38
5071:.
5067:.
5037:.
5013:,
4964:.
4938:.
4932:.
4909:,
4907:31
4905:,
4899:,
4873:31
4871:,
4650:.
4556:.
4546:21
4544:.
4519:42
4517:.
4487:.
4477:50
4475:.
4471:.
4449:.
4368:.
4364:.
4339:.
4310:.
4306:.
4279:,
4269:,
4257:,
4219:.
4211:.
4201:23
4199:.
4195:.
4166:.
4156:.
4148:.
4134:.
4113:}}
4109:{{
4070:.
4062:.
4048:.
4023:.
3998:.
3969:.
3965:.
3940:.
3914:.
3889:.
3885:.
3859:.
3855:.
3797:.
3789:.
3777:.
3773:.
3722:.
3712:30
3710:.
3698:^
3677:.
3665:.
3650:^
3616:,
3608:,
3604:,
3600:,
3596:,
3592:,
3588:,
3584:,
3580:,
3576:,
3572:,
3568:,
3548:,
3509:,
3449:.
3417:.
3393:42
3391:.
3373:.
3292:,
3269:.
3236:.
3232:.
3204:.
3185:.
3142:.
3125:,
3024:17
3022:.
2985:^
2968:.
2957:^
2940:.
2915:.
2907:.
2893:.
2889:.
2798:.
2775:.
2767:.
2757:82
2755:.
2724:12
2722:.
2718:.
2661:.
2609:.
2599:70
2597:.
2363:.
2283:.
2259:,
2144:,
2140:,
2086:,
1968:,
1908:.
1886:.
1487:UK
1485:,
1281:,
964:,
941:.
889:.
865:,
825:.
785:,
781:,
777:,
769:,
624:,
376:.
236:,
214:,
150:.
41:,
7142:)
6865:,
6834:)
6830:(
6460:e
6453:t
6446:v
5970:/
5946:/
5913:e
5906:t
5899:v
5703:e
5696:t
5689:v
5676:.
5625:.
5619::
5598:.
5573:.
5548:.
5523:.
5498:.
5465:.
5440:.
5407:.
5382:.
5357:.
5332:.
5302:.
5272:.
5251:.
5199:.
5177::
5154:.
5132::
5109:.
5087::
5079::
5052:.
4996:.
4974:.
4949:.
4913::
4879::
4854:.
4829:.
4787:.
4765:.
4744:.
4719:.
4692:.
4660:.
4636:.
4615:.
4564:.
4552::
4529:.
4525::
4483::
4417:.
4382:.
4349:.
4324:.
4265::
4227:.
4207::
4180:.
4152::
4142::
4119:)
4105:.
4078:.
4066::
4056::
4033:.
4008:.
3983:.
3950:.
3925:.
3900:.
3870:.
3840:.
3834::
3819:.
3793::
3785::
3755:.
3730:.
3718::
3692:.
3673::
3644:.
3620:.
3460:.
3435:.
3403:.
3399::
3359:.
3279:.
3267:3
3246:.
3214:.
3189:.
3171:.
3059:.
3034:.
3030::
3003:.
2979:.
2901::
2895:3
2853:.
2828:.
2802:.
2783:.
2771::
2763::
2740:.
2673:.
2669::
2663:8
2646:.
2617:.
2613::
2605::
2582:.
2488:.
2130:.
2071:(
2053:.
1543:)
1397:)
1391:(
1386:)
1382:(
1378:.
1364:.
1232:.
1095:(
813:(
724:.
648:(
531:"
305:-
134:(
106:.
49:.
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.