<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
	<title type="html"><![CDATA[ChipMusic.org - Sega SPEECH synthesis CSM Sounds]]></title>
	<link rel="self" href="https://chipmusic.org:80/forums/feed/atom/topic/11028/"/>
	<updated>2014-07-16T19:39:31Z</updated>
	<generator>PunBB</generator>
	<id>https://chipmusic.org/forums/topic/11028/sega-speech-synthesis-csm-sounds/</id>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/213367/#p213367"/>
			<content type="html"><![CDATA[<p>This thread was declared dead almost 1 year ago!</p>]]></content>
			<author>
				<name><![CDATA[Delek]]></name>
				<uri>https://chipmusic.org/Delek</uri>
			</author>
			<updated>2014-07-16T19:39:31Z</updated>
			<id>https://chipmusic.org/forums/post/213367/#p213367</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/213335/#p213335"/>
			<content type="html"><![CDATA[<p>Awesome!</p>]]></content>
			<author>
				<name><![CDATA[Esopus-dragon]]></name>
				<uri>https://chipmusic.org/Esopus-dragon</uri>
			</author>
			<updated>2014-07-16T15:01:09Z</updated>
			<id>https://chipmusic.org/forums/post/213335/#p213335</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/180928/#p180928"/>
			<content type="html"><![CDATA[<div class="quotebox"><cite>akira^8GB wrote:</cite><blockquote><p>Just noticed you posted the ROM. Thanks! I&#039;ll give it a play this weekend and hopefully sample some fear-inducing sounds <img src="https://chipmusic.org/forums/img/smilies/big_smile.png" width="15" height="15" alt="big_smile" /></p></blockquote></div><p>Yep ^_^ let me hear those samples when done</p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-08-16T11:52:21Z</updated>
			<id>https://chipmusic.org/forums/post/180928/#p180928</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/180489/#p180489"/>
			<content type="html"><![CDATA[<p>Just noticed you posted the ROM. Thanks! I&#039;ll give it a play this weekend and hopefully sample some fear-inducing sounds <img src="https://chipmusic.org/forums/img/smilies/big_smile.png" width="15" height="15" alt="big_smile" /></p>]]></content>
			<author>
				<name><![CDATA[akira^8GB]]></name>
				<uri>https://chipmusic.org/akira%5E8GB</uri>
			</author>
			<updated>2013-08-13T09:22:42Z</updated>
			<id>https://chipmusic.org/forums/post/180489/#p180489</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/170350/#p170350"/>
			<content type="html"><![CDATA[<p>So here is some example of how to use the very unique CSM speech.<br />First a reminder :<br />Example of CSM speech in the MSX game Zeilard :<br /><a href="http://www.alyjameslab.com/tutorials/MSX_CSM.wav" target="_blank">http://www.alyjameslab.com/tutorials/MSX_CSM.wav</a><br />Example of CSM random speech in FMDrive vsti<br /><a href="http://www.alyjameslab.com/tutorials/FMDrive_CSM.wav" target="_blank">http://www.alyjameslab.com/tutorials/FMDrive_CSM.wav</a></p><p>Sine waves artifacts are noticeable in both of these examples because there is no decay on the envelope generator.</p><p>Now lets decay very quickly and the artifact are less noticeable at the end.<br />Example of an attempt at saying &quot; HELLO&quot; with fast decay<br />first MIDI data are played very quickly then slowed down.<br /><a href="http://www.alyjameslab.com/tutorials/hellocsm.wav" target="_blank">http://www.alyjameslab.com/tutorials/hellocsm.wav</a></p><p>and for a visual explanation better than words..<br />Here is one instance of FMDrive in Cubase with automation lane and MIDI channels for the special mode..<br />Notice the similarity between the spectrogram and the automation data.</p><p>what you see is power of spectrum and variation of frequency in the time domain.<br />Sum of sine waves at different frequencies gives us what we call vocal formants.<br /><a class="postimg" href="http://www.alyjameslab.com/tutorials/hellocsm.png" title="http://www.alyjameslab.com/tutorials/hellocsm.png" id="forum_image_35593155"><img src="http://www.alyjameslab.com/tutorials/hellocsm.png" /></a></p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-06-01T00:34:38Z</updated>
			<id>https://chipmusic.org/forums/post/170350/#p170350</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/169688/#p169688"/>
			<content type="html"><![CDATA[<p>Keep in mind that is it is one of my test ROM not intended for public release and not specially user friendly <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /></p><p><span style="color: #00FFDA">CSM MODE ROM:<br />-------------------------------------------------------------------------------------<br />Test mode for FMDrive Vsti dev.<br />Works on a real MD1 and Regen Emulator.<br />Use at your own risk <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /><br />-------------------------------------------------------------------------------------<br />USE 2 OPERATORS ON CH3: OP2&gt;OP4<br />The ROM starts in NORMAL mode until BUTTON C is pressed<br />(then it will be either in CSM or SPECIAL until ROM reset)<br />A key on to CH3 is set on startup and basic registers set.</span></p><p><span style="color: #FF0330">COMMANDS: on PAD 1 <br />(there is also a command on PAD2 that controls the TL of OP2..<br />cannot remember witch one ^^)<br />-------------------------------------------------------------------------------------<br />START : KEY on/off (OP2 + OP4)</span></p><p><span style="color: #FF0330">A: Pressed Set AR of OP2 to 1F, depressed Set AR to 00<br />So if you want to have OP2 modulating OP4 keep it pressed</span></p><p><span style="color: #FF0330">B: Pressed Key on OP2 and Key off OP4</span></p><p><span style="color: #FF0330">C: Pressed CSM mode (auto key on/off at Timer A speed)<br />Depressed Special Mode (independent FRQ set by RIGHT) </span></p><p><span style="color: #FF0330">LEFT : ALGO change from 0 to 7 then wrap.</span></p><p><span style="color: #FF0330">RIGHT: FRQ change for OP2 (change block. down then wrap)</span></p><p><span style="color: #FF0330">DOWN :FRQ change for OP4 (change block. down then wrap)</span></p><p><span style="color: #FF0330">UP: Timer A period (down then wrap)</span></p><p>DOWNLOAD: <a href="http://www.alyjameslab.com/tutorials/FMDrive_CSM_test03.bin" target="_blank">http://www.alyjameslab.com/tutorials/FM &#133; test03.bin</a></p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-05-26T18:25:31Z</updated>
			<id>https://chipmusic.org/forums/post/169688/#p169688</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/169679/#p169679"/>
			<content type="html"><![CDATA[<div class="quotebox"><cite>boomlinde wrote:</cite><blockquote><p>I did some encoding experiments that would fit this audio mode perfectly -- condensing a sample to its N most prominent sine components over a window of M microseconds. Then you just have to encode the partial number and its amplitude. For 4 components at 25 Hz, you&#039;d probably be able to get it down to 800 bps. Here&#039;s an example of 8 partials at 25 Hz: <a href="https://dl.dropboxusercontent.com/u/5016785/chariots.mp3" target="_blank">https://dl.dropboxusercontent.com/u/501 &#133; ariots.mp3</a>. This has no amplitude quantization, though.</p><p>For speech synthesis, you could severely limit the spectrum with a pre-filter, but I&#039;m not sure what would produce the best overall result for speech.</p></blockquote></div><p>Great work !<br />a frequency range of 100Hz-5000Hz should be enough for speech analysis.<br />I would need something like that to make a nice tool for FMDrive<br />// Wav vocal sample &gt;&gt; FFT analysis &gt;&gt; Formant Freqs + amplitude &gt;&gt; MIDI<br />4 main formants in the speech&gt;&gt; FRQ to operators Fnumber &gt;&gt; midi pitch or midi notes<br />4 levels for the power of spectrum &gt;&gt; Db to operators TL &gt;&gt; midi volume or TL(cc)</p><p>These should output 4 midi files for OP4,3,2 and 1</p><p>can you make something like that ? that would be also cool as a wav sample to midi using each sine waves signals for each most prominent sine...</p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-05-26T16:45:53Z</updated>
			<id>https://chipmusic.org/forums/post/169679/#p169679</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/169678/#p169678"/>
			<content type="html"><![CDATA[<div class="quotebox"><cite>akira^8GB wrote:</cite><blockquote><p>Are you going to release the ROM?</p></blockquote></div><p>thx 4 reminding me, I have forgotten to post it..the ROM should b posted here today <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /></p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-05-26T16:36:08Z</updated>
			<id>https://chipmusic.org/forums/post/169678/#p169678</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/169651/#p169651"/>
			<content type="html"><![CDATA[<p>I did some encoding experiments that would fit this audio mode perfectly -- condensing a sample to its N most prominent sine components over a window of M microseconds. Then you just have to encode the partial number and its amplitude. For 4 components at 25 Hz, you&#039;d probably be able to get it down to 800 bps. Here&#039;s an example of 8 partials at 25 Hz: <a href="https://dl.dropboxusercontent.com/u/5016785/chariots.mp3" target="_blank">https://dl.dropboxusercontent.com/u/501 &#133; ariots.mp3</a>. This has no amplitude quantization, though.</p><p>For speech synthesis, you could severely limit the spectrum with a pre-filter, but I&#039;m not sure what would produce the best overall result for speech.</p>]]></content>
			<author>
				<name><![CDATA[boomlinde]]></name>
				<uri>https://chipmusic.org/boomlinde</uri>
			</author>
			<updated>2013-05-26T11:39:32Z</updated>
			<id>https://chipmusic.org/forums/post/169651/#p169651</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/169647/#p169647"/>
			<content type="html"><![CDATA[<p>Are you going to release the ROM?</p>]]></content>
			<author>
				<name><![CDATA[akira^8GB]]></name>
				<uri>https://chipmusic.org/akira%5E8GB</uri>
			</author>
			<updated>2013-05-26T09:36:39Z</updated>
			<id>https://chipmusic.org/forums/post/169647/#p169647</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/167553/#p167553"/>
			<content type="html"><![CDATA[<div class="quotebox"><cite>Aly James wrote:</cite><blockquote><p>and Here:</p><p><div class="embed_video"><iframe width="560" height="340" src="https://www.youtube.com/embed/W3apkzZQa4E" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe></div></p></blockquote></div><p>God damn it, that is TERRIFYING. I love it <img src="https://chipmusic.org/forums/img/smilies/big_smile.png" width="15" height="15" alt="big_smile" /><br />Looking forward to play around with that ROM!</p>]]></content>
			<author>
				<name><![CDATA[akira^8GB]]></name>
				<uri>https://chipmusic.org/akira%5E8GB</uri>
			</author>
			<updated>2013-05-09T16:54:29Z</updated>
			<id>https://chipmusic.org/forums/post/167553/#p167553</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/167547/#p167547"/>
			<content type="html"><![CDATA[<p>i have updated the previous post with an another example.</p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-05-09T16:21:42Z</updated>
			<id>https://chipmusic.org/forums/post/167547/#p167547</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/167529/#p167529"/>
			<content type="html"><![CDATA[<p>indeed <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /></p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-05-09T15:30:31Z</updated>
			<id>https://chipmusic.org/forums/post/167529/#p167529</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/167527/#p167527"/>
			<content type="html"><![CDATA[<p>That is super awesome!</p>]]></content>
			<author>
				<name><![CDATA[KeFF]]></name>
				<uri>https://chipmusic.org/KeFF</uri>
			</author>
			<updated>2013-05-09T15:28:20Z</updated>
			<id>https://chipmusic.org/forums/post/167527/#p167527</id>
		</entry>
		<entry>
			<title type="html"><![CDATA[Re: Sega SPEECH synthesis CSM Sounds]]></title>
			<link rel="alternate" href="https://chipmusic.org/forums/post/167524/#p167524"/>
			<content type="html"><![CDATA[<div class="quotebox"><cite>Lazerbeat wrote:</cite><blockquote><p>Ok, Im really sorry but can someone explain what the fuck is going on here? Does the YM2612 have a speech mode in it that nobody used? Is it a kind of primative formant synthesis or something?</p></blockquote></div><p>Absolutely right <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /><br />It is not really easy to program to actually produce understandable speech&nbsp; but the technology is definitely here in the YM2612...<br />I have made a few video on the FMDRIVE VSti to showcase what you can do with it.<br />I have found some rare use of CSM inside some Game Arts games for MSX:<br />The Silpheed game on PC88 MSX computer featuring a very similar chip than YM2612 wih the exact same CSM feature.<br />In use here to produce the robotic speech:</p><p><div class="embed_video"><iframe width="560" height="340" src="https://www.youtube.com/embed/8hVwAfy88NE" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe></div></p><p>and Here:</p><p><div class="embed_video"><iframe width="560" height="340" src="https://www.youtube.com/embed/W3apkzZQa4E" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe></div></p><p>The FM sound of YAMAHA has the ability to key-on / key off immediately (some channels) or all channels when the timer A built-in overflows.<br />It is called &quot;CSM speech synthesis mode&quot; and stands for Composite Sinusoidal Modeling.<br />A type of speech coding, CSM speech synthesis is a technique to reproduce with the combination of multiple sine wave, the original data of a vocal sample.&nbsp; </p><p>There is a theory using FFT to &quot;de compose&quot; the frequency content of a signal into a sum a different sine waves, in the time domain with different pitches and volume. <br />Based on this theory, If you play at the same time more than one sine at an appropriate TL volume and frequencies , you can reproduce the waveform similar to the original waveform.<br />YM2612 can output 4 sine with 4 different Frequency and 4 Different TL volume.<br />FMDRIVE Vst uses that with MIDI CH1 , 11, 12, 13 to control Frequency and Volume an additional CH 14 to control the timer A.<br />You can also midi learn these controls to any midi controller and you&#039;re good for some live talking shit <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /></p><p>This mode is also useful to output new type of sounds similar to having a powerful filter on board...and that is what is very interesting in addition to the speech thing.<br />My testing have shown some really cool stuff <img src="https://chipmusic.org/forums/img/smilies/smile.png" width="15" height="15" alt="smile" /></p>]]></content>
			<author>
				<name><![CDATA[Aly James]]></name>
				<uri>https://chipmusic.org/Aly+James</uri>
			</author>
			<updated>2013-05-09T15:03:07Z</updated>
			<id>https://chipmusic.org/forums/post/167524/#p167524</id>
		</entry>
</feed>
