Вы находитесь на странице: 1из 28

Your computer must be in Japanese locale first.

Go to Control Panel - Regional and Language Administrative and in "Languages for Non-Unicode Programs" change it to Japanese locale.
Restart the computer, then go to the next step.
Go to http://utau2008.xrea.jp/ and click "v0.2.76 " which is the installer. It seems
that you have not installed UTAU and just used the zip file. The zip file is for you installed
UTAU already but have trouble upgrading their older versions and such. Then just install
normally.
Now to apply an English patch to UTAU, first go to
http://www.mediafire.com/file/mttooatmn5 and download the file. Extract those files in
another folder which already exists. Then create a new folder with the name "res". Cut and paste
the extracted files into the new folder. Then cut and paste the res folder to C:\Program Files
(x86)\UTAU or wherever you installed UTAU.
So now you have UTAU installed and in English! Don't take off the Japanese locale, though. The
locale is helpful when getting files that are named in Japanese (if you don't have the locale set it
comes out in gibberish = bad). And the English patch just covers up the menus, so you get the
idea.

Contribute

Share
WatchlistRandom pageRecent changes

UTAU User Manual - 3


Edit
Talk 0

3,437pages on
this wiki

UTAU manual TOP > 3. Audio playback

Previous: 2. Entering notes, rests and lyrics

--------------------------------------------------------------------------------

Contents
[show]

3. Audio playbackEdit

--------------------------------------------------------------------------------

3-1. Playing, pausing and stopping audioEdit

--------------------------------------------------------------------------------

UTAU User Manual - 3

UTAU Operation Tutorial

01 - Introducing UTAU 02 - Entering notes, rests and lyrics 03 - Audio playback 04 - Exporting
audio to a WAV file 05 - Saving projects to a UST file 06 - Importing and exporting MIDI/VSQ files 07
- Note properties and flags 08 - "A la carte" Tonal mark features 09 - Envelope and Vowel Blending
10 - Setting voice banks 11 - Adjusting the Pitch 12 - Mode2 features 13 - Properly using various

pitch files 14 - Extending features with Plug-ins 15 - Continuous sound sources 16 - Other useful
features

Once you have selected all the notes you want to playback (in pink), press the Play button (1) (or the
F5 key). A command prompt window opens, and the WAV file is generated by an external program
(resampler.exe). After generation, the command prompt window closes automatically, and the audio
starts playing. (It takes some time before playing. Please wait for the command prompt window,
without tampering with it.)

'Note': The non-selected notes (in blue) are not played when pressing the Play button. Make sure to
press the Play button only after selecting all the note range you want to play. (You can select all the
notes with "Ctrl" + "A")

For the method to save the played voice to a WAV file, please refer to 4. Exporting audio to a WAV
file.

Starting from UTAU version 0.2.61, you may use resampler.dll to generate the WAV file. When using
resampler.dll, a progress bar indicating the generation progress is displayed while generating the WAV,
instead of a command prompt screen.
How to select resampler.dll -> Using resampler.dll instead of resampler.exe.
Press Shift + F5 if you want to replay an already played part. (You can play immediately because this
uses cached files that were created when playing the first time).
If you want to temporarily stop the playback, press the Pause button (2) (or the F6 key). Press the Play
button again to restart playing from where you paused.
To stop playback, press the Stop button (3) (or the F7 key). Press the Play button again to restart
playing from the beginning.

--------------------------------------------------------------------------------

3-2. Fixing sound dropsEdit

--------------------------------------------------------------------------------

If you can not play at all, please refer to 3-3. Troubleshooting: cannot play at all.

If the sound of some notes can't be heard during playback, please consider the following possible
causes.

1. There is no primary sound corresponding to the lyrics entered with the note.

2. The specified higher/lower-pitched tone doesn't exist in prefix.map.

3. The lines of the envelope are crossing (this occurs in Ver.0.2.32 and before).

1. There is no primary sound corresponding to the lyrics entered with the note.Edit

The types of primary sounds included in a voice library may differ. In particular, infrequently used
sounds like e.g. "vu" or "ti" are recorded only in a limited number of voice libraries.
(Be careful that there are also voice banks where the more frequently used sound "wo" is
absent, too.)

You can check the types of primary sounds that are recorded in a voice bank by displaying their list with
"View" -> "Voices List" . If a sound is not listed, please substitute a
sound having a similar pronunciation.

2. The specified higher/lower-pitched tone doesn't exist in prefix.map.Edit

In UTAU, there is a functionality to automatically use a higher/lower-pitched primary WAV sound file
(WAV files named e.g. "a+" or "a-"), depending on the tune height, according to the
settings of the prefix.map file. However, depending on the voice bank, there are cases where the
higher/lower-pitched WAV files are totally absent from the voice library. Thus, when prefix.map
references a non-existing primary sound file, this may result in voice drop.
To protect against voice drops caused by the prefix.map file referencing non-existing primary sound
files, select "Options" from "Tools" , check "Check voice file existance on
rendering to prevent voice drop (slower rendering)"
in the "General" tab of
the Options screen, then press "OK".

Reference: 13-1. Automatically using different tone files (prefix.map functionality)

3. Envelope lines are crossingEdit

09/02/22 postscript: this has been fixed in Ver.0.2.33, now the sound rendering is forced even if the
envelope lines are crossing.

Download the latest version at http://utau2008.xrea.jp/index.html

Click the "~" button at the bottom left of the main screen to switch to the mode in which the envelope
of the notes (straight lines showing changes in the volume fade-in/fade-out) are displayed. In this screen
state, if a "!" mark is displayed on a note, like in the image (1) below, it indicates that no sound can be

produced because the envelope lines are crossed like in the character "", due to an error in versions
up to Ver.0.2.32.

Select the problematic note, select "Envelope" from the right-click menu like in the
image (2): the envelope management screen appears, similar to the image below. The situation in which
the envelope lines become crossed like in the character "", appears when a note with a mountainshaped envelope with large fade-in/fade-out parts and a narrow constant volume part, is very
shortened.

Starting from Ver.0.2.36 of UTAU, you can correct the mountain-shaped envelope by pressing the
"Normalize" button at the bottom of the envelope screen.

For the Envelope Screen user guide, please refer to -> 9. Envelope and vowel blending

How to manually fix an envelope in Ver.0.2.35 and earlier versions.

First, temporarily enlarge the problematic note, and re-open the Envelope window. To fix the mountainshaped envelope, like in the image (4) below, drag the red dots laterally, fix to a trapeze-shaped
envelope where the fade-in/fade-out time is short like in the image (5) below, and the constant volume
part (the part with the parallel line) is large, then shorten the note again.

Back to Top

--------------------------------------------------------------------------------

3-3. Troubleshooting: cannot play at allEdit

--------------------------------------------------------------------------------

1. Did you select the range of notes you want to play before pressing the play button?

2. In the Project Settings, did you set a voice synthesis engine other than the standard (generic) version
(e.g. resampler5.exe, resampler7.exe, resampler8.exe)?

3. Are you using Virus Buster?

4. Not playing after upgrading to Ver.0.2.30

1. Did you select the range of notes you want to play before pressing the play button?Edit

In UTAU, only the selected notes are rendered and played, thus all the unselected (blue) notes will not
be played when pressing Play. Make sure to press the Play button only after selecting all the note range
you want to play. (You can select all the notes with "Ctrl" + "A")

For reference -> 3-1. Playing, pausing and stopping audio

2. In the Project Settings, did you set a voice synthesis engine other than the standard (generic) version
(e.g. resampler5.exe, resampler7.exe, resampler8.exe)?Edit

UTAU provides the standard generic version of UTAU's voice synthesis engine, resampler.exe, but the
older generic version resampler5.exe and the development versions resampler7.exe and resampler8.exe
are still available. If the old generic version or the development version of the engine is specified in
"Tools 2 (resample)" resample in the program settings
screen, and if the corresponding engine is no present in the UTAU folder containing resampler.exe, the
playback is not possible. (Be especially careful when using UST files created by others persons)

To download the old generic engine or the development engines (as mentioned in the "old version
download page") -> http://utau2008.xrea.jp/oldversions.html

What to do if you want to use the generic version of the engine

Press the "Initialize Tools" button in the lower right of the Project Settings screen,
and set the generic version resampler.exe in "Tools 2 (resample)" resample.

In addition, you can change the type of engine that is setup when pressing the "Initialize Tools"
button of the Project Properties screen. It is in the field "Tools 2 (resample)"
resample of the "Tools" -> "Options" -> "Path" tab.

3. Are you using Virus Buster?

Edit

Because of improper changes in the monitoring functions of security softwares like Virus Buster,
the execution of the external programs temp.bat and temp_helper.bat can be restricted, thus
making impossible the WAV generation.
Remedy 1. Use resampler.dll instead of using resampler.exe. (Implemented starting from Ver0.2.61)
Edit

Starting from Ver0.2.61, you may now select resampler.dll instead of resampler.exe, which is
started as an external program.
By using resampler.dll, the WAV generation can be done without being affected by security
softwares.
Setup method 1. Select the menu "Project" -> "Project Properties"
to open the "Project Settings" screen, then
check "Use resampler.dll for this project" resampler.dll.
Setup method 2. Select the menu "Tools" -> "Options" , open the
"General" tab of the "Options" screen, then check "Use
resampler.dll for rendering" resampler.dll.

Note 1. Even if it is set in the "Project Setup" screen, this is not


saved in the project. However, it is saved if the similar but UTAU appli-wide setting is set
in the "Options" screen.
Note 2. In Ver0.2.61, executing resampler.dll can be unstable, thus we recommend that you
use Ver0.2.70 or better.
Reference -> About UTAU UTAU Ver0.2.60 now 4. Not using batch files (using resampler.dll)

Remedy 2. Option for not using batch files when generating WAV files.

Edit

Select "Options" from the "Tools" menu, open the "General"


tab of the Options screen, check "No batch file for rendering (not recommended)"
WAV then press the "OK" button to allow for
playback. In addition, just after starting UTAU, there may be cases where you can not play even
if "No batch file for rendering (not recommended)" WAV
is checked. Uncheck "No batch file for rendering (not recommended)" WAV
, enter the desired notes, select the notes, then press
the Play button (the command prompt window appears briefly, but there is no playback at this
time.) Check it again after that, and you can play again.
Note When using batches, different WAV files are potentially generated, thus it is not
recommended to use this option unless absolutely necessary.

Ultimate remedy
In order to be able to play while using the default batch, you will need to proceed as described
below. It is not fixed by just uninstalling Virus Buster.
If you are using WindowsXP Professional
In "Control Panel" -> "Administrative Tools" -> "Local security policy" -> "Security Settings" ->
"Software restriction policy" -> "Additional rules", remove the restriction by setting "temp.bat"
and "temp_helper.bat" as not restricted.
Reference -> http://itpro.nikkeibp.co.jp/article/COLUMN/20080226/294814/
If you are using WindowsXP Home Edition

As you can't set "local security policy" in WindowsXP Home Edition, please perform a clean
installation of the OS. (Caution: as the clean install will erase all your data, please be sure to
backup it all. Also, after the clean installation, we recommend that you use a security software
other than Virus Buster.)
Reference -> http://www5.plala.or.jp/papa_mama_pclife/winXP/winxp_3_2.html
4. Not playing after upgrading to Ver.0.2.30

Edit

Please refer to -> http://utau2008.blog47.fc2.com/blog-date-200901.html

Select the note range on which you want to apply the "a la carte" tonal marks, the select the
menu "Tools" -> "Built-in tools" -> "A la carte"
to open the Settings screen.

Note: You can't open the Settings screen when no note is selected. Furthermore, when you want
to apply to all the notes, you can use the "All" button in the "A la carte" settings
screen: select at least one suitable note then open the configuration screen.
8-3. Performing various tone marks in the "A la carte" configuration screenEdit

--------------------------------------------------------------------------------

"A la carte" setting screen (for explanation purpose, each tonal mark feature is surrounded by a
numbered red frame)

1). Blending Vowels

Edit

Vowel blending is a tonal mark technique that smoothes the connection of sounds by overlapping
the end portion of consonant notes like e.g. "ka, sa" with the head portion of a
vowel note like e.g. "a, i, u, e, o, n" that comes immediately
after.
Press the vowels you want to blend among the 6 buttons "a, i, u, e, o, n"
and set the checkbox on the left side of "Connect vowels smoothly to previous note!"
to apply this feature. (It is recommended to
apply this feature to all the vowels. In addition, you can extend the type of involved vowels by
entering other vowels in the "Others" text box. You should add "wo" here.)
You can also choose the blending strength from the three levels "Tightly" ,
"Medium" , "Slightly" .
Tightly: this should be individually applied when combining vowels of the same type like e.g.
"ka" + "a" .
Medium: there is generally no problem with this choice.

Slightly: this is recommended when the tempo is fast, or with a high-speed song with numerous
short notes.
More precise settings are possible with the dedicated vowel blending tool -> 9-4. Blending
vowels

(2). Portamento

Edit

Portamento is a tonal mark technique that smoothes the connection of sounds by extending the
time for changing the pitch (the height of the musical interval) between two notes.
Select the checkboxes on the left side of "Rising Note" and "Falling
Note" to apply this feature. It is generally better to apply this feature to
both the raising and falling pitch, but if you are concerned with e.g. a high-speed song, apply
only "Rising Note" .
There are three adjustable factors: "Change shape" , "Timing"
and "Change speed" .
Change shape : select either of the "Straight" (linear)
and "Curve" (S-curved) radio buttons. "Curve" is recommended because it is
smoother.
Timing : with "Fast" , the pitch change ends at the boundary between
both notes. With "Medium", the time to change the pitch is equally distributed around the
boundary between both notes. With "Slow", the pitch change starts at the boundary between both
notes. Select "Medium" in the general case, especially when you want to link facial expresions
(feelings) to the voice, and select "Fast" or "Slow" for particular cases.
Change speed : you should use "Fast" for a high-speed
song with a fast tempo, and "Medium" for quiet tunes like e.g. ballads. In particular, set this up
individually when you want to link facial expresions (feelings) to the voice. Furthermore, the
setup for this factor is common for both the raising and falling pitch.
More precise settings are possible with the dedicated portamento adjustment tool -> 11-2.
Adjusting the portamento 12-3. Adjusting the portamento in Mode2

(3). Vibrato

Edit

The vibrato is a tonal mark technique that vibrates nicely, especially the second half part of the
pronunciation, by rippling the pitch curve.
Select the checkbox on the left side of "Add Vibrato!" to apply this
feature.
There are three adjustable factors: "Depth" , "Frequency" and
"Duration" .
Depth : the depth of the pitch curve amplitude can be adjusted with four levels. The
actual magnitude of the pitch curve are "Shallow" = 10cent (1/10 of semitone),
"Little" = 20cent (1/5 of semitone), "Medium" = 50cent (1/2 of
semitone), "Deep" = 70cent (7/10 of semitone). As the sound becomes rather
mechanical when vibrato is applied uniformly, it is better to set "Little" on all the notes, and to
deepen it individually only on end of words and more especially on the notes you want to
highlight.
"Frequency" : the period of the pitch curve can be adjusted with four levels. It is
better to set it to "Medium" in the general case, to "Fast" on the
portions where the vibrato is short, and to "Slow" or "Very slow"
on the long parts.
"Duration" : the proportion of the note length during which the vibrato applies can
be adjusted with five levels. The actual percentages are "A bit" = 30%, "Medium" = 50%,
"Much" = 65%, "More" = 75%, and "Mostly" = 90%. It is best to select "A bit", "Medium" and
"Much" for short notes up to a half-note, and to select "More" and "Mostly" for notes longer than
a whole note.
More precise settings are possible with the dedicated vibrato adjustment tool -> 11-3.
Adjusting the vibrato 12-5. Adjusting the vibrato in Mode2

(4). Modulation

Edit

The modulation is the feature that adjusts the extent of the change of pitch at the beginning of the
pronunciation conveyed by the primary sounds. (This corresponds to setting the depth of the
bend in VOCALOID2.)
Left clicking on the "Modulation" area on the lower left of the settings
screen increases it by 10%, while right-clicking decreases it by 10%. Set it generally to 0%,

because a high modulation value makes it quite off-key. (With voice banks whose pitch is stable
like e.g. Nagone Mako, 10% is fine.)
In addition, it can also be configured in 1% increments in the "Notes Properties" screen. -> 7-2.
The settings of the "Notes Properties" screen

(5). Saving the Settings -> Please refer to 8-5. Saving the settings.

Edit

Back to Top

8-4. Applying the configured tonal marks

1. If you want to apply the settings only to the notes that were selected when opening the
configuration screen, press the "Inside Region" button.
2. If you want to apply the settings to all the notes, press the "All" button. In the popup screen depicted below that appears, press "Yes" to apply to all to the notes.

3. To restore each setting to its original state, press the "Revert" button.
However, with this button you can not revert the changes applied to notes when pressing the
"Inside Region" or "All" buttons. In that case, select "Undo A la
carte" from the "Edit" menu to revert the notes to the
state they had before the "A la carte" tonal mark changes.

4. If you want to cancel the "A la carte" tonal marks, press the "Cancel" button
to close the window.

In the "Save settings" text box in the upper right of the "A la carte" settings
screen, type an appropriate save name then press the "+" button to save the current configuration
in the field below.
To recall a saved configuration, click on the save name in the field below.
To remove a saved configuration, click on the save name in the field below, then press the "-"
button.
13-1. Automatically using the proper pitch file (Prefixmap)
When changing the pitch of the primary sounds to the scale of each note, UTAU adjusts the
formants so as to reduce as much as possible the degradation of sound quality, but there are
limits to the pitch heights to which it can change while minimizing the degradation of the sound
quality. Therefore, in order to maintain the degree of pitch change within a defined range,
numerous published voice banks are compiled with multiple pitches (2 to 6 pitches).
To distinguish pitch differences between primary sounds, a method is to attach to the WAV
filename a symbol (Suffix) indicating the type of pitch, behind the lyrics, like e.g. "a+.wav"
.wav or "a-.wav" .wav. Another method is to use separate folders for each pitch, like
e.g. "F4\a.wav" F4.wav or "F3\a.wav" F3.wav. But as properly chosing and
appending the suffix or the folder name to each manually entered lyric would take a lot of time,
UTAU comes with the Prefix.map function, that sets the type of WAV file to use for each pitch

value. You just need to enter the lyrics only, and Prefixmap automatically selects among the
various pitch files.
Opening the Prefixmap Editor Screen
Select "Edit prefixmap" prefixmap from the "Tools" menu.

Prefixmap Editor registration procedure

From the table (the part (1) enclosed in a red frame) displaying the Prefix/Suffix assigned to each
note, click and select the notes to set, enter the prefix and suffix in the input areas "Prefix"
Prefix (3) and "Suffix" Surffix (4), then press the "Set" button
(5). Finally, press the "OK" button (9) to close the "Prefixmap Editor" screen and save the
settings.
Prefixmap Editor table view
If a symbol (suffix) is attached to a primary sound file name, like e.g. "a+.wav" .wav or
"a-.wav" .wav and differentiates the pitch height, it is displayed in the rightmost column

of the table. If primary sounds are separated one pitch per folder, the various keys folder paths
(for example, if the primary sound's folder is F4, "F4\") are displayed in the central column.
However, it stays blank if the distributor of the primary sound file did not set it up in advance.
How to select the pitch to assign
If you want to select contiguous notes, click on both ends of the note range you want to select
while pressing the Shift key. (Example: Clicking A3 and B3 while pressing the Shift key selects
A3 .. B3). It is also possible to select separated notes simultaneously, like e.g. G3, by clicking
while pressing Ctrl at the same time. If you want to select all the notes, press the "Select All"
button (7).
Click the notes you want to assign in the table: they become selected (dark blue). The note scale
is displayed in the "Key" field (2), but only if you selected just one note.
Remarks

It you want to cancel the settings already done, press the "Clear" button (6).

When you press the "Set" button, settings are applied and displayed in the
table but, please note that settings are not saved if you don't press the "OK" button
(9).

If you want to cancel the data that was set and resume settings, press the "Reload"
button (8).

If you want to cancel the data that was set and terminate settings, press the "Cancel"
button (10) to close the Prefixmap Editor screen.

For the Prefixmap registration method for voicebanks organized with one folder per
pitch, please refer to About Prefix.map in the UTAU voice synthesis@wiki.

If a symbol (Suffix) is entered in a note with the "?" mark, the Prefixmap assignment
becomes invalid. Please refer to 13-2. Manually using the proper pitch file (SuffixBroker
function) below for details.

13-2. Manually using the proper pitch file (SuffixBroker)

Edit

If you want to attach to an individual note the symbol (Suffix) of a different pitch than the one
specified by Prefixmap, or if you want to select a primary sound file based not on the pitch but
e.g. on the strength of the sound, use the SuffixBroker functionality to easily enter a symbol
(Suffix) to the note and change the primary sound file.

Opening the SuffixBroker settings screen


Select one or more of the notes you want to set, then select "Built-in Tools"
-> "SuffixBroker" from the "Tools" menu.

Basic usage
For example, suppose you want to specify "i+.wav" .wav while in Prefix.map the
unlabeled "i.wav" .wav or a symbol other than "+" (like e.g. "i-.wav" .wav
) is specified. Select "+" in the pull-down menu on the left side of the SuffixBroker
Settings screen, then press "OK" to close the Settings screen: a "+" is now entered in the
selected "i" note, which is now sounding the "i+.wav" .wav sound file.

If you want to select a symbol not present in the pull-down menu, like e.g. "++" , you
can append and register it at the bottom of the pull-down menu by directly entering the symbol in

the pull-down menu and pressing the "+" button. Also, selecting in the pull-down menu a symbol
no longer in use and pressing the "-" button makes it disappear from the pull-down menu.

If you want to delete the symbol from a note to which a symbol like e.g. "+" or "-"
is attached, and validate Prefixmap's assignment, select the corresponding note, open the
SuffixBroker Settings screen, check "Remove Prefix" then press "OK" to close the screen. In the
SuffixBroker of older versions of UTAU there is no "Remove Prefix", but you can select "none"
or a blank space from the pull-down menu instead.

6-1. Importing MIDI/VSQ files to UTAU


UTAU, like VOCALOID2, has the ability to enter notes imported from standard MIDI files or
VSQ files (VOCALOID2 project files). Entering notes is easier with a MIDI sequencer software
like e.g. Domino, thus this method is recommended for entering the rough notes.

1. Use a MIDI sequencer software like Domino and create a standard MIDI (*.mid) file (also
named SMF file), or use the VOCALOID2 editor and create a VSQ file.

Please refer to the following website for Domino download and User Manual.

Domino Download page -> MIDI music editor software download "Domino" TAKABO SOFT
(new window)

Domino operating course and introductory definition files -> MIDI course MimiCopy for
beginners (new window)

2. In UTAU main screen's menu, select "File" -> "Import" .

3. In the Import screen, select the MIDI file or VSQ file you want to import, in the file type
select "SMF file format" SMF or "VSQ file format" VSQ
, then press "OK".
For Standard MIDI files

For VSQ files

4. Select the track containing the data you want to import, then press "OK".

5. If you import standard MIDI files like in the image below, "a" will automatically be entered for
all the notes, thus enter the lyrics using the Lyrics Replace function.

Also, if you import a VSQ file, lyrics are entered as-is. However, lyrics in VSQ files are not only in hiragana
but in katakana and Roman letters too, and may also use english or phonetic symbols, thus please
correct the lyrics if need be.
Reference -> 2-10. Changing the lyrics

Note: If you import a standard MIDI file or a VSQ file, the tempo, velocity and pitch bend information
too are applied, but by design, UTAU cannot change the tempo in the middle of a note, therefore
information for tempo change in the middle of a note is ignored. Tempo changes in the middle of a rest,
however, are applied by automatically splitting the rest. In addition, because the velocity of VSQ files is
being used to set the length of the consonants and not the volume, it is not applied to the volume. (It is
not applied to consonant speed () either. However, it is applied in older versions of UTAU, and
importing may result in variations of the volume. In this case, please align once the volume of all the
notes.)
Reference -> 2-11. Changing the volume

Pitch bend in VSQ files is applied to the pitch data of UTAU's Mode1, but as this produces a lot of
discrepancies, it seems better to fix and render in Mode2. (Because pitch data is not applied to Mode2
pitch data.)
Reference -> 12. Mode2 features

Вам также может понравиться