Skip site navigation (1)Skip section navigation (2)

FreeBSD Manual Pages


home | help

ADINTOOL(1)							   ADINTOOL(1)

	  - a tool to record / split / send / receive audio streams

       adintool	{-in inputdev} {-out outputdev}	[options...]

       adintool	analyzes speech	input, finds speech segments skipping silence,
       and records the detected	segments in various ways. It performs speech
       detection based on zerocross number and power (level), and records the
       detected	parts to files or other	output devices sucessively.

       adintool	is a upper version of adinrec with various functions.
       Supported input device are: microphone input, a speech file, standard
       tty input, and network socket (called adin-net server mode). Julius
       plugin can be also used.	Detected speech	segments will be saved to
       output devices: speech files, standard tty output, and network socket
       (called adin-net	client mode). For example, you can split the incoming
       speech to segments and send them	to Julius to be	recognized.

       Output format is	WAV, 16bit (signed short), monoral. If the file
       already exist, it will be overridden.

       All Julius options can be set. Only audio input related options are
       treated and others are silently skipped.	Below is a list	of options.

   adintool specific options
	-freq  Hz
	  Set sampling rate in Hz. (default: 16,000)

	-in  inputdev
	  Audio	input device. "mic" to capture via microphone input, "file"
	  for audio file input,	and "stdin" to read raw	data from
	  standard-input. For file input, file name prompt will	appear after
	  startup. Use "adinnet" to make adintool as "adinnet server",
	  receiving data from client via network socket. Default port number
	  is 5530, which can be	altered	by option "-inport".

	  Alternatively, input device can be set by "-input" option, in	which
	  case you can use plugin input.

	-out  outputdev
	  Audio	output device store the	data. Specify "file" to	save to	file,
	  in which the output filename should be given by "-filename". Use
	  "stdout" to standard out. "adinnet" will make	adintool to be an
	  adinnet client, sending speech data to a server via tcp/ip socket.
	  When using "adinnet" output, the server name to send data should be
	  specified by "-server". The default port number is 5530, which can
	  be changed by	"-port"	option.

	-inport	 num
	  When adintool	becomes	adinnet	server to receive data (-in adinnet),
	  set the port number to listen. (default: 5530)

	-server	 [host]	[,host...]
	  When output to adinnet server	(-out adinnet),	set the	hostname. You
	  can send to multiple hosts by	specifying their hostnames as
	  comma-delimited list like "host1,host2,host3".

	-port  [num] [,num...]
	  When adintool	send a data to adinnet server (-out adinnet), set the
	  port number to connect. (default: 5530) For multiple servers,
	  specify port numbers for all servers like "5530,5530,5531".

	-filename  file
	  When output to file (-out file), set the output filename. The	actual
	  file name will be as "file.0000.wav" , "file.0001.wav" and so	on,
	  where	the four digit number increases	as speech segment detected.
	  The initial number will be set to 0 by default, which	can be changed
	  by "-startid"	option.	When using "-oneshot" option to	save only the
	  first	segment, the input will	be saved as "file".

	-startid  number
	  At file output, set the initial file number. (default: 0)

	  Exit after the end of	first speech segment.

	  Do not perform speech	detection for input, just treat	all the	input
	  as a single valid segment.

	  Output as RAW	file (no header).

	  When output to adinnet server, adintool enter	pause state at every
	  end of speech	segment. It will restart when the destination adinnet
	  server sends it a resume signal.

	  When output to multiple adinnet server, not to do strict
	  synchronization for restart. By default, when	adintool has entered
	  pause	state, it will not restart until resume	commands are received
	  from all servers. This option	will allow restart at least one
	  restart command has arrived.

	-rewind	 msec
	  When input is	a live microphone device, and there has	been some
	  continuing input at the moment adintool resumes, it start recording
	  backtracking by the specified	milliseconds.

   Concerning Julius options
	-input	{mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss}
	  Choose speech	input source. Specify 'file' or	'rawfile' for waveform
	  file.	On file	input, users will be prompted to enter the file	name
	  from stdin.

	  'mic'	is to get audio	input from a default live microphone device,
	  and 'adinnet'	means receiving	waveform data via tcpip	network	from
	  an adinnet client. 'netaudio'	is from	DatLink/NetAudio input,	and
	  'stdin' means	data input from	standard input.

	  At Linux, you	can choose API at run time by specifying alsa, oss and

	-lv  thres
	  Level	threshold for speech input detection. Values should be in
	  range	from 0 to 32767. (default: 2000)

	-zc  thres
	  Zero crossing	threshold per second. Only input that goes over	the
	  level	threshold (-lv)	will be	counted. (default: 60)

	-headmargin  msec
	  Silence margin at the	start of speech	segment	in milliseconds.
	  (default: 300)

	-tailmargin  msec
	  Silence margin at the	end of speech segment in milliseconds.
	  (default: 400)

	  This option enables DC offset	removal.

	-smpFreq  Hz
	  Set sampling rate in Hz. (default: 16,000)

	  Record input with 48kHz sampling, and	down-sample it to 16kHz
	  on-the-fly. This option is valid for 16kHz model only. The
	  down-sampling	routine	was ported from	sptk. (Rev. 4.0)

	-NA  devicename
	  Host name for	DatLink	server input (-input netaudio).

	-adport	 port_number
	  With -input adinnet, specify adinnet port number to listen.
	  (default: 5530)

	  Julius by default removes successive zero samples in input speech
	  data.	This option stop it.

	-C  jconffile
	  Load a jconf file at here. The content of the	jconffile will be
	  expanded at this point.

	-plugindir  dirlist
	  Specify which	directories to load plugin. If several direcotries
	  exist, specify them by colon-separated list.

	  (using mic input with	alsa device) specify a capture device name. If
	  not specified, "default" will	be used.

	  (using mic input with	oss device) specify a capture device path. If
	  not specified, "/dev/dsp" will be used.

	  Try to set input latency of microphone input in milliseconds.
	  Smaller value	will shorten latency but sometimes make	process
	  unstable. Default value will depend on the running OS.

       Record microphone input to files: "data.0000.wav", "data.0001.wav" and
       so on:
       Split a long speech file	"foobar.raw" into "foobar.1500.wav",
       "foobar.1501.wav" ...:
       Copy an entire audio file via network socket.
       Detect speech segment, send to Julius via network and recognize it:

	julius ( 1 ) ,
	adinrec	( 1 )

       Copyright (c) 1997-2000 Information-technology Promotion	Agency,	Japan

       Copyright (c) 1991-2008 Kawahara	Lab., Kyoto University

       Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and

       Copyright (c) 2005-2008 Julius project team, Nagoya Institute of

       The same	as Julius.

				  10/02/2008			   ADINTOOL(1)


Want to link to this manual page? Use this URL:

home | help