Skip to content
David Wood edited this page Aug 27, 2022 · 1 revision

label

Classifies sounds using a model to produce a metadata-formatted labeling of the
sounds on stdout.  While the -clipLen option is not strictly required, it is 
typically used to classify sub-segments of the input sound(s).  Segments from
the same file that are adjacent in time and have the same label name as 
produced by the model will be merged.
  -file file : specifies the file containing the model to load.
  [list of wav files] : specifies 1 or more .wav files without labeling.
     File names are space-separated.  The can not be used with the
     -sounds option.
  -sounds csv list of (dir|metadata.csv) : specifies 1 or more metadata.csv
     files referencing sounds or directories containing a metadata.csv.
     This is an alternative to a list of wav files, but adds labels from 
     metadata files.
  -metadata (all|some) : require that all files listed in the metadata file 
      to be present. Only used with -sounds option. 
      Default does not require all files.
  -clipLen int: splits sound recordings up into clips of the given
      number of milliseconds. Set to 0 to turn off.
      Defaults to 0.
  -pad (no|zero|duplicate): when clips are shorter than the requests clip
      padding can be added to make all clips the same length. Some models may
      require this.  Zero padding sets the added samples to zero.  Duplicate
      reuses the sound as many times a necessary to set the added samples.
      No padding removes clips shorter than the requested clip length.
      Default is no padding.
Label examples: 
  ... -file myclassifier.cfr -clipLen 1000 number1.wav number2.wav
  ... -file myclassifier.cfr -clipLen 1000 -pad duplicate number1.wav
  ... -file myclassifier.cfr -clipLen 1000 -pad duplicate -sounds metadata.csv

Clone this wiki locally