-
Notifications
You must be signed in to change notification settings - Fork 3
label
David Wood edited this page Aug 27, 2022
·
1 revision
Classifies sounds using a model to produce a metadata-formatted labeling of the sounds on stdout. While the -clipLen option is not strictly required, it is typically used to classify sub-segments of the input sound(s). Segments from the same file that are adjacent in time and have the same label name as produced by the model will be merged. -file file : specifies the file containing the model to load. [list of wav files] : specifies 1 or more .wav files without labeling. File names are space-separated. The can not be used with the -sounds option. -sounds csv list of (dir|metadata.csv) : specifies 1 or more metadata.csv files referencing sounds or directories containing a metadata.csv. This is an alternative to a list of wav files, but adds labels from metadata files. -metadata (all|some) : require that all files listed in the metadata file to be present. Only used with -sounds option. Default does not require all files. -clipLen int: splits sound recordings up into clips of the given number of milliseconds. Set to 0 to turn off. Defaults to 0. -pad (no|zero|duplicate): when clips are shorter than the requests clip padding can be added to make all clips the same length. Some models may require this. Zero padding sets the added samples to zero. Duplicate reuses the sound as many times a necessary to set the added samples. No padding removes clips shorter than the requested clip length. Default is no padding. Label examples: ... -file myclassifier.cfr -clipLen 1000 number1.wav number2.wav ... -file myclassifier.cfr -clipLen 1000 -pad duplicate number1.wav ... -file myclassifier.cfr -clipLen 1000 -pad duplicate -sounds metadata.csv