Changelog¶
v0.1.0a¶
v0.1.0a3¶
2021-04-09
- Bug fixes
- no longer use Librosa for feature extraction: allow easier implementation of augmentations, especially during training. 
- soundpy.feats.plotnow uses parameter- subprocessto allow for different backends to be applied, depending on when funciton is called. For example, if plotting from within a Generator while training,- subprocessshould be set to True, and the ‘Agg’ backend will be applied. Otherwise, ‘TkAgg’ backend is used. Fixes issues with multi-threading.
- Fixed generator and Tensorflow issue: with Tensorflow 2.2.0+ the models in - soundpy.models.builtinthat were trained via generator failed. Use- tensorflow.data.Dataset.from_generatorto feed generator data to models.
- Improved clip_at_zero. 
 
- Features
- Python 3.8 can now be used. 
- throw depreciation warning for parameters context_window or frames_per_sample as these “features” will be removed from feature extraction. Rather the features can be reshaped post feature extraction. 
- added timestep, axis_timestep, context_window, axis_context_window and combine_axes_0_1 paremeters to - soundpy.models.Generator: allow more control over shape of the features.
- can run - soundpy.models.builtin.envclassifier_extract_trainto run with pre-extracted val and test features.
- soundpy.feats.plotsound,- soundpy.feats.plot_vadand- soundpy.feats.plot_dom_freqall can plot stereo sound: for each channel in a stereo signal, a plot is either generated or saved. If a filename already exists, a date stamp is added to filename to avoid overwriting images.
- allow grayscale2color to be applied to 2D data. 
 
- Breaking changes
- soundpy.models.Generatoruses parameter normalize instaed of normalized. Found this to be more intuitive. If normalize is set to True, data will be normalized. Before, if normalized was set to True, data would not be normalized.
- removed add_tensor_last and add_tensor_first: require adding of tensors (for keras) to be included in parameter desired_input_shape. 
 
- Other changes
- CPU soundpy can use Tensorflow 2.1.0, 2.2.0 and 2.3.0. Dockerfile still uses Tensorflow 2.1.0 as it is still compatible with updated code. 
- soundpy.models.builtin.implement_denoiser raises warning if cleaned features cannot be converted to raw audio samples. 
 
v0.1.0a2¶
2020-08-13
- Bug fixes
- added use_beg_ms parameter in - soundpy.dsp.vad: improved VAD recognition of silences post speech.
 
- Features
- added GPU option: provide instructions and Docker image for running SoundPy with GPU 
- added extend_window_ms paremeter to - soundpy.feats.get_vad_samplesand- soundpy.feats.get_vad_stft: can extend VAD window if desired. Useful in higher SNR environments.
- added - soundpy.feats.get_samples_clippedand- soundpy.feats.get_stft_clippedto clip off beginning and ending silences.
- added beg_end_clipped parameter to - soundpy.feats.plot_vadto visualize VAD by clipping the beginning and ending silences (if True) or VAD instances throughout the signal (if False).
- added - soundpy.models.dataprep.GeneratorFeatExtractionclass for extracting and augmenting features during training (still experimental).
- added - soundpy.models.builtin.envclassifier_extract_trainas an example of extracting and augmenting features during training (still experimental).
- added - soundpy.dsp.clip_at_zeroto enable smoother concatenations of signals and enables removal of clicks at beginning and ending of signals.
- added - soundpy.dsp.remove_dc_biasto enable smoother concatenations of signals
- added and set remove_dc parameter to True in - soundpy.files.loadsoundand- soundpy.files.savesoundto ensure signals all have mean zero.
- added mirror_sound option to - soundpy.dsp.apply_sample_lengthas a way to extend sound.
- added - soundpy.dsp.ismonoto check if samples were mono or stereo.
- added - soundpy.dsp.average_channelsto average sample amplitudes across channels, e.g. to identify where high energy begins / ends in the signal without disregarding additional channels (if stereo sound).
- added - soundpy.dsp.add_channelsfor adding additional channels if needed (e.g. for applying a ‘hann’ or ‘hamming’ window to stereo sound)
- added stereo sound functionality to - soundpy.dsp.add_backgroundsound,- soundpy.dsp.clip_at_zero,- soundpy.dsp.calc_fft,- soundpy.feats.get_stft,- soundpy.feats.get_vad_stft
 
- Other changes
- name change: from pysoundtool to soundpy: simpler 
- updated dependencies to newest versions still compatible with Tensorflow 2.1.0 
- moved soundpy.dsp.get_vad_samples to - soundpy.feats.get_vad_samples
- moved soundpy.dsp.get_vad_stft to - soundpy.feats.get_vad_stft
- name change: allow - soundpy.feats.normalizeto be used as- soundpy.normalize
- removed pysoundtool_online and mybinder button as maintaining the online version was not easily done. Aim to reimplement at some point. 
 
v0.1.0a1¶
Initial public alpha release.