Fix pitch data overhangs in freq domain #7

anthonio9 · 2024-01-27T22:03:31Z

Just noticed that the data pitch labels are wrong. Many beginnings and endings of the pitch seem to overhang like in the picture below. This is totally unacceptable and very possibly the cause of the poor raw pitch accuracy of around 80%.

Fix that!

Check:

labels in the original dataset
data preprocessing when creating the gset dataset

anthonio9 · 2024-01-27T23:08:19Z

After the initial review of the original data sets my conclusion is that the dataset is really bad or at least the pitch values are. Every ending is indeed seen noted as falling down in pitch, even though such fall isn't audible in the recordings (not even those with the separated strings). This means that the dataset needs to have those endings cut manually or have all the pitch tracking redone with a better method, perhaps with the original penn?

anthonio9 · 2024-01-29T22:40:26Z

There must be a problem with the data processing, the nodes that are so much below the other ones do not exist in the labeled data, below is the file of 05_BN3-119-G_solo_mic.wav

anthonio9 · 2024-01-29T23:12:21Z

Some data seems to be missing, this is indeed missing as it is present in the audio. The next step is to write a program printing the original data labels over the STFT image. I don't see any other way than comparing what was there originally with what I got after preprocessing. Still analyzing 05_BN3-119-G_solo_mic.wav

Plot data as it is in the dataset, without any processing. Raw pitch data is plotted with plotly, next steps are midi and spectrogram. related to: #7

anthonio9 · 2024-02-01T22:57:56Z

This is already slightly better, got rid of putting 'zeros' into the almost raw data, it seems that all the overhangs are still there, but at least not the most ridiculous ones, again file 05_BN3-119-G_solo_mic.wav.

anthonio9 · 2024-02-01T23:04:23Z

It could be possible to slighly help with deleting the unwanted values with plotly!
https://plotly.com/python/v3/selection-events/?_gl=1*1fq1w1w*_ga*MTI0MDkzODAyNy4xNzA1NDM5MTky*_ga_6G7EE0JNSC*MTcwNjgyNTcxNi4zLjEuMTcwNjgyODM2NC41MS4wLjA.

anthonio9 · 2024-02-02T14:23:43Z

This may also be helpful for learning more about the FigureWidget
https://medium.com/@jacky.kaub/build-custom-widgets-with-ipywidgets-and-plotly-a454cb3b2b4f

related to: #7

anthonio9 · 2024-02-02T23:14:09Z

I was really hoping that midi would help with seeing where the proper parts of the node end, sadly that is not the case.
Anyway, midi is now printed on top of the spectrogram and pitch data. The next step is to try to fix the endings of the pitch notes with some kind of a simple algorithm.

This is what the plot looks like now:

Introduce penn.convert.midi_to_frequency related to: #7

anthonio9 · 2024-02-03T14:38:53Z

Another small break through. Resampling was obviously the issue! To get rid of unwanted fields in strange spots that were not in the original data it was enough to disable resampling for sampling rates that divide the GuitarSet native sampling rate of 44100 Hz without a remainder - that leaves the 11025, 22050 sampling rates without the need to resample.

Sampling rates 11025 Hz and 22050 Hz are safe and do not need resampling. This simple fix gets rid of many ugly data points that were not present in the original data annotations. related to: #7

Had to cut the labels, probably around two from each string at the end to fit the audio data better. Really do not know why exactly. related to: #7

The jam track is separated into notes ordered by strings, each of those contains multiple pitches. related to: #7

remove_overhangs attempts to remove the overhangs from the last 20% of a note if they are below the average of the other 80% of the note. related to: #7

And add new config options to enable and manipulate the overhangs removal: REMOVE_OVERHANGS - set True to enable the removal, REMOVE_OVERHANGS_DIVIDER - (int) set to manipulate the length of the overhang, REMOVE_OVERHANGS_THRESHOLD - (int) set the threshold in cents related to: #7

anthonio9 added the bug Something isn't working label Jan 27, 2024

anthonio9 self-assigned this Jan 27, 2024

anthonio9 added a commit that referenced this issue Feb 1, 2024

Plot raw pitch data

3783966

Plot data as it is in the dataset, without any processing. Raw pitch data is plotted with plotly, next steps are midi and spectrogram. related to: #7

anthonio9 added a commit that referenced this issue Feb 2, 2024

Plot raw pitch data with stft

d432958

related to: #7

anthonio9 added a commit that referenced this issue Feb 2, 2024

Plot midi and pitch over stft

2cf80fd

Introduce penn.convert.midi_to_frequency related to: #7

anthonio9 added a commit that referenced this issue Feb 3, 2024

Cut labels to fit the audio data

ed56c54

Had to cut the labels, probably around two from each string at the end to fit the audio data better. Really do not know why exactly. related to: #7

anthonio9 added a commit that referenced this issue Feb 5, 2024

Cut labels to fit the audio data

bfaa0fb

Had to cut the labels, probably around two from each string at the end to fit the audio data better. Really do not know why exactly. related to: #7

anthonio9 added a commit that referenced this issue Feb 5, 2024

Convert jam track to notes

457055b

The jam track is separated into notes ordered by strings, each of those contains multiple pitches. related to: #7

anthonio9 added a commit that referenced this issue Feb 7, 2024

Convert notes dict to pitch dict and remove overhangs

fe1bc91

remove_overhangs attempts to remove the overhangs from the last 20% of a note if they are below the average of the other 80% of the note. related to: #7

anthonio9 added a commit that referenced this issue Feb 7, 2024

Convert notes dict to pitch dict and remove overhangs

a21026d

remove_overhangs attempts to remove the overhangs from the last 20% of a note if they are below the average of the other 80% of the note. related to: #7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix pitch data overhangs in freq domain #7

Fix pitch data overhangs in freq domain #7

anthonio9 commented Jan 27, 2024

anthonio9 commented Jan 27, 2024

anthonio9 commented Jan 29, 2024 •

edited

Loading

anthonio9 commented Jan 29, 2024 •

edited

Loading

anthonio9 commented Feb 1, 2024

anthonio9 commented Feb 1, 2024

anthonio9 commented Feb 2, 2024

anthonio9 commented Feb 2, 2024

anthonio9 commented Feb 3, 2024

Fix pitch data overhangs in freq domain #7

Fix pitch data overhangs in freq domain #7

Comments

anthonio9 commented Jan 27, 2024

anthonio9 commented Jan 27, 2024

anthonio9 commented Jan 29, 2024 • edited Loading

anthonio9 commented Jan 29, 2024 • edited Loading

anthonio9 commented Feb 1, 2024

anthonio9 commented Feb 1, 2024

anthonio9 commented Feb 2, 2024

anthonio9 commented Feb 2, 2024

anthonio9 commented Feb 3, 2024

anthonio9 commented Jan 29, 2024 •

edited

Loading

anthonio9 commented Jan 29, 2024 •

edited

Loading