Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Creating own dataset #8

Open
Wild1234 opened this issue Apr 3, 2018 · 4 comments
Open

Creating own dataset #8

Wild1234 opened this issue Apr 3, 2018 · 4 comments

Comments

@Wild1234
Copy link

Wild1234 commented Apr 3, 2018

Good night.

I want to create my own dataset with my own labels. is it possible for this repository?

Thanks

@igor-panteleev
Copy link
Contributor

Hi,
Yes, you can use your own dataset.

You need to extract features from raw data.
To do this take a look to vggish lib here.

Also you can find how i did this on the fly.

Hope it will help.

@Lelo123
Copy link

Lelo123 commented Apr 6, 2018

Thanks!

But isn't necessary to train a model with youtube 8M?

@igor-panteleev
Copy link
Contributor

Short answer - yes.
Two models have been used here.
vggish - as feature extractor.
youtube8m - as classifier.

So if you want to use different labels you need to extract features using vggish and then train youtube 8m model with these features.

@hei9gag
Copy link

hei9gag commented Nov 29, 2018

Hi,

I still have some questions about creating own data set.

The training script provided by youtube8m is using .tfrecord extension. Do you know how to generate this format for audio?

Also, how do I add my custom labels using the youtube8m training model? Many thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants