Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Train experts on a custom dataset #153

Open
vietanh125 opened this issue Jan 3, 2025 · 2 comments
Open

Train experts on a custom dataset #153

vietanh125 opened this issue Jan 3, 2025 · 2 comments

Comments

@vietanh125
Copy link

Hi,

I want to train several experts on my own QA dataset. How can I extend the code to work with such a custom dataset?

Thank you.

@sordonia
Copy link
Member

sordonia commented Jan 6, 2025

Hi,

Try to follow the notebook here, mbc_arrow_tutorial

instead of get_datamodule() you can just create your on datamodule (with train_dataloader(), val_dataloader(), etc.) and pass it to the train_model() function?

@vietanh125
Copy link
Author

Thank you for your response. I have looked at the code, my plan is

  • Implementing CustomDataModule extending DataModule
  • Implementing CustomConfig extending DatasetConfig
  • Implementing CustomEvaluator extending EMEvaluator (I evaluate based on exact match)

Do you think it's enough? If not, what else should I do? Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants