Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨ MOVE update #89

Open
wants to merge 173 commits into
base: developer
Choose a base branch
from
Open

✨ MOVE update #89

wants to merge 173 commits into from

Conversation

ri-heme
Copy link
Collaborator

@ri-heme ri-heme commented Dec 13, 2023

No description provided.

@ri-heme ri-heme changed the title Developer move 2.0 ✨ MOVE update Dec 13, 2023
- Add MOVE task
- Add log function to sub-tasks
- Add sub-task class with CSV writer
- Make training loop CSV writer sub-task
- Set up mixins for i/o directories
- Make subtask a mixin
- Make CSV writing a mixin
Place mixin dfn first to ensure they take precedence
- Add perturbation class and property to MOVE dataset
- Add checks to ensure MOVE datasets have unique names
- Add function to discrete datasets to one-hot encode
- Add checks to ensure named datasets have unique features
- Because MOVE dataset returns a tuple, update batch loops to take first element
- Add type hints to MOVE dataloader
- Make data subset size configurable (training, test, validation size) in data config
- Add option to standardize data without applying base-2 log
- During encoding, generate split indices and save
- When re-loading a dataset, use split indices to load a specific subset of data
Make default "all" (to not affect other tasks that do not require train/test split)
- Read "./." genotypes as NA
- Reindex dataframe (include NA rows if missing samples)
Handle data loader config internally, only require users to input batch size
Keep dimensions if NaNs are present
If NaNs were present, the boxplot would not work. Remove them beforehand
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants