Skip to content

Conversation

drcxcruz
Copy link
Contributor

@drcxcruz drcxcruz commented Jun 8, 2020

A new tutorial showcasing MLJ by C. Cruz

A new tutorial showcasing MLJ by C. Cruz
@tlienart
Copy link
Collaborator

this is ongoing work on /cruz2; it will take me some time to go through the full tutorial and adjust a few things, sorry

@drcxcruz
Copy link
Contributor Author

drcxcruz commented Jun 20, 2020 via email

@tlienart
Copy link
Collaborator

Actually Clarman, after now reading 2/3 of your tutorial and fixing a few things, I'm a bit uncomfortable with the fact that it's synthetic data; I thing a tutorial with this kind of depth would be great for real data because people could relate to the data and do further analysis and uncover things that may match their expectations or surprise them. Synthetic data is great for small tutorials where you show one thing; but here it's a bit awkward because explanations go in quite some depth to give context etc but ultimately the data is generated.

What do you think?

@drcxcruz
Copy link
Contributor Author

drcxcruz commented Jun 22, 2020 via email

@tlienart
Copy link
Collaborator

tlienart commented Jun 23, 2020

Thanks a lot this is much appreciated!

For good data sources: https://datasetsearch.research.google.com also UCI (https://archive.ics.uci.edu/ml/datasets.php?format=&task=&att=&area=&numAtt=&numIns=&type=&sort=dateDown&view=table) for UCI I'd suggest taking anything that's more recent than 2010 and seems interesting for you.

@tlienart
Copy link
Collaborator

@ablaom
Copy link
Member

ablaom commented Jun 24, 2020

If you find one at OpenML, you can load it directly from MLJ using OpenML.load.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants