Path: blob/main/L4assets/DSandMLOpsAssets/HandsOn/Notebooks/DS Splitting modeling records.ipynb
1928 views
Kernel: Python 3.10
Splitting modeling records
CPDaaS: Make sure to first insert a "project token"
Click on the three vertical dots icon in the uper right of the screen, then click on Insert project token
Once inserted, execute the cell.
A project token is only available if you followed the prerequesite instructions to create on in your project.
Get the modeling data
In [ ]:
Split the records randomly 80/20
In some cases you would want to split 60/20/20 for training, testing, and validation.
When using SPSS modeler or AutoAI, the training/testing split is done during the processing.
For this reason, we simply want some validation records that were'nt use in training or testing for later work.
In [ ]:
Write the dataset to the project
In [ ]:
In [ ]:
Author
Jacques Roy is a member of the IBM Enablement for Data and AI
Copyright © 2023. This notebook and its source code are released under the terms of the MIT License.