I would be grateful for some help. I am a radiation oncologist trying to get au fait with Machine Learning. I have data! That’s the first hurdle I suppose.
My data exists in three CSV files - ClinicalData.CSV (in which resides the target variable for grouping; between 19 and 300 columns, many contain missing data), DVH.CSV (Dose-Volume Histogram data describing the radiation dose received by various ROIs dawn on a CT; ~700 columns), and PyRadiomics.CSV (feature manipulation of the same ROIs drawn on a CT; ~107 columns).
All CSV files contain the same patient ID. The DVH and PyRadiomics files have the same number of rows (each ROI for each patient gives one DVH row and one PyRadiomics row).
I would like to know how can I ‘combine’ these three files automatically for use in one ML instance.
Thanks for the assistance, both now and in the past. The community is a truly a benefit.