I have a file with two columns. the first one contains the objects while the second one one or muliple information (comma separated).
i need to generate a matrix in which the column are the objects and the first row all the possible information. For each object, if it matches the infomration in the input file 1 will be the corresponding score otherwise will be empty cell.
An example:
Input:
input.txt (170 Bytes)
Substarte_name Enzymes Diazepam GSH CHEMBL407009 ADH5, AKR1B10, AKR1C1, ALDH1A3, CBR1, CBR3, HSD11B1, NQO1 Valsartan GSH Donezepil GSH brenda_212032_CID_667436 UGT
expected output_matrix:
out_matrix.txt (196 Bytes)
GSH ADH5 AKR1B10 AKR1C1 ALDH1A3 CBR1 CBR3 HSD11B1 NQO1 UGT Diazepam 1 CHEMBL407009 1 1 1 1 1 1 1 1 Valsartan 1 Donezepil 1 brenda_212032_CID_667436 1
Could someone suggest a flow to do that?
Thanks in advance!