Hi, I am new to Knime. I would be very thankful if someone can help me or give me some advice with a question related to bioinformatics. I have a spreadsheet which contains the names of 1000 genes and I would like to collect information from the Uniprot database for each gene. I can download the Unirpot database in several different formats, fof instance fasta, text and so on and I can read in the data into an excel sheet ( I actually store the relevant part of the database corresponding to around 500 000 rows in an excel sheet).
Thus I have an input spreadsheet with gene names, for instance, BCR1, and I would like to search the "Gene Name"-column in the Uniprot spreadsheet (or fasta or text-format) in order to identify the row which contains the information for this gene and then store this in a separate file.
Is there any workflow ready for such a task or is there a simple way to understand how to create such a workflow.
I would be very grateful for any sort of help on this matter! Also, sorry if I post this under the wrong forum!
Best regards,
Bobby