Hello,
I am new to Knime but currently learning quick
I have a table with web visitor instances almost 100,000 rows . Each instance has a session ID, this unique ID is assigned to a user so that to understand user behavior. The user may visit the website in different times but the ID will be kept the same. We have a data and hour and minute of a specific user visiting a site at different times. Please keep in mind that the user may visits the website several times in one hour and still have same ID ( ID X instances)
I have another table (Weather table) that shows the weather condition of a city. The unique ID for this table is the date and hour (a weather condition at a specific day and specific hour)
My task is to understand the influence of weather on web visitors (visitor beaver) so I wanted to merge this two tables using the date and hour column. But my problem is that a single visitor may have several instances (rows) in a single hour While the weather table can only have one single value per hour.
Example Visitor Table
Session ID |
Date and Time |
Hour |
Minute |
Page path |
Distance from platform |
2324 |
2015-11-23 12:00 |
12 |
00 |
xyz.com/products/electronics |
locall |
2324 |
2015-11-23 12:30 |
12 |
30 |
xyz.com/products/books |
locall |
4547 |
2015-11-23 13:00 |
13 |
00 |
Xyz.com/products/pets |
distance |
6784 |
2015-11-23 14:00 |
14 |
00 |
Xyz.com/products/tv |
locall |
6784 |
2015-11-23 14:30 |
14 |
30 |
Xyz.com/products/electronics |
locall |
6784 |
2015-11-23 14:55 |
14 |
55 |
Xyz.com/products/computers |
locall |
Example of Weahter table
Hour of the Day |
Houre |
City |
Temprature |
Wind |
Rain |
Season |
Weather category |
2015-11-23 |
12 |
ABC |
12.6 |
4.2 |
1.2 |
Winter |
moderate |
2015-11-23 |
13 |
ABC |
3.6 |
3.6 |
5.2 |
Winter |
hgjgjh |
2015-11-23 |
14 |
ABC |
6.8 |
4.2 |
2.9 |
Winter |
hgjgjh |
2015-11-23 |
15 |
ABC |
17.4 |
0.4 |
0 |
Winter |
hgjgjh |
2015-11-23 |
16 |
ABC |
17.2 |
0.5 |
0 |
Winter |
hgjgjh |
2015-11-23 |
17 |
ABC |
16.9 |
2.6 |
1.1 |
Winter |
hgjgjh |
So far have merged the date and hour columns in to one and then formatted the dates on the two tables using the “String to Date“ node and after that I try to merge them but the number of rows after merge are way less than I have expected and I suspect in the visitor table, the existence of multiple lines in a same hour might have affected .
Can you please advise on how to merge this two tables. And any further recommendations appreciated.
Thank you