Can anyone explain how to use get request to obtain data from the web? I am trying to obtain local postal codes by street names. I have two different large data sets (over a million rows). One data set already has a column by the postal code, the other by street names.
- How do i use get request to find the postal code for street names in a column?
- Is it smart to break up the data to smaller size through sampling or partitioning? Or can i get the information by for the entire dataset and how quickly can it be processed?
- Should i use any specific request/response headers?
Below is the workflow that i am currently working through.
Link to data: