diff --git a/scrape/Readme.md b/scrape/Readme.md index bb9ef9b3..65272b3d 100644 --- a/scrape/Readme.md +++ b/scrape/Readme.md @@ -21,4 +21,14 @@ Update the ``URL`` parameter to use your filtered url from Funda. URL = "https://www.funda.nl/zoeken/koop?selected_area=%5B%22utrecht,15km%22%5D&price=%22-400000%22&object_type=%5B%22house%22%5D" ``` -Next you should be able to scrape the data from funda. See the RTD for more docs on how to setup OSRM and use the results. \ No newline at end of file +Next you should be able to scrape the data from funda. See the RTD for more docs on how to setup OSRM and use the results. + +## Panda +To just interact with the panda dataframe: +```python +data = pickle.load(open('panda_dump.bin', 'rb')) +type(data) + +data.descrip.get(0) +"Aan de rand van de populaire woonwijk 'De Hagen' te Vianen staat deze fijne tussenwoning met groenstrook en water voor de deur. De buurt straalt een gemoedelijke sfeer uit en[..]" +``` \ No newline at end of file diff --git a/scrape/panda_dump.bin b/scrape/panda_dump.bin new file mode 100644 index 00000000..4ce53a13 Binary files /dev/null and b/scrape/panda_dump.bin differ