Hi all,
I wrote a little scraper with selenium and pyvirtualdisplay. It ran about five to six minutes and scraped some data into a csv that came to about 11kb.
However, while it was running, my file storage jumped from like 88% full to about 95% full.
I am making diligent search but I don't think I had any other scrapers downloading anything else at that time and I'm not finding any huge files created during that time.
I'm not super clear on Selenum and virtual display -- is there any chance those processes are creating files somewhere?
FWIW, python 2.7, emulating Firefox browser, and the code is organized like:
with Display:
try:
driver = webdriver.Firefox()
// stuff
finally:
driver.quit()