{"componentChunkName":"component---src-templates-project-post-js","path":"/projects/2019-05-06-python-web-crawler-w-regex/","result":{"data":{"markdownRemark":{"id":"fce7be92-4004-5036-9961-6281fcba4460","html":"<p>Python script which accepts a urls text file (urls.txt) in the root directory, of which contains a line seperated list of URLS to webcrawl/scan. More specifically, scans each HTML page served at the URL then parses the results for links to which will output a CSV file containing the results.</p>","frontmatter":{"date":"May 07, 2019","title":"Python Web Crawler w/ Regex","github":"https://github.com/daylennguyen/PythonWebCrawler","description":"Python script which accepts a urls text file (urls.txt) in the root directory, of which contains a line seperated list of URLS to webcrawl/scan. More specifically, scans each HTML page served at the URL then parses the results for links to which will output a CSV file containing the results.","tags":["python","tkinter","webcrawler","web","crawl","python3","link parsing","regex","regular-expression"],"featimage":"https://ucarecdn.com/4584d685-0feb-45f2-93b7-621915a15a56/"}}},"pageContext":{"id":"fce7be92-4004-5036-9961-6281fcba4460"}},"staticQueryHashes":["3020398965"]}