Develop
a crawler that collects the email addresses in the visited web pages. You can write
a function emails() that takes a document (as a string) as
input and returns the distinct
email addresses appearing in it [remove duplicate email addresses]. You should use a regular
expression to find the email addresses in the document.
1.
Use
your WebCrawler to check email links in Lab3_Webpage.html (2 points)
2.
Use
your WebCrawler to check email links in any URL. (2 points)
3.
Explain
the code and how it works. (1 point)
Deliverables:
·
Python program (.py file)
·
Screenshot(s) of your WebCrawler output in your
python IDE
·
Explanation of work (word file)
Sun | Mon | Tue | Wed | Thu | Fri | Sat |
---|---|---|---|---|---|---|
27 | 28 | 29 | 30 | 1 | 2 | 3 |
4 | 5 | 6 | 7 | 8 | 9 | 10 |
11 | 12 | 13 | 14 | 15 | 16 | 17 |
18 | 19 | 20 | 21 | 22 | 23 | 24 |
25 | 26 | 27 | 28 | 29 | 30 | 31 |